Dbfetch

ID   AK153249; SV 1; linear; mRNA; HTC; MUS; 3286 BP.
XX
AC   AK153249;
XX
DT   06-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 11)
XX
DE   Mus musculus bone marrow macrophage cDNA, RIKEN full-length enriched
DE   library, clone:I830129H23 product:valosin containing protein, full insert
DE   sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-3286
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (30-MAR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; 9ac6d1197fafa1b8fa6df1101983d7f5.
DR   Ensembl-Gn; ENSMUSG00000028452; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0028196; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0028152; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0028104; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0028172; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0027894; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0028613; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0027334; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_FVBNJ_G0027976; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0028108; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0028649; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0027060; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0027414; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000030164; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0065244; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0065290; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0065186; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0065220; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0064895; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0065689; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0065206; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_FVBNJ_T0064901; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0065077; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0065865; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0064742; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0064104; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Tissues were provided by David A. Hume ( Depts. of Biochemistry
CC   and Microbiology/Parasitology Institute for Molecular Bioscience
CC   University of Queensland Brisbane,Q 4072 Australia ) whose
CC   assistance we gratefully acknowledge.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=I830129H23
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..3286
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="I830129H23"
FT                   /cell_type="macrophage"
FT                   /tissue_type="bone marrow"
FT                   /db_xref="taxon:10090"
FT   CDS             301..2721
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="putative"
FT                   /note="valosin containing protein (MGD|MGI:99919
FT                   GB|NM_009503, evidence: BLASTN, 99%, match=3242)"
FT                   /db_xref="GOA:Q01853"
FT                   /db_xref="InterPro:IPR003338"
FT                   /db_xref="InterPro:IPR003593"
FT                   /db_xref="InterPro:IPR003959"
FT                   /db_xref="InterPro:IPR003960"
FT                   /db_xref="InterPro:IPR004201"
FT                   /db_xref="InterPro:IPR005938"
FT                   /db_xref="InterPro:IPR009010"
FT                   /db_xref="InterPro:IPR015415"
FT                   /db_xref="InterPro:IPR027417"
FT                   /db_xref="InterPro:IPR029067"
FT                   /db_xref="MGI:MGI:99919"
FT                   /db_xref="PDB:1E32"
FT                   /db_xref="PDB:1R7R"
FT                   /db_xref="PDB:1S3S"
FT                   /db_xref="PDB:2PJH"
FT                   /db_xref="PDB:3CF0"
FT                   /db_xref="PDB:3CF1"
FT                   /db_xref="PDB:3CF2"
FT                   /db_xref="PDB:3CF3"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q01853"
FT                   /protein_id="BAE31840.1"
FT                   /translation="MASGADSKGDDLSTAILKQKNRPNRLIVDEAINEDNSVVSLSQPK
FT                   MDELQLFRGDTVLLKGKKRREAVCIVLSDDTCSDEKIRMNRVVRNNLRVRLGDVISIQP
FT                   CPDVKYGKRIHVLPIDDTVEGITGNLFEVYLKPYFLEAYRPIRKGDIFLVRGGMRAVEF
FT                   KVVETDPSPYCIVAPDTVIHCEGEPIKREDEEESLNEVGYDDIGGCRKQLAQIKEMVEL
FT                   PLRHPALFKAIGVKPPRGILLYGPPGTGKTLIARAVANETGAFFFLINGPEIMSKLAGE
FT                   SESNLRKAFEEAEKNAPAIIFIDELDAIAPKREKTHGEVERRIVSQLLTLMDGLKQRAH
FT                   VIVMAATNRPNSIDPALRRFGRFDREVDIGIPDATGRLEILQIHTKNMKLADDVDLEQV
FT                   ANETHGHVGADLAALCSEAALQAIRKKMDLIDLEDETIDAEVMNSLAVTMDDFRWALSQ
FT                   SNPSALRETVVEVPQVTWEDIGGLEDVKRELQELVQYPVEHPDKFLKFGMTPSKGVLFY
FT                   GPPGCGKTLLAKAIANECQANFISIKGPELLTMWFGESEANVREIFDKARQAAPCVLFF
FT                   DELDSIAKARGGNIGDGGGAADRVINQILTEMDGMSTKKNVFIIGATNRPDIIDPAILR
FT                   PGRLDQLIYIPLPDEKSRVAILKANLRKSPVAKDVDLEFLAKMTNGFSGADLTEICQRA
FT                   CKLAIRESIESEIRRERERQTNPSAMEVEEDDPVPEIRRDHFEEAMRFARRSVSDNDIR
FT                   KYEMFAQTLQQSRGFGSFRFPSGNQGGAGPSQGSGGGTGGSVYTEDNDDDLYG"
FT   regulatory      3265..3270
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      3286
FT                   /note="putative"
XX
SQ   Sequence 3286 BP; 843 A; 736 C; 898 G; 809 T; 0 other;
     ggttagagca gctttccttc cgatgattcg gcttttctcg gttcagtctc cgtgaagcgt        60
     ttgcagccgt cgtttgatta gtcgctgccc ccctcgcgga ttaggagcta gcgtctcccg       120
     cccgcccgcc tgccgccccg gtgcccctgg gaggaagcga gagggaggct gccagagggt       180
     ttgtcactgc tgttgctcct ccgcctcagc gagtccagcc ccggcctagt cggtcgcctg       240
     cctttctcat agccgttacc ctcaggccgc cacagccgcc gaccgggaga ggcgcgcgcc       300
     atggcctctg gagccgattc aaaaggtgat gatttatcaa cagccattct caaacagaag       360
     aaccgaccca atcggttaat tgttgatgaa gccatcaatg aagataacag cgtggtgtcc       420
     ttgtcccagc ccaagatgga tgaactgcag ttgttccgag gtgacacggt gttgctaaaa       480
     ggaaagaaaa gacgggaagc tgtatgcatt gttctttctg atgacacgtg ttctgatgag       540
     aagattcgaa tgaatagagt tgttcggaat aacctccgag ttcgcctagg agatgtcatc       600
     agcatccagc catgccctga tgtaaagtat ggcaaacgta tccacgttct acccatcgat       660
     gacacagtgg aaggcatcac tggcaatctc tttgaggtat accttaagcc gtacttcctg       720
     gaagcttatc ggcccatccg taaaggagat atttttcttg tccggggtgg gatgcgtgct       780
     gtggagttca aagttgtaga gacagatccc agcccttact gtattgttgc tccagacaca       840
     gtgatccact gtgaggggga gccaatcaag cgagaggatg aggaggaatc cttgaatgaa       900
     gtaggctatg atgacatcgg tggttgcagg aagcagctag ctcagataaa ggagatggtg       960
     gagctgccac tgagacatcc tgcgctcttt aaggcgattg gtgtaaagcc tcctcgggga      1020
     atcttgttgt atgggcctcc tgggacaggg aagaccctga ttgctcgagc tgtggcaaat      1080
     gaaactggag ccttcttctt tctgatcaat ggtcctgaaa tcatgagcaa attggctggt      1140
     gagtctgaga gcaaccttcg taaagccttt gaggaagctg aaaagaatgc tcctgctatc      1200
     atcttcatcg atgagcttga tgccattgca cccaaaagag agaaaactca tggggaagtg      1260
     gagcgtcgca tcgtgtctca gttgttgacc ctcatggatg gcctaaagca gagagcacat      1320
     gtgatagtta tggcagcaac caatagaccc aacagcattg acccagccct acggcgattt      1380
     ggtcgctttg acagagaggt agatattgga atacctgatg ctacaggacg tttggagatt      1440
     cttcagatcc ataccaagaa catgaaactg gcagatgatg tggacttgga acaggtagcc      1500
     aatgagactc atggtcatgt tggtgctgat ttggcagccc tatgttcaga ggctgctctg      1560
     caggccatcc ggaaaaaaat ggacctcatt gacctagaag atgagaccat tgatgctgag      1620
     gtcatgaatt ccctggcagt tactatggat gacttccggt gggctttgag tcaaagcaac      1680
     ccatcagcac ttcgggaaac tgtggtagag gtgccacaag taacctggga agatattgga      1740
     ggcctggagg atgtcaaacg tgagcttcag gagttggttc agtatcctgt ggaacatcca      1800
     gacaaattcc tcaaatttgg catgactccc tccaaaggcg ttcttttcta tggacctcct      1860
     ggctgtggga aaaccttact ggctaaagcc attgctaatg aatgccaggc caacttcatc      1920
     tccatcaagg gtcctgagct gcttaccatg tggtttgggg aatctgaggc caatgtccgg      1980
     gaaatttttg acaaggcacg gcaagctgcc ccctgtgtac tcttctttga tgagttagat      2040
     tcaattgcca aggctcgagg tggtaatatt ggagatggtg gtggagctgc tgaccgagtc      2100
     atcaatcaga tcctgacaga aatggatggc atgtctacaa aaaagaatgt gtttatcatt      2160
     ggagctacca acaggcctga catcattgat cctgctatcc taagacctgg ccgtctagat      2220
     cagctcattt atatcccact tcctgatgag aagtcccgtg ttgccatcct aaaagccaat      2280
     ctgcgaaagt ccccagttgc caaggatgtg gatttggagt tcctggctaa gatgactaat      2340
     ggcttttctg gagctgattt gacagaaatt tgccaacggg cttgtaaact ggccattcgt      2400
     gaatctattg agagtgagat taggcgagaa cgagagaggc agacaaatcc atcggctatg      2460
     gaggtagaag aggacgatcc agtgcctgag atccgcagag atcactttga ggaagccatg      2520
     cgttttgccc gacgttctgt cagcgataat gacattcgga agtatgaaat gttcgcccag      2580
     acactgcagc agagtcgagg ttttggcagc ttcagattcc cttcagggaa ccagggtgga      2640
     gctggtccca gccagggcag tggaggtggc acaggtggca gtgtgtacac agaagacaat      2700
     gacgatgacc tgtatggcta agtgatgtgc cagcatgcag cgagctggcc tggctggacc      2760
     ttgttccctg ggggaggggg cgcttgccca agagggacca ggggtgtgcc catggcctgt      2820
     tccattcctc agtctgaaca gttcagcccc agtcagactc tggacagggg tttcctgttg      2880
     caaaaaaaaa aaattacaaa agcgataaaa taaaagtgat tttcatttgg gaggtgaaga      2940
     gtgaattacc agcaaggaat tgggtcttgg gcccacacgg tttctgtcgt agtttggggt      3000
     ggtgcaggta acctgtgtgg tgtgaaccaa ggcattgcca ccaccatcac cacagtaaag      3060
     catctatact caatgctgtc caagtcctcc cttaccctag ccaacctggg taggtggatg      3120
     aggggcctca gtttgctggg tgtttatata gaaagtaggt tgatttttat tttacatgct      3180
     tttgagttac tgttggaaga ttaatcataa gcagtttcta aaccaaaaaa aagaagaaaa      3240
     aaaaaagaca tgttgtaaaa ggacaataaa tgttgggtca aaatgg                     3286
//