Dbfetch

ID   AK167794; SV 1; linear; mRNA; HTC; MUS; 2858 BP.
XX
AC   AK167794;
XX
DT   09-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 10)
XX
DE   Mus musculus TIB-55 BB88 cDNA, RIKEN full-length enriched library,
DE   clone:I730026E20 product:valosin containing protein, full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2858
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (14-APR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; 54a73d0ef547bf454dae8a99cc674652.
DR   Ensembl-Gn; ENSMUSG00000028452; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0028196; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0028152; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0028104; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0028172; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0027894; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0028613; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0027334; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_FVBNJ_G0027976; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0028108; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0028649; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0027060; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0027414; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000030164; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0065244; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0065290; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0065186; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0065220; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0064895; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0065689; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0065206; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_FVBNJ_T0064901; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0065077; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0065865; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0064742; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0064104; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=I730026E20
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2858
FT                   /organism="Mus musculus"
FT                   /strain="BALB/C"
FT                   /mol_type="mRNA"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="I730026E20"
FT                   /cell_line="TIB-55 BB88"
FT                   /db_xref="taxon:10090"
FT   CDS             278..2698
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="putative"
FT                   /note="valosin containing protein (MGD|MGI:99919
FT                   GB|BC049114, evidence: BLASTN, 99%, match=2855)"
FT                   /db_xref="GOA:Q01853"
FT                   /db_xref="InterPro:IPR003338"
FT                   /db_xref="InterPro:IPR003593"
FT                   /db_xref="InterPro:IPR003959"
FT                   /db_xref="InterPro:IPR003960"
FT                   /db_xref="InterPro:IPR004201"
FT                   /db_xref="InterPro:IPR005938"
FT                   /db_xref="InterPro:IPR009010"
FT                   /db_xref="InterPro:IPR015415"
FT                   /db_xref="InterPro:IPR027417"
FT                   /db_xref="InterPro:IPR029067"
FT                   /db_xref="MGI:MGI:99919"
FT                   /db_xref="PDB:1E32"
FT                   /db_xref="PDB:1R7R"
FT                   /db_xref="PDB:1S3S"
FT                   /db_xref="PDB:2PJH"
FT                   /db_xref="PDB:3CF0"
FT                   /db_xref="PDB:3CF1"
FT                   /db_xref="PDB:3CF2"
FT                   /db_xref="PDB:3CF3"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q01853"
FT                   /protein_id="BAE39824.1"
FT                   /translation="MASGADSKGDDLSTAILKQKNRPNRLIVDEAINEDNSVVSLSQPK
FT                   MDELQLFRGDTVLLKGKKRREAVCIVLSDDTCSDEKIRMNRVVRNNLRVRLGDVISIQP
FT                   CPDVKYGKRIHVLPIDDTVEGITGNLFEVYLKPYFLEAYRPIRKGDIFLVRGGMRAVEF
FT                   KVVETDPSPYCIVAPDTVIHCEGEPIKREDEEESLYEVGYDDIGGCRKQLAQIKEMVEL
FT                   PLRHPALFKAIGVKPPRGILLYGPPGTGKTLIARAVANETGAFFFLINGPEIMSKLAGE
FT                   SESNLRKAFEEAEKNAPAIIFIDELDAIAPKREKTHGEVERRIVSQLLTLMDGLKQRAH
FT                   VIVMAATNRPNSIDPALRRFGRFDREVDIGIPDATGRLEILQIHTKNMKLADDVDLEQV
FT                   ANETHGHVGADLAALCSEAALQAIRKKMDLIDLEDETIDAEVMNSLAVTMDDFRWALSQ
FT                   SNPSALRETVVEVPQVTWEDIGGLEDVKRELQELVQYPVEHPDKFLKFGMTPSKGVLFY
FT                   GPPGCGKTLLAKAIANECQANFISIKGPELLTMWFGESEANVREIFDKARQAAPCVLFF
FT                   DELDSIAKARGGNIGDGGGAADRVINQILTEMDGMSTKKNVFIIGATNRPDIIDPAILR
FT                   PGRLDQLIYIPLPDEKSRVAILKANLRKSPVAKDVDLEFLAKMTNGFSGADLTEICQRA
FT                   CKLAIRESIESEIRRERERQTNPSAMEVEEDDPVPEIRRDHFEEAMRFARRSVSDNDIR
FT                   KYEMFAQTLQQSRGFGSFRFPSGNQGGAGPSQGSGGGTGGSVYTEDNDDDLYG"
XX
SQ   Sequence 2858 BP; 702 A; 666 C; 797 G; 693 T; 0 other;
     ggcttcggct tttctcggtt cagtctccgt gaagcgtttg cagccgtcgt ttgattagtc        60
     gctgcccccc tcgcggatta ggagctagcg tctcccgccc gcccgcctgc cgccccggtg       120
     cccctgggag gaagcgagag ggaggctgcc agagggtttg tcactgctgt tgctcctccg       180
     cctcagcgag tccagccccg gcctagtcgg tcgcctgcct ttctcatagc cgttaccctc       240
     aggccgccac agccgccgac cgggagaggc gcgcgccatg gcctctggag ccgattcaaa       300
     aggtgatgat ttatcaacag ccattctcaa acagaagaac cgacccaatc ggttaattgt       360
     tgatgaagcc atcaatgaag ataacagcgt ggtgtccttg tcccagccca agatggatga       420
     actgcagttg ttccgaggtg acacggtgtt gctaaaagga aagaaaagac gggaagctgt       480
     atgcattgtt ctttctgatg acacgtgttc tgatgagaag attcgaatga atagagttgt       540
     tcggaataac ctccgagttc gcctaggaga tgtcatcagc atccagccat gccctgatgt       600
     aaagtatggc aaacgtatcc acgttctacc catcgatgac acagtggaag gcatcactgg       660
     caatctcttt gaggtatacc ttaagccgta cttcctggaa gcttatcggc ccatccgtaa       720
     aggagatatt tttcttgtcc ggggtgggat gcgtgctgtg gagttcaaag ttgtagagac       780
     agatccaagc ccttactgta ttgttgctcc agacacagtg atccactgtg agggggagcc       840
     aatcaagcga gaggatgagg aggaatcctt gtatgaagta ggctatgatg acatcggtgg       900
     ttgcaggaag cagctagctc agataaagga gatggtggag ctgccactga gacatcctgc       960
     gctctttaag gcgattggtg taaagcctcc tcggggaatc ttgttgtatg ggcctcctgg      1020
     gacagggaag accctgattg ctcgagctgt ggcaaatgaa actggagcct tcttctttct      1080
     gatcaatggt cctgaaatca tgagcaaatt ggctggtgag tctgagagca accttcgtaa      1140
     agcctttgag gaagctgaaa agaatgctcc tgctatcatc ttcatcgatg agcttgatgc      1200
     cattgcaccc aaaagagaga aaactcatgg ggaagtggag cgtcgcatcg tgtctcagtt      1260
     gttgaccctc atggatggcc taaagcagag agcacatgtg atagttatgg cagcaaccaa      1320
     tagacccaac agcattgacc cagccctacg gcgatttggt cgctttgaca gagaggtaga      1380
     tattggaata cctgatgcta caggacgttt ggagattctt cagatccata ccaagaacat      1440
     gaaactggca gatgatgtgg acttggaaca ggtagccaat gagactcatg gtcatgttgg      1500
     tgctgatttg gcagccctat gttcagaggc tgctctgcag gccatccgga aaaaaatgga      1560
     cctcattgac ctagaagatg agaccattga tgctgaggtc atgaattccc tggcagttac      1620
     tatggatgac ttccggtggg ctttgagtca aagcaaccca tcagcacttc gggaaactgt      1680
     ggtagaggtg ccacaagtaa cctgggaaga tattggaggc ctggaggatg tcaaacgtga      1740
     gcttcaggag ttggttcagt atcctgtgga acatccagac aaattcctca aatttggcat      1800
     gactccctcc aaaggcgttc ttttctatgg acctcctggc tgtgggaaaa ccttactggc      1860
     taaagccatt gctaatgaat gccaggccaa cttcatctcc atcaagggtc ctgagctgct      1920
     taccatgtgg tttggggaat ctgaggccaa tgtccgggaa atttttgaca aggcacggca      1980
     agctgccccc tgtgtactct tctttgatga gttagattca attgccaagg ctcgaggtgg      2040
     taatattgga gatggtggtg gagctgctga ccgagtcatc aatcagatcc tgacagaaat      2100
     ggatggcatg tctacaaaaa agaatgtgtt tatcattgga gctaccaaca ggcctgacat      2160
     cattgatcct gctatcctaa gacctggccg tctagatcag ctcatttata tcccacttcc      2220
     tgatgagaag tcccgtgttg ccatcctaaa agccaatctg cgaaagtccc cagttgccaa      2280
     ggatgtggat ttggagttcc tggctaagat gactaatggc ttttctggag ctgatttgac      2340
     agaaatttgc caacgggctt gtaaactggc cattcgtgaa tctattgaga gtgagattag      2400
     gcgagaacga gagaggcaga caaatccatc ggctatggag gtagaagagg acgatccagt      2460
     gcctgagatc cgcagagatc actttgagga agccatgcgt tttgcccgac gttctgtcag      2520
     cgataatgac attcggaagt atgaaatgtt cgcccagaca ctgcagcaga gtcgaggttt      2580
     tggcagcttc agattccctt cagggaacca gggtggagct ggtcccagcc agggcagtgg      2640
     aggtggcaca ggtggcagtg tgtacacaga agacaatgac gatgacctgt atggctaagt      2700
     gatgtgccag catgcagcga gctggcctgg ctggaccttg ttccctgggg gagggggcgc      2760
     ttgcccaaga gggaccaggg gtgtgcccat ggcctgttcc attcctcagt ctgaacagtt      2820
     cagccccagt cagactctgg acaggggttt cctgttgc                              2858
//