Dbfetch

ID   AK030751; SV 1; linear; mRNA; HTC; MUS; 3091 BP.
XX
AC   AK030751;
XX
DT   18-DEC-2002 (Rel. 74, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 15)
XX
DE   Mus musculus 8 days embryo whole body cDNA, RIKEN full-length enriched
DE   library, clone:5730519N02 product:valosin containing protein, full insert
DE   sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; Muroidea;
OC   Muridae; Murinae; Mus; Mus.
XX
RN   [1]
RP   1-3091
RA   Adachi J., Aizawa K., Akimura T., Arakawa T., Bono H., Carninci P.,
RA   Fukuda S., Furuno M., Hanagaki T., Hara A., Hashizume W., Hayashida K.,
RA   Hayatsu N., Hiramoto K., Hiraoka T., Hirozane T., Hori F., Imotani K.,
RA   Ishii Y., Itoh M., Kagawa I., Kasukawa T., Katoh H., Kawai J., Kojima Y.,
RA   Kondo S., Konno H., Kouda M., Koya S., Kurihara C., Matsuyama T.,
RA   Miyazaki A., Murata M., Nakamura M., Nishi K., Nomura K., Numazaki R.,
RA   Ohno M., Ohsato N., Okazaki Y., Saito R., Saitoh H., Sakai C., Sakai K.,
RA   Sakazume N., Sano H., Sasaki D., Shibata K., Shinagawa A., Shiraki T.,
RA   Sogabe Y., Tagami M., Tagawa A., Takahashi F., Takaku-Akahira S.,
RA   Takeda Y., Tanaka T., Tomaru A., Toya T., Yasunishi A., Muramatsu M.,
RA   Hayashizaki Y.;
RT   ;
RL   Submitted (16-JUL-2001) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; 8ba6a50448dd73d91a31f47f74dd4bdb.
DR   Ensembl-Gn; ENSMUSG00000028452; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000030164; mus_musculus.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=5730519N02
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..3091
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /dev_stage="8 days embryo"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="5730519N02"
FT                   /tissue_type="whole body"
FT                   /db_xref="taxon:10090"
FT   CDS             161..2581
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="putative"
FT                   /note="valosin containing protein (MGD|MGI:99919
FT                   GB|NM_009503, evidence: BLASTN, 99%, match=3069)"
FT                   /db_xref="GOA:Q01853"
FT                   /db_xref="InterPro:IPR003338"
FT                   /db_xref="InterPro:IPR003593"
FT                   /db_xref="InterPro:IPR003959"
FT                   /db_xref="InterPro:IPR003960"
FT                   /db_xref="InterPro:IPR004201"
FT                   /db_xref="InterPro:IPR005938"
FT                   /db_xref="InterPro:IPR009010"
FT                   /db_xref="InterPro:IPR015415"
FT                   /db_xref="InterPro:IPR027417"
FT                   /db_xref="InterPro:IPR029067"
FT                   /db_xref="MGI:MGI:99919"
FT                   /db_xref="PDB:1E32"
FT                   /db_xref="PDB:1R7R"
FT                   /db_xref="PDB:1S3S"
FT                   /db_xref="PDB:2PJH"
FT                   /db_xref="PDB:3CF0"
FT                   /db_xref="PDB:3CF1"
FT                   /db_xref="PDB:3CF2"
FT                   /db_xref="PDB:3CF3"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q01853"
FT                   /protein_id="BAC27119.1"
FT                   /translation="MASGADSKGDDLSTAILKQKNRPNRLIVDEAINEDNSVVSLSQPK
FT                   MDELQLFRGDTVLLKGKKRREAVCIVLYDDTCSDEKIRMNRVVRNNLRVRLGDVISIQP
FT                   CPDVKYGKRIHVLPIDDTVEGITGNLFEVYLKPYFLEAYRPIRKGDIFLVRGGMRAVEF
FT                   KVVETDPSPYCIVAPDTVIHCEGEPIKREDEEESLNEVGYDDIGGCRKQLAQIKEMVEL
FT                   PLRHPALFKAIGVKPPRGILLYGPPGTGKTLIARAVANETGAFFFLINGPEIMSKLAGE
FT                   SESNLRKAFEEAEKNAPAIIFIDELDAIAPKREKTHGEVERRIVSQLLTLMDGLKQRAH
FT                   VIVMAATNRPNSIDPALRQFGRFDREVDIGIPDATGRLEILQIHTKNMKLADDVDLEQV
FT                   ANETHGHVGADLAALCSEAALQAIRKKMDLIDLEDETIDAEVMNSLAVTMDDFRWALSQ
FT                   SNPSALRETVVEVPQVTWEDIGGLEDVKRELQELVQYPVEHPDKFLKFGMTPSKGVLFY
FT                   GPPGCGKTLLAKAIANECQANFISIKGPELLTMWFGESEANVREIFDKARQAAPCVLFF
FT                   DELDSIAKARGGNIGDGGGAADRVINQILTEMDGMSTKKNVFIIGATNRPDIIDPAILR
FT                   PGRLDQLIYIPLPDEKSRVAILKANLRKSPVAKDVDLEFLAKMTNGFSGADLTEICQRA
FT                   CKLAIRESIESEIRRERERQTNPSAMEVEEDDPVPEIRRDHFEEAMRFARRSVSDNDIR
FT                   KYEMFAQTLQQSRGFGSFRFPSGNQGGAGPSQGSGGGTGGSVYTEDNDDDLYG"
XX
SQ   Sequence 3091 BP; 800 A; 683 C; 848 G; 760 T; 0 other;
     ggcccctggg aggaagcgag agggaggctg ccagagggtg ttgtcactgc tgttgctcct        60
     ccgcctcagc gagtccagcc ccggcctagt cggtcgcctg cctttctcat agccgttacc       120
     ctcaggccgc cacagccgcc gaccgggaga ggcgcgcgcc atggcctctg gagccgattc       180
     aaaaggtgat gatttatcaa cagccattct caaacagaag aaccgaccca atcggttaat       240
     tgttgatgaa gccatcaatg aagataacag cgtggtgtcc ttgtcccagc ccaagatgga       300
     tgaactgcag ttgttccgag gtgacacggt gttgctaaaa ggaaagaaaa gacgggaagc       360
     tgtatgcatt gttctttatg atgacacgtg ttctgatgag aagattcgaa tgaatagagt       420
     tgttcggaat aacctccgag ttcgcctagg agatgtcatc agcatccagc catgccctga       480
     tgtaaagtat ggcaaacgta tccacgttct acccatcgat gacacagtgg aaggcatcac       540
     tggcaatctc tttgaggtat accttaagcc gtacttcctg gaagcttatc ggcccatccg       600
     taaaggagat atttttcttg tccggggtgg gatgcgtgct gtggagttca aagttgtaga       660
     gacagatccc agcccttact gtattgttgc tccagacaca gtgatccact gtgaggggga       720
     gccaatcaag cgagaggatg aggaggaatc cttgaatgaa gtaggctatg atgacatcgg       780
     tggttgcagg aagcagctag ctcagataaa ggagatggtg gagctgccac tgagacatcc       840
     tgcgctcttt aaggcgattg gtgtaaagcc tcctcgggga atcttgttgt atgggcctcc       900
     tgggacaggg aagaccctga ttgctcgagc tgtggcaaat gaaactggag ccttcttctt       960
     tctgatcaat ggtcctgaaa tcatgagcaa attggctggt gagtctgaga gcaaccttcg      1020
     taaagccttt gaggaagctg aaaagaatgc tcctgctatc atcttcatcg atgagcttga      1080
     tgccattgca cccaaaagag agaaaactca tggggaagtg gagcgtcgca tcgtgtctca      1140
     gttgttgacc ctcatggatg gcctaaagca gagagcacat gtgatagtta tggcagcaac      1200
     caatagaccc aacagcattg acccagccct acggcaattt ggtcgctttg acagagaggt      1260
     agatattgga atacctgatg ctacaggacg tttggagatt cttcagatcc ataccaagaa      1320
     catgaaactg gcagatgatg tggacttgga acaggtagcc aatgagactc atggtcatgt      1380
     tggtgctgat ttggcagccc tatgttcaga ggctgctctg caggccatcc ggaaaaaaat      1440
     ggacctcatt gacctagaag atgagaccat tgatgctgag gtcatgaatt ccctggcagt      1500
     tactatggat gacttccggt gggctttgag tcaaagcaac ccatcagcac ttcgggaaac      1560
     tgtggtagag gtgccacaag taacctggga agatattgga ggcctggagg atgtcaaacg      1620
     tgagcttcag gagttggttc agtatcctgt ggaacatcca gacaaattcc tcaaatttgg      1680
     catgactccc tccaaaggcg ttcttttcta tggacctcct ggctgtggga aaaccttact      1740
     ggctaaagcc attgctaatg aatgccaggc caacttcatc tccatcaagg gtcctgagct      1800
     gcttaccatg tggtttgggg aatctgaggc caatgtccgg gaaatttttg acaaggcacg      1860
     gcaagctgcc ccctgtgtac tcttctttga tgagttagat tcaattgcca aggctcgagg      1920
     tggtaatatt ggagatggtg gtggagctgc tgaccgagtc atcaatcaga tcctgacaga      1980
     aatggatggc atgtctacaa aaaagaatgt gtttatcatt ggagctacca acaggcctga      2040
     catcattgat cctgctatcc taagacctgg ccgtctagat cagctcattt atatcccact      2100
     tcctgatgag aagtcccgtg ttgccatcct aaaagccaat ctgcgaaagt ccccagttgc      2160
     caaggatgtg gatttggagt tcctggctaa gatgactaat ggcttttctg gagctgattt      2220
     gacagaaatt tgccaacggg cttgtaaact ggccattcgt gaatctattg agagtgagat      2280
     taggcgagaa cgagagaggc agacaaatcc atcggctatg gaggtagaag aggacgatcc      2340
     agtgcctgag atccgcagag atcactttga ggaagccatg cgttttgccc gacgttctgt      2400
     cagcgataat gacattcgga agtatgaaat gttcgcccag acactgcagc agagtcgagg      2460
     ttttggcagc ttcagattcc cttcagggaa ccagggtgga gctggtccca gccagggcag      2520
     tggaggtggc acaggtggca gtgtgtacac agaagacaat gacgatgacc tgtatggcta      2580
     agtgatgtgc cagcatgcag cgagctggcc tggctggacc ttgttccctg ggggaggggg      2640
     cgcttgccca agagggacca ggggtgtgcc catggcctgt tccattcctc agtctgaaca      2700
     gttcagcccc agtcagactc tggacagggg tttcctgttg caaaaaaaaa aaattacaaa      2760
     agcgataaaa taaaagtgat tttcatttgg gaggtgaaga gtgaattacc agcaaggaat      2820
     tgggtcttgg gcccacacgg tttctgtcgt agtttggggt ggtgcaggta acctgtgtgg      2880
     tgtgaaccaa ggcattgcca ccaccatcac cacagtaaag catctatact caatgctgtc      2940
     caagtcctcc cttaccctag ccaacctggg taggtggatg aggggcctca gtttgctggg      3000
     tgtttatata gaaagtaggt tgatttttat tttacatgct tttgagttac tgttggaaga      3060
     ttaatcataa gcagtttcta aaccaaaaaa g                                     3091
//