Dbfetch

ID   AK151109; SV 1; linear; mRNA; HTC; MUS; 3210 BP.
XX
AC   AK151109;
XX
DT   06-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 11)
XX
DE   Mus musculus bone marrow macrophage cDNA, RIKEN full-length enriched
DE   library, clone:I830025E07 product:valosin containing protein, full insert
DE   sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-3210
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (30-MAR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; 1b9cf6cdf45ab8d0c8dd7e76726ceec0.
DR   Ensembl-Gn; ENSMUSG00000028452; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0028196; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0028152; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0028104; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0028172; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0027894; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0028613; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0027334; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_FVBNJ_G0027976; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0028108; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0028649; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0027060; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0027414; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000030164; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0065244; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0065290; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0065186; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0065220; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0064895; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0065689; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0065206; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_FVBNJ_T0064901; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0065077; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0065865; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0064742; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0064104; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Tissues were provided by David A. Hume ( Depts. of Biochemistry
CC   and Microbiology/Parasitology Institute for Molecular Bioscience
CC   University of Queensland Brisbane,Q 4072 Australia ) whose
CC   assistance we gratefully acknowledge.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=I830025E07
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..3210
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="I830025E07"
FT                   /cell_type="macrophage"
FT                   /tissue_type="bone marrow"
FT                   /db_xref="taxon:10090"
FT   CDS             224..2644
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="putative"
FT                   /note="valosin containing protein (MGD|MGI:99919
FT                   GB|NM_009503, evidence: BLASTN, 99%, match=3210)"
FT                   /db_xref="GOA:Q01853"
FT                   /db_xref="InterPro:IPR003338"
FT                   /db_xref="InterPro:IPR003593"
FT                   /db_xref="InterPro:IPR003959"
FT                   /db_xref="InterPro:IPR003960"
FT                   /db_xref="InterPro:IPR004201"
FT                   /db_xref="InterPro:IPR005938"
FT                   /db_xref="InterPro:IPR009010"
FT                   /db_xref="InterPro:IPR015415"
FT                   /db_xref="InterPro:IPR027417"
FT                   /db_xref="InterPro:IPR029067"
FT                   /db_xref="MGI:MGI:99919"
FT                   /db_xref="PDB:1E32"
FT                   /db_xref="PDB:1R7R"
FT                   /db_xref="PDB:1S3S"
FT                   /db_xref="PDB:2PJH"
FT                   /db_xref="PDB:3CF0"
FT                   /db_xref="PDB:3CF1"
FT                   /db_xref="PDB:3CF2"
FT                   /db_xref="PDB:3CF3"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q01853"
FT                   /protein_id="BAE30119.1"
FT                   /translation="MASGADSKGDDLSTAILKQKNRPNRLIVDEAINEDNSVVSLSQPK
FT                   MDELQLFRGDTVLLKGKKRREAVCIVLSDDTCSDEKIRMNRVVRNNLRVRLGDVISIQP
FT                   CPDVKYGKRIHVLPIDDTVEGITGNLFEVYLKPYFLEAYRPIRKGDIFLVRGGMRAVEF
FT                   KVVETDPSPYCIVAPDTVIHCEGEPIKREDEEESLNEVGYDDIGGCRKQLAQIKEMVEL
FT                   PLRHPALFKAIGVKPPRGILLYGPPGTGKTLIARAVANETGAFFFLINGPEIMSKLAGE
FT                   SESNLRKAFEEAEKNAPAIIFIDELDAIAPKREKTHGEVERRIVSQLLTLMDGLKQRAH
FT                   VIVMAATNRPNSIDPALRRFGRFDREVDIGIPDATGRLEILQIHTKNMKLADDVDLEQV
FT                   ANETHGHVGADLAALCSEAALQAIRKKMDLIDLEDETIDAEVMNSLAVTMDDFRWALSQ
FT                   SNPSALRETVVEVPQVTWEDIGGLEDVKRELQELVQYPVEHPDKFLKFGMTPSKGVLFY
FT                   GPPGCGKTLLAKAIANECQANFISIKGPELLTMWFGESEANVREIFDKARQAAPCVLFF
FT                   DELDSIAKARGGNIGDGGGAADRVINQILTEMDGMSTKKNVFIIGATNRPDIIDPAILR
FT                   PGRLDQLIYIPLPDEKSRVAILKANLRKSPVAKDVDLEFLAKMTNGFSGADLTEICQRA
FT                   CKLAIRESIESEIRRERERQTNPSAMEVEEDDPVPEIRRDHFEEAMRFARRSVSDNDIR
FT                   KYEMFAQTLQQSRGFGSFRFPSGNQGGAGPSQGSGGGTGGSVYTEDNDDDLYG"
FT   regulatory      3189..3194
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      3210
FT                   /note="putative"
XX
SQ   Sequence 3210 BP; 834 A; 717 C; 877 G; 782 T; 0 other;
     ttagtcgctg cccccctcgc ggattaggag ctagcgtctc ccgcccgccc gcctgccgcc        60
     ccggtgcccc tgggaggaag cgagagggag gctgccagag ggtttgtcac tgctgttgct       120
     cctccgcctc agcgagtcca gccccggcct agtcggtcgc ctgcctttct catagccgtt       180
     accctcaggc cgccacagcc gccgaccggg agaggcgcgc gccatggcct ctggagccga       240
     ttcaaaaggt gatgatttat caacagccat tctcaaacag aagaaccgac ccaatcggtt       300
     aattgttgat gaagccatca atgaagataa cagcgtggtg tccttgtccc agcccaagat       360
     ggatgaactg cagttgttcc gaggtgacac ggtgttgcta aaaggaaaga aaagacggga       420
     agctgtatgc attgttcttt ctgatgacac gtgttctgat gagaagattc gaatgaatag       480
     agttgttcgg aataacctcc gagttcgcct aggagatgtc atcagcatcc agccatgccc       540
     tgatgtaaag tatggcaaac gtatccacgt tctacccatc gatgacacag tggaaggcat       600
     cactggcaat ctctttgagg tataccttaa gccgtacttc ctggaagctt atcggcccat       660
     ccgtaaagga gatatttttc ttgtccgggg tgggatgcgt gctgtggagt tcaaagttgt       720
     agagacagat cccagccctt actgtattgt tgctccagac acagtgatcc actgtgaggg       780
     ggagccaatc aagcgagagg atgaggagga atccttgaat gaagtaggct atgatgacat       840
     cggtggttgc aggaagcagc tagctcagat aaaggagatg gtggagctgc cactgagaca       900
     tcctgcgctc tttaaggcga ttggtgtaaa gcctcctcgg ggaatcttgt tgtatgggcc       960
     tcctgggaca gggaagaccc tgattgctcg agctgtggca aatgaaactg gagccttctt      1020
     ctttctgatc aatggtcctg aaatcatgag caaattggct ggtgagtctg agagcaacct      1080
     tcgtaaagcc tttgaggaag ctgaaaagaa tgctcctgct atcatcttca tcgatgagct      1140
     tgatgccatt gcacccaaaa gagagaaaac tcatggggaa gtggagcgtc gcatcgtgtc      1200
     tcagttgttg accctcatgg atggcctaaa gcagagagca catgtgatag ttatggcagc      1260
     aaccaataga cccaacagca ttgacccagc cctacggcga tttggtcgct ttgacagaga      1320
     ggtagatatt ggaatacctg atgctacagg acgtttggag attcttcaga tccataccaa      1380
     gaacatgaaa ctggcagatg atgtggactt ggaacaggta gccaatgaga ctcatggtca      1440
     tgttggtgct gatttggcag ccctatgttc agaggctgct ctgcaggcca tccggaaaaa      1500
     aatggacctc attgacctag aagatgagac cattgatgct gaggtcatga attccctggc      1560
     agttactatg gatgacttcc ggtgggcttt gagtcaaagc aacccatcag cacttcggga      1620
     aactgtggta gaggtgccac aagtaacctg ggaagatatt ggaggcctgg aggatgtcaa      1680
     acgtgagctt caggagttgg ttcagtatcc tgtggaacat ccagacaaat tcctcaaatt      1740
     tggcatgact ccctccaaag gcgttctttt ctatggacct cctggctgtg ggaaaacctt      1800
     actggctaaa gccattgcta atgaatgcca ggccaacttc atctccatca agggtcctga      1860
     gctgcttacc atgtggtttg gggaatctga ggccaatgtc cgggaaattt ttgacaaggc      1920
     acggcaagct gccccctgtg tactcttctt tgatgagtta gattcaattg ccaaggctcg      1980
     aggtggtaat attggagatg gtggtggagc tgctgaccga gtcatcaatc agatcctgac      2040
     agaaatggat ggcatgtcta caaaaaagaa tgtgtttatc attggagcta ccaacaggcc      2100
     tgacatcatt gatcctgcta tcctaagacc tggccgtcta gatcagctca tttatatccc      2160
     acttcctgat gagaagtccc gtgttgccat cctaaaagcc aatctgcgaa agtccccagt      2220
     tgccaaggat gtggatttgg agttcctggc taagatgact aatggctttt ctggagctga      2280
     tttgacagaa atttgccaac gggcttgtaa actggccatt cgtgaatcta ttgagagtga      2340
     gattaggcga gaacgagaga ggcagacaaa tccatcggct atggaggtag aagaggacga      2400
     tccagtgcct gagatccgca gagatcactt tgaggaagcc atgcgttttg cccgacgttc      2460
     tgtcagcgat aatgacattc ggaagtatga aatgttcgcc cagacactgc agcagagtcg      2520
     aggttttggc agcttcagat tcccttcagg gaaccagggt ggagctggtc ccagccaggg      2580
     cagtggaggt ggcacaggtg gcagtgtgta cacagaagac aatgacgatg acctgtatgg      2640
     ctaagtgatg tgccagcatg cagcgagctg gcctggctgg accttgttcc ctgggggagg      2700
     gggcgcttgc ccaagaggga ccaggggtgt gcccatggcc tgttccattc ctcagtctga      2760
     acagttcagc cccagtcaga ctctggacag gggtttcctg ttgcaaaaaa aaaaaaatta      2820
     caaaagcgat aaaataaaag tgattttcat ttgggaggtg aagagtgaat taccagcaag      2880
     gaattgggtc ttgggcccac acggtttctg tcgtagtttg gggtggtgca ggtaacctgt      2940
     gtggtgtgaa ccaaggcatt gccaccacca tcaccacagt aaagcatcta tactcaatgc      3000
     tgtccaagtc ctcccttacc ctagccaacc tgggtaggtg gatgaggggc ctcagtttgc      3060
     tgggtgttta tatagaaagt aggttgattt ttattttaca tgcttttgag ttactgttgg      3120
     aagattaatc ataagcagtt tctaaaccaa aaaaaagaag aaaaaaaaaa gacatgttgt      3180
     aaaaggacaa taaatgttgg gtcaaaatgg                                       3210
//