Dbfetch

ID   AK160487; SV 1; linear; mRNA; HTC; MUS; 2033 BP.
XX
AC   AK160487;
XX
DT   09-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 10)
XX
DE   Mus musculus adult male stomach cDNA, RIKEN full-length enriched library,
DE   clone:2210414C06 product:albumin 1, full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2033
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (14-APR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; f90b4e23230344619e1f661050067bbb.
DR   Ensembl-Gn; ENSMUSG00000029368; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0029721; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0029687; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0029633; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0029697; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0029420; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0030151; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0028830; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0029390; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0029534; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0029493; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0029622; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0029522; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0030185; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0028549; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0028909; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000031314; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0072543; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0072622; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0072536; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0072549; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0072192; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0073016; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0072736; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0072149; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0072280; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0072183; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0072336; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0072185; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0073230; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0072204; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0071339; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=2210414C06
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2033
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /sex="male"
FT                   /dev_stage="adult"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="2210414C06"
FT                   /tissue_type="stomach"
FT                   /db_xref="taxon:10090"
FT   CDS             39..1865
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="albumin 1 (MGD|MGI:87991 GB|NM_009654, evidence:
FT                   BLASTN, 99%, match=2028)"
FT                   /note="putative"
FT                   /db_xref="GOA:P07724"
FT                   /db_xref="InterPro:IPR000264"
FT                   /db_xref="InterPro:IPR014760"
FT                   /db_xref="InterPro:IPR020857"
FT                   /db_xref="InterPro:IPR020858"
FT                   /db_xref="InterPro:IPR021177"
FT                   /db_xref="MGI:MGI:87991"
FT                   /db_xref="UniProtKB/Swiss-Prot:P07724"
FT                   /protein_id="BAE35818.1"
FT                   /translation="MKWVTFLLLLFVSGSAFSRGVFRREAHKSEIAHRYNDLGEQHFKG
FT                   LVLIAFSQYLQKCSYDEHAKLVQEVTDFAKTCVADESAANCDKSLHTLFGDKLCAIPNL
FT                   RENYGELADCCTKQEPERNECFLQHKDDNPSLPPFERPEAEAMCTSFKENPTTFMGHYL
FT                   HEVARRHPYFYAPELLYYAEQYNEILTQCCAEADKESCLTPKLDGVKEKALVSSVRQRM
FT                   KCSSMQKFGERAFKAWAVARLSQTFPNADFAEITKLATDLTKVNKECCHGDLLECADDR
FT                   AELAKYMCENQATISSKLQTCCDKPLLKKAHCLSEVEHDTMPADLPAIAADFVEDQEVC
FT                   KNYAEAKDVFLGTFLYEYSRRHPDYSVSLLLRLAKKYEATLEKCCAEANPPACYGTVLA
FT                   EFQPLVEEPKNLVKTNCDLYEKLGEYGFQNAILVRYTQKAPQVSTPTLVEAARNLGRVG
FT                   TKCCTLPEDQRLPCVEDYLSAILNRVCLLHEKTPVSEHVTKCCSGSLVERRPCFSALTV
FT                   DETYVPKEFKAETFTFHSDICTLPEKEKQIKKQTALAELVKHKHKATAEQLKTVMDDFA
FT                   QFLDTCCKAADKDTCFSTEGPNLVTRCKDALA"
FT   regulatory      2013..2018
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      2033
FT                   /note="putative"
XX
SQ   Sequence 2033 BP; 577 A; 496 C; 469 G; 491 T; 0 other;
     gatcaccttt cctatcaacc ccactagcct ctggcaaaat gaagtgggta acctttctcc        60
     tcctcctctt cgtctccggc tctgcttttt ccaggggtgt gtttcgccga gaagcacaca       120
     agagtgagat cgcccatcgg tataatgatt tgggagaaca acatttcaaa ggcctagtcc       180
     tgattgcctt ttcccagtat ctccagaaat gctcatacga tgagcatgcc aaattagtgc       240
     aggaagtaac agactttgca aagacgtgtg ttgccgatga gtctgccgcc aactgtgaca       300
     aatcccttca cactcttttt ggagataagt tgtgtgccat tccaaacctc cgtgaaaact       360
     atggtgaact ggctgactgc tgtacaaaac aagagcccga aagaaacgaa tgtttcctgc       420
     aacacaaaga tgacaacccc agcctgccac catttgaaag gccagaggct gaggccatgt       480
     gcacctcctt taaggaaaac ccaaccacct ttatgggaca ctatttgcat gaagttgcca       540
     gaagacatcc ttatttctat gccccagaac ttctttacta tgctgagcag tacaatgaga       600
     ttctgaccca gtgttgtgca gaggctgaca aggaaagctg cctgaccccg aagcttgatg       660
     gtgtgaagga gaaagcattg gtctcatctg tccgtcagag aatgaagtgc tccagtatgc       720
     agaagtttgg agagagagct tttaaagcat gggcagtagc tcgtctgagc cagacattcc       780
     ccaatgctga ctttgcagaa atcaccaaat tggcaacaga cctgaccaaa gtcaacaagg       840
     agtgctgcca tggtgacctg ctggaatgcg cagatgacag ggcggaactt gccaagtaca       900
     tgtgtgaaaa ccaggcgact atctccagca aactgcagac ttgctgcgat aaaccactgt       960
     tgaagaaagc ccactgtctt agtgaggtgg agcatgacac catgcctgct gatctgcctg      1020
     ccattgctgc tgattttgtt gaggaccagg aagtgtgcaa gaactatgct gaggccaagg      1080
     atgtcttcct gggcacgttc ttgtatgaat attcaagaag acaccctgat tactctgtat      1140
     ccctgttgct gagacttgct aagaaatatg aagccactct ggaaaagtgc tgcgctgaag      1200
     ccaatcctcc cgcatgctac ggcacagtgc ttgctgaatt tcagcctctt gtagaagagc      1260
     ctaagaactt ggtcaaaacc aactgtgatc tttacgagaa gcttggagaa tatggattcc      1320
     aaaatgccat tctagttcgc tacacccaga aagcacctca ggtgtcaacc ccaactctcg      1380
     tggaggctgc aagaaaccta ggaagagtgg gcaccaagtg ttgtacactt cctgaagatc      1440
     agagactgcc ttgtgtggaa gactatctgt ctgcaatcct gaaccgtgtg tgtctgctgc      1500
     atgagaagac cccagtgagt gagcatgtta ccaagtgctg tagtggatcc ctggtggaaa      1560
     ggcggccatg cttctctgct ctgacagttg atgaaacata tgtccccaaa gagtttaaag      1620
     ctgagacctt caccttccac tctgatatct gcacacttcc agagaaggag aagcagatta      1680
     agaaacaaac ggctcttgct gagctggtga agcacaagca caaggctaca gcggagcaac      1740
     tgaagactgt catggatgac tttgcacagt tcctggatac atgttgcaag gctgctgaca      1800
     aggacacctg cttctcgact gagggtccaa accttgtcac tagatgcaaa gacgccttag      1860
     cctaaacaca tcacaaccac aaccttctca ggctaccctg agaaaaaaag acatgaagac      1920
     tcaggactca tcttttctgt tggtgtaaaa tcaacaccct aaggaacaca aatttcttta      1980
     aacatttgac ttcttgtctc tgtgctgcaa ttaataaaaa atggaaagaa tct             2033
//