Dbfetch

ID   AK134770; SV 1; linear; mRNA; HTC; MUS; 2867 BP.
XX
AC   AK134770;
XX
DT   06-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 11)
XX
DE   Mus musculus adult male medulla oblongata cDNA, RIKEN full-length enriched
DE   library, clone:6330555J11 product:protein-O-mannosyltransferase 1, full
DE   insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2867
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (30-MAR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; a8e1010b520cf4779aea2240ffc4a22d.
DR   Ensembl-Gn; ENSMUSG00000039254; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0025598; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0025576; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0025545; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0025572; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0025333; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0026017; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0024795; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0025310; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0025443; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0025406; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0025529; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0025436; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0026075; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0024542; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0024863; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000036473; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0054043; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0054058; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0054012; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0054002; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0053733; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0054488; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0053903; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0053679; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0053787; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0053748; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0053894; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0053738; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0054628; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0053508; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0053007; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=6330555J11
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2867
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /sex="male"
FT                   /dev_stage="adult"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="6330555J11"
FT                   /tissue_type="medulla oblongata"
FT                   /db_xref="taxon:10090"
FT   CDS             68..2308
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="protein-O-mannosyltransferase 1 (MGD|MGI:2138994
FT                   GB|NM_145145, evidence: BLASTN, 99%, match=2823)"
FT                   /note="putative"
FT                   /db_xref="GOA:Q8R2R1"
FT                   /db_xref="InterPro:IPR003342"
FT                   /db_xref="InterPro:IPR016093"
FT                   /db_xref="InterPro:IPR027005"
FT                   /db_xref="InterPro:IPR032421"
FT                   /db_xref="MGI:MGI:2138994"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q8R2R1"
FT                   /protein_id="BAE22274.1"
FT                   /translation="MGSHSTGLEETLGVLPSWLFCKMLRFLKRPLVVTVDINLNLVALT
FT                   GLGLLTRLWQLSYPRAVVFDEVYYGQYISFYMKRIFFLDDSGPPFGHMLLALGGWLGGF
FT                   DGNFLWNRIGAEYSSNVPIWSLRLLPALAGALSVPMAYQIVLELHFSHGAAIGAALLML
FT                   IENALITQSRLMLLESILIFFNLLAVLSYLKFFNSQTHSPFSVHWWLWLLLTGVSCSCA
FT                   VGIKYMGIFTYLLVLGIAAVHAWNLIGDQTLSNMRVLSHLLARIVALLVVPVFLYLLFF
FT                   YVHLMLLYRSGPHDQIMSSAFQASLEGGLARITQGQHLEVAFGSQVTLKSVSGKPLPCW
FT                   LHSHKNTYPMIYENGRGSSHQQQVTCYPFKDINNWWIVKDPGRHQLVVNNPPRPVRHGD
FT                   IVQLVHGMTTRLLNTHDVAAPLSPHSQEVSCYIDYNISMPAQNLWKLDIVNRESNRDTW
FT                   KTILSEVRFVHVNTSAILKLSGAHLPDWGFRQLEVVGEKLSPGYHESMVWNVEEHRYGK
FT                   SHEQKERELELHSPTQLDISRNLSFMARFSELQWKMLTLKNEDLEHQYSSTPLEWLTLD
FT                   TNIAYWLHPRTSAQIHLLGNIVIWTSASLATVVYTLLFFWYLLRRRRSICDLPEDAWSR
FT                   WVLAGALCTGGWALNYLPFFLMERVLFLYHYLPALTFQILLLPIVLQHASDHLCRSQLQ
FT                   RNVFSALVVAWYSSACHVSNMLRPLTYGDTSLSPGELRALRWKDSWDILIRK"
FT   regulatory      2847..2852
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      2867
FT                   /note="putative"
XX
SQ   Sequence 2867 BP; 633 A; 837 C; 734 G; 663 T; 0 other;
     gagagtaatc gagcggcctc gcccccaacg gtcgatcctg cctggcggtt cgcaggcctg        60
     gctccacatg gggagccact ctacgggact cgaagaaacg ctcggagtcc tcccgagctg       120
     gcttttctgc aaaatgttaa gatttttgaa acggcctcta gtggtgactg ttgacatcaa       180
     tttgaacttg gtagctctga ctggcctggg actacttacc cgactatggc aactctccta       240
     ccctcgggct gtggttttcg atgaagtata ttatgggcag tacatttcct tctacatgaa       300
     gcgcatcttc tttctggatg acagtgggcc accatttggc cacatgctac tggccttagg       360
     aggttggtta gggggattcg atggtaactt tctgtggaac cgaattggag cagagtacag       420
     tagcaatgtg cctatatggt ccttacgcct gctgccagcg cttgccgggg ccctgtcagt       480
     gcccatggcc taccagatag tgctagagct ccacttttcc cacggtgctg ccattggagc       540
     cgccctgctg atgctcattg agaacgccct gatcactcag tccaggctca tgctgttgga       600
     gtccatactg atatttttta acctgttggc tgtgttgtcc tatctgaagt tcttcaactc       660
     ccagacacac agccctttct cagtgcactg gtggctgtgg ctactgctga ccggggtctc       720
     ttgttcctgt gcagttggga tcaaatacat ggggattttc acctacctgc ttgtgctcgg       780
     cattgcagct gtccacgcgt ggaatctgat cggagaccag accttgtcaa atatgcgcgt       840
     gctcagtcac ttgctcgcca gaatcgtggc tctgctggtc gtcccagtct tcctgtactt       900
     actcttcttc tatgtccacc tgatgctgct ctaccgctct gggccccatg accaaatcat       960
     gtccagtgcc ttccaggcca gcttggaggg agggcttgcc cgcatcaccc aaggccagca      1020
     cctggaggtg gcctttggtt cccaggtcac tctgaagagc gtctctggca aacccttgcc      1080
     ctgctggctt cattcgcaca agaacaccta tcccatgata tatgagaatg gccgtggcag      1140
     ctcccaccag caacaggtga cctgttatcc cttcaaagac atcaataact ggtggatcgt      1200
     caaggatcct gggcgacacc agctggtggt aaacaaccct ccccggcctg taagacatgg      1260
     agacattgta cagctcgttc acggcatgac cacccgcctc cttaacacgc acgatgtcgc      1320
     agccccactg agcccccatt ctcaagaagt ctcctgctac attgactata acatctccat      1380
     gcctgcccag aacctctgga aactggacat tgtgaacaga gagtccaacc gggatacctg      1440
     gaagactatc ttgtcggaag tgcgctttgt acatgtgaac acatccgcca tcttgaagct      1500
     gagcggggct cacctccctg actgggggtt ccggcagttg gaggtagttg gggagaagtt      1560
     gtcaccgggc taccacgaga gcatggtgtg gaatgtggaa gaacaccggt atggcaaaag      1620
     ccatgagcag aaggagaggg agctggagct ccactcgccc actcagctcg atatcagcag      1680
     gaacctcagc ttcatggcca gattctcaga gttacagtgg aagatgctga cgctgaagaa      1740
     tgaggacttg gaacaccagt acagctccac cccgctggag tggctcacgc tggacaccaa      1800
     catcgcctat tggctacatc ccaggaccag tgctcagatc cacttgcttg ggaacattgt      1860
     gatctggact tcagccagcc ttgccacagt ggtctacact ctactcttct tctggtacct      1920
     gctccgccgg cgaaggagca tctgtgacct ccctgaggat gcctggtccc gctgggtgct      1980
     ggctggagcc ctgtgtactg gcggctgggc actcaactac ctgcccttct tcctgatgga      2040
     gagggtgctc ttcctctacc actacttgcc ggcactcacc ttccagatcc tgctgctccc      2100
     gattgtcctg cagcacgcca gcgaccatct gtgcaggtcc cagctgcaga ggaatgtctt      2160
     cagtgccctg gttgtagcat ggtattcctc cgcatgccat gtgtccaaca tgctacgccc      2220
     actaacctat ggggacacgt cactctcacc aggcgagctc cgggcccttc gctggaaaga      2280
     cagctgggat attctgatcc gaaagtaata gagaacaaga acacagaaga caagcacaca      2340
     ggacaaagcc tcaaagatgt gtttgtctcc caccaacagg agcctcagca ggcaggactg      2400
     ccagggtcca ggaggaactc cagggactaa ttccaatttc acctcaagag ccctgtccac      2460
     tggttccttg tttgaagcaa ttgatttctc ttcacacagt gaagaatgtg cccagccaca      2520
     gcgttaccca tgaggcccaa ctctgaccca gccagagttt gagctgccag tgtaggaacc      2580
     accaaggcag gaggggcacc cagccaggga aggagtgggg gggactcagg acgagctgcg      2640
     ggcctactat agggccttag ccctgtcatt tatggggccc acagtgccac acctcattgg      2700
     gcacaggcac agccaccctc tgtaaaccct gaaagctgcc agccatccac agactcctga      2760
     gccaactcta aagagtcctg ggagactgca gccaccaaac tgccacggcc aaggtgtcgt      2820
     ccattcactt ccttaccttt aatgtaaata aaacaggaca aattgtt                    2867
//