Dbfetch

ID   AK153984; SV 1; linear; mRNA; HTC; MUS; 2921 BP.
XX
AC   AK153984;
XX
DT   06-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 11)
XX
DE   Mus musculus 2 days neonate thymus thymic cells cDNA, RIKEN full-length
DE   enriched library, clone:E430019B20 product:protein-O-mannosyltransferase 1,
DE   full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2921
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (30-MAR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; cc227c3e2a424fb0c89b6550225e1db3.
DR   Ensembl-Gn; ENSMUSG00000039254; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0025598; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0025576; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0025545; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0025572; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0025333; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0026017; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0024795; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0025310; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0025443; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0025406; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0025529; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0025436; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0026075; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0024542; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0024863; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000036473; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0054043; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0054058; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0054012; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0054002; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0053733; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0054488; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0053903; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0053679; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0053787; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0053748; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0053894; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0053738; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0054628; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0053508; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0053007; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=E430019B20
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2921
FT                   /organism="Mus musculus"
FT                   /strain="NOD"
FT                   /mol_type="mRNA"
FT                   /dev_stage="2 days neonate"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="E430019B20"
FT                   /cell_type="thymic cells"
FT                   /tissue_type="thymus"
FT                   /db_xref="taxon:10090"
FT   CDS             123..2363
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="protein-O-mannosyltransferase 1 (MGD|MGI:2138994
FT                   GB|NM_145145, evidence: BLASTN, 100%, match=2822)"
FT                   /note="putative"
FT                   /db_xref="GOA:Q8R2R1"
FT                   /db_xref="InterPro:IPR003342"
FT                   /db_xref="InterPro:IPR016093"
FT                   /db_xref="InterPro:IPR027005"
FT                   /db_xref="InterPro:IPR032421"
FT                   /db_xref="InterPro:IPR036300"
FT                   /db_xref="MGI:MGI:2138994"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q8R2R1"
FT                   /protein_id="BAE32295.1"
FT                   /translation="MGSHSTGLEETLGVLPSWLFCKMLRFLKRPLVVTVDINLNLVALT
FT                   GLGLLTRLWQLSYPRAVVFDEVYYGQYISFYMKRIFFLDDSGPPFGHMLLALGGWLGGF
FT                   DGNFLWNRIGAEYSSNVPIWSLRLLPALAGALSVPMAYQIVLELHFSHGAAIGAALLML
FT                   IENALITQSRLMLLESILIFFNLLAVLSYLKFFNSQTHSPFSVHWWLWLLLTGVSCSCA
FT                   VGIKYMGIFTYLLVLGIAAVHAWNLIGDQTLSNMRVLSHLLARIVALLVVPVFLYLLFF
FT                   YVHLMLLYRSGPHDQIMSSAFQASLEGGLARITQGQPLEVAFGSQVTLKSVSGKPLPCW
FT                   LHSHKNTYPMIYENGRGSSHQQQVTCYPFKDINNWWIVKDPGRHQLVVNNPPRPVRHGD
FT                   IVQLVHGMTTRLLNTHDVAAPLSPHSQEVSCYIDYNISMPAQNLWKLDIVNRESNRDTW
FT                   KTILSEVRFVHVNTSAILKLSGAHLPDWGFRQLEVVGEKLSPGYHESMVWNVEEHRYGK
FT                   SHEQKERELELHSPTQLDISRNLSFMARFSELQWKMLTLKNEDLEHQYSSTPLEWLTLD
FT                   TNIAYWLHPRTSAQIHLLGNIVIWTSASLATVVYTLLFFWYLLRRRRSICDLPEDAWSR
FT                   WVLAGALCTGGWALNYLPFFLMERVLFLYHYLPALTFQILLLPIVLQHASDHLCRSQLQ
FT                   RNVFSALVVAWYSSACHVSNMLRPLTYGDTSLSPGELRALRWKDSWDILIRK"
FT   regulatory      2902..2907
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      2921
FT                   /note="putative"
XX
SQ   Sequence 2921 BP; 641 A; 854 C; 752 G; 674 T; 0 other;
     gagagtctgt cggccgcgta gacacaacta gttgagcgtc gatccgtggc cgctgcagag        60
     taatcgagcg gcctcgcccc caacggtcga tcctgcctgg cggttcgcag gcctggctcc       120
     acatggggag ccactctacg ggactcgaag aaacgctcgg agtcctcccg agctggcttt       180
     tctgcaaaat gttaagattt ttgaaacggc ctctagtggt gactgttgac atcaatttga       240
     acttggtagc tctgactggc ctgggactac ttacccgact atggcaactc tcctaccctc       300
     gggctgtggt tttcgatgaa gtatattatg ggcagtacat ttccttctac atgaagcgca       360
     tcttctttct ggatgacagt gggccaccat ttggccacat gctactggcc ttaggaggtt       420
     ggttaggggg attcgatggt aactttctgt ggaaccgaat tggagcagag tacagtagca       480
     atgtgcctat atggtcctta cgcctgctgc cagcgcttgc cggggccctg tcagtgccca       540
     tggcctacca gatagtgcta gagctccact tttcccacgg tgctgccatt ggagccgccc       600
     tgctgatgct cattgagaac gccctgatca ctcagtccag gctcatgctg ttggagtcca       660
     tactgatatt ttttaacctg ttggctgtgt tgtcctatct gaagttcttc aactcccaga       720
     cacacagccc tttctcagtg cactggtggc tgtggctact gctgaccggg gtctcttgtt       780
     cctgtgcagt tgggatcaaa tacatgggga ttttcaccta cctgcttgtg ctcggcattg       840
     cagctgtcca cgcgtggaat ctgatcggag accagacctt gtcaaatatg cgcgtgctca       900
     gtcacttgct cgccagaatc gtggctctgc tggtcgtccc agtcttcctg tacttactct       960
     tcttctatgt ccacctgatg ctgctctacc gctctgggcc ccatgaccaa atcatgtcca      1020
     gtgccttcca ggccagcttg gagggagggc ttgcccgcat cacccaaggc cagcccctgg      1080
     aggtggcctt tggttcccag gtcactctga agagcgtctc tggcaaaccc ttgccctgct      1140
     ggcttcattc gcacaagaac acctatccca tgatatatga gaatggccgt ggcagctccc      1200
     accagcaaca ggtgacctgt tatcccttca aagacatcaa taactggtgg atcgtcaagg      1260
     atcctgggcg acaccagctg gtggtaaaca accctccccg gcctgtaaga catggagaca      1320
     ttgtacagct cgttcacggc atgaccaccc gcctccttaa cacgcacgat gtcgcagccc      1380
     cactgagccc ccattctcaa gaagtctcct gctacattga ctataacatc tccatgcctg      1440
     cccagaacct ctggaaactg gacattgtga acagagagtc caaccgggat acctggaaga      1500
     ctatcttgtc ggaagtgcgc tttgtacatg tgaacacatc cgccatcttg aagctgagcg      1560
     gggctcacct ccctgactgg gggttccggc agttggaggt agttggggag aagttgtcac      1620
     cgggctacca cgagagcatg gtgtggaatg tggaagaaca ccggtatggc aaaagccatg      1680
     agcagaagga gagggagctg gagctccact cgcccactca gctcgatatc agcaggaacc      1740
     tcagcttcat ggccagattc tcagagttac agtggaagat gctgacgctg aagaatgagg      1800
     acttggaaca ccagtacagc tccaccccgc tggagtggct cacgctggac accaacatcg      1860
     cctattggct acatcccagg accagtgctc agatccactt gcttgggaac attgtgatct      1920
     ggacttcagc cagccttgcc acagtggtct acactctact cttcttctgg tacctgctcc      1980
     gccggcgaag gagcatctgt gacctccctg aggatgcctg gtcccgctgg gtgctggctg      2040
     gagccctgtg tactggcggc tgggcactca actacctgcc cttcttcctg atggagaggg      2100
     tgctcttcct ctaccactac ttgccggcac tcaccttcca gatcctgctg ctcccgattg      2160
     tcctgcagca cgccagcgac catctgtgca ggtcccagct gcagaggaat gtcttcagtg      2220
     ccctggttgt agcatggtat tcctccgcat gccatgtgtc caacatgcta cgcccactaa      2280
     cctatgggga cacgtcactc tcaccaggcg agctccgggc ccttcgctgg aaagacagct      2340
     gggatattct gatccgaaag taatagagaa caagaacaca gaagacaagc acacaggaca      2400
     aagcctcaaa gatgtgtttg tctcccacca acaggagcct cagcaggcag gactgccagg      2460
     gtccaggagg aactccaggg actaattcca atttcacctc aagagccctg tccactggtt      2520
     ccttgtttga agcaattgat ttctcttcac acagtgaaga atgtgcccag ccacagcgtt      2580
     acccatgagg cccaactctg acccagccag agtttgagct gccagtgtag gaaccaccaa      2640
     ggcaggaggg gcacccagcc agggaaggag tgggggggac tcaggacgag ctgcgggcct      2700
     actatagggc cttagccctg tcatttatgg ggcccacagt gccacacctc attgggcaca      2760
     ggcacagcca ccctctgtaa accctgaaag ctgccagcca tccacagact cctgagccaa      2820
     ctctaaagag tcctgggaga ctgcagccac ctaactgcca cggccaaggt gtcgtccatt      2880
     cacttcctta cctttaatgt aaataaaaca ggacaaattg t                          2921
//