Dbfetch

ID   AK132327; SV 1; linear; mRNA; HTC; MUS; 3433 BP.
XX
AC   AK132327;
XX
DT   06-SEP-2005 (Rel. 85, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 11)
XX
DE   Mus musculus 15 days embryo head cDNA, RIKEN full-length enriched library,
DE   clone:4022415M04 product:Exostosin-1 (EC 2.4.1.224) (EC 2.4.1.225)
DE   (Glucuronosyl-N- acetylglucosaminyl-proteoglycan/N-
DE   acetylglucosaminyl-proteoglycan 4- alpha-N-acetylglucosaminyltransferase)
DE   (Multiple exostoses protein 1 homolog), full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-3433
RA   Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA   Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA   Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA   Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA   Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RT   ;
RL   Submitted (30-MAR-2004) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; 8d0e1d8c77b8cd990753b5e9f6f68a12.
DR   Ensembl-Gn; ENSMUSG00000061731; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0021843; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0021803; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0021784; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0021810; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0021584; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0022249; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0021101; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0021553; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0021677; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0021655; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0021747; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0021679; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0022272; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0020846; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0021156; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000077273; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0040935; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0040895; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0040864; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0040894; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0040629; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0041361; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0040687; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0040546; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0040697; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0040660; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0040767; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0040642; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0041408; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0040279; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0040030; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=4022415M04
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..3433
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /dev_stage="15 days embryo"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="4022415M04"
FT                   /tissue_type="head"
FT                   /db_xref="taxon:10090"
FT   CDS             798..3038
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="Exostosin-1 (EC 2.4.1.224) (EC 2.4.1.225)
FT                   (Glucuronosyl-N- acetylglucosaminyl-proteoglycan/N-
FT                   acetylglucosaminyl-proteoglycan 4-
FT                   alpha-N-acetylglucosaminyltransferase) (Multiple exostoses
FT                   protein 1 homolog) (UniProt|P97464, evidence: FASTY,
FT                   100%ID, 100%length, match=2238)"
FT                   /note="putative"
FT                   /db_xref="GOA:Q3V1P4"
FT                   /db_xref="InterPro:IPR004263"
FT                   /db_xref="InterPro:IPR015338"
FT                   /db_xref="InterPro:IPR027670"
FT                   /db_xref="InterPro:IPR029044"
FT                   /db_xref="MGI:MGI:894663"
FT                   /db_xref="UniProtKB/TrEMBL:Q3V1P4"
FT                   /protein_id="BAE21106.1"
FT                   /translation="MQAKKRYFILLSAGSCLALLFYFGGVQFRASRSHSRREEHSGRNG
FT                   LHQPSPDHFWPRFPDALRPFFPWDQLENEDSSVHISPRQKRDANSSIYKGKKCRMESCF
FT                   DFTLCKKNGFKVYVYPQQKGEKIAESYQNILAAIEGSRFYTSDPSQACLFVLSLDTLDR
FT                   DQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFDIGQAMLAKASIS
FT                   TENFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGKRYLTGIGSDTRN
FT                   ALYHVHNGEDVLLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRGR
FT                   RLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD
FT                   KILALRQQTQFLWEAYFSSVEKIVLTTLEIIQDRIFKHISRNSLIWNKHPGGLFVLPQY
FT                   SSYLGDFPYYYANLGLKPPSKFTAVIHAVTPLVSQSQPVLKLLVAAAKSQYCAQIIVLW
FT                   NCDKPLPAKHRWPATAVPVIVIEGESKVMSSRFLPYDNIITDAVLSLDEDTVLSTTEVD
FT                   FAFTVWQSFPERIVGYPARSHFWDNSKERWGYTSKWTNDYSMVLTGAAIYHKYYHYLYS
FT                   HYLPASLKNMVDQLANCEDILMNFLVSAVTKLPPIKVTQKKQYKETMMGQTSRASRWAD
FT                   PDHFAQRQSCMNTFASWFGYMPLIHSQMRLDPVLFKDQVSILRKKYRDIERL"
FT   regulatory      3410..3415
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      3433
FT                   /note="putative"
XX
SQ   Sequence 3433 BP; 876 A; 864 C; 879 G; 814 T; 0 other;
     ggtgtaagtg taacattcag aaccgggtaa cattcggcga ccgaacgcgg cggtcggcag        60
     tgatagcgcg gggcctgcga agcaccgctc ggggccggca ctgcccgcgg ggaggacgcg       120
     ccgccgccgc cgccccgcgc cgcctcgtgc cgccggccgg gccgccgcgc gcccggggag       180
     ccggccccgt gagcgcagga gtaaacaccg gcggagtgtg ggagccgctg cagaagggaa       240
     taaagagaca tgcagggatt tgtgaggtta cggcgcccca gctgcaagat gcactagccg       300
     gctgaagccg gggcggattg acttgttgga accgtagtgc tctgcaccgg agtgtggatg       360
     agttgaagtt gctttccccg ggctcgttct ccacgccgcc gagaggaatc cgagcggaga       420
     ggcagtcact tcgtcttgcc attgattggg tatcgggagc tttttcttcc ctgctctctt       480
     tcttttcctc cgtcttgttg catgcaagaa aattacagtc cgctgttcgc ccgccctggg       540
     tgcgagatct tctgcccggc tctctcccgt gcattgtgca gcccaaagat gaaagaccga       600
     aggggagaaa gttaaagaaa tcgcccacat gcgctggatc agtccacggc ttggggaaag       660
     gcatccagag aaggtgggag cggagagttt gaagtcttta caggcgggaa gatggcggac       720
     tggagctgaa agtgttgatt gggaaacttg ggtgattctt gtgtttattt acaatcctct       780
     tgacccaggc aggacacatg caggccaaaa aacgctattt catcctgctc tcagctggct       840
     cttgtctcgc ccttttgttt tattttggag gcgtgcagtt tagggcatcg aggagccaca       900
     gccggagaga agagcacagt ggtcggaatg gcttgcacca gcccagtccg gatcatttct       960
     ggccccgctt cccggacgct ctgcgccctt tctttccttg ggatcaattg gaaaacgagg      1020
     attccagcgt gcacatttcc ccccggcaga agcgagacgc caactcgagc atctacaaag      1080
     gcaagaagtg ccgcatggag tcctgcttcg atttcaccct ttgcaagaaa aacggcttca      1140
     aagtctacgt gtacccgcag cagaaagggg agaaaatcgc cgaaagttac caaaacattc      1200
     tagcggccat cgagggctcc aggttctaca cctcggaccc cagccaggcg tgcctctttg      1260
     tcttgagtct ggatacttta gacagagacc agttatcacc tcagtatgtg cacaatttga      1320
     gatccaaagt gcagagtctc cacttgtgga acaatggtag gaatcattta atttttaatt      1380
     tatattctgg cacttggcct gactacactg aggacgtggg gtttgacatc ggccaggcga      1440
     tgctggccaa agccagcatc agtactgaaa acttccgacc aaactttgat gtttctattc      1500
     ccctcttttc taaggatcat cccaggacag gaggggagag ggggtttttg aaatttaaca      1560
     ccatccctcc tctcaggaag tacatgctgg tattcaaggg gaagcggtac ctgacaggga      1620
     tagggtcaga caccaggaat gccttatatc acgtccataa cggggaggac gtcttgctcc      1680
     tcaccacctg caagcatggc aaagactggc aaaagcacaa ggattctcgc tgtgacagag      1740
     acaacaccga gtatgagaaa tatgattatc gggaaatgct gcacaatgcc actttctgtc      1800
     tggttcctcg tggtcgcagg cttgggtcct tcagattcct ggaggctttg caggctgcct      1860
     gtgtccctgt aatgctcagc aacggatggg agttgccatt ctccgaagtg attaattgga      1920
     accaagctgc cgtcataggc gatgagagat tgctattaca gattccttct acaatcaggt      1980
     ctattcatca ggataaaatc ctagcactta gacagcagac acagttcttg tgggaggctt      2040
     atttttcttc agttgagaag attgtattaa ctacactaga gattattcag gacagaatat      2100
     tcaagcacat atcacgtaac agtttaatat ggaacaaaca tcctggagga ttgttcgtcc      2160
     taccgcagta ttcatcttac ctgggagatt tcccttacta ctatgctaat ttaggtttaa      2220
     agcccccctc caaattcact gcagtcatcc atgctgtgac tcccctggtc tctcagtccc      2280
     agccagtgtt gaagcttctt gtggctgcag ccaaatccca gtactgtgcg cagatcatag      2340
     ttctgtggaa ttgtgacaag cctctaccag ccaaacatcg ctggcctgcc actgccgtgc      2400
     ctgtcatcgt cattgaagga gaaagcaagg ttatgagcag ccggtttctg ccctatgaca      2460
     acatcatcac tgatgctgtg ctcagcctgg atgaggacac tgtgctttca actacggaag      2520
     tggattttgc cttcaccgta tggcagagct tcccagagag gattgtggga tatcctgctc      2580
     gcagtcattt ctgggataac tcaaaggagc ggtggggata tacatccaag tggacgaatg      2640
     actactccat ggtgttgaca ggagctgcta tctaccacaa atattatcac tacctgtatt      2700
     cccattacct gccagccagc ctgaagaaca tggtagacca actggccaac tgtgaggaca      2760
     ttctcatgaa tttcctggtg tctgctgtga caaaattgcc tccaatcaaa gtgacccaga      2820
     agaaacagta taaggagaca atgatgggac agacttcccg agcatcccgc tgggccgacc      2880
     ctgaccactt tgcccagcga cagagctgca tgaatacatt tgccagctgg tttggctaca      2940
     tgccgctgat ccattctcag atgaggctgg acccggtcct ctttaaagac caagtctcaa      3000
     ttctgaggaa gaaatacaga gacattgaac gactttgagg aagcccaccg agtgggggag      3060
     gggaagcaag acgggcgtcc agctgctctc tcctccttcc caatgcagat ccgctcacgc      3120
     ccagcagtgg agccagactg tgccaagtat caaaaaatca aaaaaaatca aaaaacaaaa      3180
     aaaacaaaaa agacaaaaaa agaaaaagga aaaaataaac agcttagatg aacacagtga      3240
     caaaactcgg ctggaatcct ggctcctggg gctgcgccag actgctcaat tcacctcact      3300
     ggcttctgtg tcccacaact aggttgtgta cagtttaatt atggaacatt aaataattat      3360
     ttttgaaatg attgctatgc aggtttaaac ttttttaatg atcaaaacta ttaaaaacca      3420
     gagttctttc ttt                                                         3433
//