Dbfetch

ID   AK033515; SV 1; linear; mRNA; HTC; MUS; 2937 BP.
XX
AC   AK033515;
XX
DT   18-DEC-2002 (Rel. 74, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 14)
XX
DE   Mus musculus adult male colon cDNA, RIKEN full-length enriched library,
DE   clone:9030417M10 product:UDP-GALNAC:POLYPEPTIDE
DE   N-ACETYLGALACTOSAMINYLTRANSFERASE T9 (EC 2.4.1.41) homolog [Rattus
DE   norvegicus], full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; Muroidea;
OC   Muridae; Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2937
RA   Adachi J., Aizawa K., Akimura T., Arakawa T., Bono H., Carninci P.,
RA   Fukuda S., Furuno M., Hanagaki T., Hara A., Hashizume W., Hayashida K.,
RA   Hayatsu N., Hiramoto K., Hiraoka T., Hirozane T., Hori F., Imotani K.,
RA   Ishii Y., Itoh M., Kagawa I., Kasukawa T., Katoh H., Kawai J., Kojima Y.,
RA   Kondo S., Konno H., Kouda M., Koya S., Kurihara C., Matsuyama T.,
RA   Miyazaki A., Murata M., Nakamura M., Nishi K., Nomura K., Numazaki R.,
RA   Ohno M., Ohsato N., Okazaki Y., Saito R., Saitoh H., Sakai C., Sakai K.,
RA   Sakazume N., Sano H., Sasaki D., Shibata K., Shinagawa A., Shiraki T.,
RA   Sogabe Y., Tagami M., Tagawa A., Takahashi F., Takaku-Akahira S.,
RA   Takeda Y., Tanaka T., Tomaru A., Toya T., Yasunishi A., Muramatsu M.,
RA   Hayashizaki Y.;
RT   ;
RL   Submitted (16-JUL-2001) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; e222984331092f3f80b27883fe0913d7.
DR   Ensembl-Gn; ENSMUSG00000020520; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000066987; mus_musculus.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=9030417M10
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2937
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /sex="male"
FT                   /dev_stage="adult"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="9030417M10"
FT                   /tissue_type="colon"
FT                   /db_xref="taxon:10090"
FT   CDS             <1..1587
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="UDP-GALNAC:POLYPEPTIDE
FT                   N-ACETYLGALACTOSAMINYLTRANSFERASE T9 (EC 2.4.1.41) homolog
FT                   [Rattus norvegicus] (SPTR|Q925R7, evidence: FASTY, 98.9%ID,
FT                   87.5%length, match=1584)"
FT                   /note="putative"
FT                   /note="start codon is not identified"
FT                   /db_xref="GOA:Q6P9S7"
FT                   /db_xref="InterPro:IPR000772"
FT                   /db_xref="InterPro:IPR001173"
FT                   /db_xref="InterPro:IPR029044"
FT                   /db_xref="MGI:MGI:1890480"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q6P9S7"
FT                   /protein_id="BAC28334.1"
FT                   /translation="NKEAIRRDAQRVGYGEQGKPYPMTDAERVDQAYRENGFNIYVSDK
FT                   ISLNRSLPDIRHPNCNSKLYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAE
FT                   IVLVDDFSDREHLKKPLEDYMALFPSVRILRTKKREGLIRTRMLGASAATGDVVTFLDS
FT                   HCEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRI
FT                   PIPPELQKADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMC
FT                   GGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEYIYQRRPEYRHL
FT                   SAGDVVAQKKLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCTDT
FT                   KLGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPGDPQHTKKFCFDAVSHTSPV
FT                   TLYDCHSMKGNQLWKYRKDKTLYHPVSGSCMDCSESDHRVFMNTCNPSSLTQQWLFEHT
FT                   NSTVLENFNKN"
XX
SQ   Sequence 2937 BP; 683 A; 819 C; 835 G; 600 T; 0 other;
     aacaaggagg ctatcaggag ggacgcacag cgcgtaggat atggagaaca ggggaagcct        60
     taccccatga ccgatgccga gagagtggac caggcgtacc gggaaaatgg atttaacatc       120
     tacgtcagtg ataaaatctc cctgaatcgc tctctccccg atatccggca cccaaactgt       180
     aacagcaagc tctacctgga gacgctcccc aacaccagca tcatcatccc cttccacaac       240
     gagggctggt cctcactgct gcgcactgtt cacagcgtgc tcaaccgctc gcccccagag       300
     ctagtggcgg agattgtgtt ggtcgatgac ttcagtgatc gagagcacct gaagaagccc       360
     ctggaggact acatggccct tttccccagc gtgaggattc tccggaccaa gaaacgggaa       420
     ggcctgataa ggacccgaat gctgggggca tcagctgcca ccggagatgt cgtcactttc       480
     ttagattcac actgtgaagc caacgtcaac tggctccccc ctttgcttga ccgcattgct       540
     cggaaccgga agaccatcgt gtgcccaatg atcgatgtca tcgaccacga tgacttccgg       600
     tatgagactc aggctgggga cgccatgcgt ggtgccttcg actgggaaat gtattacaaa       660
     cggatcccaa tccctccaga gctgcagaag gctgacccca gtgacccatt tgagtctccc       720
     gtgatggctg gagggttgtt cgctgtggac cggaaatggt tctgggagtt gggtggctat       780
     gaccctggct tggagatctg gggaggagag cagtatgaga tctccttcaa ggtgtggatg       840
     tgtgggggcc gcatggagga catcccctgc tccagagtgg gccacatcta caggaagtat       900
     gtgccctaca aggtccctgc cggagtcagc ctggcccgga acctgaagcg agtggcagag       960
     gtatggatgg atgaatacgc agaatacatc taccagcgca ggccggagta ccgccacctc      1020
     tcagctggag atgtcgtggc ccagaaaaag ctccgagtct ccctcaactg taagagcttc      1080
     aagtggttta tgaccaaaat tgcctgggac ctgcccaagt tctacccacc cgtggaacct      1140
     ccggctgcgg catgggggga gattcgcaat gtgggcacag gactgtgcac agacacgaag      1200
     cttggcacac tgggctcccc actgaggctc gagacctgca tccggggccg aggcgaggct      1260
     gcttggaaca gtatgcaggt ctttaccttc acctggcggg aggacatccg gcctggagac      1320
     cctcagcaca ccaagaagtt ttgcttcgac gctgtctccc acaccagccc agtcaccctc      1380
     tacgactgcc acagcatgaa gggcaaccag ctgtggaaat accgcaagga caagacgctg      1440
     taccaccctg tgagcggcag ctgcatggac tgcagcgaga gtgaccacag ggtcttcatg      1500
     aacacctgca atccctcctc cctcacccag cagtggctct ttgaacacac caactcgacg      1560
     gtcttagaga atttcaacaa gaactgagtc cttagacctt gacaaacccc tcaggttcct      1620
     gtggagctca taaaccttcc tcctgtggga gaagaggcat cggtgggcac caaggtgctg      1680
     ggttcttgaa ggagtgacat ggtggggcca cggggagaac agaccatacc aatggctctc      1740
     caagaaggcg gagcctgctc acatcatagc agagctcacc agcctgtcag catgttccct      1800
     aagtgttagg aatcgcctgg gcagccttga gccatgaggt tgcccttgca gacaggacag      1860
     tgccctagaa aggaaggtgg tatcggcctt gggacagctt aaacctgtgt gccagctgct      1920
     ggcaaggaag tctgcctgtt cttgagggaa ctgctagaat tgcccagctt ctgtgttggt      1980
     ccagggcaat gagaggccat tgggacttcc gagtgctcca tagctagcac tggccagcaa      2040
     ccaggcgagg ggccccttcc tctgcagcca gatgtaaagg atgtctcccc tggctctctg      2100
     ggtatttcag atgcccgttc tagtttcagg gctacctggg caccaggtcg tcagggctag      2160
     ttcagccggc taagtagacc tccagaccga acagctccca gccaatgctt ccaaagccct      2220
     ttctccgtgc ttttccttgg cggctccctg ccttggagag gaaccggtgc tgtgagggat      2280
     cacctccaga gtctcctggg gggtggcttc ccccagataa tgactgctgt ggttccaacc      2340
     tcacagaacc ctggagtttc tggaaagttc tgtggttctg ttgaaagcat agatgagcca      2400
     tggcacgtgt gggcatataa cgcaagcagc caggtgtcag gcaacccctg acccaagcag      2460
     aggcggtagg gcaccagaag cttcggtcca acccaatgcc gtcacagcca cagcctcgtg      2520
     aggtgtgagg acatcacctt tagccccatg tctcaaacat gaactcaccc tccaaggagc      2580
     tggtcacttg ccagtgaagt aactgagaca tggtcctgag gccagagctc tgccttagag      2640
     atgctgggat cggggtcatc gggtccccag gctcgtcggg ttggtgctgt agaaccacct      2700
     ccacgtagcc cctgtgctgt cacagctcac ttgtggattt ctagtgccct tttccatatg      2760
     agcactctgt ctgacaagtg gccggctggg aaggagtcaa aaaggggagt gggtgggtgt      2820
     ccagaatcca gcagggatgc gtgttgcagc caaggggagt tgagtcctca gagagcaatg      2880
     cggccttgtt tcaataaaaa ccgtgccttt tacaaagaga aaaaaaaaaa aaaaaag         2937
//