Dbfetch

ID   AK034371; SV 1; linear; mRNA; HTC; MUS; 4427 BP.
XX
AC   AK034371;
XX
DT   18-DEC-2002 (Rel. 74, Created)
DT   07-OCT-2010 (Rel. 106, Last updated, Version 16)
XX
DE   Mus musculus adult male diencephalon cDNA, RIKEN full-length enriched
DE   library, clone:9330184D19 product:UDP-Gal:betaGlcNAc beta
DE   1,3-galactosyltransferase, polypeptide 2, full insert sequence.
XX
KW   CAP trapper; HTC; HTC_FLI.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-4427
RA   Adachi J., Aizawa K., Akimura T., Arakawa T., Bono H., Carninci P.,
RA   Fukuda S., Furuno M., Hanagaki T., Hara A., Hashizume W., Hayashida K.,
RA   Hayatsu N., Hiramoto K., Hiraoka T., Hirozane T., Hori F., Imotani K.,
RA   Ishii Y., Itoh M., Kagawa I., Kasukawa T., Katoh H., Kawai J., Kojima Y.,
RA   Kondo S., Konno H., Kouda M., Koya S., Kurihara C., Matsuyama T.,
RA   Miyazaki A., Murata M., Nakamura M., Nishi K., Nomura K., Numazaki R.,
RA   Ohno M., Ohsato N., Okazaki Y., Saito R., Saitoh H., Sakai C., Sakai K.,
RA   Sakazume N., Sano H., Sasaki D., Shibata K., Shinagawa A., Shiraki T.,
RA   Sogabe Y., Tagami M., Tagawa A., Takahashi F., Takaku-Akahira S.,
RA   Takeda Y., Tanaka T., Tomaru A., Toya T., Yasunishi A., Muramatsu M.,
RA   Hayashizaki Y.;
RT   ;
RL   Submitted (16-JUL-2001) to the INSDC.
RL   Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical
RL   Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22
RL   Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL   
RL   :http://www.osc.riken.jp/
XX
RN   [2]
RX   PUBMED; 16141072.
RG   The FANTOM Consortium, Riken Genome Exploration Research Group and Genome
RG   Science Group (Genome Network Project Core Group)
RA   ;
RT   "The Transcriptional Landscape of the Mammalian Genome";
RL   Science, e1252229 309(5740):1559-1563(2005).
XX
RN   [3]
RX   DOI; 10.1126/science.1112009.
RX   PUBMED; 16141073.
RG   RIKEN Genome Exploration Research Group and Genome Science Group (Genome
RG   Network Project Core Group) and the FANTOM Consortium
RA   ;
RT   "Antisense Transcription in the Mammalian Transcriptome";
RL   Science, e1252229 309(5740):1564-1566(2005).
XX
RN   [4]
RX   PUBMED; 12466851.
RG   The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase
RG   I and II Team
RA   ;
RT   "Analysis of the mouse transcriptome based on functional annotation of
RT   60,770 full-length cDNAs";
RL   Nature 420(6915):563-573(2002).
XX
RN   [5]
RX   PUBMED; 11217851.
RG   The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM
RG   Consortium
RA   ;
RT   "Functional annotation of a full-length mouse cDNA collection";
RL   Nature 409(6821):685-690(2001).
XX
RN   [6]
RX   DOI; 10.1016/S0076-6879(99)03004-9.
RX   PUBMED; 10349636.
RA   Carninci P., Hayashizaki Y.;
RT   "High-efficiency full-length cDNA cloning";
RL   Meth. Enzymol. 303:19-44(1999).
XX
RN   [7]
RX   DOI; 10.1101/gr.145100.
RX   PUBMED; 11042159.
RA   Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA   Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT   "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT   full-length cDNA libraries for rapid discovery of new genes";
RL   Genome Res. 10(10):1617-1630(2000).
XX
RN   [8]
RX   DOI; 10.1101/gr.152600.
RX   PUBMED; 11076861.
RA   Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA   Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA   Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA   Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA   Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA   Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA   Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT   "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT   pipeline with 384 multicapillary sequencer";
RL   Genome Res. 10(11):1757-1771(2000).
XX
DR   MD5; 7167835774321485efdd303e6a6cd088.
DR   Ensembl-Gn; ENSMUSG00000033849; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0016561; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0016544; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0016509; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0016504; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0016329; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0016965; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0015908; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0016303; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0016409; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0016407; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0016480; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0016433; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0017001; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0015694; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0015971; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000038252; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0022489; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0022459; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0022426; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0022429; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0022239; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0022941; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0021766; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0022179; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0022304; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0022314; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0022406; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0022283; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0022979; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0021492; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0021807; mus_musculus_wsbeij.
XX
CC   cDNA library was prepared and sequenced in Mouse Genome
CC   Encyclopedia Project of Genome Exploration Research Group in Riken
CC   Genomic Sciences Center and Genome Science Laboratory in RIKEN.
CC   Division of Experimental Animal Research in Riken contributed to
CC   prepare mouse tissues.
CC   Please visit our web site for further details.
CC   URL:http://www.osc.riken.jp/
CC   URL:http://fantom.gsc.riken.jp/
CC   clone information is available at:
CC   http://fantom.gsc.riken.jp/3/db/annotate/
CC   main.cgi?masterid=9330184D19
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..4427
FT                   /organism="Mus musculus"
FT                   /strain="C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /sex="male"
FT                   /dev_stage="adult"
FT                   /clone_lib="RIKEN full-length enriched mouse cDNA library"
FT                   /clone="9330184D19"
FT                   /tissue_type="diencephalon"
FT                   /db_xref="taxon:10090"
FT   CDS             725..1993
FT                   /codon_start=1
FT                   /transl_table=1
FT                   /note="UDP-Gal:betaGlcNAc beta 1,3-galactosyltransferase,
FT                   polypeptide 2 (MGD|MGI:1349461 GB|NM_020025, evidence:
FT                   BLASTN, 99%, match=1269)"
FT                   /note="putative"
FT                   /db_xref="GOA:O54905"
FT                   /db_xref="InterPro:IPR002659"
FT                   /db_xref="MGI:MGI:1349461"
FT                   /db_xref="UniProtKB/Swiss-Prot:O54905"
FT                   /protein_id="BAC28688.1"
FT                   /translation="MLQWRRRHCCFAKMTWSPKRSLLRTPLTGVLSLVFLFAMFLFFNH
FT                   HDWLPGRPGFKENPVTYTFRGFRSTKSETNHSSLRTIWKEVAPQTLRPHTASNSSNTEL
FT                   SPQGVTGLQNTLSANGSIYNEKGTGHPNSYHFKYIINEPEKCQEKSPFLILLIAAEPGQ
FT                   IEARRAIRQTWGNETLAPGIQIIRVFLLGISIKLNGYLQHAIQEESRQYHDIIQQEYLD
FT                   TYYNLTIKTLMGMNWVATYCPHTPYVMKTDSDMFVNTEYLIHKLLKPDLPPRHNYFTGY
FT                   LMRGYAPNRNKDSKWYMPPDLYPSERYPVFCSGTGYVFSGDLAEKIFKVSLGIRRLHLE
FT                   DVYVGICLAKLRVDPVPPPNEFVFNRWRVSYSSCKYSHLITSHQFQPSELIKYWNHLQQ
FT                   NKHNACANAAKEKAGRYRHRKLH"
FT   regulatory      4409..4414
FT                   /note="putative"
FT                   /regulatory_class="polyA_signal_sequence"
FT   polyA_site      4427
FT                   /note="putative"
XX
SQ   Sequence 4427 BP; 1570 A; 781 C; 766 G; 1310 T; 0 other;
     gaaaactgct gcctgcctgt gcagcacgga agaacggtgc atttcacatt acagaccaaa        60
     accccaggaa aacagctcaa aaatccttcc tgcagctgcc aggcaaccac gacggagaaa       120
     ggcaaagcct ttttttttcc cccaatgcaa ctgaaacact aaaccacagc tctgctgctt       180
     aacattgcag ctcagcgcta ttactagaaa tatggatact gagaagagaa tacagcactg       240
     cattgtccag ccgggaatac agcagatgta aagagcttca atgcatcaac tgtcggaaag       300
     agtcaactgt gcaccaaata caacagacag ctacagctct tttgtttagt gaaagagaga       360
     aaatgaaaga aaggaaaaat ctctgaagac tataagatat agacatatga acaagaaggg       420
     taacttgaag acaccccgac agatggacac tttggatact gtgaaaagca atcacaggag       480
     gcagactgtt gggggatgtg cgcatgttcg atagcatcgt tttttgctga agtgatggcg       540
     tgccaaaagt attttcagta gacataatcc tctccatcta atggtctgac caagaaagaa       600
     agaatgacat cgagggacat gtacctgaac cagaagacga tgaatcaagc gcagtattga       660
     ctgaggacgg aacaacagtg tttttggcca cagacatcca ttactgctac tggatactta       720
     caacatgctt cagtggagaa gacgacactg ctgctttgca aaaatgacct ggagccctaa       780
     gaggtctctg ctccggactc cccttacggg tgtgctttct ctagtgtttc tctttgctat       840
     gttcttgttt ttcaatcatc atgactggtt accaggtaga ccaggattca aagaaaatcc       900
     tgtgacatac actttccgag gatttcgttc tacaaaaagt gagacaaacc atagctccct       960
     tcggaccatc tggaaagaag tagctcctca gactctgagg cctcacacag caagcaactc      1020
     cagtaacacc gagctatcac cacagggagt cacagggctg cagaacactc tcagtgccaa      1080
     tggcagcatt tataatgaaa aaggaactgg acatccaaac tcttaccatt tcaaatatat      1140
     tatcaatgag cctgaaaaat gccaagagaa aagtccattt ttaatactat taatagctgc      1200
     agaacctgga caaatcgaag caagaagagc tatacggcaa acttggggca atgaaacttt      1260
     ggcacctggc atccaaatca tacgggtttt tttgttgggc ataagtatta agctaaatgg      1320
     ctatcttcaa catgcaattc aagaagaaag cagacagtat catgatataa ttcagcagga      1380
     atatttagat acatactata atctgaccat taaaacacta atgggtatga actgggttgc      1440
     aacatactgt ccacatactc cctatgttat gaaaacggac agtgacatgt ttgtcaacac      1500
     agaatactta atacacaagt tactaaagcc agacctgcct cctagacata actattttac      1560
     tggctatcta atgagaggat atgcaccgaa cagaaacaaa gacagtaagt ggtacatgcc      1620
     accagacctt tacccaagtg agcgctaccc tgtcttctgc tcaggaactg gttatgtgtt      1680
     ttctggggat ctggcagaga agatatttaa ggtttcttta ggtatccgtc gtttgcactt      1740
     ggaagatgta tatgtaggga tctgtcttgc caagttgaga gttgatcctg tgccccctcc      1800
     caatgagttc gtgttcaatc gctggcgagt ttcttattca agctgtaaat acagccacct      1860
     aattacctct catcagttcc aacctagtga actgataaaa tactggaacc atttacaaca      1920
     aaataagcac aacgcctgtg ccaatgcagc aaaggaaaag gcaggcaggt atcgacaccg      1980
     caaactacac tagaagacta tttttgttca aatgtggagt ctgtaaatat tgcttaaagc      2040
     atgtatagtt aaaaacttga ttatatacat aggacaagtt ttagttcaac tcatcacata      2100
     aaggaattca aagctatttt ttaaattttc tgaataagat aattcataca attgcaaatt      2160
     atgacaaaaa ggtatcccaa aagagtctat ttaaataact gttatgagga gattctctat      2220
     attaacatgc aataataagc atgcatacat aaatggttca agacttacat tagggaccaa      2280
     tacaatgtat ctgcatacat tttctatata aatcttaaga aatgaagaca gtaaagagat      2340
     tcctagattt acttttgatt tcatcatata actaaatgta aataagacag tactattgat      2400
     tttaaaggaa ctttgtaatt gtgcaatgaa caagttttct gacctgactc agttgcaata      2460
     agatttagtt aagttattcc ataaattcat ttatagcatt caggtgtttg agcaatgcaa      2520
     ttctcattca agaatatact tttaaaaata atttataatt attttaattt cttttattaa      2580
     tacttatcta tactgggaaa attattttga catgatgtga taaatgtgaa aaattaatgt      2640
     gtctcaggct caagttttta taaaatgaat tattaaaggt atcaaaatag ataaattttg      2700
     cttcctcatt ggtattaagg gtcaagacca tatttcaatg gattttgcct ttaaaatatt      2760
     acactcttgg ggagtgcttt tatcttcaac tatacctgac ttaaagtatg tctttaaaat      2820
     agaatcatat tcctctagct gaaatacaag tagtaacata acagcacagg ggtaatgtgt      2880
     ggtcttaaat aaggaagaaa aaaaggaata caaatagtca tttcattcaa tctattttaa      2940
     agttgtatat ctgccaaaca gcgagatata ggaaaagctt atattctgag agcagtgtac      3000
     gtcagacaac atttaatcat tttatggcac ttatttcttt ctctcagtgt caagtttata      3060
     aaccttctgt ttttaatttg gtagatgaca aatattcgca tcaccaattc aaaaccattt      3120
     agagaatggt ggggagaaag ctacatatgt ttgataatca tttgctttta gagtggacaa      3180
     tatacttaat gcactttaac caaggtacta ttggttttta aaaggtaaac aacaaagccc      3240
     tgaactgaat gctatacaaa tgttctcata tttcatttgg ccctatcatg aattgtaaca      3300
     tcaataaaca caagcccagt aactttgaaa cagtgtatcc tatgtatatt tataaatttt      3360
     aattgagtca gaatatcttc tatcagaatt ctaatattcg tttttatcac taaatctagt      3420
     atgaaaggct aaattgacag acacacaaaa agctttattt attagtaata cagagttcta      3480
     cttatattaa atgtatagct cattttttaa ataagaagcc taagtattct attccacacc      3540
     aaatggttat gaataatcag ctataaaaca tatccagctc ttagctaatg tttttttaat      3600
     ttattctatg taattgtagt gaacagaatt gcctcacaaa aagacataca cagaggataa      3660
     caccaaacca caccagtgat caaagacaga gagagaccct gacagtgtac tgagtctctc      3720
     tgatttatac cacaaaggtt taaaatgaaa atgggaacta tgtatgtttg tgagcgttcc      3780
     tgcacaagta tgtaaacaaa aagagagctc ctaagtgata tttaaaaact ataatagtat      3840
     aggattactg ggattaaaat actgacttaa aaaacacaga gttcttaaaa atacttaagt      3900
     gcttactaaa attatttcca ttaatttgaa aaaataattt aatcatggtg tgagtcctca      3960
     gagagcatta aaccagtcaa gttattaatg tcaaaagtct gtacttttga ctttgggaga      4020
     acacacgcta gcatagcttc agcacttgga aaataaggca ggaagaaaga gagttcaagg      4080
     caattttagg ttaaacaata aaatcctaag aaaaccatca aaccttactt taaaataaaa      4140
     attctattgt taatttattt ttacatttaa ttttcaataa tataaatgct aactttataa      4200
     gctgctatat aatgtctcac cagttgttaa ttatatcttg tttaaataaa atttttgtat      4260
     gtaagaaatg tttgaaaatg actatattat attgatttga ccaaaagaaa gtttacagaa      4320
     tcaactgtat aacactaaac tcaaagtgtt aatagacatt tcaaaaggaa aatgcaaact      4380
     tcaatattta caacattagt ttgtttgaaa taaaattaga agtaatg                    4427
//