Dbfetch

ID   BC046322; SV 1; linear; mRNA; STD; MUS; 2594 BP.
XX
AC   BC046322;
XX
DT   14-FEB-2003 (Rel. 74, Created)
DT   21-OCT-2008 (Rel. 97, Last updated, Version 8)
XX
DE   Mus musculus UDP-Gal:betaGlcNAc beta 1,3-galactosyltransferase, polypeptide
DE   2, mRNA (cDNA clone MGC:54549 IMAGE:6392262), complete cds.
XX
KW   MGC.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2594
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-2594
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (31-JAN-2003) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; 7e60921220e1b4dddad80306d4aa7ade.
DR   Ensembl-Gn; ENSMUSG00000033849; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0016561; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0016544; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0016509; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0016504; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0016329; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0016965; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0015908; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0016303; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0016409; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0016407; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0016480; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0016433; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0017001; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0015694; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0015971; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000038252; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0022489; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0022459; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0022426; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0022429; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0022239; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0022941; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0021766; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0022179; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0022304; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0022314; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0022406; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0022283; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0022979; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0021492; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0021807; mus_musculus_wsbeij.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Susan L. Sullivan, PhD.
CC   cDNA Library Preparation: ResGen, Invitrogen Corp
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: National Institutes of Health Intramural
CC   Sequencing Center (NISC),
CC   Gaithersburg, Maryland;
CC   Web site: http://www.nisc.nih.gov/
CC   Contact: nisc_mgc@nhgri.nih.gov
CC   Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
CC   Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
CC   Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
CC   Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
CC   Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
CC   McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
CC   Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
CC   Young,A., Zhang,L.-H. and Green,E.D.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 100 Row: g Column: 12
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 31542172.
CC   Differences found between this sequence and the mouse C57BL/6J
CC   genome (build 36) are described in misc_difference features below.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2594
FT                   /organism="Mus musculus"
FT                   /lab_host="DH10B"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NIH_MGC_129"
FT                   /clone="MGC:54549 IMAGE:6392262"
FT                   /tissue_type="Olfactory epithelium, neonatal mouse,
FT                   C57Bl/6"
FT                   /note="Vector: pCMV-SPORT6"
FT                   /db_xref="taxon:10090"
FT   gene            1..2594
FT                   /gene="B3galt2"
FT   CDS             605..1873
FT                   /codon_start=1
FT                   /gene="B3galt2"
FT                   /product="UDP-Gal:betaGlcNAc beta
FT                   1,3-galactosyltransferase, polypeptide 2"
FT                   /db_xref="GOA:O54905"
FT                   /db_xref="InterPro:IPR002659"
FT                   /db_xref="MGI:MGI:1349461"
FT                   /db_xref="UniProtKB/Swiss-Prot:O54905"
FT                   /protein_id="AAH46322.1"
FT                   /translation="MLQWRRRHCCFAKMTWSPKRSLLRTPLTGVLSLVFLFAMFLFFNH
FT                   HDWLPGRPGFKENPVTYTFRGFRSTKSETNHSSLRTIWKEVAPQTLRPHTASNSSNTEL
FT                   SPQGVTGLQNTLSANGSIYNEKGTGHPNSYHFKYIINEPEKCQEKSPFLILLIAAEPGQ
FT                   IEARRAIRQTWGNETLAPGIQIIRVFLLGISIKLNGYLQHAIQEESRQYHDIIQQEYLD
FT                   TYYNLTIKTLMGMNWVATYCPHTPYVMKTDSDMFVNTEYLIHKLLKPDLPPRHNYFTGY
FT                   LMRGYAPNRNKDSKWYMPPDLYPSERYPVFCSGTGYVFSGDLAEKIFKVSLGIRRLHLE
FT                   DVYVGICLAKLRVDPVPPPNEFVFNHWRVSYSSCKYSHLITSHQFQPSELIKYWNHLQQ
FT                   NKHNACANAAKEKAGRYRHRKLH"
FT   misc_difference 2576..2594
FT                   /gene="B3galt2"
FT                   /note="polyA tail: 19 bases do not align to the mouse
FT                   genome."
XX
SQ   Sequence 2594 BP; 901 A; 490 C; 482 G; 721 T; 0 other;
     ggcaaagcct ttttttttcc cccaatgcaa ctgaaacact aaaccacagc tctgctgctt        60
     aacattgcag ctcagcgcta ttactagaaa tatggatact gagaagagaa tacagcactg       120
     cattgtccag ccgggaatac agcagatgta aagagcttca atgcatcaac tgtcggaaag       180
     agtcaactgt gcaccaaata caacagacag ctacagctct tttgtttagt gaaagagaga       240
     aaatgaaaga aaggaaaaat ctctgaagac tataagatat agacatatga acaagaaggg       300
     taacttgaag acaccccgac agatggacac tttggatact gtgaaaagca atcacaggag       360
     gcagactgtt gggggatgtg cgcatgttcg atagcatcgt tttttgctga agtgatggcg       420
     tgccaaaagt attttcagta gacataatcc tctccatcta atggtctgac caagaaagaa       480
     agaatgacat cgagggacat gtacctgaac cagaagacga tgaatcaagc gcagtattga       540
     ctgaggacgg aacaacagtg tttttggcca cagacatcca ttactgctac tggatactta       600
     caacatgctt cagtggagaa gacgacactg ctgctttgca aaaatgacct ggagccctaa       660
     gaggtctctg ctccggactc cccttacggg tgtgctttct ctagtgtttc tctttgctat       720
     gttcttgttt ttcaatcatc atgactggtt accaggtaga ccaggattca aagaaaatcc       780
     tgtgacatac actttccgag gatttcgttc tacaaaaagt gagacaaacc atagctccct       840
     tcggaccatc tggaaagaag tagctcctca gactctgagg cctcacacag caagcaactc       900
     cagtaacacc gagctatcac cacagggagt cacagggctg cagaacactc tcagtgccaa       960
     tggcagcatt tataatgaaa aaggaactgg acatccaaac tcttaccatt tcaaatatat      1020
     tatcaatgag cctgaaaaat gccaagagaa aagtccattt ttaatactat taatagctgc      1080
     agaacctgga caaatcgaag caagaagagc tatacggcaa acttggggca atgaaacttt      1140
     ggcacctggc atccaaatca tacgggtttt tttgttgggc ataagtatta agctaaatgg      1200
     ctatcttcaa catgcaattc aagaagaaag cagacagtat catgatataa ttcagcagga      1260
     atatttagat acatactata atctgaccat taaaacacta atgggtatga actgggttgc      1320
     aacatactgt ccacatactc cctatgttat gaaaacggac agtgacatgt ttgtcaacac      1380
     agaatactta atacacaagt tactaaagcc agacctgcct cctagacata actattttac      1440
     tggctatcta atgagaggat atgcaccgaa cagaaacaaa gacagtaagt ggtacatgcc      1500
     accagacctt tacccaagtg agcgctaccc tgtcttctgc tcaggaactg gttatgtgtt      1560
     ttctggggat ctggcagaga agatatttaa ggtttcttta ggtatccgtc gtttgcactt      1620
     ggaagatgta tatgtaggga tctgtcttgc caagttgaga gttgatcctg tgccccctcc      1680
     caatgagttc gtgttcaatc actggcgagt ttcttattca agctgtaaat acagccacct      1740
     aattacctct catcagttcc aacctagtga actgataaaa tactggaacc atttacaaca      1800
     aaataagcac aacgcctgtg ccaatgcagc aaaggaaaag gcaggcaggt atcgacaccg      1860
     caaactacac tagaagacta tttttgttca aatgtggagt ctgtaaatat tgcttaaagc      1920
     atgtatagtt aaaaacttga ttatatacat aggacaagtt ttagttcaac tcatcacata      1980
     aaggaattca aagctatttt ttaaattttc tgaataagat aattcataca attgcaaatt      2040
     atgacaaaaa ggtatcccaa aagagtctat ttaaataact gttatgagga gattctctat      2100
     attaacatgc aataataagc atgcatacat aaatggttca agacttacat tagggaccaa      2160
     tacaatgtat ctgcatacat tttctatata aatcttaaga aatgaagaca gtaaaaagat      2220
     tcctagattt acttttgatt tcatcatata actaaatgta aataagacag tactattgat      2280
     tttaaaggaa ctttgtaatt gtgcaatgaa caagttttct gacctgactc agttgcaata      2340
     agatttagtt aagttattcc ataaactcat ttatagcatt caggtgtttg agcaatgcaa      2400
     ttctcattca agaatatact tttaaaaata atttataatt attttaattt cttttattaa      2460
     tacttatcta tactgggaaa attattttga catgatgtga taaatgtgaa aaattaatgt      2520
     gtctcaggct caagttttta taaaatgaat tattaaaggt atcaaaatag ataaaaaaaa      2580
     aaaaaaaaaa aaaa                                                        2594
//