Dbfetch

ID   BC003835; SV 1; linear; mRNA; STD; MUS; 1850 BP.
XX
AC   BC003835;
XX
DT   17-MAR-2001 (Rel. 67, Created)
DT   16-JUL-2006 (Rel. 88, Last updated, Version 16)
XX
DE   Mus musculus UDP-GalNAc:betaGlcNAc beta 1,3-galactosaminyltransferase,
DE   polypeptide 1, mRNA (cDNA clone MGC:6335 IMAGE:3486016), complete cds.
XX
KW   MGC.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-1850
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-1850
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (28-FEB-2001) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; 86d00a958a3b61ba3e8da762001d623b.
DR   Ensembl-Gn; ENSMUSG00000043300; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0027296; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0027258; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0027226; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0027269; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0027013; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0027715; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0026460; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0026988; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0027124; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0027092; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0027232; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0027111; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0027773; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0026194; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0026538; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000061826; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0061187; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0061211; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0061147; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0061136; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0060830; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0061614; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0061124; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0060806; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0060902; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0060887; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0061045; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0060856; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0061800; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0060679; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0060104; mus_musculus_wsbeij.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Lothar Hennighausen Ph.D., Robin Humphreys
CC   cDNA Library Preparation: Life Technologies, Inc.
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: Sequencing Group at the Stanford Human Genome
CC   Center, Stanford University School of Medicine, Stanford, CA  94305
CC   Web site:       http://www-shgc.stanford.edu
CC   Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
CC   Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
CC   R. M.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 7 Row: b Column: 7
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 31560375.
CC   Differences found between this sequence and the mouse C57BL/6J
CC   genome (build 36) are described in misc_difference features below.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..1850
FT                   /organism="Mus musculus"
FT                   /lab_host="DH10B"
FT                   /strain="mix FVB/N, C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NCI_CGAP_Mam5"
FT                   /clone="MGC:6335 IMAGE:3486016"
FT                   /tissue_type="Mammary tumor. WAP-TGF alpha model. 7 months
FT                   old, gross tissue."
FT                   /note="Vector: pCMV-SPORT6"
FT                   /db_xref="taxon:10090"
FT   gene            1..1850
FT                   /gene="B3galnt1"
FT                   /note="synonym: b3GT3"
FT   misc_difference 1..9
FT                   /gene="B3galnt1"
FT                   /note="9 bases at the 5' end do not align to the mouse
FT                   genome."
FT   misc_difference 11
FT                   /gene="B3galnt1"
FT                   /note="'C' in cDNA is 'T' in the mouse genome."
FT   misc_difference 12^13
FT                   /gene="B3galnt1"
FT                   /note="1 base in the mouse genome, T, is not found in
FT                   cDNA."
FT   CDS             66..1061
FT                   /codon_start=1
FT                   /gene="B3galnt1"
FT                   /product="UDP-GalNAc:betaGlcNAc beta
FT                   1,3-galactosaminyltransferase, polypeptide 1"
FT                   /db_xref="GOA:Q920V1"
FT                   /db_xref="InterPro:IPR002659"
FT                   /db_xref="MGI:MGI:1349405"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q920V1"
FT                   /protein_id="AAH03835.3"
FT                   /translation="MAPAVLTALPNRMSLRSLKWSLLLLSLLSFLVIWYLSLPHYNVIE
FT                   RVNWMYFYEYEPIYRQDFRFTLREHSNCSHQNPFLVILVTSRPSDVKARQAIRVTWGEK
FT                   KSWWGYEVLTFFLLGQQAEREDKTLALSLEDEHVLYGDIIRQDFLDTYNNLTLKTIMAF
FT                   RWVMEFCPNAKYIMKTDTDVFINTGNLVKYLLNLNHSEKFFTGYPLIDNYSYRGFFHKN
FT                   HISYQEYPFKVFPPYCSGLGYIMSGDLVPRVYEMMSHVKPIKFEDVYVGICLNLLKVDI
FT                   HIPEDTNLFFLYRIHLDVCQLRRVIAAHGFSSKEIITFWQVMLRNTTCHY"
FT   misc_difference 593
FT                   /gene="B3galnt1"
FT                   /note="'T' in cDNA is 'C' in the mouse genome; no amino
FT                   acid change."
FT   misc_difference 1149
FT                   /gene="B3galnt1"
FT                   /note="'G' in cDNA is 'A' in the mouse genome."
FT   misc_difference 1485..1486
FT                   /gene="B3galnt1"
FT                   /note="2 bases in cDNA are not found in the mouse genome."
FT   misc_difference 1716
FT                   /gene="B3galnt1"
FT                   /note="'A' in cDNA is 'G' in the mouse genome."
FT   misc_difference 1837..1850
FT                   /gene="B3galnt1"
FT                   /note="polyA tail: 14 bases do not align to the mouse
FT                   genome."
XX
SQ   Sequence 1850 BP; 529 A; 379 C; 395 G; 547 T; 0 other;
     cccacgcgtc cggacagccc ctgtggttaa gaacctgtgc cttctgaact tctctgctgc        60
     tgtggatggc cccggctgtc ttgactgccc ttcccaatag gatgtcactg agatccctca       120
     agtggagcct gctgctgctg tccctgctga gcttcctggt gatatggtac ctcagcctcc       180
     cccactacaa tgtgatcgag cgcgtcaact ggatgtactt ctatgagtat gagccaattt       240
     acagacaaga ctttcgcttc acgctccgcg aacactcgaa ctgttctcat cagaacccat       300
     tcctggtcat cctggtgacc tcacgcccct cagatgtgaa agccagacaa gccattagag       360
     ttacttgggg tgagaaaaag tcctggtggg gatatgaggt gctcacgttt ttcttactag       420
     gccagcaggc tgaaagggaa gacaaaacgt tagccctgtc cttggaggat gagcacgttc       480
     tctatggtga tattatacgg caagactttc tagacacata taataacttg accttgaaaa       540
     ccattatggc tttcaggtgg gtaatggagt tttgccccaa tgccaagtat attatgaaaa       600
     cagacactga tgttttcatc aacactggca atttagtcaa gtatctttta aacctaaacc       660
     actcagagaa gtttttcacg ggctatcctc taattgataa ctattcctat agaggatttt       720
     tccataaaaa ccacatttca taccaagagt accccttcaa ggtgttccct ccctactgca       780
     gcgggctggg ctacattatg tccggcgacc tggtgcccag ggtctacgag atgatgagtc       840
     acgtgaagcc catcaagttt gaagacgttt atgttggcat ctgtttgaat ttgttaaaag       900
     tggacattca tattccagaa gacacaaacc ttttctttct gtacagaatc cacttggatg       960
     tatgtcagct cagacgcgtg attgcagccc atggcttttc ttccaaggag atcatcacat      1020
     tctggcaggt gatgctgagg aacaccacat gccattacta agcccatcca tcacccatct      1080
     aggcaaggga ggaaggacac tgtagacagg gtcagagtta gcactgtggg aaactcggga      1140
     agttgagcgt gctggtttgc ctgggctgaa actcatggag ctccctagac aggagtcaag      1200
     gcctgaactt agtgattgct ttcacagaat ttaacttggg tcgtttaaag gtgacagagg      1260
     agtcaaacat aatgcaaacg aagagttttg ctaaccaaat caaagggtca gacagtctgg      1320
     atggctcagg ctgtagatta gaatttctta agatttccct aaaagaaaaa ctaactagtc      1380
     cacagccaga atacagtgtg gagttgtatt tattagacag tatagttaca taatggttca      1440
     gtgtgtatct taagtggtta ttatgtaaag atatatatat atatatttct tgtaaaaagc      1500
     tttacagagt tatattgaaa acattcattt gtattgtttt catttgcaag gtactcaatg      1560
     atatggtacg taagagttat caaatcaagt attatttaat gtcatttcaa ttttctaaat      1620
     gtttaaaggt tatgtatggt ctcagtatta tatgatgaat tgcctttcac atttgagctt      1680
     tgttttattc accactgatc agtttaatta atgtgacgtt aatgtgacat caaaggttat      1740
     tactgactgt ccgtaatctg ttgaactctg ttgcactgta aacatagaga attaaagcaa      1800
     gaaaattcaa gctttgtata gttttttaaa aatgtaaaaa aaaaaaaaaa                 1850
//