spacer
spacer

EBI Dbfetch

ID   BC022180; SV 1; linear; mRNA; STD; MUS; 1671 BP.
XX
AC   BC022180;
XX
DT   29-JAN-2002 (Rel. 70, Created)
DT   24-SEP-2008 (Rel. 97, Last updated, Version 12)
XX
DE   Mus musculus beta-1,4-N-acetyl-galactosaminyl transferase 1, mRNA (cDNA
DE   clone IMAGE:4006866), complete cds.
XX
KW   .
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; Muroidea;
OC   Muridae; Murinae; Mus.
XX
RN   [1]
RP   1-1671
RX   DOI; 10.1073/pnas.242603899
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-1671
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (25-JAN-2002) to the EMBL/GenBank/DDBJ databases.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   ASTD; TRAN00000155040.
DR   Ensembl-Gn; ENSMUSG00000006731; Mus_musculus.
DR   Ensembl-Tr; ENSMUST00000006914; Mus_musculus.
DR   ImaGenes; IMAGp998N199238Q.
DR   ImaGenes; IRAKp961E1418Q.
DR   ImaGenes; IRAVp968C0322D.
DR   ImaGenes; IRAVp968E066Q.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Gilbert Smith, Ph.D.
CC   cDNA Library Preparation: Life Technologies, Inc.
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: Institute for Systems Biology
CC   http://www.systemsbiology.org
CC   contact: amadan@systemsbiology.org
CC   Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
CC   Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 18 Row: e Column: 14
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 42476102
CC   This clone has the following problem: The cds is short compared to
CC   the longest cds in the locus.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..1671
FT                   /organism="Mus musculus"
FT                   /lab_host="DH10B"
FT                   /strain="Czech II"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NCI_CGAP_Lu30"
FT                   /clone="IMAGE:4006866"
FT                   /tissue_type="Mammary tumor metastatized to lung.
FT                   MMTV-LTR/Wnt1 model. Expression driven by an MMTV-LTR
FT                   enhancer."
FT                   /note="Vector: pCMV-SPORT6"
FT                   /db_xref="taxon:10090"
FT   gene            1..1671
FT                   /gene="B4galnt1"
FT                   /note="synonyms: GalNAcT, Gal-NAc-T, GalNAc-T"
FT   misc_difference 1..10
FT                   /gene="B4galnt1"
FT                   /note="10 bases at the 5' end do not align to the mouse
FT                   genome; Differences found between this sequence and the
FT                   mouse C57BL/6J genome (build 35) are described in
FT                   misc_difference features below"
FT   CDS             154..888
FT                   /codon_start=1
FT                   /gene="B4galnt1"
FT                   /product="B4galnt1 protein"
FT                   /db_xref="GOA:Q8VDF0"
FT                   /db_xref="MGI:1342057"
FT                   /db_xref="UniProtKB/TrEMBL:Q8VDF0"
FT                   /protein_id="AAH22180.1"
FT                   /translation="MRLDRRALYALVLLLACASLGLLYSSTRNAPSLPNPLALWSPPQG
FT                   PPRLDLLDLAPEPRYAHIPVRIKEQVVGLLAQNNCSCESKGGSLPLPFLRQVRAVDLTK
FT                   AFDAEELRAVSVAREQEYQAFLARSRSLADQLLIAPANSPLQYPLQGVEVQPLRSILVP
FT                   GLSLQEASVQEIYQVNLSASLGTWDVAGEVTGVTLTGEGQPDLTLASPVLDKLNRQLQL
FT                   VTYSSRSYQANTADTGGRPEEG"
FT   misc_difference 1181
FT                   /gene="B4galnt1"
FT                   /note="1 base in cDNA is not found in the mouse genome;
FT                   Differences found between this sequence and the mouse
FT                   C57BL/6J genome (build 35) are described in misc_difference
FT                   features below"
FT   misc_difference 1203
FT                   /gene="B4galnt1"
FT                   /note="'G' in cDNA is 'A' in the mouse genome; Differences
FT                   found between this sequence and the mouse C57BL/6J genome
FT                   (build 35) are described in misc_difference features below"
FT   misc_difference 1335
FT                   /gene="B4galnt1"
FT                   /note="'G' in cDNA is 'A' in the mouse genome; Differences
FT                   found between this sequence and the mouse C57BL/6J genome
FT                   (build 35) are described in misc_difference features below"
FT   misc_difference 1353
FT                   /gene="B4galnt1"
FT                   /note="'T' in cDNA is 'C' in the mouse genome; Differences
FT                   found between this sequence and the mouse C57BL/6J genome
FT                   (build 35) are described in misc_difference features below"
FT   misc_difference 1389
FT                   /gene="B4galnt1"
FT                   /note="'G' in cDNA is 'A' in the mouse genome; Differences
FT                   found between this sequence and the mouse C57BL/6J genome
FT                   (build 35) are described in misc_difference features below"
FT   misc_difference 1433
FT                   /gene="B4galnt1"
FT                   /note="'C' in cDNA is 'T' in the mouse genome; Differences
FT                   found between this sequence and the mouse C57BL/6J genome
FT                   (build 35) are described in misc_difference features below"
FT   misc_difference 1509..1671
FT                   /gene="B4galnt1"
FT                   /note="polyA tail: 163 bases do not align to the mouse
FT                   genome; Differences found between this sequence and the
FT                   mouse C57BL/6J genome (build 35) are described in
FT                   misc_difference features below"
XX
SQ   Sequence 1671 BP; 481 A; 448 C; 402 G; 340 T; 0 other;
     ccacgcgtcg cggatccccg acactgtcgg gctccctccc gcccagctcc cgcatcaggc        60
     cgtcgccggc ctcatgagga ccccccggcc ggggcggaga accgggcact gcccggaccg       120
     cgatcctgcc gcggccttag aacgttagac aggatgcggc tagaccgccg ggccctctat       180
     gcgctagtct tgctgctcgc ctgcgcctcg ctgggtctcc tgtactccag cacccgaaac       240
     gcgccaagtc tcccgaaccc tctggcattg tggtcgcccc cacaaggtcc cccgagactc       300
     gatctgctag accttgcccc tgagcctcgc tacgcacaca tcccggtcag gatcaaggag       360
     caagtggtgg ggctgctggc tcagaacaac tgcagttgtg aatccaaggg aggaagcctt       420
     cccttgccct ttctgagaca ggttcgggcg gttgacctca ctaaagcctt tgacgctgag       480
     gagctgaggg ctgtttctgt cgccagggag caggaatacc aggccttcct tgcaaggagc       540
     cggtccctgg ctgaccagct gctgatagct cccgccaact cccccttaca gtaccccctg       600
     cagggtgtgg aggttcagcc cctcaggagc atcctggtgc cagggctaag tctgcaggaa       660
     gcttctgtcc aggagatata ccaggtgaac ctgagtgctt ctctaggcac ctgggacgtg       720
     gcaggggaag taacaggagt gactctcacg ggagaggggc aaccggatct cacccttgcc       780
     agcccagttc tggataaact caaccggcag ctgcaactgg tgacttacag cagccggagc       840
     taccaggcca acacagcaga cacaggtggg aggcccgagg aggggtaggg acagactgag       900
     agaacagaaa gaaccagaaa agcaaatgac cacgtgtggg aggccctcgg gcaatggcca       960
     gaacagccta gacacacaat gaatgttgca agtgggccag cagggccagg agtcagagga      1020
     gcctttgttt atctagggcc aaaacaggcc agggagaaac cagagctgga gagcattccc      1080
     acttctgtct gtcatcacgg ctccaggaag tgacctgttc ctggctgttt ctagcccaac      1140
     acttaccttg ccaagctccc tcacccttcc tccttttttt tctccaaaac tggcttttct      1200
     gagcctgcgc ctaccttagg gcctttgcat ctgccgtttt caggaatgtt tttgcctaag      1260
     tttgtttgtt tgctttcaag gttggctccc tcttgatgtt tacatctggg ctcagacgtc      1320
     tcctgtttgc ttgagacaat gtcccttgcc tctgctttta cactagacgt ttcttattta      1380
     ctcagcctga gctttgacgt tgtaaaatat ggacaattta cttttctgtg cccaagctta      1440
     gtttcctgca atagtttatc tagtaagatt accatgaaga ttaaatggct tcatctattt      1500
     agaacacaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa      1560
     aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa      1620
     aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a               1671
//


  
spacer
spacer