Dbfetch

ID   BC004741; SV 1; linear; mRNA; STD; MUS; 2562 BP.
XX
AC   BC004741;
XX
DT   25-MAR-2001 (Rel. 67, Created)
DT   15-OCT-2008 (Rel. 97, Last updated, Version 15)
XX
DE   Mus musculus exostoses (multiple) 1, mRNA (cDNA clone MGC:5903
DE   IMAGE:3499743), complete cds.
XX
KW   MGC.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2562
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-2562
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (21-MAR-2001) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; 72466b1716750be19259a85ad64d581f.
DR   Ensembl-Gn; ENSMUSG00000061731; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0021843; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0021803; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0021784; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0021810; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0021584; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0022249; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0021101; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0021553; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0021677; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0021655; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0021747; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0021679; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0022272; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0020846; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0021156; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000077273; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0040935; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0040895; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0040864; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0040894; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0040629; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0041361; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0040687; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0040546; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0040697; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0040660; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0040767; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0040642; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0041408; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0040279; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0040030; mus_musculus_wsbeij.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Jeffrey Green M.D.
CC   cDNA Library Preparation: Life Technologies, Inc.
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: Baylor College of Medicine Human Genome
CC   Sequencing Center
CC   Center code: BCM-HGSC
CC   Web site: http://www.hgsc.bcm.tmc.edu/cdna/
CC   Contact: amg@bcm.tmc.edu
CC   Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
CC   Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
CC   A.N., Gibbs, R.A.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 9 Row: f Column: 23
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 7106308.
CC   Differences found between this sequence and the mouse C57BL/6J
CC   genome (build 36) are described in misc_difference features below.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2562
FT                   /organism="Mus musculus"
FT                   /lab_host="DH10B"
FT                   /strain="FVB/N"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NCI_CGAP_Mam6"
FT                   /clone="MGC:5903 IMAGE:3499743"
FT                   /tissue_type="Mammary tumor. C3(1)-Tag model. Infiltrating
FT                   ductal carcinoma. 5 month old virgin mouse."
FT                   /note="Vector: pCMV-SPORT6"
FT                   /db_xref="taxon:10090"
FT   gene            1..2562
FT                   /gene="Ext1"
FT   misc_difference 1..14
FT                   /gene="Ext1"
FT                   /note="14 bases at the 5' end do not align to the mouse
FT                   genome."
FT   misc_difference 16
FT                   /gene="Ext1"
FT                   /note="'C' in cDNA is 'T' in the mouse genome."
FT   misc_difference 20
FT                   /gene="Ext1"
FT                   /note="1 base in cDNA is not found in the mouse genome."
FT   misc_difference 22
FT                   /gene="Ext1"
FT                   /note="'C' in cDNA is 'T' in the mouse genome."
FT   CDS             185..2425
FT                   /codon_start=1
FT                   /gene="Ext1"
FT                   /product="exostoses (multiple) 1"
FT                   /db_xref="GOA:P97464"
FT                   /db_xref="InterPro:IPR004263"
FT                   /db_xref="InterPro:IPR015338"
FT                   /db_xref="InterPro:IPR027670"
FT                   /db_xref="InterPro:IPR029044"
FT                   /db_xref="MGI:MGI:894663"
FT                   /db_xref="UniProtKB/Swiss-Prot:P97464"
FT                   /protein_id="AAH04741.1"
FT                   /translation="MQAKKRYFILLSAGSCLALLFYFGGVQFRASRSHSRREEHSGRNG
FT                   LHQPSPDHFWPRFPDALRPFFPWDQLENEDSSVHISPRQKRDANSSIYKGKKCRMESCF
FT                   DFTLCKKNGFKVYVYPQQKGEKIAESYQNILAAIEGSRFYTSDPSQACLFVLSLDTLDR
FT                   DQLSPQYVHNLRSKVQSLHLWNNGRNHLIFNLYSGTWPDYTEDVGFDIGQAMLAKASIS
FT                   TENFRPNFDVSIPLFSKDHPRTGGERGFLKFNTIPPLRKYMLVFKGKRYLTGIGSDTRN
FT                   ALYHVHNGEDVLLLTTCKHGKDWQKHKDSRCDRDNTEYEKYDYREMLHNATFCLVPRGR
FT                   RLGSFRFLEALQAACVPVMLSNGWELPFSEVINWNQAAVIGDERLLLQIPSTIRSIHQD
FT                   KILALRQQTQFLWEAYFSSVEKIVLTTLEIIQDRIFKHISRNSLIWNKHPGGLFVLPQY
FT                   SSYLGDFPYYYANLGLKPPSKFTAVIHAVTPLVSQSQPVLKLLVAAAKSQYCAQIIVLW
FT                   NCDKPLPAKHRWPATAVPVIVIEGESKVMSSRFLPYDNIITDAVLSLDEDTVLSTTEVD
FT                   FAFTVWQSFPERIVGYPARSHFWDNSKERWGYTSKWTNDYSMVLTGAAIYHKYYHYLYS
FT                   HYLPASLKNMVDQLANCEDILMNFLVSAVTKLPPIKVTQKKQYKETMMGQTSRASRWAD
FT                   PDHFAQRQSCMNTFASWFGYMPLIHSQMRLDPVLFKDQVSILRKKYRDIERL"
FT   misc_difference 1990
FT                   /gene="Ext1"
FT                   /note="'T' in cDNA is 'A' in the mouse genome; no amino
FT                   acid change."
FT   misc_difference 2459
FT                   /gene="Ext1"
FT                   /note="'T' in cDNA is 'C' in the mouse genome."
FT   misc_difference 2555..2562
FT                   /gene="Ext1"
FT                   /note="polyA tail: 8 bases do not align to the mouse
FT                   genome."
XX
SQ   Sequence 2562 BP; 665 A; 641 C; 630 G; 626 T; 0 other;
     cccacgcgtc cgccacgcgt ccggatcagt ccacggcttg gggaaaggca tccagagaag        60
     gtgggagcgg agagtttgaa gtctttacag gcgggaagat ggcggactgg agctgaaagt       120
     gttgattggg aaacttgggt gattcttgtg tttatttaca atcctcttga cccaggcagg       180
     acacatgcag gccaaaaaac gctatttcat cctgctctca gctggctctt gtctcgccct       240
     tttgttttat tttggaggcg tgcagtttag ggcatcgagg agccacagcc ggagagaaga       300
     gcacagtggt cggaatggct tgcaccagcc cagtccggat catttctggc cccgcttccc       360
     ggacgctctg cgccctttct ttccttggga tcaattggaa aacgaggatt ccagcgtgca       420
     catttccccc cggcagaagc gagacgccaa ctcgagcatc tacaaaggca agaagtgccg       480
     catggagtcc tgcttcgatt tcaccctttg caagaaaaac ggcttcaaag tctacgtgta       540
     cccgcagcag aaaggggaga aaatcgccga aagttaccaa aacattctag cggccatcga       600
     gggctccagg ttctacacct cggaccccag ccaggcgtgc ctctttgtct tgagtctgga       660
     tactttagac agagaccagt tatcacctca gtatgtgcac aatttgagat ccaaagtgca       720
     gagtctccac ttgtggaaca atggtaggaa tcatttaatt tttaatttat attctggcac       780
     ttggcctgac tacactgagg acgtggggtt tgacatcggc caggcgatgc tggccaaagc       840
     cagcatcagt actgaaaact tccgaccaaa ctttgatgtt tctattcccc tcttttctaa       900
     ggatcatccc aggacaggag gggagagggg gtttttgaaa tttaacacca tccctcctct       960
     caggaagtac atgctggtat tcaaggggaa gcggtacctg acagggatag ggtcagacac      1020
     caggaatgcc ttatatcacg tccataacgg ggaggacgtc ttgctcctca ccacctgcaa      1080
     gcatggcaaa gactggcaaa agcacaagga ttctcgctgt gacagagaca acaccgagta      1140
     tgagaaatat gattatcggg aaatgctgca caatgccact ttctgtctgg ttcctcgtgg      1200
     tcgcaggctt gggtccttca gattcctgga ggctttgcag gctgcctgtg tccctgtaat      1260
     gctcagcaac ggatgggagt tgccattctc cgaagtgatt aattggaacc aagctgccgt      1320
     cataggcgat gagagattgc tattacagat tccttctaca atcaggtcta ttcatcagga      1380
     taaaatccta gcacttagac agcagacaca gttcttgtgg gaggcttatt tttcttcagt      1440
     tgagaagatt gtattaacta cactagagat tattcaggac agaatattca agcacatatc      1500
     acgtaacagt ttaatatgga acaaacatcc tggaggattg ttcgtcctac cgcagtattc      1560
     atcttacctg ggagatttcc cttactacta tgctaattta ggtttaaagc ccccctccaa      1620
     attcactgca gtcatccatg ctgtgactcc cctggtctct cagtcccagc cagtgttgaa      1680
     gcttcttgtg gctgcagcca aatcccagta ctgtgcgcag atcatagttc tgtggaattg      1740
     tgacaagcct ctaccagcca aacatcgctg gcctgccact gccgtgcctg tcatcgtcat      1800
     tgaaggagaa agcaaggtta tgagcagccg gtttctgccc tatgacaaca tcatcactga      1860
     tgctgtgctc agcctggatg aggacactgt gctttcaact acggaagtgg attttgcctt      1920
     caccgtatgg cagagcttcc cagagaggat tgtgggatat cctgctcgca gtcatttctg      1980
     ggataactct aaggagcggt ggggatatac atccaagtgg acgaatgact actccatggt      2040
     gttgacagga gctgctatct accacaaata ttatcactac ctgtattccc attacctgcc      2100
     agccagcctg aagaacatgg tagaccaact ggccaactgt gaggacattc tcatgaattt      2160
     cctggtgtct gctgtgacaa aattgcctcc aatcaaagtg acccagaaga aacagtataa      2220
     ggagacaatg atgggacaga cttcccgagc atcccgctgg gccgaccctg accactttgc      2280
     ccagcgacag agctgcatga atacatttgc cagctggttt ggctacatgc cgctgatcca      2340
     ttctcagatg aggctggacc cggtcctctt taaagaccaa gtctcaattc tgaggaagaa      2400
     atacagagac attgaacgac tttgaggaag cccaccgagt gggggagggg aagcaagatg      2460
     ggcgtccagc tgctctctcc tccttcccaa tgcagatccg ctcacgccca gcagtggagc      2520
     cagactgtgc caagtatcaa aaaatcaaaa aaaaaaaaaa aa                         2562
//