Dbfetch

ID   BC036172; SV 1; linear; mRNA; STD; MUS; 3144 BP.
XX
AC   BC036172;
XX
DT   24-SEP-2002 (Rel. 73, Created)
DT   15-OCT-2008 (Rel. 97, Last updated, Version 15)
XX
DE   Mus musculus src homology 2 domain-containing transforming protein C1, mRNA
DE   (cDNA clone MGC:37600 IMAGE:4988907), complete cds.
XX
KW   MGC.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; Muroidea;
OC   Muridae; Murinae; Mus; Mus.
XX
RN   [1]
RP   1-3144
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-3144
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (31-JUL-2002) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; c88cdbe2394284179f8588a116b6234d.
DR   Ensembl-Gn; ENSMUSG00000042626; mus_musculus.
DR   Ensembl-Gn; MGP_AJ_G0027403; mus_musculus_aj.
DR   Ensembl-Gn; MGP_BALBcJ_G0027412; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0027858; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0026600; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0027128; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0027266; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0027917; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0026334; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0026682; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000039110; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000094378; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000107417; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000191485; mus_musculus.
DR   Ensembl-Tr; MGP_AJ_T0061917; mus_musculus_aj.
DR   Ensembl-Tr; MGP_BALBcJ_T0061844; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0062320; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0061831; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0061503; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0061605; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0062511; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0061384; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0060811; mus_musculus_wsbeij.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Jeffrey E. Green, M.D.
CC   cDNA Library Preparation: Life Technologies, Inc.
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: Sequencing Group at the Stanford Human Genome
CC   Center, Stanford University School of Medicine, Stanford, CA  94305
CC   Web site:       http://www-shgc.stanford.edu
CC   Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
CC   Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
CC   R. M.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 58 Row: f Column: 9
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 31543699.
CC   Differences found between this sequence and the mouse C57BL/6J
CC   genome (build 36) are described in misc_difference features below.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..3144
FT                   /organism="Mus musculus"
FT                   /lab_host="DH10B"
FT                   /strain="FVB/N"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NCI_CGAP_Co24"
FT                   /clone="MGC:37600 IMAGE:4988907"
FT                   /tissue_type="Colon, normal. 5 month old male mouse."
FT                   /note="Vector: pCMV-SPORT6"
FT                   /db_xref="taxon:10090"
FT   gene            1..3144
FT                   /gene="Shc1"
FT                   /note="synonyms: Shc, p66, ShcA"
FT   misc_difference 1
FT                   /gene="Shc1"
FT                   /note="1 base at the 5' end does not align to the mouse
FT                   genome."
FT   misc_difference 5
FT                   /gene="Shc1"
FT                   /note="'C' in cDNA is 'G' in the mouse genome."
FT   misc_difference 8..10
FT                   /gene="Shc1"
FT                   /note="in the mouse genome : CCA."
FT   misc_difference 13..14
FT                   /gene="Shc1"
FT                   /note="2 bases in cDNA are not found in the mouse genome."
FT   CDS             114..1523
FT                   /codon_start=1
FT                   /gene="Shc1"
FT                   /product="src homology 2 domain-containing transforming
FT                   protein C1"
FT                   /db_xref="GOA:P98083"
FT                   /db_xref="InterPro:IPR000980"
FT                   /db_xref="InterPro:IPR006019"
FT                   /db_xref="InterPro:IPR006020"
FT                   /db_xref="InterPro:IPR011993"
FT                   /db_xref="InterPro:IPR029586"
FT                   /db_xref="MGI:MGI:98296"
FT                   /db_xref="UniProtKB/Swiss-Prot:P98083"
FT                   /protein_id="AAH36172.1"
FT                   /translation="MNKLSGGGGRRTRVEGGQLGGEEWTRHGSFVNKPTRGWLHPNDKV
FT                   MGPGVSYLVRYMGCVEVLQSMRALDFNTRTQVTREAISLVCEAVPGAKGATRRRKPCSR
FT                   PLSSILGRSNLKFAGMPITLTVSTSSLNLMAADCKQIIANHHMQSISFASGGDPDTAEY
FT                   VAYVAKDPVNQRACHILECPEGLAQDVISTIGQAFELRFKQYLRNPPKLVTPHDRMAGF
FT                   DGSAWDEEEEEPPDHQYYNDFPGKEPPLGGVVDMRLREGAARPTLPSAQMSSHLGATLP
FT                   IGQHAAGDHEVRKQMLPPPPCPGRELFDDPSYVNIQNLDKARQAGGGAGPPNPSLNGSA
FT                   PRDLFDMKPFEDALRVPPPPQSMSMAEQLQGEPWFHGKLSRREAEALLQLNGDFLVRES
FT                   TTTPGQYVLTGLQSGQPKHLLLVDPEGVVRTKDHRFESVSHLISYHMDNHLPIISAGSE
FT                   LCLQQPVDRKV"
FT   misc_difference 3094..3144
FT                   /gene="Shc1"
FT                   /note="polyA tail: 51 bases do not align to the mouse
FT                   genome."
XX
SQ   Sequence 3144 BP; 738 A; 807 C; 836 G; 763 T; 0 other;
     cggacgcgtg ggtcctgggg tgaaagttgg ggcggtgact taagcagaca gttgcgtgat        60
     ccggaaccag atcggcccgc ggtgcggtgc ggagactcca tgagaccctg gacatgaaca       120
     agctgagtgg aggcggcggg cgcaggactc gggtagaagg gggccagctg gggggcgagg       180
     agtggaccag acacgggagc tttgtcaata agcccacacg aggctggctg catcccaacg       240
     acaaagtcat gggacctggg gtttcctact tggttcggta catgggctgt gtggaggtct       300
     tacagtcaat gcgagccctt gacttcaata cccggactca ggtcaccagg gaggccatca       360
     gtttggtgtg tgaagctgtg cctggtgcca aaggggcgac aaggaggaga aagccttgta       420
     gccgcccact cagctccatc ctggggagga gtaacctgaa gtttgctgga atgccaatca       480
     ctctcactgt gtctaccagc agccttaacc tcatggcagc cgactgcaaa cagatcattg       540
     ccaaccatca catgcaatct atctctttcg cgtccggtgg ggatccggac acagctgagt       600
     atgttgccta tgttgccaaa gaccctgtga atcagagagc ctgccatatc ctggagtgtc       660
     ctgaagggct tgctcaggat gtcatcagca ccatcgggca ggcctttgag ttgcgcttca       720
     aacagtatct caggaatcca ccgaagctgg tcacccccca tgacaggatg gctggctttg       780
     atggctcagc ttgggatgag gaggaagaag agccccctga ccatcagtac tacaatgact       840
     ttccagggaa ggaaccccct cttggtgggg tggtagatat gaggcttcgg gaaggggctg       900
     ctcgacccac tctgcctagt gcccagatgt ccagccactt gggagctaca ctgcctatag       960
     ggcagcatgc tgcaggagac catgaagtcc gtaaacagat gttgcctccg ccgccttgcc      1020
     caggcagaga actcttcgat gacccctcct atgtcaacat ccagaatcta gacaaggccc      1080
     ggcaggctgg gggtggggct gggcccccaa atccttctct taatggcagt gcaccccgag      1140
     acctttttga catgaagccc tttgaagatg cacttcgggt gccaccccca ccgcagtcca      1200
     tgtccatggc tgagcagctg caaggggagc cctggttcca cgggaagctg agccggaggg      1260
     aggccgaggc gctgctgcag ctcaatggtg acttcttggt gcgagagagc acgaccacgc      1320
     ctggccagta tgtgctcact ggcctgcaga gtgggcagcc caagcacttg ctgctggtgg      1380
     accctgaagg tgtggttcgg acaaaggatc accgctttga gagtgtcagt cacctgatca      1440
     gctaccacat ggacaatcac ttgcccatca tctctgcggg cagcgaactg tgcctacagc      1500
     aacccgtgga tcggaaagtg tgatccttct cagcttctcc aacaggatgc tctccatttc      1560
     cgtctcccgt attctctaac ttgtgggacc tctgttttgt gggtctggcc ttgggtggga      1620
     actgggagca acgaggacat gggtttagtg cccacttgag agagagaaaa agagggtttc      1680
     agtaaggagc ctggggtagc atcctgcctc tggccaaact tcaccaaagt attaatgtgc      1740
     agagtggtcc cttgtctggg ccttgcctgt gccaacctga tgcccttccc ccccaaaggg      1800
     tgggttctta taatggaaaa tgccctgtga tgataggccc agtggagcaa ctgccctttg      1860
     ggggaaggga aataattata cctctggttt actcctgggt cttcagggta ccccagatcc      1920
     cgcataacat atcccactcc ctctgcttcc ccttaaactt tgtgcctttg actatcatag      1980
     gtctgcagat acttaatgca gagttctcag gcccttcacg tgtggacagg ggttactgcc      2040
     accttggctt ctggagccct gtcctattca gcaccccttc ctgtgtctag ggagaatagg      2100
     gacaggagtg gccgctatct gctctgcctt tcggatgtgc agcccttaag agattgcccc      2160
     aagcctgaat atggtggcgc acgcctttaa tcacagcact caggaggcag agacaggagg      2220
     aattgtgagg ccatccgatc tacaacagag tgagttccag gacagccagg gctatggaga      2280
     gagaacctgt ctccaaaaac caaaaagagc gattacccca gagccttctt cctgatggta      2340
     gcggggaggg gcaggactgg acccatcttg ctcagtgcct cctgacctca atgcctttcc      2400
     tccaaggggt ctgtatacat ttctcaagcc tgctcctccc atgtttgcat gtgtgttata      2460
     gtctacagcc aaagtatagc cctcactgta accccatcct gcctccctcc tttgggatag      2520
     gtgtgtgcgt ctgacttggg cctccagggt gtgtacagtc agtgtgggtt ttgtggaggc      2580
     aataagactg aagcagtaga caatccccaa taccatttgc aggtctggaa ctgcactctc      2640
     ttttttaaaa aacatgtata cattttaggg ctgtagattt attttcctgg ttttgttttt      2700
     cattgctgac ttttgagcac agaattatga taatcaatta catttataca tcacctcgat      2760
     gacttttcca aacttttatt ttttttttta aacaactgtt gggattttac tccctggcct      2820
     taactaggac aggattgtac cccactcctc ccccccccct ttttcttttt cgccaagaca      2880
     actgagcaga aatttggctg agcagtgttg tgggactatg atgtgatagt tttagatcct      2940
     accttctgct ttcgggcagc tgcagccagc acagaaacct tgcaagctca ctctgtgtgt      3000
     aggctttctg gacaaggaat ggtcgccaaa tttttggttt ggatgtctta taccaaaggg      3060
     aaatagtctt cattaaagtt cgtatttctt ttaaaaaaaa aaaaaaaaaa aaaaaaaaaa      3120
     aaaaaaaaaa aaaaaaaaaa aaaa                                             3144
//