spacer
spacer

EBI Dbfetch

ID   BC026163; SV 1; linear; mRNA; STD; HUM; 2361 BP.
XX
AC   BC026163;
XX
DT   09-APR-2002 (Rel. 71, Created)
DT   15-OCT-2008 (Rel. 97, Last updated, Version 14)
XX
DE   Homo sapiens sperm adhesion molecule 1 (PH-20 hyaluronidase, zona pellucida
DE   binding), mRNA (cDNA clone MGC:26532 IMAGE:4838230), complete cds.
XX
KW   MGC.
XX
OS   Homo sapiens (human)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC   Homo.
XX
RN   [1]
RP   1-2361
RX   DOI; 10.1073/pnas.242603899
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-2361
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (02-APR-2002) to the EMBL/GenBank/DDBJ databases.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   ASTD; TRAN00000086219.
DR   Ensembl-Gn; ENSG00000106304; Homo_sapiens.
DR   Ensembl-Tr; ENST00000223028; Homo_sapiens.
DR   H-InvDB; HIT000040121.
DR   ImaGenes; IMAGp998N2310771Q.
DR   ImaGenes; IRAKp961N1634Q.
DR   ImaGenes; IRATp970A1228D.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Miklos Palkovits, M.D., Ph.D.
CC   cDNA Library Preparation: Michael J. Brownstein (NHGRI) &  Shiraki
CC   Toshiyuki and Piero Carninci (RIKEN)
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: Institute for Systems Biology
CC   http://www.systemsbiology.org
CC   contact: amadan@systemsbiology.org
CC   Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
CC   Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 34 Row: n Column: 16
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 23510416.
CC   Differences found between this sequence and the human reference
CC   genome (build 36) are described in misc_difference features below
CC   and these differences were also compared to chimpanzee genomic
CC   sequences available as of 09/15/2004.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2361
FT                   /organism="Homo sapiens"
FT                   /lab_host="DH10B"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NIH_MGC_97"
FT                   /clone="MGC:26532 IMAGE:4838230"
FT                   /tissue_type="Testis"
FT                   /note="Vector: pBluescriptR"
FT                   /db_xref="taxon:9606"
FT   gene            1..2361
FT                   /gene="SPAM1"
FT                   /note="synonyms: SPAG15, HYAL3, HYA1, HYAL1, PH20,
FT                   MGC26532, PH-20"
FT   misc_difference 3
FT                   /gene="SPAM1"
FT                   /note="'C' in cDNA is 'T' in the human genome. The
FT                   chimpanzee genome agrees with the cDNA sequence, suggesting
FT                   that this difference is unlikely to be due to an artifact."
FT   misc_difference 5
FT                   /gene="SPAM1"
FT                   /note="'G' in cDNA is 'T' in the human genome. The
FT                   chimpanzee genome agrees with the cDNA sequence, suggesting
FT                   that this difference is unlikely to be due to an artifact."
FT   CDS             363..1898
FT                   /codon_start=1
FT                   /gene="SPAM1"
FT                   /product="sperm adhesion molecule 1 (PH-20 hyaluronidase,
FT                   zona pellucida binding)"
FT                   /db_xref="GOA:Q8TC30"
FT                   /db_xref="HGNC:11217"
FT                   /db_xref="HSSP:1FCQ"
FT                   /db_xref="InterPro:IPR018155"
FT                   /db_xref="UniProtKB/TrEMBL:Q8TC30"
FT                   /protein_id="AAH26163.1"
FT                   /translation="MGVLKFKHIFFRSFVKSSGVSQIVFTFLLIPCCLTLNFRAPPVIP
FT                   NVPFLWAWNAPSEFCLGKFDEPLDMSLFSFIGSPRINATGQGVTIFYVDRLGYYPYIDS
FT                   ITGVTVNGGIPQKISLQDHLDKAKKDITFYMPVDNLGMAVIDWEEWRPTWARNWKPKDV
FT                   YKNRSIELVQQQNVQLSLTEATEKAKQEFEKAGKDFLVETIKLGKLLRPNHLWGYYLFP
FT                   DCYNHHYKKPGYNGSCFNVEIKRNDDLSWLWNESTALYPSIYLNTQQSPVAATLYVRNR
FT                   VREAIRVSKIPDAKSPLPVFAYTRIVFTDQVLKFLSQDELVYTFGETVALGASGIVIWG
FT                   TLSIMRSMKSCLLLDNYMETILNPYIINVTLAAKMCSQVLCQEQGVCIRKNWNSSDYLH
FT                   LNPDNFAIQLEKGGKFTVRGKPTLEDLEQFSEKFYCSCYSTLSCKEKADVKDTDAVDVC
FT                   IADGVCIDAFLKPPMETEEPQIFYNASPSTLSATMFIWRLEVWDQGISRIGFF"
FT   misc_difference 944
FT                   /gene="SPAM1"
FT                   /note="'A' in cDNA is 'G' in the human genome; no amino
FT                   acid change. The chimpanzee genome agrees with the human
FT                   genomic sequence and not the cDNA."
FT   misc_difference 2351..2361
FT                   /gene="SPAM1"
FT                   /note="polyA tail: 11 bases do not align to the human
FT                   genome."
XX
SQ   Sequence 2361 BP; 756 A; 441 C; 467 G; 697 T; 0 other;
     agcggaggtg gttagcagca cctcataagg tccttcctag caagggatgc taatgactag        60
     ccaatgctct aggaagacat tgagaccagc caacttcttg ccttgataac tactgaagag       120
     acattgggtg gctggatttt gaaagcagac ttctggttat aggtgatgca acttgaaaaa       180
     caatcctgaa acatgaaaca agaataataa tatttaaatg taacttaatc attatacctc       240
     tttatccatc aaagtgaatt cattccattc cctttcatct gtgctcatac tttgcatcag       300
     atattgggta aaccaaagtg tgtaggaaga aataaatgtt ttcatagtca ttactcttta       360
     caatgggagt gctaaaattc aagcacatct ttttcagaag ctttgttaaa tcaagtggag       420
     tatcccagat agttttcacc ttccttctga ttccatgttg cttgactctg aatttcagag       480
     cacctcctgt tattccaaat gtgcctttcc tctgggcctg gaatgcccca agtgaatttt       540
     gtcttggaaa atttgatgag ccactagata tgagcctctt ctctttcata ggaagccccc       600
     gaataaacgc caccgggcaa ggtgttacaa tattttatgt tgatagactt ggctactatc       660
     cttacataga ttcaatcaca ggagtaactg tgaatggagg aatcccccag aagatttcct       720
     tacaagacca tctggacaaa gctaagaaag acattacatt ttatatgcca gtagacaatt       780
     tgggaatggc tgttattgac tgggaagaat ggagacccac ttgggcaaga aactggaaac       840
     ctaaagatgt ttacaagaat aggtctattg aattggttca gcaacaaaat gtacaactta       900
     gtctcacaga ggccactgag aaagcaaaac aagaatttga aaaagcaggg aaggatttcc       960
     tggtagagac tataaaattg ggaaaattac ttcggccaaa tcacttgtgg ggttattatc      1020
     tttttccgga ttgttacaac catcactata agaaacccgg ttacaatgga agttgcttca      1080
     atgtagaaat aaaaagaaat gatgatctca gctggttgtg gaatgaaagc actgctcttt      1140
     acccatccat ttatttgaac actcagcagt ctcctgtagc tgctacactc tatgtgcgca      1200
     atcgagttcg ggaagccatc agagtttcca aaatacctga tgcaaaaagt ccacttccgg      1260
     tttttgcata tacccgcata gtttttactg atcaagtttt gaaattcctt tctcaagatg      1320
     aacttgtgta tacatttggc gaaactgttg ctctgggtgc ttctggaatt gtaatatggg      1380
     gaaccctcag tataatgcga agtatgaaat cttgcttgct cctagacaat tacatggaga      1440
     ctatactgaa tccttacata atcaacgtca cactagcagc caaaatgtgt agccaagtgc      1500
     tttgccagga gcaaggagtg tgtataagga aaaactggaa ttcaagtgac tatcttcacc      1560
     tcaacccaga taattttgct attcaacttg agaaaggtgg aaagttcaca gtacgtggaa      1620
     aaccgacact tgaagacctg gagcaatttt ctgaaaaatt ttattgcagc tgttatagca      1680
     ccttgagttg taaggagaaa gctgatgtaa aagacactga tgctgttgat gtgtgtattg      1740
     ctgatggtgt ctgtatagat gcttttctaa aacctcccat ggagacagaa gaacctcaaa      1800
     ttttctacaa tgcttcaccc tccacactat ctgccacaat gttcatttgg aggctggaag      1860
     tctgggatca aggtattagc agaattggtt tcttctgaga gtcatgaggg aaaaatgtgt      1920
     ttcaggcctc ttcccttggc ttacaggaaa tgaaaaaacc atgactatca tcaccaacat      1980
     ccttgggtat taagtgcagt cactctccta gatgctgtgg ggagaaggca agttacaaag      2040
     atagaccttc cctcaagata atcagatttt catggtatta tccttaacct ttttgacatc      2100
     atggaggctt tgggaatctg atgaagccta tcaattttct tccagaagat atttatataa      2160
     gattataaga aaaattatgt acacagctta ttttattgca ttggatcaaa atgccattta      2220
     taaagaatta tgccttttcc atcaatttta gcatggaaaa ataatttcag gcaatatgct      2280
     taaaaattgg gggaagacaa aagaaatcca tatcgtgtaa ataaaaataa attttggttt      2340
     tgctcaaaaa aaaaaaaaaa a                                                2361
//


  
spacer
spacer