![]() |
EBI DbfetchID BC026163; SV 1; linear; mRNA; STD; HUM; 2361 BP. XX AC BC026163; XX DT 09-APR-2002 (Rel. 71, Created) DT 15-OCT-2008 (Rel. 97, Last updated, Version 14) XX DE Homo sapiens sperm adhesion molecule 1 (PH-20 hyaluronidase, zona pellucida DE binding), mRNA (cDNA clone MGC:26532 IMAGE:4838230), complete cds. XX KW MGC. XX OS Homo sapiens (human) OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; OC Homo. XX RN [1] RP 1-2361 RX DOI; 10.1073/pnas.242603899 RX PUBMED; 12477932. RG Mammalian Gene Collection Program Team RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D., RA Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F., RA Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H., RA Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K., RA Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F., RA Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S., RA Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J., RA Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J., RA Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M., RA Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., RA Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., RA Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G., RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C., RA Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I., RA Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.; RT "Generation and initial analysis of more than 15,000 full-length human and RT mouse cDNA sequences"; RL Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002). XX RN [2] RC NIH-MGC Project URL: http://mgc.nci.nih.gov RP 1-2361 RG NIH MGC Project RA ; RT ; RL Submitted (02-APR-2002) to the EMBL/GenBank/DDBJ databases. RL National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, RL MD 20892-2590, USA XX DR ASTD; TRAN00000086219. DR Ensembl-Gn; ENSG00000106304; Homo_sapiens. DR Ensembl-Tr; ENST00000223028; Homo_sapiens. DR H-InvDB; HIT000040121. DR ImaGenes; IMAGp998N2310771Q. DR ImaGenes; IRAKp961N1634Q. DR ImaGenes; IRATp970A1228D. XX CC Contact: MGC help desk CC Email: cgapbs-r@mail.nih.gov CC Tissue Procurement: Miklos Palkovits, M.D., Ph.D. CC cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki CC Toshiyuki and Piero Carninci (RIKEN) CC cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) CC DNA Sequencing by: Institute for Systems Biology CC http://www.systemsbiology.org CC contact: amadan@systemsbiology.org CC Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha CC Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting CC Clone distribution: MGC clone distribution information can be found CC through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov CC Series: IRAK Plate: 34 Row: n Column: 16 CC This clone was selected for full length sequencing because it CC passed the following selection criteria: matched mRNA gi: 23510416. CC Differences found between this sequence and the human reference CC genome (build 36) are described in misc_difference features below CC and these differences were also compared to chimpanzee genomic CC sequences available as of 09/15/2004. XX FH Key Location/Qualifiers FH FT source 1..2361 FT /organism="Homo sapiens" FT /lab_host="DH10B" FT /mol_type="mRNA" FT /clone_lib="NIH_MGC_97" FT /clone="MGC:26532 IMAGE:4838230" FT /tissue_type="Testis" FT /note="Vector: pBluescriptR" FT /db_xref="taxon:9606" FT gene 1..2361 FT /gene="SPAM1" FT /note="synonyms: SPAG15, HYAL3, HYA1, HYAL1, PH20, FT MGC26532, PH-20" FT misc_difference 3 FT /gene="SPAM1" FT /note="'C' in cDNA is 'T' in the human genome. The FT chimpanzee genome agrees with the cDNA sequence, suggesting FT that this difference is unlikely to be due to an artifact." FT misc_difference 5 FT /gene="SPAM1" FT /note="'G' in cDNA is 'T' in the human genome. The FT chimpanzee genome agrees with the cDNA sequence, suggesting FT that this difference is unlikely to be due to an artifact." FT CDS 363..1898 FT /codon_start=1 FT /gene="SPAM1" FT /product="sperm adhesion molecule 1 (PH-20 hyaluronidase, FT zona pellucida binding)" FT /db_xref="GOA:Q8TC30" FT /db_xref="HGNC:11217" FT /db_xref="HSSP:1FCQ" FT /db_xref="InterPro:IPR018155" FT /db_xref="UniProtKB/TrEMBL:Q8TC30" FT /protein_id="AAH26163.1" FT /translation="MGVLKFKHIFFRSFVKSSGVSQIVFTFLLIPCCLTLNFRAPPVIP FT NVPFLWAWNAPSEFCLGKFDEPLDMSLFSFIGSPRINATGQGVTIFYVDRLGYYPYIDS FT ITGVTVNGGIPQKISLQDHLDKAKKDITFYMPVDNLGMAVIDWEEWRPTWARNWKPKDV FT YKNRSIELVQQQNVQLSLTEATEKAKQEFEKAGKDFLVETIKLGKLLRPNHLWGYYLFP FT DCYNHHYKKPGYNGSCFNVEIKRNDDLSWLWNESTALYPSIYLNTQQSPVAATLYVRNR FT VREAIRVSKIPDAKSPLPVFAYTRIVFTDQVLKFLSQDELVYTFGETVALGASGIVIWG FT TLSIMRSMKSCLLLDNYMETILNPYIINVTLAAKMCSQVLCQEQGVCIRKNWNSSDYLH FT LNPDNFAIQLEKGGKFTVRGKPTLEDLEQFSEKFYCSCYSTLSCKEKADVKDTDAVDVC FT IADGVCIDAFLKPPMETEEPQIFYNASPSTLSATMFIWRLEVWDQGISRIGFF" FT misc_difference 944 FT /gene="SPAM1" FT /note="'A' in cDNA is 'G' in the human genome; no amino FT acid change. The chimpanzee genome agrees with the human FT genomic sequence and not the cDNA." FT misc_difference 2351..2361 FT /gene="SPAM1" FT /note="polyA tail: 11 bases do not align to the human FT genome." XX SQ Sequence 2361 BP; 756 A; 441 C; 467 G; 697 T; 0 other; agcggaggtg gttagcagca cctcataagg tccttcctag caagggatgc taatgactag 60 ccaatgctct aggaagacat tgagaccagc caacttcttg ccttgataac tactgaagag 120 acattgggtg gctggatttt gaaagcagac ttctggttat aggtgatgca acttgaaaaa 180 caatcctgaa acatgaaaca agaataataa tatttaaatg taacttaatc attatacctc 240 tttatccatc aaagtgaatt cattccattc cctttcatct gtgctcatac tttgcatcag 300 atattgggta aaccaaagtg tgtaggaaga aataaatgtt ttcatagtca ttactcttta 360 caatgggagt gctaaaattc aagcacatct ttttcagaag ctttgttaaa tcaagtggag 420 tatcccagat agttttcacc ttccttctga ttccatgttg cttgactctg aatttcagag 480 cacctcctgt tattccaaat gtgcctttcc tctgggcctg gaatgcccca agtgaatttt 540 gtcttggaaa atttgatgag ccactagata tgagcctctt ctctttcata ggaagccccc 600 gaataaacgc caccgggcaa ggtgttacaa tattttatgt tgatagactt ggctactatc 660 cttacataga ttcaatcaca ggagtaactg tgaatggagg aatcccccag aagatttcct 720 tacaagacca tctggacaaa gctaagaaag acattacatt ttatatgcca gtagacaatt 780 tgggaatggc tgttattgac tgggaagaat ggagacccac ttgggcaaga aactggaaac 840 ctaaagatgt ttacaagaat aggtctattg aattggttca gcaacaaaat gtacaactta 900 gtctcacaga ggccactgag aaagcaaaac aagaatttga aaaagcaggg aaggatttcc 960 tggtagagac tataaaattg ggaaaattac ttcggccaaa tcacttgtgg ggttattatc 1020 tttttccgga ttgttacaac catcactata agaaacccgg ttacaatgga agttgcttca 1080 atgtagaaat aaaaagaaat gatgatctca gctggttgtg gaatgaaagc actgctcttt 1140 acccatccat ttatttgaac actcagcagt ctcctgtagc tgctacactc tatgtgcgca 1200 atcgagttcg ggaagccatc agagtttcca aaatacctga tgcaaaaagt ccacttccgg 1260 tttttgcata tacccgcata gtttttactg atcaagtttt gaaattcctt tctcaagatg 1320 aacttgtgta tacatttggc gaaactgttg ctctgggtgc ttctggaatt gtaatatggg 1380 gaaccctcag tataatgcga agtatgaaat cttgcttgct cctagacaat tacatggaga 1440 ctatactgaa tccttacata atcaacgtca cactagcagc caaaatgtgt agccaagtgc 1500 tttgccagga gcaaggagtg tgtataagga aaaactggaa ttcaagtgac tatcttcacc 1560 tcaacccaga taattttgct attcaacttg agaaaggtgg aaagttcaca gtacgtggaa 1620 aaccgacact tgaagacctg gagcaatttt ctgaaaaatt ttattgcagc tgttatagca 1680 ccttgagttg taaggagaaa gctgatgtaa aagacactga tgctgttgat gtgtgtattg 1740 ctgatggtgt ctgtatagat gcttttctaa aacctcccat ggagacagaa gaacctcaaa 1800 ttttctacaa tgcttcaccc tccacactat ctgccacaat gttcatttgg aggctggaag 1860 tctgggatca aggtattagc agaattggtt tcttctgaga gtcatgaggg aaaaatgtgt 1920 ttcaggcctc ttcccttggc ttacaggaaa tgaaaaaacc atgactatca tcaccaacat 1980 ccttgggtat taagtgcagt cactctccta gatgctgtgg ggagaaggca agttacaaag 2040 atagaccttc cctcaagata atcagatttt catggtatta tccttaacct ttttgacatc 2100 atggaggctt tgggaatctg atgaagccta tcaattttct tccagaagat atttatataa 2160 gattataaga aaaattatgt acacagctta ttttattgca ttggatcaaa atgccattta 2220 taaagaatta tgccttttcc atcaatttta gcatggaaaa ataatttcag gcaatatgct 2280 taaaaattgg gggaagacaa aagaaatcca tatcgtgtaa ataaaaataa attttggttt 2340 tgctcaaaaa aaaaaaaaaa a 2361 // ![]() |