spacer

EBI Dbfetch

ID   BC057983; SV 1; linear; mRNA; STD; MUS; 4696 BP.
XX
AC   BC057983;
XX
DT   18-SEP-2003 (Rel. 77, Created)
DT   21-OCT-2008 (Rel. 97, Last updated, Version 8)
XX
DE   Mus musculus pregnancy zone protein, mRNA (cDNA clone MGC:65284
DE   IMAGE:5134629), complete cds.
XX
KW   MGC.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; Muroidea;
OC   Muridae; Murinae; Mus; Mus.
XX
RN   [1]
RP   1-4696
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-4696
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (08-SEP-2003) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; c56b95fb8883389298c09873c43881f9.
DR   Ensembl-Gn; ENSMUSG00000030359; mus_musculus.
DR   Ensembl-Tr; ENSMUST00000112132; mus_musculus.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Jeffrey E. Green, M.D.
CC   cDNA Library Preparation: Life Technologies, Inc.
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: National Institutes of Health Intramural
CC   Sequencing Center (NISC),
CC   Gaithersburg, Maryland;
CC   Web site: http://www.nisc.nih.gov/
CC   Contact: nisc_mgc@nhgri.nih.gov
CC   Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
CC   Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
CC   Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
CC   Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
CC   Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
CC   McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
CC   Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
CC   Young,A., Zhang,L.-H. and Green,E.D.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 123 Row: h Column: 6
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 6680607.
CC   Differences found between this sequence and the mouse C57BL/6J
CC   genome (build 36) are described in misc_difference features below.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..4696
FT                   /organism="Mus musculus"
FT                   /lab_host="DH10B"
FT                   /strain="FVB/N"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NCI_CGAP_Li9"
FT                   /clone="MGC:65284 IMAGE:5134629"
FT                   /tissue_type="Liver, normal. 5 month old male mouse."
FT                   /note="Vector: pCMV-SPORT6"
FT                   /db_xref="taxon:10090"
FT   gene            1..4696
FT                   /gene="Pzp"
FT                   /note="synonyms: A1m, MAM"
FT   misc_difference 1..25
FT                   /gene="Pzp"
FT                   /note="25 bases at the 5' end do not align to the mouse
FT                   genome."
FT   CDS             57..4544
FT                   /codon_start=1
FT                   /gene="Pzp"
FT                   /product="pregnancy zone protein"
FT                   /db_xref="GOA:Q61838"
FT                   /db_xref="InterPro:IPR001599"
FT                   /db_xref="InterPro:IPR002890"
FT                   /db_xref="InterPro:IPR008930"
FT                   /db_xref="InterPro:IPR009048"
FT                   /db_xref="InterPro:IPR011625"
FT                   /db_xref="InterPro:IPR011626"
FT                   /db_xref="InterPro:IPR014756"
FT                   /db_xref="InterPro:IPR019565"
FT                   /db_xref="InterPro:IPR019742"
FT                   /db_xref="MGI:MGI:87854"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q61838"
FT                   /protein_id="AAH57983.1"
FT                   /translation="MRRNQLPTPAFLLLFLLLPRDATTATAKPQYVVLVPSEVYSGIPE
FT                   KACVSLNHVNETVMLSLTLEYAMQQTKLLTDQAVDKDSFYCSPFTISGSPLPYTFITVE
FT                   IKGPTQRFIKKKSIQIIKAESPVFVQTDKPIYKPGQIVKFRVVSVDISFRPLNETFPVV
FT                   YIETPKRNRIFQWQNIHLAGGLHQLSFPLSVEPALGIYKVVVQKDSGKKIEHSFEVKEY
FT                   VLPKFEVIIKMQKTMAFLEEELPITACGVYTYGKPVPGLVTLRVCRKYSRYRSTCHNQN
FT                   SMSICEEFSQQADDKGCFRQVVKTKVFQLRQKGHDMKIEVEAKIKEEGTGIELTGIGSC
FT                   EIANALSKLKFTKVNTNYRPGLPFSGQVLLVDEKGKPIPNKNITSVVSPLGYLSIFTTD
FT                   EHGLANISIDTSNFTAPFLRVVVTYKQNHVCYDNWWLDEFHTQADHSATLVFSPSQSYI
FT                   QLELVFGTLACGQTQEIRIHYLLNEDIMKNEKDLTFYYLIKARGSIFNLGSHVLSLEQG
FT                   NMKGVFSLPIQVEPGMAPEAQLLVYAILPNEELVADAQNFEIEKCFANKVNLSFPSAQS
FT                   LPASDTHLKVKAAPLSLCALTAVDQSVLLLKPEAKLSPQSIYNLLPGKTVQGAFFGVPV
FT                   YKDHENCISGEDITHNGIVYTPKHSLGDNDAHSIFQSVGINIFTNSKIHKPRFCQEFQH
FT                   YPAMGGVAPQALAVAASGPGSSFRAMGVPMMGLDYSDEINQVVEVRETVRKYFPETWIW
FT                   DLVPLDVSGDGELAVKVPDTITEWKASAFCLSGTTGLGLSSTISLQAFQPFFLELTLPY
FT                   SVVRGEAFTLKATVLNYMSHCIQIRVDLEISPDFLAVPVGGHENSHCICGNERKTVSWA
FT                   VTPKSLGEVNFTATAEALQSPELCGNKLTEVPALVHKDTVVKSVIVEPEGIEKEQTYNT
FT                   LLCPQDTELQDNWSLELPPNVVEGSARATHSVLGDILGSAMQNLQNLLQMPYGCGEQNM
FT                   VLFVPNIYVLNYLNETQQLTEAIKSKAINYLISGYQRQLNYQHSDGSYSTFGNHGGGNT
FT                   PGNTWLTAFVLKAFAQAQSHIFIEKTHITNAFNWLSMKQKENGCFQQSGYLLNNAMKGG
FT                   VDDEVTLSAYITIALLEMPLPVTHSVVRNALFCLETAWASISQSQESHVYTKALLAYAF
FT                   ALAGNKAKRSELLESLNKDAVKEEDSLHWQRPGDVQKVKALSFYQPRAPSAEVEMTAYV
FT                   LLAYLTSESSRPTRDLSSSDLSTASKIVKWISKQQNSHGGFSSTQDTVVALQALSKYGA
FT                   ATFTRSQKEVLVTIESSGTFSKTFHVNSGNRLLLQEVRLPDLPGNYVTKGSGSGCVYLQ
FT                   TSLKYNILPVADGKAPFALQVNTLPLNFDKAGDHRTFQIRINVSYTGERPSSNMVIVDV
FT                   KMVSGFIPMKPSVKKLQDQPNIQRTEVNTNHVLIYIEKLTNQTLGFSFAVEQDIPVKNL
FT                   KPAPIKVYDYYETDEFTVEEYSAPFSDGSEQGNA"
FT   misc_difference 183
FT                   /gene="Pzp"
FT                   /note="'A' in cDNA is 'G' in the mouse genome; amino acid
FT                   difference: 'I' in cDNA, 'V' in the mouse genome."
FT   misc_difference 521
FT                   /gene="Pzp"
FT                   /note="'A' in cDNA is 'G' in the mouse genome; no amino
FT                   acid change."
FT   misc_difference 1040
FT                   /gene="Pzp"
FT                   /note="'G' in cDNA is 'A' in the mouse genome; no amino
FT                   acid change."
FT   misc_difference 1268
FT                   /gene="Pzp"
FT                   /note="'C' in cDNA is 'T' in the mouse genome; no amino
FT                   acid change."
FT   misc_difference 1677
FT                   /gene="Pzp"
FT                   /note="'G' in cDNA is 'A' in the mouse genome; amino acid
FT                   difference: 'V' in cDNA, 'I' in the mouse genome."
FT   misc_difference 3362
FT                   /gene="Pzp"
FT                   /note="'C' in cDNA is 'T' in the mouse genome; no amino
FT                   acid change."
FT   misc_difference 3434
FT                   /gene="Pzp"
FT                   /note="'C' in cDNA is 'T' in the mouse genome; no amino
FT                   acid change."
FT   misc_difference 3454
FT                   /gene="Pzp"
FT                   /note="'T' in cDNA is 'C' in the mouse genome; amino acid
FT                   difference: 'V' in cDNA, 'A' in the mouse genome."
FT   misc_difference 3461
FT                   /gene="Pzp"
FT                   /note="'C' in cDNA is 'A' in the mouse genome; no amino
FT                   acid change."
FT   misc_difference 3470
FT                   /gene="Pzp"
FT                   /note="'G' in cDNA is 'A' in the mouse genome; no amino
FT                   acid change."
FT   misc_difference 3554
FT                   /gene="Pzp"
FT                   /note="'T' in cDNA is 'C' in the mouse genome; no amino
FT                   acid change."
FT   misc_difference 3632
FT                   /gene="Pzp"
FT                   /note="'A' in cDNA is 'G' in the mouse genome; no amino
FT                   acid change."
FT   misc_difference 4160
FT                   /gene="Pzp"
FT                   /note="'A' in cDNA is 'C' in the mouse genome; no amino
FT                   acid change."
FT   misc_difference 4620
FT                   /gene="Pzp"
FT                   /note="'T' in cDNA is 'C' in the mouse genome."
FT   misc_difference 4682..4696
FT                   /gene="Pzp"
FT                   /note="polyA tail: 15 bases do not align to the mouse
FT                   genome."
XX
SQ   Sequence 4696 BP; 1345 A; 1173 C; 1052 G; 1126 T; 0 other;
     gacccacgcg tccgcccacg cgtccggctc agacgttctt ctctgccctc tccaccatga        60
     ggagaaacca gctgcccaca ccagcttttc ttttactgtt cctgcttctt cccagagatg       120
     ccaccacagc tactgcaaaa ccacaatatg tggtgctggt cccgtcagag gtctattcag       180
     gaatccctga aaaggcctgt gtcagcctca accatgtgaa tgagactgtg atgctcagct       240
     taactctaga gtatgcaatg cagcaaacga agctcctcac agaccaggct gtggataagg       300
     actccttcta ctgcagcccc ttcacgatct caggttcacc tttaccctac acattcatta       360
     ctgtggagat aaaaggacca acgcagcgct tcataaagaa gaagtcaata caaataataa       420
     aagctgagag cccagtcttt gtccagacag acaaacccat atacaaacca ggacagatag       480
     tgaaattccg agttgtttct gtggacatca gttttcgccc attgaatgaa acgttccctg       540
     tcgtttatat tgagactccc aagaggaacc gaatttttca atggcaaaat atccatctgg       600
     caggaggact ccaccagctc tctttcccac tgtctgttga gccagctctg ggtatctaca       660
     aggttgtagt gcagaaggac tcagggaaga aaatagaaca ctcctttgag gtgaaggaat       720
     acgttttacc caaatttgag gtgataataa aaatgcagaa gactatggct ttcctggaag       780
     aggaacttcc tataactgct tgtggcgtat acacatatgg aaagcctgtt cccggtctgg       840
     tgacattgag agtgtgcaga aaatattcac gataccgctc cacctgccat aaccaaaact       900
     caatgagtat ctgtgaagaa ttcagccaac aggcagatga taaaggatgt ttcagacaag       960
     ttgtaaaaac caaagtattc cagctcagac aaaaaggcca cgacatgaag atagaggtgg      1020
     aagccaaaat caaagaggag ggaacaggaa tagaactaac tggtattgga tcatgtgaaa      1080
     tagcaaatgc cttaagcaaa ctgaaattta ctaaagtaaa tacaaattac aggcctgggc      1140
     tacctttctc tggacaggtt cttcttgttg atgagaaggg taaaccaatc cccaacaaaa      1200
     acataacttc cgtcgtgtct ccacttgggt acctgtccat ttttactacg gatgagcatg      1260
     gcttggccaa catttccatt gacacttcca acttcacagc tccgtttttg agagttgtgg      1320
     tcacctacaa gcagaaccat gtctgctatg ataactggtg gcttgatgaa tttcacacac      1380
     aggcagacca ttctgcaact ctagtctttt ctccaagcca gagttatatt caacttgaac      1440
     ttgtttttgg tactttggcc tgtgggcaaa ctcaggagat tcggatacac tacctcttga      1500
     atgaagatat catgaagaat gaaaaagact taacctttta ctatctgatc aaagcaaggg      1560
     gaagcatctt caacttagga agccatgtgt tgtcgcttga acaaggaaac atgaaaggag      1620
     tcttttccct cccaattcaa gtggagccag gcatggctcc cgaggctcag ctgctcgttt      1680
     atgctatttt acctaatgaa gaacttgtcg ctgatgctca gaattttgaa atcgagaagt      1740
     gttttgccaa caaggtaaat ttgagtttcc catcagcaca gagcctgcca gcctctgaca      1800
     cccacctgaa ggtcaaagcc gcgcctctgt ccctctgtgc cctcactgca gtagaccaga      1860
     gtgtgctgct actgaagccc gaagccaagc tctcacctca gtcaatctac aatttgctgc      1920
     caggaaagac tgtccagggt gccttctttg gtgtcccagt gtacaaagac catgagaact      1980
     gcatcagcgg agaagacatc actcacaatg ggatcgtgta cacgccaaag cactctctgg      2040
     gcgataatga tgcacacagc atttttcagt ctgtaggaat aaatattttt accaactcca      2100
     aaatccacaa accacgcttt tgtcaagagt ttcaacacta tccggcaatg ggaggagtgg      2160
     cacctcaagc cttagctgtg gctgcttcag gcccaggatc cagcttcaga gcaatgggcg      2220
     tgccgatgat ggggttggac tactctgatg aaattaatca ggtggtggaa gtaagagaga      2280
     cagtgcggaa gtacttccct gagacctgga tctgggacct ggtgccactg gacgtatccg      2340
     gggacggtga attggcggta aaggtccccg acaccatcac tgagtggaag gccagtgcat      2400
     tctgcttgtc tggaaccact ggccttggcc tctcctccac catctccctt caagccttcc      2460
     agcccttctt cttggagctc actctcccat actctgtggt gcgaggggaa gcctttaccc      2520
     tcaaagccac cgtgctcaat tacatgtctc actgcattca gatccgagtg gacctagaga      2580
     tttctcctga tttcctggca gtcccagtgg ggggccatga aaactctcat tgcatctgtg      2640
     gaaatgaaag gaaaaccgtg tcctgggctg tgaccccaaa gtcgctgggg gaggtgaact      2700
     tcacagctac cgcagaagcc ttgcagtctc cagaattgtg tggcaataag ttgacagaag      2760
     tgccagccct tgtacacaag gacactgtgg tgaagtccgt aatagttgag cctgaaggaa      2820
     ttgagaagga gcaaacgtac aacacactgt tatgcccaca agatactgag ttacaagata      2880
     attggtcact ggagcttcca cccaatgtgg ttgaaggatc tgccagggct acacattccg      2940
     ttttgggtga tatactgggc tctgcaatgc aaaacctcca gaatcttctc cagatgccct      3000
     atggctgtgg ggaacaaaac atggtccttt ttgtccctaa catctatgtt ctgaactatc      3060
     tgaacgagac acagcagctg acagaggcga tcaagtccaa agccattaac tacctcatca      3120
     gtgggtacca gaggcagctg aactatcagc acagtgatgg ctcgtacagc acattcggga      3180
     accatggtgg tggcaacact ccgggaaata cttggctcac tgcgtttgtg ctcaaggcct      3240
     ttgctcaagc tcagtcgcac atctttatag agaagacaca catcacaaat gctttcaact      3300
     ggctctccat gaaacaaaag gagaatggct gtttccaaca gtctggatat ctgcttaaca      3360
     acgcgatgaa gggtggtgtg gatgatgaag tgactctctc tgcctacatc accattgctc      3420
     tgctggagat gcccctgcct gtcacgcaca gtgttgtgcg caatgctctg ttttgcctgg      3480
     aaacggcctg ggcctccatc tcacagagcc aggaaagtca tgtctatacc aaggcattgc      3540
     tggcctatgc ctttgccctg gcaggaaaca aggccaagag aagcgaactg cttgaatccc      3600
     taaacaaaga tgctgtgaag gaagaggatt cactgcactg gcaacgccct ggggatgttc      3660
     agaaagtgaa ggccttatct ttctatcaac ctcgggcccc ttctgccgaa gtggagatga      3720
     cagcttacgt gcttctcgcc tatctaacct ctgagtcttc ccggcccaca cgggacctgt      3780
     cttcatcaga cctgtccaca gcatccaaga ttgtgaagtg gatcagcaag caacaaaact      3840
     cccacggggg cttctcctcc actcaggata cagtggtggc tctccaagcc ctctccaaat      3900
     atggagctgc cacttttacg agaagtcaga aagaagtgtt ggtcaccatc gagtcttcag      3960
     ggaccttctc taagactttc catgttaaca gtggcaatcg cctgctgctg caggaagtca      4020
     ggctgccaga tctgcccgga aattacgtca ccaaagggtc aggatcagga tgtgtgtacc      4080
     ttcagacatc tctaaagtac aacatcctcc cagtggcaga tggaaaagca cccttcgctc      4140
     tgcaagtcaa cactctccca ctaaactttg acaaggcagg agatcacaga acattccaga      4200
     ttcgcatcaa cgtaagctac actggagaac gacccagctc caacatggtc attgttgatg      4260
     tgaagatggt atcaggcttc atacctatga agccatccgt gaaaaagctc caagaccagc      4320
     ctaacattca gaggactgaa gtgaacacca accatgttct aatctacatt gaaaagctaa      4380
     ccaatcaaac cctcggtttc tccttcgcgg tggaacaaga catcccagta aagaacttaa      4440
     aaccagcccc cataaaagtc tatgattatt atgagacaga tgaattcacc gttgaagaat      4500
     acagtgcccc tttcagtgat ggctctgaac aaggaaatgc ttaaaagtcg gaacctatag      4560
     atgtcctcag aaggactcct ccacacatga agaaccaaga agaaagacag caagataaat      4620
     ggaagggaat taaaaaacca taacatttat atgttgaata aagttatgac atttacatct      4680
     gaaaaaaaaa aaaaaa                                                      4696
//


spacer
spacer