Dbfetch

ID   BC057983; SV 2; linear; mRNA; STD; MUS; 4670 BP.
XX
AC   BC057983;
XX
DT   18-SEP-2003 (Rel. 77, Created)
DT   23-JAN-2018 (Rel. 135, Last updated, Version 9)
XX
DE   Mus musculus pregnancy zone protein, mRNA (cDNA clone MGC:65284
DE   IMAGE:5134629), complete cds.
XX
KW   MGC.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-4670
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-4670
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (08-SEP-2003) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
RN   [3]
RC   Sequence update by database staff to remove vector contamination
RP   1-4670
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (19-JAN-2018) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; 11510f55b106adf8d86bc32ce7e9e86f.
DR   Ensembl-Gn; ENSMUSG00000030359; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0031328; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AKRJ_G0031227; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0031306; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0031030; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0031765; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0030411; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0030991; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0031147; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0031097; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0031227; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0031135; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0031798; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0030126; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0030496; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000112132; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0080375; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AKRJ_T0080376; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0080392; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0080015; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0080858; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0080594; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0079966; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0080122; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0080001; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0080160; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0080018; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0081125; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0080017; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0079109; mus_musculus_wsbeij.
XX
CC   On Jan 19, 2018 this sequence version replaced BC057983.1.
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Jeffrey E. Green, M.D.
CC   cDNA Library Preparation: Life Technologies, Inc.
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: National Institutes of Health Intramural
CC   Sequencing Center (NISC),
CC   Gaithersburg, Maryland;
CC   Web site: http://www.nisc.nih.gov/
CC   Contact: nisc_mgc@nhgri.nih.gov
CC   Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
CC   Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
CC   Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
CC   Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
CC   Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
CC   McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
CC   Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
CC   Young,A., Zhang,L.-H. and Green,E.D.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 123 Row: h Column: 6
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 6680607.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..4670
FT                   /organism="Mus musculus"
FT                   /lab_host="DH10B"
FT                   /strain="FVB/N"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NCI_CGAP_Li9"
FT                   /clone="MGC:65284 IMAGE:5134629"
FT                   /tissue_type="Liver, normal. 5 month old male mouse."
FT                   /note="Vector: pCMV-SPORT6"
FT                   /db_xref="taxon:10090"
FT   gene            1..4670
FT                   /gene="Pzp"
FT                   /gene_synonym="A1m"
FT                   /gene_synonym="MAM"
FT   CDS             31..4518
FT                   /codon_start=1
FT                   /gene="Pzp"
FT                   /gene_synonym="A1m"
FT                   /gene_synonym="MAM"
FT                   /product="pregnancy zone protein"
FT                   /db_xref="GOA:Q61838"
FT                   /db_xref="InterPro:IPR001599"
FT                   /db_xref="InterPro:IPR002890"
FT                   /db_xref="InterPro:IPR008930"
FT                   /db_xref="InterPro:IPR009048"
FT                   /db_xref="InterPro:IPR011625"
FT                   /db_xref="InterPro:IPR011626"
FT                   /db_xref="InterPro:IPR013783"
FT                   /db_xref="InterPro:IPR014756"
FT                   /db_xref="InterPro:IPR019565"
FT                   /db_xref="InterPro:IPR019742"
FT                   /db_xref="InterPro:IPR036595"
FT                   /db_xref="MGI:MGI:87854"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q61838"
FT                   /protein_id="AAH57983.1"
FT                   /translation="MRRNQLPTPAFLLLFLLLPRDATTATAKPQYVVLVPSEVYSGIPE
FT                   KACVSLNHVNETVMLSLTLEYAMQQTKLLTDQAVDKDSFYCSPFTISGSPLPYTFITVE
FT                   IKGPTQRFIKKKSIQIIKAESPVFVQTDKPIYKPGQIVKFRVVSVDISFRPLNETFPVV
FT                   YIETPKRNRIFQWQNIHLAGGLHQLSFPLSVEPALGIYKVVVQKDSGKKIEHSFEVKEY
FT                   VLPKFEVIIKMQKTMAFLEEELPITACGVYTYGKPVPGLVTLRVCRKYSRYRSTCHNQN
FT                   SMSICEEFSQQADDKGCFRQVVKTKVFQLRQKGHDMKIEVEAKIKEEGTGIELTGIGSC
FT                   EIANALSKLKFTKVNTNYRPGLPFSGQVLLVDEKGKPIPNKNITSVVSPLGYLSIFTTD
FT                   EHGLANISIDTSNFTAPFLRVVVTYKQNHVCYDNWWLDEFHTQADHSATLVFSPSQSYI
FT                   QLELVFGTLACGQTQEIRIHYLLNEDIMKNEKDLTFYYLIKARGSIFNLGSHVLSLEQG
FT                   NMKGVFSLPIQVEPGMAPEAQLLVYAILPNEELVADAQNFEIEKCFANKVNLSFPSAQS
FT                   LPASDTHLKVKAAPLSLCALTAVDQSVLLLKPEAKLSPQSIYNLLPGKTVQGAFFGVPV
FT                   YKDHENCISGEDITHNGIVYTPKHSLGDNDAHSIFQSVGINIFTNSKIHKPRFCQEFQH
FT                   YPAMGGVAPQALAVAASGPGSSFRAMGVPMMGLDYSDEINQVVEVRETVRKYFPETWIW
FT                   DLVPLDVSGDGELAVKVPDTITEWKASAFCLSGTTGLGLSSTISLQAFQPFFLELTLPY
FT                   SVVRGEAFTLKATVLNYMSHCIQIRVDLEISPDFLAVPVGGHENSHCICGNERKTVSWA
FT                   VTPKSLGEVNFTATAEALQSPELCGNKLTEVPALVHKDTVVKSVIVEPEGIEKEQTYNT
FT                   LLCPQDTELQDNWSLELPPNVVEGSARATHSVLGDILGSAMQNLQNLLQMPYGCGEQNM
FT                   VLFVPNIYVLNYLNETQQLTEAIKSKAINYLISGYQRQLNYQHSDGSYSTFGNHGGGNT
FT                   PGNTWLTAFVLKAFAQAQSHIFIEKTHITNAFNWLSMKQKENGCFQQSGYLLNNAMKGG
FT                   VDDEVTLSAYITIALLEMPLPVTHSVVRNALFCLETAWASISQSQESHVYTKALLAYAF
FT                   ALAGNKAKRSELLESLNKDAVKEEDSLHWQRPGDVQKVKALSFYQPRAPSAEVEMTAYV
FT                   LLAYLTSESSRPTRDLSSSDLSTASKIVKWISKQQNSHGGFSSTQDTVVALQALSKYGA
FT                   ATFTRSQKEVLVTIESSGTFSKTFHVNSGNRLLLQEVRLPDLPGNYVTKGSGSGCVYLQ
FT                   TSLKYNILPVADGKAPFALQVNTLPLNFDKAGDHRTFQIRINVSYTGERPSSNMVIVDV
FT                   KMVSGFIPMKPSVKKLQDQPNIQRTEVNTNHVLIYIEKLTNQTLGFSFAVEQDIPVKNL
FT                   KPAPIKVYDYYETDEFTVEEYSAPFSDGSEQGNA"
XX
SQ   Sequence 4670 BP; 1342 A; 1159 C; 1045 G; 1124 T; 0 other;
     gctcagacgt tcttctctgc cctctccacc atgaggagaa accagctgcc cacaccagct        60
     tttcttttac tgttcctgct tcttcccaga gatgccacca cagctactgc aaaaccacaa       120
     tatgtggtgc tggtcccgtc agaggtctat tcaggaatcc ctgaaaaggc ctgtgtcagc       180
     ctcaaccatg tgaatgagac tgtgatgctc agcttaactc tagagtatgc aatgcagcaa       240
     acgaagctcc tcacagacca ggctgtggat aaggactcct tctactgcag ccccttcacg       300
     atctcaggtt cacctttacc ctacacattc attactgtgg agataaaagg accaacgcag       360
     cgcttcataa agaagaagtc aatacaaata ataaaagctg agagcccagt ctttgtccag       420
     acagacaaac ccatatacaa accaggacag atagtgaaat tccgagttgt ttctgtggac       480
     atcagttttc gcccattgaa tgaaacgttc cctgtcgttt atattgagac tcccaagagg       540
     aaccgaattt ttcaatggca aaatatccat ctggcaggag gactccacca gctctctttc       600
     ccactgtctg ttgagccagc tctgggtatc tacaaggttg tagtgcagaa ggactcaggg       660
     aagaaaatag aacactcctt tgaggtgaag gaatacgttt tacccaaatt tgaggtgata       720
     ataaaaatgc agaagactat ggctttcctg gaagaggaac ttcctataac tgcttgtggc       780
     gtatacacat atggaaagcc tgttcccggt ctggtgacat tgagagtgtg cagaaaatat       840
     tcacgatacc gctccacctg ccataaccaa aactcaatga gtatctgtga agaattcagc       900
     caacaggcag atgataaagg atgtttcaga caagttgtaa aaaccaaagt attccagctc       960
     agacaaaaag gccacgacat gaagatagag gtggaagcca aaatcaaaga ggagggaaca      1020
     ggaatagaac taactggtat tggatcatgt gaaatagcaa atgccttaag caaactgaaa      1080
     tttactaaag taaatacaaa ttacaggcct gggctacctt tctctggaca ggttcttctt      1140
     gttgatgaga agggtaaacc aatccccaac aaaaacataa cttccgtcgt gtctccactt      1200
     gggtacctgt ccatttttac tacggatgag catggcttgg ccaacatttc cattgacact      1260
     tccaacttca cagctccgtt tttgagagtt gtggtcacct acaagcagaa ccatgtctgc      1320
     tatgataact ggtggcttga tgaatttcac acacaggcag accattctgc aactctagtc      1380
     ttttctccaa gccagagtta tattcaactt gaacttgttt ttggtacttt ggcctgtggg      1440
     caaactcagg agattcggat acactacctc ttgaatgaag atatcatgaa gaatgaaaaa      1500
     gacttaacct tttactatct gatcaaagca aggggaagca tcttcaactt aggaagccat      1560
     gtgttgtcgc ttgaacaagg aaacatgaaa ggagtctttt ccctcccaat tcaagtggag      1620
     ccaggcatgg ctcccgaggc tcagctgctc gtttatgcta ttttacctaa tgaagaactt      1680
     gtcgctgatg ctcagaattt tgaaatcgag aagtgttttg ccaacaaggt aaatttgagt      1740
     ttcccatcag cacagagcct gccagcctct gacacccacc tgaaggtcaa agccgcgcct      1800
     ctgtccctct gtgccctcac tgcagtagac cagagtgtgc tgctactgaa gcccgaagcc      1860
     aagctctcac ctcagtcaat ctacaatttg ctgccaggaa agactgtcca gggtgccttc      1920
     tttggtgtcc cagtgtacaa agaccatgag aactgcatca gcggagaaga catcactcac      1980
     aatgggatcg tgtacacgcc aaagcactct ctgggcgata atgatgcaca cagcattttt      2040
     cagtctgtag gaataaatat ttttaccaac tccaaaatcc acaaaccacg cttttgtcaa      2100
     gagtttcaac actatccggc aatgggagga gtggcacctc aagccttagc tgtggctgct      2160
     tcaggcccag gatccagctt cagagcaatg ggcgtgccga tgatggggtt ggactactct      2220
     gatgaaatta atcaggtggt ggaagtaaga gagacagtgc ggaagtactt ccctgagacc      2280
     tggatctggg acctggtgcc actggacgta tccggggacg gtgaattggc ggtaaaggtc      2340
     cccgacacca tcactgagtg gaaggccagt gcattctgct tgtctggaac cactggcctt      2400
     ggcctctcct ccaccatctc ccttcaagcc ttccagccct tcttcttgga gctcactctc      2460
     ccatactctg tggtgcgagg ggaagccttt accctcaaag ccaccgtgct caattacatg      2520
     tctcactgca ttcagatccg agtggaccta gagatttctc ctgatttcct ggcagtccca      2580
     gtggggggcc atgaaaactc tcattgcatc tgtggaaatg aaaggaaaac cgtgtcctgg      2640
     gctgtgaccc caaagtcgct gggggaggtg aacttcacag ctaccgcaga agccttgcag      2700
     tctccagaat tgtgtggcaa taagttgaca gaagtgccag cccttgtaca caaggacact      2760
     gtggtgaagt ccgtaatagt tgagcctgaa ggaattgaga aggagcaaac gtacaacaca      2820
     ctgttatgcc cacaagatac tgagttacaa gataattggt cactggagct tccacccaat      2880
     gtggttgaag gatctgccag ggctacacat tccgttttgg gtgatatact gggctctgca      2940
     atgcaaaacc tccagaatct tctccagatg ccctatggct gtggggaaca aaacatggtc      3000
     ctttttgtcc ctaacatcta tgttctgaac tatctgaacg agacacagca gctgacagag      3060
     gcgatcaagt ccaaagccat taactacctc atcagtgggt accagaggca gctgaactat      3120
     cagcacagtg atggctcgta cagcacattc gggaaccatg gtggtggcaa cactccggga      3180
     aatacttggc tcactgcgtt tgtgctcaag gcctttgctc aagctcagtc gcacatcttt      3240
     atagagaaga cacacatcac aaatgctttc aactggctct ccatgaaaca aaaggagaat      3300
     ggctgtttcc aacagtctgg atatctgctt aacaacgcga tgaagggtgg tgtggatgat      3360
     gaagtgactc tctctgccta catcaccatt gctctgctgg agatgcccct gcctgtcacg      3420
     cacagtgttg tgcgcaatgc tctgttttgc ctggaaacgg cctgggcctc catctcacag      3480
     agccaggaaa gtcatgtcta taccaaggca ttgctggcct atgcctttgc cctggcagga      3540
     aacaaggcca agagaagcga actgcttgaa tccctaaaca aagatgctgt gaaggaagag      3600
     gattcactgc actggcaacg ccctggggat gttcagaaag tgaaggcctt atctttctat      3660
     caacctcggg ccccttctgc cgaagtggag atgacagctt acgtgcttct cgcctatcta      3720
     acctctgagt cttcccggcc cacacgggac ctgtcttcat cagacctgtc cacagcatcc      3780
     aagattgtga agtggatcag caagcaacaa aactcccacg ggggcttctc ctccactcag      3840
     gatacagtgg tggctctcca agccctctcc aaatatggag ctgccacttt tacgagaagt      3900
     cagaaagaag tgttggtcac catcgagtct tcagggacct tctctaagac tttccatgtt      3960
     aacagtggca atcgcctgct gctgcaggaa gtcaggctgc cagatctgcc cggaaattac      4020
     gtcaccaaag ggtcaggatc aggatgtgtg taccttcaga catctctaaa gtacaacatc      4080
     ctcccagtgg cagatggaaa agcacccttc gctctgcaag tcaacactct cccactaaac      4140
     tttgacaagg caggagatca cagaacattc cagattcgca tcaacgtaag ctacactgga      4200
     gaacgaccca gctccaacat ggtcattgtt gatgtgaaga tggtatcagg cttcatacct      4260
     atgaagccat ccgtgaaaaa gctccaagac cagcctaaca ttcagaggac tgaagtgaac      4320
     accaaccatg ttctaatcta cattgaaaag ctaaccaatc aaaccctcgg tttctccttc      4380
     gcggtggaac aagacatccc agtaaagaac ttaaaaccag cccccataaa agtctatgat      4440
     tattatgaga cagatgaatt caccgttgaa gaatacagtg cccctttcag tgatggctct      4500
     gaacaaggaa atgcttaaaa gtcggaacct atagatgtcc tcagaaggac tcctccacac      4560
     atgaagaacc aagaagaaag acagcaagat aaatggaagg gaattaaaaa accataacat      4620
     ttatatgttg aataaagtta tgacatttac atctgaaaaa aaaaaaaaaa                 4670
//