Dbfetch

ID   BC031429; SV 1; linear; mRNA; STD; MUS; 2525 BP.
XX
AC   BC031429;
XX
DT   28-JUN-2002 (Rel. 72, Created)
DT   24-SEP-2008 (Rel. 97, Last updated, Version 16)
XX
DE   Mus musculus protease, serine, 12 neurotrypsin (motopsin), mRNA (cDNA clone
DE   MGC:18371 IMAGE:3665834), complete cds.
XX
KW   MGC.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi; Muroidea;
OC   Muridae; Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2525
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-2525
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (06-JUN-2002) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; 4dde0ac95cc2a4adb80e532b1642e83f.
DR   Ensembl-Gn; ENSMUSG00000027978; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0027850; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0027809; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0027774; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0027821; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0027551; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0028266; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0027006; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0027526; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0027667; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0027635; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0027777; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0027665; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0028322; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0026732; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0027086; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000029603; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0063589; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0063619; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0063553; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0063545; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0063231; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0064033; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0063531; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0063190; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0063304; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0063269; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0063445; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0063262; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0064246; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0063075; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0062488; mus_musculus_wsbeij.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: Lothar Hennighausen Ph.D., Robin Humphreys
CC   cDNA Library Preparation: Life Technologies, Inc.
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: Baylor College of Medicine Human Genome
CC   Sequencing Center
CC   Center code: BCM-HGSC
CC   Web site: http://www.hgsc.bcm.tmc.edu/cdna/
CC   Contact: amg@bcm.tmc.edu
CC   Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
CC   Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
CC   A.N., Gibbs, R.A.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 23 Row: f Column: 1
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 6679484.
CC   Differences found between this sequence and the mouse C57BL/6J
CC   genome (build 36) are described in misc_difference features below.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2525
FT                   /organism="Mus musculus"
FT                   /lab_host="DH10B"
FT                   /strain="mix FVB/N, C57BL/6J"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NCI_CGAP_Mam5"
FT                   /clone="MGC:18371 IMAGE:3665834"
FT                   /tissue_type="Mammary tumor. WAP-TGF alpha model. 7 months
FT                   old, gross tissue."
FT                   /note="Vector: pCMV-SPORT6"
FT                   /db_xref="taxon:10090"
FT   gene            1..2525
FT                   /gene="Prss12"
FT                   /note="synonym: Bssp-3"
FT   CDS             149..2434
FT                   /codon_start=1
FT                   /gene="Prss12"
FT                   /product="protease, serine, 12 neurotrypsin (motopsin)"
FT                   /db_xref="GOA:O08762"
FT                   /db_xref="InterPro:IPR000001"
FT                   /db_xref="InterPro:IPR001190"
FT                   /db_xref="InterPro:IPR001254"
FT                   /db_xref="InterPro:IPR001314"
FT                   /db_xref="InterPro:IPR009003"
FT                   /db_xref="InterPro:IPR013806"
FT                   /db_xref="InterPro:IPR017448"
FT                   /db_xref="InterPro:IPR018056"
FT                   /db_xref="InterPro:IPR018114"
FT                   /db_xref="InterPro:IPR033116"
FT                   /db_xref="MGI:MGI:1100881"
FT                   /db_xref="UniProtKB/Swiss-Prot:O08762"
FT                   /protein_id="AAH31429.1"
FT                   /translation="MALARCVLAVILGALSVVARADPVSRSPLHRPHPSPPRSQHAHYL
FT                   PSSRRPPRTPRFPLPLRIPAAQRPQVLSTGHTPPTIPRRCGAGESWGNATNLGVPCLHW
FT                   DEVPPFLERSPPASWAELRGQPHNFCRSPDGSGRPWCFYRNAQGKVDWGYCDCGQGPAL
FT                   PVIRLVGGNSGHEGRVELYHAGQWGTICDDQWDNADADVICRQLGLSGIAKAWHQAHFG
FT                   EGSGPILLDEVRCTGNELSIEQCPKSSWGEHNCGHKEDAGVSCVPLTDGVIRLAGGKST
FT                   HEGRLEVYYKGQWGTVCDDGWTEMNTYVACRLLGFKYGKQSSVNHFDGSNRPIWLDDVS
FT                   CSGKEVSFIQCSRRQWGRHDCSHREDVGLTCYPDSDGHRLSPGFPIRLVDGENKKEGRV
FT                   EVFVNGQWGTICDDGWTDKHAAVICRQLGYKGPARARTMAYFGEGKGPIHMDNVKCTGN
FT                   EKALADCVKQDIGRHNCRHSEDAGVICDYLEKKASSSGNKEMLSSGCGLRLLHRRQKRI
FT                   IGGNNSLRGAWPWQASLRLRSAHGDGRLLCGATLLSSCWVLTAAHCFKRYGNNSRSYAV
FT                   RVGDYHTLVPEEFEQEIGVQQIVIHRNYRPDRSDYDIALVRLQGPGEQCARLSTHVLPA
FT                   CLPLWRERPQKTASNCHITGWGDTGRAYSRTLQQAAVPLLPKRFCKERYKGLFTGRMLC
FT                   AGNLQEDNRVDSCQGDSGGPLMCEKPDESWVVYGVTSWGYGCGVKDTPGVYTRVPAFVP
FT                   WIKSVTSL"
FT   misc_difference 2505..2525
FT                   /gene="Prss12"
FT                   /note="polyA tail: 21 bases do not align to the mouse
FT                   genome."
XX
SQ   Sequence 2525 BP; 612 A; 642 C; 763 G; 508 T; 0 other;
     cccggtgtga tcctccagct gccccggggg ctgggacagc agggcggcgg cgcgagcgtg        60
     ggagggggct ctaggactct gccggccccg ccccgccccc tccgcgggga cccggagccc       120
     agcatggacc acactcggcg ccgcagccat ggcgctcgcc cgctgcgtgc tggctgtgat       180
     tttaggggca ctgtctgtag tggcccgcgc tgatccggtc tcgcgctctc cccttcaccg       240
     cccgcatccg tccccaccgc gttcccaaca cgcgcactac cttcccagct cgcggcggcc       300
     acccaggacc ccgcgcttcc cgctcccgct gcggatcccc gctgcccagc gcccgcaggt       360
     cctcagcacc gggcacacgc ccccgacgat tccacgccgc tgcggggcag gagagtcgtg       420
     gggcaatgcc accaacctcg gcgtcccgtg tctacactgg gacgaggtgc cgcccttcct       480
     ggagcggtcg cccccggcca gttgggctga gctgcgaggg cagccgcaca acttctgccg       540
     gagcccggat ggctcgggca gaccttggtg cttctatcgg aatgcccagg gcaaagtaga       600
     ctggggctac tgcgattgtg gtcaaggccc ggcgttgccc gtcattcgcc ttgttggtgg       660
     gaacagtggg catgaaggtc gagtggagct gtaccacgct ggccagtggg ggaccatctg       720
     tgacgaccaa tgggacaatg cagacgcaga cgtcatctgt aggcagctgg ggctcagtgg       780
     cattgccaaa gcatggcatc aggcacattt tggggaagga tctggcccaa tattgttgga       840
     tgaagtacgc tgcaccggaa acgagctgtc aattgagcaa tgtccaaaga gttcctgggg       900
     cgaacataac tgtggccata aagaagatgc tggagtgtct tgtgttcctc taacagatgg       960
     tgtcatcaga ctggcaggag gaaaaagtac ccatgaaggt cgcctggagg tctactacaa      1020
     ggggcagtgg gggacagtct gtgatgatgg ctggactgag atgaacacat acgtggcttg      1080
     tcgactgctg ggatttaaat acggcaaaca gtcctctgtg aaccattttg atggcagcaa      1140
     caggcccata tggctggatg acgtcagctg ctcaggaaaa gaagtcagct tcattcagtg      1200
     ttccaggaga cagtggggaa ggcatgactg cagccataga gaagatgtgg gcctcacctg      1260
     ctatcctgac agcgatggac ataggctttc tccaggtttt cccatcagac tagtggatgg      1320
     agagaataag aaggaaggac gagtggaggt ttttgtcaat ggccaatggg gaacaatctg      1380
     cgatgacgga tggaccgata agcatgcagc tgtgatctgc cggcagcttg gctataaggg      1440
     tcctgccaga gcaaggacta tggcttattt tggggaagga aaaggcccca tccacatgga      1500
     taatgtgaag tgcacaggaa atgagaaggc cctggctgac tgtgtcaaac aagacattgg      1560
     aaggcacaac tgccgccaca gtgaggatgc aggagtcatc tgtgactatt tagagaagaa      1620
     agcatcaagt agtggtaata aagagatgct ctcatctgga tgtggactga ggttactgca      1680
     ccgtcggcag aaacggatca ttggtgggaa caattcttta aggggtgcct ggccttggca      1740
     ggcttccctc aggctgaggt cggcccatgg agacggcagg ctgctttgtg gagctaccct      1800
     tctgagtagc tgctgggtcc tgacagctgc acactgcttc aaaaggtacg gaaacaactc      1860
     gaggagctat gcagttcgag ttggggatta tcatactctg gtaccagagg agtttgaaca      1920
     agaaataggg gttcaacaga ttgtgattca caggaactac aggccagaca gaagcgacta      1980
     tgacattgcc ctggttagat tgcaaggacc aggggagcaa tgtgccagac taagcaccca      2040
     cgttttgcca gcctgtttac ctctatggag agagaggcca cagaaaacag cctccaactg      2100
     tcacataaca ggatggggag acacaggtcg tgcctactca agaactctac aacaagctgc      2160
     tgtgcctctg ttacccaaga ggttttgtaa agagaggtac aagggactat ttactgggag      2220
     aatgctctgt gctgggaacc tccaagaaga caaccgtgtg gacagctgcc agggagacag      2280
     tggaggacca ctcatgtgtg aaaagcctga tgagtcctgg gttgtgtatg gggtgacttc      2340
     ctgggggtat ggatgtggag tcaaagacac tcctggagtt tataccagag tccccgcctt      2400
     tgtaccttgg ataaaaagtg tcaccagtct gtaacttatg gaaagctcaa gaaaatagta      2460
     aaacagtaac cattcagtct tcatacttgg caccatgcca gaaaaaaaaa aaaaaaaaaa      2520
     aaaaa                                                                  2525
//