Dbfetch

ID   BC024820; SV 1; linear; mRNA; STD; MUS; 2279 BP.
XX
AC   BC024820;
XX
DT   14-MAR-2002 (Rel. 71, Created)
DT   24-SEP-2008 (Rel. 97, Last updated, Version 16)
XX
DE   Mus musculus tripeptidyl peptidase I, mRNA (cDNA clone MGC:36086
DE   IMAGE:5360558), complete cds.
XX
KW   MGC.
XX
OS   Mus musculus (house mouse)
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC   Murinae; Mus; Mus.
XX
RN   [1]
RP   1-2279
RX   DOI; 10.1073/pnas.242603899.
RX   PUBMED; 12477932.
RG   Mammalian Gene Collection Program Team
RA   Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., Klausner R.D.,
RA   Collins F.S., Wagner L., Shenmen C.M., Schuler G.D., Altschul S.F.,
RA   Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., Hopkins R.F., Jordan H.,
RA   Moore T., Max S.I., Wang J., Hsieh F., Diatchenko L., Marusina K.,
RA   Farmer A.A., Rubin G.M., Hong L., Stapleton M., Soares M.B., Bonaldo M.F.,
RA   Casavant T.L., Scheetz T.E., Brownstein M.J., Usdin T.B., Toshiyuki S.,
RA   Carninci P., Prange C., Raha S.S., Loquellano N.A., Peters G.J.,
RA   Abramson R.D., Mullahy S.J., Bosak S.A., McEwan P.J., McKernan K.J.,
RA   Malek J.A., Gunaratne P.H., Richards S., Worley K.C., Hale S., Garcia A.M.,
RA   Gay L.J., Hulyk S.W., Villalon D.K., Muzny D.M., Sodergren E.J., Lu X.,
RA   Gibbs R.A., Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S.,
RA   Sanchez A., Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G.,
RA   Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., Rodriguez A.C.,
RA   Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., Krzywinski M.I.,
RA   Skalska U., Smailus D.E., Schnerch A., Schein J.E., Jones S.J., Marra M.A.;
RT   "Generation and initial analysis of more than 15,000 full-length human and
RT   mouse cDNA sequences";
RL   Proc. Natl. Acad. Sci. U.S.A. 99(26):16899-16903(2002).
XX
RN   [2]
RC   NIH-MGC Project URL: http://mgc.nci.nih.gov
RP   1-2279
RG   NIH MGC Project
RA   ;
RT   ;
RL   Submitted (01-MAR-2002) to the INSDC.
RL   National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda,
RL   MD 20892-2590, USA
XX
DR   MD5; c8b48a566d4c1f69ac6075ace2f0ce7a.
DR   Ensembl-Gn; ENSMUSG00000030894; mus_musculus.
DR   Ensembl-Gn; MGP_129S1SvImJ_G0032764; mus_musculus_129s1svimj.
DR   Ensembl-Gn; MGP_AJ_G0032744; mus_musculus_aj.
DR   Ensembl-Gn; MGP_AKRJ_G0032677; mus_musculus_akrj.
DR   Ensembl-Gn; MGP_BALBcJ_G0032750; mus_musculus_balbcj.
DR   Ensembl-Gn; MGP_C3HHeJ_G0032459; mus_musculus_c3hhej.
DR   Ensembl-Gn; MGP_C57BL6NJ_G0033255; mus_musculus_c57bl6nj.
DR   Ensembl-Gn; MGP_CASTEiJ_G0031789; mus_musculus_casteij.
DR   Ensembl-Gn; MGP_CBAJ_G0032433; mus_musculus_cbaj.
DR   Ensembl-Gn; MGP_DBA2J_G0032586; mus_musculus_dba2j.
DR   Ensembl-Gn; MGP_FVBNJ_G0032539; mus_musculus_fvbnj.
DR   Ensembl-Gn; MGP_LPJ_G0032677; mus_musculus_lpj.
DR   Ensembl-Gn; MGP_NODShiLtJ_G0032571; mus_musculus_nodshiltj.
DR   Ensembl-Gn; MGP_NZOHlLtJ_G0033276; mus_musculus_nzohlltj.
DR   Ensembl-Gn; MGP_PWKPhJ_G0031502; mus_musculus_pwkphj.
DR   Ensembl-Gn; MGP_WSBEiJ_G0031906; mus_musculus_wsbeij.
DR   Ensembl-Tr; ENSMUST00000033184; mus_musculus.
DR   Ensembl-Tr; MGP_129S1SvImJ_T0085705; mus_musculus_129s1svimj.
DR   Ensembl-Tr; MGP_AJ_T0085775; mus_musculus_aj.
DR   Ensembl-Tr; MGP_AKRJ_T0085721; mus_musculus_akrj.
DR   Ensembl-Tr; MGP_BALBcJ_T0085744; mus_musculus_balbcj.
DR   Ensembl-Tr; MGP_C3HHeJ_T0085302; mus_musculus_c3hhej.
DR   Ensembl-Tr; MGP_C57BL6NJ_T0086245; mus_musculus_c57bl6nj.
DR   Ensembl-Tr; MGP_CASTEiJ_T0085822; mus_musculus_casteij.
DR   Ensembl-Tr; MGP_CBAJ_T0085246; mus_musculus_cbaj.
DR   Ensembl-Tr; MGP_DBA2J_T0085441; mus_musculus_dba2j.
DR   Ensembl-Tr; MGP_FVBNJ_T0085293; mus_musculus_fvbnj.
DR   Ensembl-Tr; MGP_LPJ_T0085484; mus_musculus_lpj.
DR   Ensembl-Tr; MGP_NODShiLtJ_T0085335; mus_musculus_nodshiltj.
DR   Ensembl-Tr; MGP_NZOHlLtJ_T0086531; mus_musculus_nzohlltj.
DR   Ensembl-Tr; MGP_PWKPhJ_T0085235; mus_musculus_pwkphj.
DR   Ensembl-Tr; MGP_WSBEiJ_T0084359; mus_musculus_wsbeij.
XX
CC   Contact: MGC help desk
CC   Email: cgapbs-r@mail.nih.gov
CC   Tissue Procurement: The Cepko Laboratory
CC   cDNA Library Preparation: Life Technologies, Inc.
CC   cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
CC   DNA Sequencing by: Baylor College of Medicine Human Genome
CC   Sequencing Center
CC   Center code: BCM-HGSC
CC   Web site: http://www.hgsc.bcm.tmc.edu/cdna/
CC   Contact: amg@bcm.tmc.edu
CC   Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
CC   Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
CC   A.N., Gibbs, R.A.
CC   Clone distribution: MGC clone distribution information can be found
CC   through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
CC   Series: IRAK Plate: 54 Row: m Column: 24
CC   This clone was selected for full length sequencing because it
CC   passed the following selection criteria: matched mRNA gi: 31542406.
CC   Differences found between this sequence and the mouse C57BL/6J
CC   genome (build 36) are described in misc_difference features below.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..2279
FT                   /organism="Mus musculus"
FT                   /lab_host="DH10B"
FT                   /mol_type="mRNA"
FT                   /clone_lib="NIH_MGC_94"
FT                   /clone="MGC:36086 IMAGE:5360558"
FT                   /tissue_type="Eye, retina, mouse strain C57Bl\6"
FT                   /note="Vector: pCMV-SPORT6"
FT                   /db_xref="taxon:10090"
FT   gene            1..2279
FT                   /gene="Tpp1"
FT                   /note="synonym: TPP-I"
FT   misc_difference 1..12
FT                   /gene="Tpp1"
FT                   /note="12 bases at the 5' end do not align to the mouse
FT                   genome."
FT   CDS             23..1711
FT                   /codon_start=1
FT                   /gene="Tpp1"
FT                   /product="tripeptidyl peptidase I"
FT                   /db_xref="GOA:O89023"
FT                   /db_xref="InterPro:IPR000209"
FT                   /db_xref="InterPro:IPR009020"
FT                   /db_xref="InterPro:IPR015366"
FT                   /db_xref="InterPro:IPR030400"
FT                   /db_xref="MGI:MGI:1336194"
FT                   /db_xref="UniProtKB/Swiss-Prot:O89023"
FT                   /protein_id="AAH24820.1"
FT                   /translation="MGLQARLLGLLALVIAGKCTYNPEPDQRWMLPPGWVSLGRVDPEE
FT                   ELSLTFALKQRNLERLSELVQAVSDPSSPQYGKYLTLEDVAELVQPSPLTLLTVQKWLS
FT                   AAGARNCDSVTTQDFLTCWLSVRQAELLLPGAEFHRYVGGPTKTHVIRSPHPYQLPQAL
FT                   APHVDFVGGLHRFPPSSPRQRPEPQQVGTVSLHLGVTPSVLRQRYNLTAKDVGSGTTNN
FT                   SQACAQFLEQYFHNSDLTEFMRLFGGSFTHQASVAKVVGKQGRGRAGIEASLDVEYLMS
FT                   AGANISTWVYSSPGRHEAQEPFLQWLLLLSNESSLPHVHTVSYGDDEDSLSSIYIQRVN
FT                   TEFMKAAARGLTLLFASGDTGAGCWSVSGRHKFRPSFPASSPYVTTVGGTSFKNPFLIT
FT                   DEVVDYISGGGFSNVFPRPPYQEEAVAQFLKSSSHLPPSSYFNASGRAYPDVAALSDGY
FT                   WVVSNMVPIPWVSGTSASTPVFGGILSLINEHRILNGRPPLGFLNPRLYQQHGTGLFDV
FT                   THGCHESCLNEEVEGQGFCSGPGWDPVTGWGTPNFPALLKTLLNP"
FT   misc_difference 2277..2279
FT                   /gene="Tpp1"
FT                   /note="3 bases at the 3' end do not align to the mouse
FT                   genome."
XX
SQ   Sequence 2279 BP; 515 A; 654 C; 533 G; 577 T; 0 other;
     cggacgcgtg gggaaagcca aaatgggact ccaagcccgc ctcctagggc tccttgctct        60
     cgtcatcgcc ggcaaatgca cttacaaccc tgagccggac cagcggtgga tgctgcctcc       120
     aggctgggtg tccctgggcc gcgtggatcc cgaggaagag ctgagtctca cttttgcgct       180
     gaaacagcgg aacctggaaa gactctcgga gctggtgcag gctgtgtcgg atcctagctc       240
     tcctcaatat ggaaagtacc taaccctgga ggatgtagct gagctggttc aaccatcacc       300
     cctgaccctc ctcactgtcc aaaagtggct ctcagcagct ggagcccgga actgcgattc       360
     agtgaccacc caggactttc tgacttgctg gctgagtgtc cgacaggctg agctgctgct       420
     cccaggagct gagtttcatc gctatgtagg gggacctaca aagacccatg ttataaggtc       480
     cccacatccc taccagcttc cccaggcctt ggcccctcat gtggattttg tgggggggct       540
     gcaccgtttc cccccttcat ctccaagaca acgtccagaa ccacaacagg taggaactgt       600
     tagcctgcac ttgggagtga ctccgtctgt gctccgtcag cgatacaacc tgacagccaa       660
     agatgtgggc tcaggcacca ccaacaatag ccaggcctgt gcccagttcc tggaacagta       720
     cttccataac tcggatctga ctgagttcat gcgcctattc ggtggcagtt ttacacacca       780
     ggcctcagta gcaaaagttg ttggaaagca agggcgaggc cgagctggga tcgaggccag       840
     tctagatgtg gaatacctga tgagtgctgg tgccaatatc tccacttggg tctacagtag       900
     ccctggccgc catgaggcac aggagccctt cttacaatgg ctcctgcttc ttagcaatga       960
     gtcatctttg ccacatgtac atactgtgag ttacggagac gatgaagact ccctcagcag      1020
     catctacatc cagagagtca acactgagtt catgaaggct gctgctcggg gtctcaccct      1080
     cctttttgcc tcaggtgaca ctggagctgg gtgttggtct gtctccggaa gacacaagtt      1140
     ccgccctagc ttccctgctt ccagccccta tgttactaca gttggaggaa cctccttcaa      1200
     gaatcctttc ctcatcacag atgaagtagt tgactatatc agtggtggag gcttcagcaa      1260
     tgttttccca cggcctccct accaggagga agcagtggcc cagttcttga aatccagctc      1320
     tcatctacca ccatccagtt acttcaatgc tagtggccgt gcctacccag atgttgccgc      1380
     actatctgat ggctactggg tggtcagcaa catggtcccc attccatggg tatctggaac      1440
     ctcggcctct actccagtgt ttgggggaat tttatccttg ataaatgagc acagaatcct      1500
     caatggccgc cctcctcttg gctttctcaa ccccaggctc tatcagcagc atgggacagg      1560
     actctttgat gtaacccacg gctgccatga gtcctgtctg aatgaagaag tggagggtca      1620
     gggtttctgc tctggtcctg gctgggatcc tgtgacaggt tggggaacac ccaacttccc      1680
     agccctactg aagaccctgc tcaacccttg accctttcgt gccatgacga gaaagcagaa      1740
     ctgttccctg tactaaaagg gaaggctcag tttcttgtta ttcctcgata gaagccctgc      1800
     tgaactcctg ttgcctgctg cagatagctt ctccctaacc ctcagatgct gtgaacagga      1860
     ctcaactctc aatcctactg tgtgccatca aactcaggtc tccaaacttc tacttcaaga      1920
     tcctcaacaa gatgctataa ccagcatatt ttgtctcacc ccaaccccat ctctccttcc      1980
     tctttccagc ttgagatgtg aaagcagggc aagaaggttc agtcttccat tactgacact      2040
     agcaggtcca cccaacgctt accacctctg cactgaccgt acactctatt tctcttcggg      2100
     tttgcttttc cgttcactga agtgagacct ttgactaatc gttttgtctt tcttctctcg      2160
     gcactgaagt acaatggtct ccccaatgtt ttatccagtt ataccctttt cagtgtttgt      2220
     tttatgggtt ttcttattta agaacaggtt gtcaaaaaac cattaaaaaa aaaaaaaaa       2279
//