ID EU493092; SV 1; circular; genomic DNA; STD; VRL; 7020 BP. XX AC EU493092; XX DT 08-NOV-2008 (Rel. 97, Created) DT 27-FEB-2009 (Rel. 100, Last updated, Version 2) XX DE Caretta caretta papillomavirus 1, complete genome. XX KW . XX OS Caretta caretta papillomavirus 1 OC Viruses; dsDNA viruses, no RNA stage; Papillomaviridae; OC Dyozetapapillomavirus. XX RN [1] RP 1-7020 RX DOI; 10.1016/j.virol.2008.09.022. RX PUBMED; 18973915. RA Herbst L.H., Lenz J., Van Doorslaer K., Chen Z., Stacy B.A., RA Wellehan J.F.Jr., Manire C.A., Burk R.D.; RT "Genomic characterization of two novel reptilian papillomaviruses, Chelonia RT mydas papillomavirus 1 and Caretta caretta papillomavirus 1"; RL Virology 383(1):131-135(2009). XX RN [2] RP 1-7020 RA Herbst L.H., Lenz J., Van Doorslaer K., Chen Z., Stacy B.A., RA Wellehan J.F.X.Jr., Manire C.A., Burk R.D.; RT ; RL Submitted (14-FEB-2008) to the INSDC. RL Pathology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, RL Bronx, NY 10461, USA XX FH Key Location/Qualifiers FH FT source 1..7020 FT /organism="Caretta caretta papillomavirus 1" FT /host="Caretta caretta" FT /mol_type="genomic DNA" FT /country="USA" FT /db_xref="taxon:485241" FT gene 1..258 FT /gene="E6" FT CDS 1..258 FT /codon_start=1 FT /gene="E6" FT /product="E6" FT /db_xref="UniProtKB/TrEMBL:B6RUP8" FT /protein_id="ACD39812.1" FT /translation="MPPCSASLHAAFSRYAKLRKLCKRLRIDIGQLHVSCVYCKKQLSE FT LEVLHLLEDGGPFAVTKGKKKKERRCFAACCRCRLFLDNA" FT gene 251..586 FT /gene="E7" FT CDS 251..586 FT /codon_start=1 FT /gene="E7" FT /product="E7" FT /db_xref="UniProtKB/TrEMBL:B6RUP9" FT /protein_id="ACD39813.1" FT /translation="MHNKGQPIKGYSGKYMCACCRKDVSFAQQEICIMLSEQLGMLVCM FT NCEVVVPSNQIQDALGTADLAEVTVSCGIGLVDEGSDLASYSELETDTDSDNEGEAFLA FT GTDSEAV" FT gene 589..2274 FT /gene="E1" FT CDS 589..2274 FT /codon_start=1 FT /gene="E1" FT /product="E1 replication protein" FT /db_xref="GOA:B6RUQ0" FT /db_xref="InterPro:IPR001177" FT /db_xref="InterPro:IPR014015" FT /db_xref="UniProtKB/TrEMBL:B6RUQ0" FT /protein_id="ACD39814.1" FT /translation="MADYPESPDSPPGTYSDLFDNEATEVYASGDDEVQEEESDETLAG FT GQGCKRRLPSTPGSATDVGEALSPQFDALQLSRKPRSKKCRRKLNMTLDSECGNSTCAA FT SSTPAMSALAFSFEKRSQRIKQQALFKDSFMVSFSDISRQFKSNKTQSNGWVFALFDYC FT MDITHLHNALKETCTSILTDHNPTYRSYFFLCDFISRKSRESLLHLIRPFGVAAEDVHL FT CEPPNTRSVPAAVFFTKVTLLHGMLPLWIQQLTAIGDSNADQFKLVEMIQWAYDNNYTE FT ESRIAYEYAKLATEDTNAKAWLKNNSQAKYVKDCAVMVRLYKKGEVQAMSLSDLIVDRC FT EHFTVYDPDGWKNILLLLRFQQIPLAEFLKALKDCLHCVPKKCCLAFVGVPDSGKSMLC FT MSLIEFLEGRVLSFSNARSHFWLQPLGECKIALIDDATKPCWQYIETYMRNALDGNPVC FT LDAKYKAPVQIRCPPILITSNVDIRQATTDADGMAQESDFKYLLNRISIFPFCRPIPVR FT EGRLRFTVQTSDWKSFFLTYSSALEFDLTDYQDGCEHETDGNAG" FT gene 2237..3454 FT /gene="E2" FT CDS 2237..3454 FT /codon_start=1 FT /gene="E2" FT /product="E2 regulatory protein" FT /db_xref="GOA:B6RUQ1" FT /db_xref="InterPro:IPR000427" FT /db_xref="InterPro:IPR001866" FT /db_xref="InterPro:IPR012677" FT /db_xref="UniProtKB/TrEMBL:B6RUQ1" FT /protein_id="ACD39815.1" FT /translation="MDANTRLMEMQDKQMTIIEKDDQSLDDILEYYFAVKDEYLILAAA FT RKAGVAHIGLQRVPSLQVSESKYREAVTMIIIVQSLQNSQFKNVNFTLRDLQYQLVMQA FT PAFTIKKGPKTVYVTYSGPDKLTVTHTKWKDIYYQRDEQWRSSKDLQHTHDPNWFRAHT FT LTDSKGLFYVDIYGDVDYYVLFNSGEPQNATRKGQWEISTSPPDARTSTDTPSRKTPER FT HIVAITPPIRNPRNPAYQSKPTHPTPTKPTHTSTVTTRRGNGRGRGLGGRGGGGGGAGG FT STTRRPFAEKQSPVSAEEVGRTRETVQGTGTRLERLLKEARDPPGLVFEGSTAQIKHIR FT RRVEAGSLKYSRVTSTWHWISGKKVLKPSKMIVVFNNEKERSTFLNLFRVEGDGITVRL FT CSFNGL" FT gene 2751..3191 FT /gene="E4" FT CDS 2751..3191 FT /codon_start=1 FT /gene="E4" FT /product="putative E4 protein" FT /db_xref="UniProtKB/TrEMBL:B6RUQ2" FT /protein_id="ACD39816.1" FT /translation="MLTFMVTSTIMSFLIAESPKTPPGKGSGKSPRPRPTPEHPPTPHP FT GRRPKDTSSPSHRLSGTLGTPHTSPSPPILPPRSPPTPPLSPRDVETGEEGASGGGEEE FT GGEQEVPQPVGPSLKSKVQSLQKRLDELEKQCKALERDLNGY" FT gene 3451..4809 FT /gene="L2" FT CDS 3451..4809 FT /codon_start=1 FT /gene="L2" FT /product="L2 capsid protein" FT /db_xref="GOA:B6RUQ3" FT /db_xref="InterPro:IPR000784" FT /db_xref="UniProtKB/TrEMBL:B6RUQ3" FT /protein_id="ACD39817.1" FT /translation="MMSKRRRVTRASPDDIWRHCKQFGDCPDDIQKVYTGNTIADNILK FT WASSFLFFGGLGIGSAEGAVAAAASEHILPIGGGSLPKQPIDVPITRVPASNVTPGFSD FT ITVNPDVALDAGTVVHAAEPVDPVSGTPPIIHASPNSTEVIPPIRPVENPPWQNPFDSG FT LETPGVNVGVVDYSAGNEIELSVLSSTAPTLTNAVEETELFSRFELDPRTSTPNTTTRG FT GWMSHVAVGRFAKTAAREVPLPVLTSTGGVMQFENPAFEFSEAVSEVSRSISFNDPDSA FT PFARLSRPSLFQRAGRLGVQRVGNLLGMVTRAGKQLFVPRVYYNELSSIFESPDVLEME FT PIIIEDSGPPIEDEAIPGAPAGVFPQGNRPYAYNGYLFGPIPVDVSIKVSGTGFIPMPV FT TVSGNTIFPLYPSFDKSTPLYPPRHVFFSDLDDPIMFKRRKKCFADGCVDAFY" FT gene 4784..6268 FT /gene="L1" FT CDS 4784..6268 FT /codon_start=1 FT /gene="L1" FT /product="L1 capsid protein" FT /db_xref="GOA:B6RUQ4" FT /db_xref="InterPro:IPR002210" FT /db_xref="InterPro:IPR011222" FT /db_xref="UniProtKB/TrEMBL:B6RUQ4" FT /protein_id="ACD39818.1" FT /translation="MAVWTPSTKALFVPPVNVPTLYSTREYVRRTSYVFHGTTERLITI FT GNPYFALTDNATVTVPKVSAYQHRVFRIKLPDPNKFPIPESAVGDRDTTRLVWAVRGIQ FT VNKSQPLGVGASGNTMFNGLQDFAETHHPSMEKPDPPEDRRVNAAFDAKQSQALIVGCI FT PPVGQHWDAAKRCVEDNNKDMCPPLELQHTVIEDGDMIDMGMGTLNFKSLSLNWSTLPL FT ELINSVSKYPDWLTMNADPYGNHCFFMLKREQVYMKGVGLHLGNIGEDEPTTMFRKGTT FT GQKYQTPGRHSWFPLLSGSLSTSDNQLFNRPYWLENSTAPNDGICWHNQMFVTCVDTTR FT NTIFQISQFKKGVTATADYKEANYDMYARHVEEYEISFILQLCSIKMDLPVLNHLHNMD FT ASLLDDWGFGATPPQNLTVEDQYRFLNSKATKCPPPPATPADADPWGKYKFWDVDCTAQ FT ISSDLTPFPLGRRFQQLYPQAGKPAPSNPRKRRRGR" XX SQ Sequence 7020 BP; 1891 A; 1633 C; 1676 G; 1820 T; 0 other; atgccccctt gctctgcgtc tttgcatgct gccttttctc gatacgctaa attaagaaaa 60 ctatgtaagc gtctacggat agacatagga cagctacatg tgtcttgcgt ttactgtaag 120 aaacaactga gcgagctgga agtgctgcat ctgcttgagg atggtgggcc atttgccgtg 180 acaaagggga aaaaaaagaa ggaaaggcgt tgctttgctg cctgttgccg ttgtcgtctg 240 tttttggaca atgcataaca aaggccagcc gattaaaggg tacagtggga aatacatgtg 300 tgcttgttgt cgtaaagacg tgtcttttgc tcagcaagag atatgtatca tgctgagtga 360 gcagttggga atgctggtat gcatgaattg cgaggtggtg gtgccatcca atcaaatcca 420 ggacgccctc gggactgctg accttgctga ggtgacggtc agctgtggta ttggccttgt 480 agatgaggga agtgatcttg catcttacag tgaattagaa acagacacag actcagacaa 540 cgaaggagaa gcatttctag ctggaacgga ctcggaggca gtgtaaccat ggctgattat 600 cccgaatcac ccgactcgcc accaggtaca tattcagatt tatttgacaa cgaagccaca 660 gaagtatatg ccagcgggga tgatgaagtg caagaggaag aaagtgatga aacgttggca 720 gggggtcagg gatgtaagcg cagactacca agcacaccag gcagcgctac cgatgttgga 780 gaagcgctaa gtccccaatt cgacgccttg cagctaagca gaaaacccag atcaaagaag 840 tgtaggcgaa aactaaacat gacccttgac agcgagtgtg gaaacagcac gtgtgctgca 900 tcctctactc ccgctatgtc agctttagct ttcagtttcg aaaagcgcag ccagcgtata 960 aaacagcagg cgctattcaa agactctttt atggtgtcat tcagtgacat ctctcggcaa 1020 ttcaaaagta acaaaactca gagtaatggt tgggtatttg cgttgtttga ctattgcatg 1080 gacattacgc atttacacaa tgctttgaag gaaacatgta catccatcct tacggatcac 1140 aatcctacct ataggtcata tttctttctc tgtgatttta tatctagaaa gtctagggaa 1200 tcattgttac atcttattag gccgttcggt gtagcggcag aggacgtgca tttgtgtgaa 1260 cctccaaata caagaagtgt tcccgctgcg gtgtttttta caaaagtgac attgctacat 1320 ggtatgctac ccctatggat tcagcagcta actgccattg gggacagtaa tgccgatcag 1380 tttaagctag tagaaatgat ccagtgggct tatgacaaca actatacaga agaaagcaga 1440 atagcgtacg agtatgctaa attggctacc gaagatacaa atgctaaagc ttggctaaaa 1500 aacaatagtc aagctaaata cgtgaaggat tgtgcagtaa tggtcaggct ttataaaaag 1560 ggagaagtgc aggctatgtc gctcagcgat ttaatagtag atcgctgtga acattttact 1620 gtttatgacc cagacggttg gaaaaacatc cttttgctgc ttcgcttcca gcaaatcccg 1680 ctagctgaat ttctaaaagc tctaaaagat tgcctccatt gcgttccaaa gaaatgttgt 1740 ttagcatttg ttggcgtgcc agacagtggg aaaagtatgc tttgcatgag cttaatcgag 1800 ttcttagaag gacgagtact tagctttagc aacgctcgca gccacttttg gttgcaaccg 1860 ttaggtgaat gtaaaattgc cctaatagat gatgctacta agccatgttg gcagtatatt 1920 gaaacataca tgcgcaatgc gctagatggc aaccctgtgt gcttagacgc aaagtacaaa 1980 gcacctgttc aaatacgttg cccgccaatt cttattacta gcaatgttga tataaggcaa 2040 gcaacaactg acgctgatgg catggcacag gaaagtgact ttaaatattt gttaaatagg 2100 atttctatct ttccattctg caggccaatt ccagtaagag aagggagatt gaggtttact 2160 gtgcaaacct ctgactggaa gtcgtttttc cttacatact catcagcgct tgagttcgat 2220 ttaactgatt atcaggatgg atgcgaacac gagactgatg gaaatgcagg ataaacaaat 2280 gactattatt gaaaaggacg accaatcatt agatgatatt ctcgagtact attttgcagt 2340 gaaggacgag tatttgatac tggcagctgc gcgcaaagca ggtgtggccc atattgggtt 2400 acagcgcgtg ccatctttgc aggtgtcaga gtcgaaatac cgagaggccg tcactatgat 2460 catcatagtg cagtccctcc aaaactcaca gtttaaaaat gtcaacttta cgttgcgaga 2520 cttgcaatac caactggtaa tgcaggcacc agcgttcacg atcaaaaaag gccccaaaac 2580 ggtttacgtg acgtactctg ggccagataa actgactgtg acccatacaa agtggaaaga 2640 catttattat caaagggatg agcagtggcg aagcagcaaa gacctgcaac acacacatga 2700 cccgaattgg tttcgcgcgc ataccttgac agacagtaaa ggactgtttt atgttgacat 2760 ttatggtgac gtcgactatt atgtcctttt taatagcgga gagccccaaa acgccacccg 2820 gaaagggcag tgggaaatct ccacgtcccc gcccgacgcc agaacatcca ccgacacccc 2880 atcccggaag acgcccgaaa gacacatcgt cgccatcaca ccgcctatca ggaaccctag 2940 gaaccccgca taccagtcca agcccaccca tcctaccccc acgaagccca cccacacctc 3000 cactgtcacc acgagacgtg gaaacgggag aggaaggggc ctcgggggga ggggaggagg 3060 agggggggga gcaggaggtt ccacaacccg tcggcccttc gctgaaaagc aaagtccagt 3120 ctctgcagaa gaggttggac gaactagaga aacagtgcaa ggcactggaa cgcgacttga 3180 acggctactg aaagaagccc gagacccccc tggccttgtg tttgagggtt ccaccgctca 3240 aattaagcac attaggcggc gcgtagaagc agggtctctt aaatactcta gggtcacatc 3300 cacatggcat tggataagcg gcaagaaggt gttaaaacct tctaagatga ttgttgtttt 3360 taataatgaa aaggaacgaa gtacgttcct aaatctcttt cgcgtagagg gtgatgggat 3420 cacggtgcga ctttgctcat ttaacggcct atgatgtcta aacgtcgccg tgtaaccaga 3480 gcgagtccag atgacatttg gcgccactgt aaacagtttg gcgactgtcc tgacgacatc 3540 caaaaagtat atacaggcaa tactattgca gacaacatcc taaaatgggc ctcatctttt 3600 ctgtttttcg gtggcttggg tattgggtct gcggaagggg ctgttgctgc tgctgcttca 3660 gagcatattc ttcccattgg tggtgggtct ttaccaaagc aaccaattga cgtccccatt 3720 acccgagttc ctgccagcaa tgtcacccct gggttctctg atattacggt caatccagat 3780 gtggcattgg acgcaggcac cgtggtgcac gcagcggagc cagtggaccc cgtaagtgga 3840 accccaccaa ttattcatgc atctccgaat tcgaccgagg taatacctcc catccgcccc 3900 gtggagaacc cgccatggca gaaccctttt gacagcggcc tagaaactcc gggtgtcaat 3960 gttggcgtag ttgattatag tgctggcaat gaaatagaat tatcagtgct gtcatcgact 4020 gctccaacgt tgacaaatgc tgtggaggaa actgaacttt tcagcaggtt tgagctggac 4080 ccacgaacta gtaccccaaa tacaactacc aggggaggat ggatgtctca tgtcgctgtc 4140 ggtcgttttg ctaagacagc ggctagagag gtccctttac ccgtgttaac aagcaccggg 4200 ggagtgatgc agttcgaaaa tcccgccttt gagtttagcg aggcggtgtc cgaggtttcc 4260 cgctcgattt cttttaacga tcctgattct gcaccatttg cacgtctcag ccgtccttct 4320 ctgtttcaac gagcagggag gctaggcgta caaagggtgg gaaatctatt gggcatggtg 4380 actagagcgg gaaagcagct atttgttcca cgagtttatt acaatgaact atcttccata 4440 tttgaaagcc ctgacgtcct ggagatggag cccataataa tagaggatag tggcccacct 4500 attgaggatg aggcgatacc aggggcccca gcaggtgtct ttccccaggg aaacaggcca 4560 tatgcatata atgggtactt gtttggtcca atacctgttg atgtatcaat taaagtgtcc 4620 ggtaccggtt tcatacccat gcctgtaact gtttctggaa ataccatttt ccctttgtat 4680 ccttctttcg acaaaagtac ccccctgtac cctccccgcc atgttttttt ctccgatcta 4740 gacgatccca tcatgtttaa acgacgtaaa aaatgttttg cagatggctg tgtggacgcc 4800 ttctactaaa gcgttgtttg taccccctgt aaatgtcccc actttgtatt caacgcgtga 4860 atatgtacgg cgtacatcct atgtatttca tggtacaacc gagcggttaa tcaccatcgg 4920 caacccatat tttgcactta ctgacaatgc aactgtgacg gttcctaagg tttcagcata 4980 tcagcaccgt gtttttagaa taaagcttcc agatccaaac aagtttccaa ttcctgagtc 5040 cgcggtgggg gatcgagata ccacgcggtt ggtgtgggct gttcgtggca tccaggtcaa 5100 taaaagtcaa ccgttggggg tgggtgcctc tgggaacaca atgtttaacg gtcttcaaga 5160 ctttgctgaa acacatcacc cttctatgga aaagccggac cctccggagg acagacgggt 5220 caatgctgcc tttgatgcaa agcagagtca agcgttaatc gtcgggtgca tcccacctgt 5280 tggccaacat tgggacgctg caaagcggtg tgtggaggac aataataaag atatgtgtcc 5340 accactggaa ctccaacaca cagtaataga ggatggggat atgattgata tggggatggg 5400 gactttaaac tttaaatcat tgagccttaa ttggtcaacc cttccactgg agcttattaa 5460 ttctgtatca aaataccctg actggttgac aatgaatgcc gacccttatg gcaaccattg 5520 ctttttcatg cttaagcggg agcaggtgta tatgaagggt gtcggtttgc atttgggcaa 5580 tattggggaa gacgagccta ccacaatgtt tagaaaaggt accactgggc aaaaatatca 5640 aactcctggc cggcacagct ggtttccttt actgtcaggg tccctgtcca cttcggacaa 5700 tcaacttttc aaccgcccct attggttgga gaatagcaca gctccaaatg atggcatatg 5760 ctggcataac caaatgtttg tgacctgcgt tgacaccacc cgaaatacaa tctttcaaat 5820 ctctcaattt aagaagggtg ttactgcgac tgcagattat aaggaagcca attatgatat 5880 gtacgcccgt catgtggagg agtatgaaat ttcatttatt ctgcaactat gttccattaa 5940 aatggatctt cctgttttaa accacctaca caacatggat gcatcccttc tagatgactg 6000 gggttttggg gccacgccac ctcaaaacct tactgtggag gaccagtaca ggttccttaa 6060 ctccaaggcc actaaatgtc ccccccctcc ggccacgcct gctgacgcgg acccatgggg 6120 aaagtataaa ttctgggatg tagattgcac agcacagata tcatccgatc tcactccctt 6180 tcctttagga cgccggtttc agcaacttta tccgcaggca ggaaaaccgg cgccttctaa 6240 tccccgtaaa cgccgacgag ggcggtaatg ttttacctgc tgcaagtcct cctctgcctt 6300 tcagaacatg acctctatct attgattatc ctttttattt acctggctcg ctgttttctt 6360 aaaaagattc gagattggga atacgcgctg taacccccat tcggtagcgt caaagctctc 6420 caatcggtgg tgtcaacctg catggtatgt atgccaagcg tgtgccaagc ccggggggaa 6480 ctatcaacgg tttccactgt ctgcgccaga cagtatccat ttggataccc atcatgggct 6540 catgttacaa ccgtcattcc aacggtatcc aattggttac ccaaaatatt gccaaattat 6600 ccaaaccgac atctagtgca tccaaaaaag ctatcctttt ggtgctgaca ctgtgtaata 6660 cgtgccagag catctctgtt tgggcgggaa ttcttaatta attgtaaact gccaggtgtg 6720 ctttcattgt ttgactgcca actttaaatg cctgtagtgc atgtacaata aaaacctgat 6780 gttctgccaa cttctctatt tccatctgca cctcctacac tatcctaagg ctgtgtcttt 6840 tgggtgagcc tctccaaagg gcggttctaa atattctttt cggattcgtc catcatatcg 6900 acatcatcgt tcttttctag ttgagggtat ccaataggcg gtatatctaa ttatagggcg 6960 tgactggcat ccggcttctg tttatcactt cgtcgatcgt cggaccaaag gtaagtcgca 7020 //