ID FR692334; SV 1; circular; genomic DNA; STD; VRL; 5086 BP. XX AC FR692334; XX DT 01-DEC-2010 (Rel. 107, Created) DT 01-DEC-2010 (Rel. 107, Last updated, Version 1) XX DE Chimpanzee polyomavirus complete genome, isolate Bob XX KW complete genome. XX OS Chimpanzee polyomavirus OC Viruses; dsDNA viruses, no RNA stage; Polyomaviridae; Polyomavirus; OC unclassified Polyomavirus. XX RN [1] RP 1-5086 RA Verschoor E.J.; RT ; RL Submitted (21-SEP-2010) to the INSDC. RL Verschoor E.J., Virology, Biomedical Primate Research Centre, Lange Kleiweg RL 161 Rijswijk, 2288GJ, NETHERLANDS. XX RN [2] RX PUBMED; 21110837. RA Deuzing I., Fagrouch Z., Groenewoud M.J., Niphuis H., Kondova I., RA Bogers W., Verschoor E.J.; RT "Detection and characterization of two chimpanzee polyomavirus genotypes RT from different subspecies"; RL Virol J 7:347-347(2010). XX FH Key Location/Qualifiers FH FT source 1..5086 FT /organism="Chimpanzee polyomavirus" FT /host="Pan troglodytes verus" FT /isolate="Bob" FT /mol_type="genomic DNA" FT /isolation_source="host blood" FT /db_xref="taxon:305677" FT CDS 156..1085 FT /gene="VP2" FT /product="VP2 protein" FT /function="structural protein" FT /db_xref="GOA:E5AX28" FT /db_xref="InterPro:IPR001070" FT /db_xref="UniProtKB/TrEMBL:E5AX28" FT /protein_id="CBX23436.1" FT /translation="MFTCLGVKPRLRASSQVIISNRRRRTAACQRSFNWRKLTVCVRTV FT TCQAKQRSGDQAGEKSFTVSKLYFLIFSRMGGLLSSLVDMIVMASELSAASGLTIEALL FT TGEALAALEAEVFSLMTVEGLSGIEALAQLGWTAEQFSNMAFISTTFSNAIGYGVLFQT FT VSGISSLVSAGIRLGTSVSSVNRHQTEQELETLFGKIAHFLHVNLAFHLDPFDWCGSIG FT TTMPPEFSNLTLDQLSKLALIIENGRWVIQRSPTHDPLFESGDIIDMFGPPGGARQRVT FT PDWMLPLILRLNGASQEKSSLCVNSNQS" FT CDS 513..1085 FT /gene="VP3" FT /product="VP3 protein" FT /function="structural protein" FT /db_xref="GOA:E5AX29" FT /db_xref="InterPro:IPR001070" FT /db_xref="UniProtKB/TrEMBL:E5AX29" FT /protein_id="CBX23437.1" FT /translation="MTVEGLSGIEALAQLGWTAEQFSNMAFISTTFSNAIGYGVLFQTV FT SGISSLVSAGIRLGTSVSSVNRHQTEQELETLFGKIAHFLHVNLAFHLDPFDWCGSIGT FT TMPPEFSNLTLDQLSKLALIIENGRWVIQRSPTHDPLFESGDIIDMFGPPGGARQRVTP FT DWMLPLILRLNGASQEKSSLCVNSNQS" FT CDS 1033..2526 FT /gene="VP1" FT /product="VP1 protein" FT /function="structural protein" FT /db_xref="GOA:E5AX30" FT /db_xref="InterPro:IPR000662" FT /db_xref="InterPro:IPR011222" FT /db_xref="UniProtKB/TrEMBL:E5AX30" FT /protein_id="CBX23438.1" FT /translation="MAPPRKRARCVSTPTKVKCVPKKCPVPTPVPKLLVKGGVEVLNII FT TGPDSTTEIELYLEPRMGINSPTGDKKEWYGYSEVIHHADGYDNNLLSIQMPQYSCARV FT QLPMLNTDMTCDTLMMWEAVSCKTEIVGIGSLISVHLLEAKMAAKEGGDGPSQPIEGMN FT YHMFAVGGEPLDLQGIESNALTKYASAIPPKTIHPNDIAKLAEEEKPQLQGLVPKAKAR FT LDKDGFYPIEEWSPDPSRNENSRYFGSFVGGLNTPPNLQFTNAVTTVLLDENGVGPLCK FT GDGLFVSAADICGVMVKADNEAIRYRGLPRYFKVTLRKRAVKNPYPITSLLGSLFTGLM FT PKMDGQPMTGPDAQIEEVRIYQGKEGLPADPDMKRYIDQFGQEQTPTPTPAAPAAVAAL FT LEKWREKYSEEHKYDTIQHWGFSYPGHLFTEESQKIPKPPEAPSPKPQETPSQTIPAVT FT EHHVIEEDYTTTPTPARILTSFGGTTNLEKLPGKDSEEV" FT CDS complement(join(2569..4197,4868..5086)) FT /gene="large T" FT /product="large T antigen" FT /db_xref="GOA:E5AX31" FT /db_xref="InterPro:IPR001623" FT /db_xref="InterPro:IPR003133" FT /db_xref="InterPro:IPR003593" FT /db_xref="InterPro:IPR010932" FT /db_xref="InterPro:IPR014015" FT /db_xref="InterPro:IPR016392" FT /db_xref="InterPro:IPR017910" FT /db_xref="UniProtKB/TrEMBL:E5AX31" FT /protein_id="CBX23439.1" FT /translation="MDKVLEKSDREMLIELLGIPSYAFGNFPIMKTAYKRASKIYHPDK FT GGSSEKMMLLNSLWQKFQEGLVDIRGSEGRNTETERESPPKRRRSPEDLDGSYTDSQAS FT FASTPPKQKRQNPDSPSDLPSCLFDFVSHAIFSNKTVNAFILYSNFEKASLLFEKIDKF FT KIEFKSLHKLIEGMAIGGGLVLVMTSGKHRLSAVKNYCQQFCTISFLIIKAVLKPLECY FT QCLCKPPFAQIKANKDGLFSYDFEDRKEENCNWNKVAEFAVAADLDDALLILAHYLDFA FT QPFPCVKCENQKTKAHDFHKAHHENAVLFEMCKSQRSICNQASDVVLAKRRLLLQESSR FT EELLAMCFQKQLKALKALDTLDIYDHMAGVAWYANLFENFDELLFQMLKLLTQNIPKQR FT NILFRGPVNSGKTTLAAALVDLLGGRSLNVNCPADKINFELGCAIDRFFVVFEDVKGQT FT MLNKKLQPGQGISNLDNMRDYLDGAVHVNLERKHMNKRSQIFPPCVMTMNEYILPQTLY FT VRFTLKLEFISRPNLQAAIEKTPGLTAKRILQKGLTLFLLLIWYTPVSKFSISLQEEIA FT NWKAIIEKTVSHSDFCKMLENIEVGESPLTDIIDEGDIA" FT exon complement(2569..4197) FT /gene="large T" FT /number=2 FT intron complement(4198..4867) FT /gene="large T" FT /number=1 FT CDS complement(4502..5086) FT /gene="small t" FT /product="small T antigen" FT /db_xref="GOA:E5AX32" FT /db_xref="InterPro:IPR001623" FT /db_xref="InterPro:IPR003354" FT /db_xref="UniProtKB/TrEMBL:E5AX32" FT /protein_id="CBX23440.1" FT /translation="MDKVLEKSDREMLIELLGIPSYAFGNFPIMKTAYKRASKIYHPDK FT GGSSEKMMLLNSLWQKFQEGLVDIRGSEVCQVSFSDCYDVKLLRNCGTVKHFHEIFLRS FT PQCLQKGSAVCNCITSTLFNQHRQIKLLCNKRCLTWGECFCFSCFLIWFGMDFKWESFD FT MWKYVIAEMPTGLLQLPPSKYKFSSFPLVLK" FT exon complement(4868..5086) FT /gene="large T" FT /number=1 XX SQ Sequence 5086 BP; 1603 A; 987 C; 958 G; 1538 T; 0 other; tttacctgcc tagaagacct gtcctgcgcg cctgctgaag caagtaagtg caagtgtccc 60 taattaggcc tctctccttt ttataagatg aggtggaggc aagaggcctc ctgcctcacc 120 acaattaata aaaaaaagca tccccttgtc tatgcatgtt tacatgtctg ggagtaaagc 180 caaggttaag ggcttctagc caagtaatta ttagcaaccg caggcggcgc acggcagcct 240 gccaaagaag ttttaattgg aggaagttga ctgtgtgtgt tcgcacggtg acctgccagg 300 caaaacaaag gagcggggac caggcaggcg aaaagagctt tactgtaagt aaactgtact 360 ttttaatttt ttctagaatg ggaggtcttt tatcatcttt ggtggatatg attgtgatgg 420 cttctgaact aagtgcagca tctggattga ctattgaagc cctcttaact ggagaagccc 480 tagctgcttt agaagcggaa gttttttctc tcatgacagt agaaggctta tcaggaatag 540 aagctttagc tcagttgggc tggactgcgg aacaattttc caacatggca ttcatttcca 600 ctacattttc taatgccata ggatatggag tattgtttca aacagtctca ggaattagtt 660 cacttgtttc cgccgggata aggttgggaa caagtgtttc atctgtaaat agacatcaaa 720 cagaacaaga attggagact ttatttggta aaattgccca ttttcttcat gtgaatttag 780 ctttccatct ggatccgttt gattggtgtg gttccatagg aacaacaatg cctcctgaat 840 tttcaaattt aactcttgat caactctcaa aattagcttt aataattgaa aatgggagat 900 gggtaattca aagatctccc acccatgatc ctctctttga aagtggggat attattgata 960 tgtttggacc tcctggggga gctagacaaa gagtaacacc tgactggatg ctccctttaa 1020 ttttaaggtt aaatggcgcc tcccaggaaa agagctcgtt gtgtgtcaac tccaaccaaa 1080 gttaaatgtg tgccaaagaa gtgccctgtg cctacaccag tgcccaaact tcttgtgaaa 1140 ggaggagtag aagttttaaa tataattact ggtccagatt ctactacaga aattgaactt 1200 tatttagaac cccgaatggg tattaatagt cctacaggtg ataaaaagga atggtatggc 1260 tacagtgaag ttattcatca tgcagatgga tatgataaca acttgttgag tattcaaatg 1320 cctcaatata gttgtgcaag agttcaatta cctatgttga acacagacat gacctgtgac 1380 acattaatga tgtgggaagc tgtgtcttgt aagactgaaa tagtaggaat tggatcttta 1440 ataagtgttc atctgctaga agcaaaaatg gctgcaaaag agggaggaga cggcccttcc 1500 caaccaatag agggaatgaa ttatcatatg tttgcagttg gaggtgaacc tctagacttg 1560 caaggcatag aaagtaatgc cttaactaaa tatgcttcag ctatacctcc taaaacaatc 1620 catccaaatg atattgctaa attagctgaa gaagaaaaac cacagctgca aggcctagtg 1680 cctaaagcta aagccagatt agataaagat ggcttttatc ctattgaaga atggagcccc 1740 gatccatcta gaaatgaaaa ttctagatat tttggatcct ttgttggggg cctcaatact 1800 ccacccaatt tgcaatttac caatgctgta actactgttt tgttggatga aaatggtgta 1860 ggtcccctgt gtaaaggaga tgggttgttt gtttcagctg ctgatatctg tggtgtcatg 1920 gtaaaggcag ataatgaggc aattagatat cgaggcctcc caagatattt taaagtaact 1980 ttaagaaaga gggcagttaa aaatccctac cctataacga gtctcttggg aagccttttc 2040 acaggcctta tgcctaaaat ggatggacaa cctatgacag gcccagatgc tcaaattgaa 2100 gaagtaagaa tttatcaagg aaaagaaggg ttaccagctg acccagacat gaaaagatac 2160 atagatcaat ttgggcaaga acaaactccc acacccacac cagctgcccc tgctgcagta 2220 gctgctttgt tggaaaagtg gagggaaaaa tattctgaag agcataagta tgacactatt 2280 cagcactggg gttttagtta tcccgggcat ctattcacag aggaatccca gaaaattcct 2340 aaacccccag aggctcccag ccctaaaccc caagagacac cctcccaaac tattccagct 2400 gtcactgagc atcatgtaat tgaagaagat tataccacca caccaacccc cgcccgcatc 2460 ttaactagtt ttggaggaac tactaacttg gaaaaattac caggcaaaga ctcagaagaa 2520 gtataaatgt ttattgtatg tatttgcatt caataaaaat ctttattctt aagcaatatc 2580 tccttcatca attatatcag ttaaaggact ttctcctact tcaatatttt ctaacatttt 2640 acaaaaatca gagtggctta cagttttttc aataatagct ttccaatttg caatttcttc 2700 ttgtaaagaa attgaaaatt tactgacagg agtataccaa attaataata gaaaaagagt 2760 taaacccttt tgtagtattc tttttgcagt taaacctggg gttttttcaa tagctgcttg 2820 gaggttgggc ctactaataa attctaactt taaagtgaat cttacatata aagtctgagg 2880 taatatatac tcattcattg tcataacaca aggaggaaaa atttgacttc ttttattcat 2940 atgttttctt tctaaattaa catgcactgc gccatctaag taatctctca tattatctaa 3000 attagaaatc ccctggcctg gttgcaattt tttatttaac atagtttgac ctttgacatc 3060 ctcaaacacc acaaagaatc tgtcaatggc acagccaagt tcaaagttta ttttatctgc 3120 tggacagtta acattcaaag atctccctcc taaaagatcc actaaagctg cagctaaagt 3180 agttttacca ctgtttacgg ggcctctaaa taatatattt ctttgtttag gaatattttg 3240 ggttaataat tttaacattt gaaagagtaa ctcatcaaaa ttttcaaaca aattagcata 3300 ccaagctact ccagccatat gatcatatat atctaatgta tctaaggcct tcaaggcctt 3360 tagttgtttc tgaaagcaca tagctaacaa ttcctctctt gaactttcct gtaataatag 3420 ccttcttttt gctaggacta catcactggc ttgattacag attgaccttt gacttttaca 3480 catttcaaac aatacagcat tctcatggtg ggctttatga aagtcatgcg ctttagtttt 3540 ttgattttca catttcacac aaggaaaagg ctgagcaaag tctaagtaat gagctaaaat 3600 taacaaagca tcatctaaat ctgctgctac tgcaaattca gcaactttat tccaattaca 3660 gttttcttct tttctatctt caaagtcata actaaataat ccatccttat tagctttaat 3720 ttgagcgaat ggaggtttgc ataagcattg ataacattcc aacggcttaa gtactgcttt 3780 tattatcaga aaactaatag tacaaaattg ctgacagtaa tttttaactg cagaaagcct 3840 atgtttacca ctagtcataa ccaaaacaag gcctcctcct attgccatac cttctattaa 3900 tttatgcaag cttttaaatt ctattttgaa cttgtcaatt ttctcaaaga gcaaggaggc 3960 cttttcaaaa ttactatata aaataaaggc atttactgtt ttattgctaa atatagcgtg 4020 actgacaaaa tcaaacaaac aagaaggaag gtcagaagga ctatcagggt tttgcctttt 4080 ttgttttggt ggtgtacttg caaaacttgc ttgcgagtca gtataagatc catccaaatc 4140 ttcaggactt cttcgtctct ttggaggaga ctctctctca gtctctgtat ttcttcctga 4200 agtagaactt ccagatgtgg aagacgtctc ggggaattga ggctcactga acgacggggg 4260 tacttcttgg gaggtagtgg aggtggaagg ggtggaagag aaggggaagg aattgtatcc 4320 agagctttgg gtgtcttctt catcagaaga ggagattggg gactcatcgc agtgtaaatc 4380 tcctctgttg tcagctcttt cattggtaaa tactgaggag caccaactgg catatcgtct 4440 cctgaaagtt gagctgccat aggagtcaga aaatatctga aaaataaaaa tcataagtaa 4500 ttcatttcaa aaccaaggga aaactagaaa acttatactt actaggaggc aactgcagca 4560 acccagtagg catctcagca atcacatatt tccacatatc aaagctctcc cacttgaagt 4620 ccatgccaaa ccatatcaga aagcaagaaa agcagaagca ttctccccaa gtaagacacc 4680 ttttgttaca caataatttg atttgtctat gctgattaaa aagagtacta gttatacagt 4740 tacaaactgc acttcccttt tgcaagcatt gagggcttct aagaaatatc tcatgaaaat 4800 gttttacagt tccacaattt cttaaaagtt ttacatcata gcaatcagaa aaagaaactt 4860 gacatacctc tgagcctctt atatcaacaa gaccctcttg aaatttttgc cacagtgaat 4920 ttaaaagcat cattttttca ctgctgcctc ctttgtcagg atgatagatt ttagaagccc 4980 ttttataagc tgttttcatt ataggaaaat ttccaaaagc atagcttgga attcctaaaa 5040 gttctataag catttctcta tcactttttt ctagcacctt gtccat 5086 //