ID K02562; SV 1; circular; genomic DNA; STD; VRL; 5270 BP. XX AC K02562; XX DT 02-JUL-1986 (Rel. 09, Created) DT 14-NOV-2006 (Rel. 89, Last updated, Version 10) XX DE African green monkey polyomavirus, complete genome. XX KW complete genome. XX OS African green monkey polyomavirus OC Viruses; dsDNA viruses, no RNA stage; Polyomaviridae; Polyomavirus. XX RN [1] RP 1-5270 RX DOI; 10.1016/0042-6822(85)90108-4. RX PUBMED; 2998001. RA Pawlita M., Clad A., zur Hausen H.; RT "Complete DNA sequence of lymphotropic papovavirus: prototype of a new RT species of the polyomavirus genus"; RL Virology 143(1):196-211(1985). XX FH Key Location/Qualifiers FH FT source 1..5270 FT /organism="African green monkey polyomavirus" FT /mol_type="genomic DNA" FT /db_xref="taxon:12480" FT CDS 703..1773 FT /codon_start=1 FT /product="VP2 capsid protein" FT /db_xref="GOA:P04011" FT /db_xref="InterPro:IPR001070" FT /db_xref="UniProtKB/Swiss-Prot:P04011" FT /protein_id="AAA47060.1" FT /translation="MGGVLSLLFNISEIAAELSLSTGFTLDAILTGEAFAAVSTEAAWL FT IEIEAVDLAGLSTLEALSLTGLTTEQFSLLSAIPTALNNAIGIGVFFQTVSGASAVVAA FT GLTTFGYSKQVPVVNMALVPWFPQVDYLFPGFTSFSYYLNAVLDWGESLFHAVGTELWR FT HLMRQATLQIGQATRAVAVRSTNELSHTLAQIAENARWALTSGPVHIYSTVQDYYRYLP FT ARNPIQLRQEYRNRGEPPPSTADFEYQENREGQTARRELGYDEPRSGQYVEHYTAPGGA FT HQRVTQDWMLPLILGLYGDITPTSEVELNKLEKEEDGPSKKKARRSMQKNMPYSRSRPQ FT APSKRRSRGARSKNRA" FT CDS 1060..1773 FT /codon_start=1 FT /product="VP3 capsid protein" FT /db_xref="GOA:P04011" FT /db_xref="InterPro:IPR001070" FT /db_xref="UniProtKB/Swiss-Prot:P04011" FT /protein_id="AAA47061.1" FT /translation="MALVPWFPQVDYLFPGFTSFSYYLNAVLDWGESLFHAVGTELWRH FT LMRQATLQIGQATRAVAVRSTNELSHTLAQIAENARWALTSGPVHIYSTVQDYYRYLPA FT RNPIQLRQEYRNRGEPPPSTADFEYQENREGQTARRELGYDEPRSGQYVEHYTAPGGAH FT QRVTQDWMLPLILGLYGDITPTSEVELNKLEKEEDGPSKKKARRSMQKNMPYSRSRPQA FT PSKRRSRGARSKNRA" FT CDS 1652..2758 FT /codon_start=1 FT /product="VP1 capsid protein" FT /db_xref="GOA:P04010" FT /db_xref="InterPro:IPR000662" FT /db_xref="InterPro:IPR011222" FT /db_xref="UniProtKB/Swiss-Prot:P04010" FT /protein_id="AAA47062.1" FT /translation="MAPQRKRQDGACKKTCPIPAPVPRLLVKGGVEVLEVRTGPDAITQ FT IEAYLNPRMGNNIPSEDLYGYSNSINTAFSKASDTPNKDTLPCYSVAVIKLPLLNEDMT FT CDTILMWEAVSVKTEVVGISSLVNLHQGGKYIYGSSSGCVPVQGTTYHMFAVGGHPLEL FT QGLVASSTATYPDDVVAIKNMKPGNQGLDPKAKPLLDKDGNYPVEVWCPDPSKNENTRY FT YRSFTGGATTPPVMQFTNSVTTVLLDENGVGPLCKGDKLFLSCADIAGVHTNYSETQVC FT TALPRYFNVTLRKRIVKNPYPVSSLLNTFFSGLMPQIQRQPIERVSGQVEEVRIFQGTE FT GLPGDPDLNRYVDKFCQHQTVLPVSNDM" FT CDS complement(join(2822..4678,5034..5270)) FT /codon_start=1 FT /product="large t-antigen" FT /db_xref="GOA:P04008" FT /db_xref="InterPro:IPR001623" FT /db_xref="InterPro:IPR003133" FT /db_xref="InterPro:IPR003593" FT /db_xref="InterPro:IPR010932" FT /db_xref="InterPro:IPR014015" FT /db_xref="InterPro:IPR016392" FT /db_xref="InterPro:IPR017910" FT /db_xref="UniProtKB/Swiss-Prot:P04008" FT /protein_id="AAA47059.1" FT /translation="MDQTLSKEERNELMDLLQITRAAWGNLSMMKKAYKNVSKLYHPDK FT GGDSAKMQRLNELFQRVQVTLMEIRSQCGSSSSQGYFSEDFYFGPTTFQYSPMDRDAVR FT EDLPNPGEGSWGKWWREFVNRQCCDDLFCSETMSSSSDEDTPPAAQPPPPPAPSPEEED FT EIEFVEETPSSCDGSCSQSSYTCTPPKRKKTEEKKPDDFPVCLYSFLSHAIYSNKTMNS FT FLIYTSLEKARQLYKTVEKSKIVVDFKASFSYQDEEGEGCLLFLITLGKHRVSAVKHFC FT VSQCTFSFIHCKAVVKRLELYKTLSKPRFKLLEENKPGVSMFEFQEEKEQCVNWQEICN FT FANEANISDVLLLLGIYIDFAVEPGKCGKCEKKQHKFHYNYHKAHHANACLFLESRAQK FT NICQQAVDQVLAAKRLKLVECSRIELLEERFLQVFDEMDDFLHGEIEILRWMAGVAWYT FT ILLDNCWDVFQNMVQLITTSQRKKRNVLIKGPINSGKTTLASAFMDFFDGKALNINCPA FT DKLSFELGCAIDQLCVLLDDVKGQITLNKHLQPGQGVNNLDNLRDHLDGTIKVNLEKKH FT VNKRSQIFPPVIMTMNEYLLPRTIGVRFALHLHLKPKAYLKQSLEKSDVVAKRILNSGY FT TILLVLLWYNPVDSFTPKVQEKVVQWKETLEKYVSITQFGNIQQNIIDGKDPLHGIVIE FT EQM" FT exon complement(2822..4678) FT /number=2 FT /note="large t-antigen" FT CDS complement(4701..5270) FT /codon_start=1 FT /product="small t-antigen" FT /db_xref="GOA:P04009" FT /db_xref="InterPro:IPR001623" FT /db_xref="InterPro:IPR003354" FT /db_xref="UniProtKB/Swiss-Prot:P04009" FT /protein_id="AAA47063.1" FT /translation="MDQTLSKEERNELMDLLQITRAAWGNLSMMKKAYKNVSKLYHPDK FT GGDSAKMQRLNELFQRVQVTLMEIRSQCGSSSSQVAWFFWDENFRTLGAFLGEKFNEKI FT IGLYPTCTKFVRANCNCIVCLLKKQHAGTKKNLKKPCLVWGECWCYKCYLVWFGFPEDF FT TSFRYWTVLMANMDLSMLKLWTELGF" FT exon complement(5034..5270) FT /number=1 FT /note="large t-antigen" XX SQ Sequence 5270 BP; 1556 A; 1123 C; 1031 G; 1560 T; 0 other; ggctgtgaat aaaatggaat aaatttaatt acctaggtgg ccttaatact taattatatc 60 ggaaaatatc tcagggcagc ttacctaatg agtttttgga aaagcctcca aagcctctct 120 ctttgttgaa agaagaggag gaggctaggg gcccctggcc tcttatacca ggtagaaaaa 180 actagggttg ccatagtgat tttgcagact tgcagacttc aataacaggt tgtttttgca 240 gaatcaactg aaagaggaag ctgtggttag actcgcagac ttcaataaca ggttgttttt 300 gcagaatcaa ctgaaagagg aagctgtggt tagactcgct caccgcctcc aaagcctctc 360 tctttgttga aagaagagga ggaggctagg ggcccctggc ctcttatacc aggtagaaaa 420 aactagggtt gccatagtga ttttgcagac ttgcagactt caataacagg ttgtttttgc 480 agaatcaact gaaagaggaa gctgtggtta gactcgctca ccgcaccgcc cggggacttt 540 ttgtgacatt tcctgataag tgacaaacaa ctgttctgcg gtctagttgc taggtggcgc 600 ttagcaacct acttagcaac ctcaggtaga ggaagttact catttacatc agcgccgccg 660 agaagcccgc cttttttaaa ttagcccgcc atttggtaag aaatgggggg tgtattatct 720 cttttgttta atatttctga aattgctgct gaattaagct taagtactgg atttacactt 780 gatgctatcc ttactgggga ggcttttgct gctgtaagta ctgaggcagc ctggctcatt 840 gaaatagaag cagtggatct tgctggactt agtactctag aggccttgtc tcttactgga 900 cttacaacag agcagttttc cctcctaagt gctatcccaa cagctctcaa caatgccata 960 ggaataggag ttttttttca aactgtttca ggtgccagtg ctgtggttgc tgcaggactg 1020 acaacttttg gatactccaa acaagtacca gttgttaata tggctcttgt gccttggttt 1080 cctcaagttg attatttgtt cccgggattt acctctttta gctactacct gaatgctgta 1140 cttgactggg gtgaatcatt gtttcatgct gtgggcacag aactatggag gcatttgatg 1200 agacaggcca ctttgcaaat tggtcaagct acaagggctg tggctgtcag aagtaccaat 1260 gaactgagtc acaccttggc tcaaattgct gaaaatgcta ggtgggcttt gaccagtggg 1320 cctgtccata tttattccac tgtccaagat tattacaggt atctacctgc tagaaatccc 1380 atccagctaa gacaagaata tagaaacaga ggagagcctc ctccaagtac agctgatttt 1440 gaatatcaag aaaataggga aggtcaaacc gcaagaagag aactgggtta tgatgaacct 1500 aggtctggtc agtatgtaga acattatact gctccaggag gggcacacca aagagtaact 1560 caagactgga tgcttcctct aattctaggt ttatatggtg atataactcc cacctcggaa 1620 gtggagctta ataaattgga gaaagaagaa gatggcccct caaagaaaaa ggcaagacgg 1680 agcatgcaaa aaaacatgcc ctattcccgc tcccgtcccc aggctcctag taaaaggagg 1740 agtagaggtg ctagaagtaa gaacagggcc tgatgctatt acccaaattg aggcctatct 1800 taatcctaga atgggaaata atattccttc tgaggacttg tatggatata gtaattctat 1860 aaatactgct ttcagtaagg cctctgacac ccccaacaaa gacacccttc cttgttattc 1920 agtagctgtt attaaactcc ccctcctaaa tgaagacatg acctgtgaca ccattttgat 1980 gtgggaagca gtgtctgtaa agactgaagt tgttggaatt tcctcactag ttaatttgca 2040 ccagggagga aagtacatct atgggtcatc ctcagggtgt gtccccgtgc agggcactac 2100 ctatcacatg tttgctgttg gaggacaccc cctggaactc caaggcctag ttgctagctc 2160 tacagctacc tatcctgatg atgtagttgc tattaaaaat atgaaaccag gaaaccaagg 2220 cctagatcca aaggccaaac ccttgctgga taaagatgga aactacccag tggaggtgtg 2280 gtgccctgac ccctctaaaa atgaaaatac tagatattat cggagtttta cagggggagc 2340 caccacccca ccagttatgc agttcactaa ttctgtcaca actgtgctgc tggatgaaaa 2400 tggagttggg cctctttgta aaggggacaa actgtttctg tcttgtgctg atattgctgg 2460 agttcatacc aactattctg aaacccaagt ttgcacggcg cttcccagat atttcaatgt 2520 gaccctcagg aaaaggattg ttaaaaatcc ttatcctgtc agctctcttc taaatacctt 2580 cttctctggt cttatgcccc aaattcagcg acaaccaatc gaacgggtct caggacaagt 2640 ggaagaagtc agaatatttc agggaacaga aggactccca ggggaccctg accttaatag 2700 atatgttgat aaattttgtc agcaccagac tgttctccct gtatcaaatg atatgtgagg 2760 ctgaatgcag tgtaagactt tattgtacca gaaataaaac agaaaatgat gattacattg 2820 tttacatttg ttcttcaatt acaattccat gcaaggggtc ttttccatca atgatatttt 2880 gctgaatatt accaaactga gtaattgaca catatttttc aagggtttct ttccattgca 2940 ccactttttc ttgcactttt ggagtaaaag aatccacagg attgtaccat aacaaaacga 3000 gcaaaatagt atatcctgaa tttaatattc ttttggctac cacgtcactt ttttccaggc 3060 tttgtttaag ataagcctta ggttttaaat gcagatgaag agcaaatcta actcctatgg 3120 tacgaggcaa caagtactca ttcatagtca taataaccgg gggaaaaatt tgactccttt 3180 tgtttacatg tttcttttct aaattaactt taattgttcc atcaagatga tctctcaggt 3240 tatcaagatt atttacccct tgacctggtt gcaagtgctt atttaaggtt atttggccct 3300 tcacatcatc taacaaaaca cacaattgat caatagcaca gccaagttca aaggacagtt 3360 tatctgcagg acaatttata tttagagctt tgccatcaaa aaaatccatg aaagcagaag 3420 ccaaagtagt tttaccactg ttaattggtc cctttatcag gacattcctt tttttgcgtt 3480 ggctggtagt tattaattgt accatatttt gaaaaacatc ccaacaatta tctagtaaaa 3540 tggtgtacca ggccacaccc gccatccatc ttagaatttc tatctcacca tgcaggaagt 3600 catccatttc atcaaaaacc tgcaaaaatc tctcttctaa taattcaatt ctactgcatt 3660 ctactaattt taacctttta gctgctagga cctggtcaac tgcttgttgg caaatgtttt 3720 tttgggctct actctccaag aagaggcaag cattggcatg atgtgctttg tgataattat 3780 agtggaattt gtgctgcttt ttttcacact tgccacattt gccaggttcc actgcaaaat 3840 ctatgtagat gccaagcaac aataagacat cagaaatgtt ggcctcattt gcaaagttac 3900 atatttcttg ccaattaaca cactgttcct tctcctcttg gaactcaaac atggatacac 3960 ccggtttgtt ctcttccaac aacttaaaac gtggtttact taaggtctta tataactcta 4020 gacgtttaac aacagcttta caatgaataa aactaaaagt acattgggat acacaaaaat 4080 gcttaacagc agacactcta tgttttccta aagtaattaa aaacagcaaa cacccctccc 4140 cttcctcatc ctgataagaa aaactagcct taaaatcaac tacaatttta gatttttcca 4200 cagttttata cagttgcctg gctttctcca aactagtata tattaaaaaa ctattcatag 4260 tcttattact ataaattgca tgacttaaaa aggaatataa acatacagga aaatcatctg 4320 gcttcttttc ttcagttttc ttccttttag ggggggtgca ggtgtaggag ctttgagaac 4380 aagatccatc acaggaactt ggggtctctt ctacaaattc tatttcatcc tcttcttctg 4440 gggaaggggc aggaggagga ggaggttgcg ccgctggggg ggtgtcttca tcacttgaac 4500 tactcattgt ttctgagcaa aacaaatcat cacaacattg cctattaaca aactctctcc 4560 accatttccc ccaagaccct tcccctggat ttggaagatc ctcccgaact gcatctcgat 4620 ccatagggct atattgaaag gtggtaggcc caaagtagaa gtcctcactg aagtaaccct 4680 agaaaataaa aatacttaca ttagaatccc agttccgtcc aaagcttgag catagataaa 4740 tccatatttg ccataagaac ggtccagtag cgaaaagagg tgaaatcctc aggaaagcca 4800 aaccatacta aataacattt gtagcaccaa cattctcccc agactaaaca tggctttttt 4860 aaattttttt ttgtacctgc atgctgcttt tttagcagac atactataca attacaatta 4920 gctcttacaa atttagtgca agtagggtag agtccaataa ttttttcatt aaatttttct 4980 cctagaaaag ctcctagggt tctaaaattc tcatcccaaa aaaaccaagc tacctgggaa 5040 gaagaggatc cacattgact ccttatctcc atcaaggtaa cctggaccct ttgaaataat 5100 tcattgagcc gctgcatttt agctgaatct cctcctttat caggatggta gagcttggag 5160 acatttttat aggctttttt catcatagaa agatttcccc atgcagctct agttatttgc 5220 aataaatcca taagctcatt tctctcctcc ttagacagcg tttggtccat 5270 //