ID AM502327; SV 1; circular; genomic DNA; STD; VRL; 2799 BP. XX AC AM502327; XX DT 17-JUN-2007 (Rel. 92, Created) DT 17-JUN-2007 (Rel. 92, Last updated, Version 1) XX DE East African cassava mosaic virus DNA-A, clone 24-ug175-8 XX KW AC1 gene; AC2 gene; AC3 gene; AC4 gene; AV1 gene; AV2 gene; coat protein. XX OS East African cassava mosaic virus OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RP 1-2799 RA Stanley J.; RT ; RL Submitted (16-MAR-2007) to the INSDC. RL Stanley J., Department of Disease and Stress Biology, John Innes Centre, RL Colney Lane, Norwich NR74 7UH, UNITED KINGDOM. XX RN [2] RA Sserubombwe W.S., Briddon R.W., Baguma Y.K., Ssemakula G.N., Bull S., RA Bua A., Otim-Nape G.W., Stanley J.; RT "Diversity of begomoviruses associated with mosaic disease of cultivated RT cassava (Manihot esculenta Cranz) and its wild relative (Manihot glaziovii) RT in Uganda."; RL Unpublished. XX DR MD5; 9c216e0d4dad3467a13993c98fe3c4bd. XX FH Key Location/Qualifiers FH FT source 1..2799 FT /organism="East African cassava mosaic virus" FT /mol_type="genomic DNA" FT /country="Uganda:Arua" FT /clone="24-ug175-8" FT /db_xref="taxon:62079" FT CDS 172..528 FT /gene="AV2" FT /product="AV2 protein" FT /db_xref="GOA:A0A3S5ZP69" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:A0A3S5ZP69" FT /protein_id="CAM59357.1" FT /translation="MWDPLVNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL FT IGVLRCKNYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS FT ETQDVQNVSKPRCP" FT CDS 332..1105 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:A0A3S5ZPD2" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:A0A3S5ZPD2" FT /protein_id="CAM59358.1" FT /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA FT NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSFEQRDDVKHLGICKVISDVTRGPGLTHR FT VGKRFCIKSIYILGKIWMDENIKKQNHTNNVMFYLLRDRRPYGNAPQDFGQIFNMFDNE FT PSTATIKNDLRDRFQVLRKFHATVVGGPSGMKEQALVKRFYKLNHHVTYNHQEAGKYEN FT HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN" FT CDS complement(1102..1506) FT /gene="AC3" FT /product="AC3 protein" FT /db_xref="GOA:A6H2V4" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:A6H2V4" FT /protein_id="CAM59359.1" FT /translation="MDSRTGELITAPQARNGVFTWDITNPLYFEITDHDKRPGNMNHDI FT ITLQIRFNHNLRKALEIHKCFLNFKVWTTLRPQTGRFLRVFRYQVLKYLDMIGVISINT FT VLQAVDHVVYDVLLNTLQVTEQHAIKFNLY" FT CDS complement(1247..1654) FT /gene="AC2" FT /product="AC2 protein" FT /db_xref="GOA:Q2UZP4" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:Q2UZP4" FT /protein_id="CAM59360.1" FT /translation="MPPSSPSTSHCSLVPIKVQHRTAKTRAVRRRRVDLECGCSFYLHI FT DCINHGFSHRGTHHCASSKEWRFYLGHNKSPLFRNHRPRQEAREHEPRHHHTPDTVQPQ FT PSEGIGDSQVFSQLQGLDDLTASDWSFLKSI" FT CDS complement(1563..2642) FT /gene="AC1" FT /product="AC1 protein" FT /db_xref="GOA:A6H2V6" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:A6H2V6" FT /protein_id="CAM59361.1" FT /translation="MPRAGRFQINAKNYFITYPRCSLTKEEALSQLQALSYPSNIKFIR FT VCRELHQDGVPHLHVLIQFEGKFQCTNPRFFDLISPSRSTHFHPNIQGAKSSSDVKAYI FT EKGGEFLDAGLFQVDARSARGEGQHLAQVYADALNASSKSEALQIIKEKDPKSFFLQFH FT NISANADRIFQAPPQTYVSPFLSSSFTQVPEDIEVWISENICSPAARPWRPISIVLEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW FT QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLDEEKNQSPKAWALKNATFITLHEPL FT FSSAHQSPTPHSEDQGRQT" FT CDS complement(2252..2485) FT /gene="AC4" FT /product="AC4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:Q2UZP2" FT /protein_id="CAM59362.1" FT /translation="MGCLISMFSSNSKASSNVPTRDSSISFPHPDQHISIRTFRELNHR FT PMSKLTLKREGNFLTLDFSKSMPEVQGERASI" XX SQ Sequence 2799 BP; 731 A; 549 C; 734 G; 785 T; 0 other; accggatggc cgcgcccgaa aaagcaggtg gatcccacaa tgaccgcgcc cgtgaaagaa 60 agtggtccct gcgcacttgt tttggtcggc cagtcatatt cacgcgtgaa agtctagata 120 tttgttgttt gtctttatag acttcgtcgc gaagtagtag agcgcgtcaa catgtgggat 180 ccattggtga acgattttcc tgaaaccgtt cacggtttcc gttctatgct tgctgttaaa 240 tacctgttac atctggaaca ggaatacgat cgcggtactg tcggggctga gtatatacgg 300 gatctaatag gggttctacg gtgtaagaat tatgtcgaag cgaccaggag atataataat 360 ctcaacaccc gtatccaagg tgcggaggag gctgaacttc gacagcccat acacgaaccg 420 tgttgttgcc cccactgtcc gcgtcaccag aagcaaaata tgggccaaca ggcccatgta 480 tcggaaaccc aagatgtaca gaatgtatcg aagcccagat gtccctaagg gctgtgaagg 540 cccatgtaag gtccagtcgt ttgagcagag ggatgatgtg aagcaccttg gtatctgtaa 600 ggtgattagt gatgtgacgc gtgggcctgg gctgacacac agggtcggaa agaggttttg 660 tatcaagtcc atttacattc ttggtaagat ctggatggat gaaaatatta agaagcagaa 720 tcacactaat aatgtgatgt tttacctgct tagggataga aggccgtatg gcaatgcgcc 780 ccaagacttt gggcagatat ttaacatgtt tgataatgag cccagtacgg caacaattaa 840 gaacgatttg agggataggt ttcaggtgtt gaggaaattt catgccaccg ttgttggtgg 900 tccatctggc atgaaggagc aggcgttggt gaaaaggttt tacaagctga atcatcacgt 960 gacatataat catcaggagg cagggaagta tgagaatcac acagagaatg cgttgttatt 1020 gtatatggca tgtacacatg cctcgaatcc tgtgtatgct acgctgaaaa tacgcatcta 1080 tttttatgat gcagtgacaa attaataaag gttgaatttt attgcatgtt gctccgtaac 1140 ttggagtgtg tttagtaata catcgtacac aacatgatca acagcttgaa ggacagtgtt 1200 aatggaaata acgcctatca tatctaaata cttgagcact tgatatctaa atactcttaa 1260 gaaacgacca gtctgaggcc gtaaggtcgt ccagaccttg aagttgagaa aacacttgtg 1320 aatctccaat gccttccgaa ggttgtggtt gaaccgtatc tggagtgtga tgatgtcgtg 1380 gttcatgttc cctggcctct tgtcgtggtc ggtgatttcg aaatagaggg gatttgttat 1440 gtcccaggta aaaacgccat tccttgcttg aggcgcagtg atgagttccc ctgtgcgaga 1500 atccatggtt gatgcagtcg atatggagat agaacgagca gccgcattcg aggtctaccc 1560 gcctacgtct gacggccctg gtcttcgctg tgcggtgttg gactttgatg ggcactagag 1620 aacaatggct cgtggagggt gatgaaggtg gcattcttta aagcccaggc tttaggggat 1680 tggttctttt cctcgtccag aaactcttta tatgatgatg tcggtcctgg attgcagagg 1740 aagatagtgg gaatgccgcc tttaatttga attggcttcc cgtattttgt attgctttgc 1800 cagtctcttt gggcccccat gaattctttg aagtgtttga ggtagtgggg gtcgacgtca 1860 tcaatgacgt tgtaccaggc gtcgtttgaa tataccttgg gagacagatc caggtgtcca 1920 caaagataat tatggggtcc cagtgaacga gcccacattg ttttgccggt tcggctatca 1980 ccttctagaa caatactgat cggtctccat ggccgcgcag cgggactgca tatattttct 2040 gatatccata cctctatgtc ctctgggact tgtgtaaaag atgatgataa gaacggacta 2100 acgtaagttt gtggcggagc ctggaagatt ctatctgcgt tagcagatat gttatggaac 2160 tgtaaaaaaa aggactttgg atctttttct ttaataattt gaagagcctc ggatttagac 2220 gaagcattca acgcgtctgc atatacctga gctaaatgct ggccctctcc ccttgcactt 2280 ctggcatcga cttggaaaag tccagcgtca agaaattccc ctcccttttc aatgtaagct 2340 ttgacatcgg acgatgattt agctccctga atgttcggat ggaaatgtgt tgatcgggat 2400 ggggaaatga gatcgaagaa tctcgggttg gtacattgga acttgccttc gaattggatg 2460 agaacatgga gatgaggcac cccatcctga tgtagttctc tgcaaaccct aatgaatttg 2520 atattcgacg ggtacgaaag ggcttgtaat tgggaaaggg cctcttcttt tgttaacgag 2580 catcggggat aggttatgaa ataatttttg gcatttattt gaaaacgacc ggctcttggc 2640 atattggctg tcgttttgga tcgggggaca ctcaaaactc tgggggaacg gtggaatggg 2700 gggcaatata tatgatgtcc cccaatggca tatgtgtaaa taggtagatg tccattcaaa 2760 atttgaattc cgaataatgg cggccatccg attaatatt 2799 //