ID AM502328; SV 1; circular; genomic DNA; STD; VRL; 2799 BP. XX AC AM502328; XX DT 17-JUN-2007 (Rel. 92, Created) DT 17-JUN-2007 (Rel. 92, Last updated, Version 1) XX DE East African cassava mosaic virus DNA-A, clone 32-ug441 XX KW AC1 gene; AC2 gene; AC3 gene; AC4 gene; AV1 gene; AV2 gene; coat protein. XX OS East African cassava mosaic virus OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RP 1-2799 RA Stanley J.; RT ; RL Submitted (16-MAR-2007) to the INSDC. RL Stanley J., Department of Disease and Stress Biology, John Innes Centre, RL Colney Lane, Norwich NR74 7UH, UNITED KINGDOM. XX RN [2] RA Sserubombwe W.S., Briddon R.W., Baguma Y.K., Ssemakula G.N., Bull S., RA Bua A., Otim-Nape G.W., Stanley J.; RT "Diversity of begomoviruses associated with mosaic disease of cultivated RT cassava (Manihot esculenta Cranz) and its wild relative (Manihot glaziovii) RT in Uganda."; RL Unpublished. XX DR MD5; ad76e1164931f2fb6ebf40b3576d5268. XX FH Key Location/Qualifiers FH FT source 1..2799 FT /organism="East African cassava mosaic virus" FT /mol_type="genomic DNA" FT /country="Uganda:Iganga" FT /clone="32-ug441" FT /db_xref="taxon:62079" FT CDS 172..528 FT /gene="AV2" FT /product="AV2 protein" FT /db_xref="GOA:A0A3S5ZP69" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:A0A3S5ZP69" FT /protein_id="CAM59363.1" FT /translation="MWDPLVNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL FT IGVLRCKNYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS FT ETQDVQNVSKPRCP" FT CDS 332..1105 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:A0A3S5ZPD2" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:A0A3S5ZPD2" FT /protein_id="CAM59364.1" FT /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA FT NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSFEQRDDVKHLGICKVISDVTRGPGLTHR FT VGKRFCIKSIYILGKIWMDENIKKQNHTNNVMFYLLRDRRPYGNAPQDFGQIFNMFDNE FT PSTATIKNDLRDRFQVLRKFHATVVGGPSGMKEQALVKRFYKLNHHVTYNHQEAGKYEN FT HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN" FT CDS complement(1102..1506) FT /gene="AC3" FT /product="AC3 protein" FT /db_xref="GOA:A6H2W0" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:A6H2W0" FT /protein_id="CAM59365.1" FT /translation="MDSRTGELITAPQARNGVFTWDITNPLYFEITDHDKRPGNMNHDI FT ITLQIRFNHNLRKALGIHKCFLNFKIWTTLRPQTGRFLRVFRYQVLKYLDMIGVISINT FT VLQAVVHVVYDVLLNTLQVTEQHAIKFNLY" FT CDS complement(1247..1654) FT /gene="AC2" FT /product="AC2 protein" FT /db_xref="GOA:A6H2W1" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:A6H2W1" FT /protein_id="CAM59366.1" FT /translation="MPPSSPSTSHCSLVPIKVQHRTAKTRAVRRRRVDLECGCSFYLHI FT DCINHGFSHRGTHHCASSKEWRFYLGHNKSPLFRNHRPRQEAREHEPRHHHTPDTVQPQ FT PSEGIGDSQVFSQLQDLDDLTASDWSFLKSI" FT CDS complement(1569..2642) FT /gene="AC1" FT /product="AC1 protein" FT /db_xref="GOA:A6H2W2" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:A6H2W2" FT /protein_id="CAM59367.1" FT /translation="MPRAGRFQINAKNYFITYPRCSLTKEEALSQLQALSYPTNIKFIR FT VCRELHQDGVPHLHVLIQFENKFQCTNPRFFDLISPSRSTHFHPNIQGAKSSSDVKAYI FT EKGGDFLDAGLFQVDARSARGEGQHLAQVYADALNASSKSEALQIIKEKDPKSFFLQFH FT NISANADRIFQAPPQTYVSPFLSSSFTQVPEDIEVWVSDNICSPAARPWRPISIVLEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW FT QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLDEEKNQSLKAWALKNATFITLHEPL FT FSSAHQSPTPHSEDQGR" FT CDS complement(2252..2485) FT /gene="AC4" FT /product="AC4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:A6H2W3" FT /protein_id="CAM59368.1" FT /translation="MGCLISMFSSNSKTSSNVPTRDSSISFPHPDQHISIRTFRALNHR FT PMSKLTLKREGIFLTLDFSKSMPEVQEGRASI" XX SQ Sequence 2799 BP; 738 A; 545 C; 734 G; 782 T; 0 other; accggatggc cgcgcccgaa aaagcaggtt gaccccacaa tgaccgcgcc cgtgaaagaa 60 agtggtccct gcgcatttgt tttggtcggc cagtcatatt cacgcgtgaa agtctagata 120 tttgttgttt gtctttatag acttcgtcgc gaagtagtag agcgcgtcaa catgtgggat 180 ccattggtga acgattttcc cgaaaccgtt cacggtttcc gttctatgct tgctgttaaa 240 tacctgttac atctggaaca ggaatacgat cgcggtactg tcggggctga gtatatacgg 300 gatctaatag gggttctacg gtgtaagaat tatgtcgaag cgaccaggag atataataat 360 ctcaacaccc gtatccaagg tgcggaggag gctgaacttc gacagcccat acacgaaccg 420 tgttgttgcc cccactgtcc gcgtcaccag aagcaaaata tgggccaaca ggcccatgta 480 tcggaaaccc aagatgtaca gaatgtatcg aagcccagat gtccctaagg gctgtgaagg 540 cccatgtaag gtccagtcgt ttgagcagag ggatgatgtg aagcaccttg gtatctgtaa 600 ggtgataagt gatgtgacgc gtgggcctgg gctgacacac agggtcggaa agaggttttg 660 tatcaagtcc atttacattc ttggtaagat ctggatggat gaaaatatta agaagcagaa 720 tcacactaat aatgtgatgt tttacctgct tagggataga aggccgtatg gcaatgcgcc 780 ccaagacttt gggcagatat ttaacatgtt tgataatgag cccagtactg caacaattaa 840 gaacgatttg agggataggt ttcaggtgtt gaggaaattt catgccactg ttgttggggg 900 tccatctggc atgaaggagc aggcgttggt gaaaaggttt tacaagctga atcatcacgt 960 gacatataat catcaggagg cagggaagta tgagaatcac acagagaatg cgttgttatt 1020 gtatatggca tgtacacatg cctcgaatcc tgtgtatgct acgctgaaaa tacgcatcta 1080 tttttatgat gcagtgacaa attaataaag gttgaatttt attgcatgtt gctccgtaac 1140 ttggagtgtg tttagtaata catcgtacac aacatgaaca acagcttgaa ggacagtgtt 1200 aatggaaata acgcctatca tatctaaata cttgagcact tgatatctaa atactcttaa 1260 gaaacgacca gtctgaggcc gtaaggtcgt ccagatcttg aagttgagaa aacacttgtg 1320 aatccccaat gccttccgaa ggttgtggtt gaaccgtatc tggagtgtga tgatgtcgtg 1380 gttcatgttc cctggcctct tgtcgtggtc ggtgatttcg aaatagaggg gatttgttat 1440 gtcccaggta aaaacgccat tccttgcttg aggcgcagtg atgagttccc ctgtgcgaga 1500 atccatggtt gatgcagtcg atatggagat agaacgagca gccgcattcg aggtctaccc 1560 gcctacgtct aacggccctg gtcttcgctg tgcggtgttg gactttgatg ggcactagag 1620 aacaatggct cgtggagggt gatgaaggtg gcattcttta aagcccaggc tttaagggat 1680 tggttctttt cctcgtccag aaactcttta tatgatgatg ttggtcctgg attgcagagg 1740 aagatagtgg gaatgccgcc tttaatttga atgggcttcc cgtattttgt attgctttgc 1800 cagtctcgtt gggcccccat gaattctttg aagtgtttga ggtagtgggg gtcgacgtca 1860 tcaatgacgt tgtaccaggc gtcgtttgaa tataccttgg gagacagatc caggtgtcca 1920 caaagataat tatggggtcc cagtgaacga gcccacattg ttttgccggt tcggctatca 1980 ccttcgagaa caatactgat cggtctccat ggccgcgcag cgggactgca tatattatcg 2040 gatacccata cctctatgtc ttctgggact tgtgtaaaag aagatgataa gaacggacta 2100 acgtaagttt gtggcggagc ctggaagatt ctatctgcgt tagcagatat gttatggaac 2160 tgtaaaaaaa aggactttgg atctttttct ttaataattt gaagagcctc ggatttagac 2220 gaagcattca acgcgtctgc atatacctga gctaaatgct ggccctcccc tcttgcactt 2280 ctggcatcga cttggaaaag tccagcgtca agaaaatccc ctcccttttc aatgtaagct 2340 ttgacatcgg acgatgattt agcgccctga atgttcggat ggaaatgtgt tgatcgggat 2400 ggggaaatga gatcgaagaa tctcgggttg gtacattgga acttgttttc gaattggatg 2460 agaacatgga gatgaggcac cccatcctga tgtagttctc tgcaaaccct aatgaatttg 2520 atattcgtcg ggtacgaaag ggcttgtaat tgggaaaggg cctcttcttt tgttaatgag 2580 catcggggat aggttatgaa ataatttttg gcatttattt gaaaacgacc ggctcttggc 2640 atattggctg tcgttttgga atgggggaca ctcaaaactc caggggaacg gtggaatggg 2700 gggcaatata tatgatgtcc cccaatggca tatgtgtaaa taggtagact tccatttaaa 2760 atttgaattc cgaatattgg cggccatccg attaatatt 2799 //