ID AM502337; SV 1; circular; genomic DNA; STD; VRL; 2798 BP. XX AC AM502337; XX DT 17-JUN-2007 (Rel. 92, Created) DT 17-JUN-2007 (Rel. 92, Last updated, Version 1) XX DE East African cassava mosaic virus DNA-A, clone 28-ug289-5 XX KW AC1 gene; AC2 gene; AC3 gene; AC4 gene; AV1 gene; AV2 gene; coat protein. XX OS East African cassava mosaic virus OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RP 1-2798 RA Stanley J.; RT ; RL Submitted (16-MAR-2007) to the INSDC. RL Stanley J., Department of Disease and Stress Biology, John Innes Centre, RL Colney Lane, Norwich NR74 7UH, UNITED KINGDOM. XX RN [2] RA Sserubombwe W.S., Briddon R.W., Baguma Y.K., Ssemakula G.N., Bull S., RA Bua A., Otim-Nape G.W., Stanley J.; RT "Diversity of begomoviruses associated with mosaic disease of cultivated RT cassava (Manihot esculenta Cranz) and its wild relative (Manihot glaziovii) RT in Uganda."; RL Unpublished. XX DR MD5; 3470bd919ece2114c6e2e159b96cd1ee. XX FH Key Location/Qualifiers FH FT source 1..2798 FT /organism="East African cassava mosaic virus" FT /mol_type="genomic DNA" FT /country="Uganda:Bushenyi" FT /clone="28-ug289-5" FT /db_xref="taxon:62079" FT CDS 172..528 FT /gene="AV2" FT /product="AV2 protein" FT /db_xref="GOA:A0A3S5ZP69" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:A0A3S5ZP69" FT /protein_id="CAM59411.1" FT /translation="MWDPLVNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL FT IGVLRCKNYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS FT ETQDVQNVSKPRCP" FT CDS 332..1105 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:A0A3S5ZPD2" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:A0A3S5ZPD2" FT /protein_id="CAM59412.1" FT /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA FT NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSFEQRDDVKHLGICKVISDVTRGPGLTHR FT VGKRFCIKSIYILGKIWMDENIKKQNHTNNVMFYLLRDRRPYGNAPQDFGQIFNMFDNE FT PSTATIKNDLRDRFQVLRKFHATVVGGPSGMKEQALVKRFYKLNHHVTYNHQEAGKYEN FT HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN" FT CDS complement(1102..1506) FT /gene="AC3" FT /product="AC3 protein" FT /db_xref="GOA:Q2UZP5" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:Q2UZP5" FT /protein_id="CAM59413.1" FT /translation="MDSRTGELITAPQARNGVFTWDITNPLYFEITDHDKRPGNMNHDI FT ITLQIRFNHNLRKALGIHKCFLNFKVWTTLRPQTGRFLRVFRYQVLKYLDMIGVISINT FT VLQAVDHVVYDVLLNTLQVTEQHAIKFNLY" FT CDS complement(1247..1654) FT /gene="AC2" FT /product="AC2 protein" FT /db_xref="GOA:A6H315" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:A6H315" FT /protein_id="CAM59414.1" FT /translation="MPPSSPSTSHCSLVPIKVQHRTAKTRAVRRRRVDLECGCFLYLHI FT DCINHGFSHRGTHHCASSKEWRFYLGHNKSPLFRNHRPRQEAREHEPRHHHTPDTVQPQ FT PSEGIGDSQVFSQLQGLDDLTASDWSFLKSI" FT CDS complement(1658..2641) FT /gene="AC1" FT /product="AC1 protein" FT /db_xref="GOA:A6H316" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:A6H316" FT /protein_id="CAM59415.1" FT /translation="MPRAGRFQINAKNYFITYPRCSLTKEEALSQLQALSCPTNIKFIR FT VCRELHQDAVPHLHVLIQFEGKFQCTNPRFFDLISPSRSTHFHPNIQGAKSSSDVKAYI FT EKGGEFLDAGLFQVDARSARGEGQHLAQVYADALNASSKSEALQIIKEKDPKSFFLQFH FT NISANADRIFQAPPQTYVSPFLSSSFTQVPEDIEVWVSENICSPAARPWRPISIVLEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHTSNTSKNSWGPKETG FT KAIQNTGSQFKLKAAFPLSSSAIQDQHHHIKSFLTRKRTNPLKPGL" FT CDS complement(2251..2484) FT /gene="AC4" FT /product="AC4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:A6H317" FT /protein_id="CAM59416.1" FT /translation="MRYLISMFSSNSKASSNVPTRDSSISFPHPDQHISIRTFRELNHR FT PMSKLTLKREGNFLTLDFSKSMPEVQGGRDST" XX SQ Sequence 2798 BP; 732 A; 551 C; 734 G; 781 T; 0 other; accggatggc cgcgcccgaa aaagcaggtg gaccccacaa tgaccgcgcc cgtgaaagaa 60 agtggtccct gcgcacttgg tttggtcggc cagtcatatt cacgcgtgaa agtctagata 120 tttgttgttt gtctttatag acttcgtcgc gaagtagtag agcgcgtcaa catgtgggat 180 ccattggtga acgattttcc cgaaaccgtt cacggtttcc gttctatgct tgctgttaaa 240 tacctgttac atctggaaca ggaatacgat cgcggtactg tcggggctga gtatatacgg 300 gatctaatag gggttctacg ttgtaagaat tatgtcgaag cgaccaggag atataataat 360 ctcaacaccc gtatccaagg tgcggaggag gctgaacttc gacagcccat acacgaaccg 420 tgttgttgcc cccactgtcc gcgtcaccag aagcaaaata tgggccaaca ggcccatgta 480 tcggaaaccc aagatgtaca gaatgtatcg aagcccagat gtccctaagg gctgtgaagg 540 cccatgtaag gtccagtcgt ttgagcaaag ggatgatgtg aagcaccttg gtatctgtaa 600 ggtgattagt gatgtgacgc gtgggcctgg gctgacacac agggtcggaa agaggttttg 660 tatcaagtcc atttacattc ttggtaagat ctggatggat gaaaatatta agaagcagaa 720 tcacactaat aatgtgatgt tttacctgct tagggataga aggccgtatg gcaatgcgcc 780 ccaagacttt gggcagatat ttaacatgtt tgataatgag cccagtactg caacaattaa 840 gaacgatttg agggataggt ttcaggtgtt gaggaaattt catgccactg ttgttggtgg 900 tccatctggc atgaaggagc aggcgttggt gaaaaggttt tacaagctga atcatcacgt 960 gacatataat catcaggagg cagggaagta tgagaatcac acagagaatg cgttgttatt 1020 gtatatggca tgtacacatg cctcgaatcc tgtgtatgct acgctgaaaa tacgcatcta 1080 tttttatgat gcagtgacaa attaataaag gttgaatttt attgcatgtt gctccgtaac 1140 ttggagtgtg tttagtaata catcgtacac aacatgatca acagcctgaa ggacagtgtt 1200 aatggaaata acgcctatca tatctaaata cttgagcact tgatatctaa atactcttaa 1260 gaaacgacca gtctgaggcc gtaaggtcgt ccagaccttg aagttgagaa aacacttgtg 1320 aatccccaat gccttccgaa ggttgtggtt gaaccgtatc tggagtgtga tgatgtcgtg 1380 gttcatgttc cctggcctct tgtcgtggtc ggtgatttcg aaatagaggg gatttgttat 1440 gtcccaggta aaaacgccat tccttgcttg aggcgcagtg atgagttccc cggtgcgaga 1500 atccatggtt gatgcagtcg atatggagat agagaaagca gccgcattcg aggtctaccc 1560 gcctacgtct gacggccctg gtcttcgctg tgcggtgttg gactttgatg ggcactagag 1620 aacaatggct cgtggagggt gatgaaggtg gcattcttta aagcccaggc tttaagggat 1680 tggttctttt cctcgtcaag aaactcttta tatgatgatg ttggtcctgg attgcagagg 1740 aagatagtgg gaatgccgcc tttaatttga attggcttcc cgtattttgt attgctttgc 1800 cagtctcttt gggcccccat gaattctttg aagtgtttga ggtgtggggg tcgacgtcat 1860 caatgacgtt gtaccaggcg tcgtttgaat ataccttggg agacagatcc aggtgtccac 1920 aaagataatt atggggtccc agtgaacgag cccacattgt tttgccggtt cggctatcac 1980 cttctagaac aatactgatc ggtctccatg gccgcgcagc gggactgcat atattttcgg 2040 atacccatac ctctatgtcc tctgggactt gtgtaaaaga ggatgataag aacggactaa 2100 cgtaagtttg tggcggagcc tggaagattc tatctgcgtt agcagatatg ttatggaact 2160 gtaaaaaaaa ggactttgga tctttttctt taataatttg aagagcctcg gatttagacg 2220 aagcattcaa cgcgtctgca tatacctgag ctaagtgctg tccctccccc cttgcacttc 2280 tggcatcgac ttggaaaagt ccagcgtcaa gaaattcccc tcccttttca atgtaagctt 2340 tgacatcgga cgatgattta gctccctgaa tgttcggatg gaaatgtgtt gatcgggatg 2400 gggaaatgag atcgaagaat ctcgggttgg tacattggaa cttgccttcg aattggatga 2460 gaacatggag atgaggtacc gcatcctgat gtagttctct gcaaacccta atgaatttga 2520 tattcgtcgg gcacgaaagg gcttgtaatt gggaaagggc ctcttctttg gttaatgagc 2580 atcggggata ggttatgaaa taatttttgg catttatttg aaaacgaccg gctcttggca 2640 taatggctgt cgttttggat cgggggacac tcaaaactcc aggggaacgg tggaatgggg 2700 ggcattatat atgatgtccc ccaatggcat atgtgtaaat aggtagactt ccattcaaaa 2760 tttgaattcc gaatattggc ggccatccga ttaatatt 2798 //