ID JF909133; SV 1; circular; genomic DNA; STD; VRL; 2800 BP. XX AC JF909133; XX DT 21-JUN-2012 (Rel. 113, Created) DT 05-DEC-2012 (Rel. 115, Last updated, Version 3) XX DE East African cassava mosaic virus-Kenya isolate Comoros:Moheli:MO10BN3:2009 DE segment DNA-A, complete sequence. XX KW . XX OS East African cassava mosaic virus-Kenya OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RC Publication Status: Online-Only RP 1-2800 RX DOI; 10.1186/1471-2148-12-228. RX PUBMED; 23186303. RA De Bruyn A., Villemot J., Lefeuvre P., Villar E., Hoareau M., RA Harimalala M., Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., RA Harkins G.W., Varsani A., Martin D.P., Lett J.M.; RT "East African cassava mosaic-like viruses from Africa to Indian ocean RT islands: molecular diversity, evolutionary history and geographical RT dissemination of a bipartite begomovirus"; RL BMC Evol. Biol. 12(1):228-228(2012). XX RN [2] RP 1-2800 RA Villemot J., Lefeuvre P., Villar E., Hoareau M., Harimalala M., RA Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., Varsani A., Martin D.P., RA Lett J.-M.; RT ; RL Submitted (24-MAR-2011) to the INSDC. RL UMR PVBMT, CIRAD, 7, chemin de l'IRAT, Saint-Pierre, Reunion 97410, France XX DR MD5; 20ece8830a2c4ca325e62dea7f8db5b5. XX FH Key Location/Qualifiers FH FT source 1..2800 FT /organism="East African cassava mosaic virus-Kenya" FT /segment="DNA-A" FT /host="Manihot esculenta (cassava)" FT /isolate="Comoros:Moheli:MO10BN3:2009" FT /mol_type="genomic DNA" FT /country="Comoros:Moheli" FT /lat_lon="12.27 S 43.7 E" FT /collection_date="2009" FT /db_xref="taxon:1229189" FT gene 173..529 FT /gene="AV2" FT CDS 173..529 FT /codon_start=1 FT /gene="AV2" FT /product="movement protein" FT /db_xref="GOA:I6LXD1" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:I6LXD1" FT /protein_id="AEG90132.1" FT /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQQYDRGTVGAEYIRDL FT IGVLRCKSYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS FT EAQDVQNVSKPRCP" FT gene 333..1106 FT /gene="AV1" FT CDS 333..1106 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:I6LY74" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:I6LY74" FT /protein_id="AEG90131.1" FT /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA FT NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSYEQRDDVKHTGMVRCVSDVTRGSGITHR FT VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGPSPQDFGQVFNMFDNE FT PTTATVKNDLRDRYQVLRKFYATVVGGPSGMKEQALVKRFFRINNHVVYNHQEQSKYEN FT HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN" FT gene complement(1103..1507) FT /gene="AC3" FT CDS complement(1103..1507) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer" FT /db_xref="GOA:I6LY78" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:I6LY78" FT /protein_id="AEG90135.1" FT /translation="MDSRTGELITAPQATNGVFIWAITNPLYFEITNHDKRPGNMNHDI FT ITLQIRFNHNLRKALEIHKCFLNFKVWTTLRPQTGRFLKVFRYQVLKYLDMIGVISINT FT VLQAVDHVLYAVLLNTLQVTEQHAIKFNLY" FT gene complement(1248..1655) FT /gene="AC2" FT CDS complement(1248..1655) FT /codon_start=1 FT /gene="AC2" FT /product="transcription activator protein" FT /db_xref="GOA:I6LXD3" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:I6LXD3" FT /protein_id="AEG90134.1" FT /translation="MPPSSPSTSHCSLVPIKVQHRTAKTRAVRRRRVDLECGCSFYLHI FT DCINHGFSHRGTHHCASSNEWRFYLGNNKSPLFRNHQPRQAAREHEPRHHHTPDTVQPQ FT PPEGIGDSQVFSQLQGLDDLTASDWSFLKSI" FT gene complement(1564..2643) FT /gene="AC1" FT CDS complement(1564..2643) FT /codon_start=1 FT /gene="AC1" FT /product="replication associated protein" FT /db_xref="GOA:I6LY76" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:I6LY76" FT /protein_id="AEG90133.1" FT /translation="MPRAGRFQINARNYFITYPRCSLTKEEALSQLKALSYPTNIKFIR FT VCRELHQDGVPHLHVLIQFEGKFQCTNPRFFDLISTSRSTHFHPNIQGAKSSSDVKAYI FT EKGGEFLDDGIFQVDARSARGEGQHLAQVYADALNASSKTEALQIIKEKDPKSFFLQFH FT NISANADRIFQAPPQIYVSPFLSSSFTQVPEDIEVWVSENICSPAARPWRPISIVLEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW FT QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLDEEKNQSLKAWALKNATFITLHEPL FT FSSAHQSPTPHSEDQGRQT" FT gene complement(2253..2486) FT /gene="AC4" FT CDS complement(2253..2486) FT /codon_start=1 FT /gene="AC4" FT /product="C4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:I6LY79" FT /protein_id="AEG90136.1" FT /translation="MGCLISMFSSSSKGSSNVPTHDSSISFPHPDPHISIRTFRELNHR FT PMSKLILKREGNFLTMEFSRSMPEVQGGRVSI" XX SQ Sequence 2800 BP; 729 A; 560 C; 708 G; 803 T; 0 other; accggatggc cgcgcccgaa aagcaggtgg accccacagg atggccgcgc ccgtgaaaga 60 aagtggtccc cgcgcacttg ttttggtcgg ccagtcatat tcacgcgtga aagtctagat 120 atttgttgtt ggtctttata gacttcgtcg cgaagtagtg gagcgcgtca acatgtggga 180 tccattgttg aacgatttcc ccgaaactgt tcacggtttc cgttctatgc ttgccgttaa 240 atatctgtta catctggaac agcaatacga tcgcggtact gtcggggctg agtatatacg 300 ggatctaata ggggttctac ggtgtaagag ttatgtcgaa gcgaccagga gatataataa 360 tctcaacacc cgtatccaag gtgcggagga ggctgaactt cgacagccca tacacgaacc 420 gtgttgttgc ccccactgtc cgcgtcacca gaagcaaaat atgggccaac aggcccatgt 480 atcggaagcc caagatgtac agaatgtatc gaagcccaga tgtccctaag ggctgtgaag 540 gcccatgtaa ggttcagtcc tatgaacaga gggatgatgt taagcacacg ggtatggttc 600 gatgtgtcag tgatgttact cgtgggtcag gtattacgca tagagtcggg aagaggtttt 660 gtgttaagtc catatatata ttgggcaaga tctggatgga tgagaatatc aagaagcaaa 720 atcatacgaa ccatgttatg ttcttccttg ttcgagatag aaggccttat ggtccgagtc 780 cgcaagattt tggacaagtg ttcaacatgt ttgataatga acctactact gcgactgtga 840 aaaatgatct tagggaccgg tatcaggtgt tacgtaaatt ctatgcgact gttgttggtg 900 gaccctctgg gatgaaggaa caagctctgg ttaagaggtt ttttaggatc aataatcatg 960 tagtgtataa tcatcaggaa cagtccaagt atgagaatca tactgagaat gcgttgttat 1020 tgtatatggc atgtacacat gcctcgaatc ctgtgtacgc gacgctgaaa atacgcatct 1080 atttctatga tgcagtgaca aattaataaa ggttgaattt tattgcatgt tgctccgtaa 1140 cttggagcgt gtttagtaat acagcgtaca gaacatgatc aacagcctga agtacagtgt 1200 taatggaaat aacgcctatc atatctaaat acttgagcac ttgatatcta aatactttta 1260 agaaacgacc agtctgaggc cgtaaggtcg tccagacctt gaagttgaga aaacacttgt 1320 gaatctccaa tgccttccgg aggttgtggt tgaaccgtat ctggagtgtg atgatgtcgt 1380 ggttcatgtt ccctggccgc ttgtcgtggt tggtgatttc gaaatagagg ggatttgtta 1440 ttgcccagat aaaaacgcca ttcgttgctt gaggcgcagt gatgagttcc cctgtgcgag 1500 aatccatgat tgatgcagtc gatatggaga tagaacgagc atccgcattc gaggtctacc 1560 cgcctacgtc tgacggccct ggtcttcgct gtgcggtgtt ggactttgat gggcactaga 1620 gaacaatggc tcgtggaggg tgatgaaggt ggcattcttt aaagcccagg ctttaaggga 1680 ctggttcttt tcctcgtcca gaaactcttt atatgatgat gttggtccag gattgcagag 1740 gaagatagtg ggaatgccgc ctttaatttg aattggcttc ccgtactttg tattgctttg 1800 ccagtctctt tgggccccca tgaattcttt gaaatgcttt agatagtgcg ggtctacgtc 1860 gtcaatgacg ttgtaccatg cgtcgtttga atataccttt ggagacagat ccaagtgtcc 1920 acatagataa ttatggggtc ccagtgaacg agcccacatt gttttccctg ttcggctatc 1980 accttcgaga acaatactga tcggtctcca tggccgcgca gcgggactgc atatattttc 2040 tgatacccat acttctatgt cttcggggac ttgtgtaaat gatgaggata agaacggact 2100 aacataaatt tggggcggag cctggaagat tctatccgcg ttagcagata tgttatggaa 2160 ctgtaaaaaa aaggactttg gatctttttc tttaataatc tgaagagctt ctgttttaga 2220 agaagcattc aacgcgtctg catatacctg agctaaatgc tgaccctccc cccttgcact 2280 tctggcatcg acctggaaaa ttccatcgtc aagaaattcc cctccctttt caatataagc 2340 tttgacatcg gacgatgatt tagctccctg aatgttcgga tggaaatgtg tggatctgga 2400 tgtggaaatg agatcgaaga atcgtgggtt ggtacattgg aacttccctt cgaactggat 2460 gagaacatgg agatgaggca ccccatcctg atgtagttct cggcaaaccc taatgaattt 2520 gatattcgtc gggtaagaca gtgcttttaa ttgggaaagt gcctcttcct ttgttaatga 2580 gcatctggga taggttatga aataatttct ggcatttatt tgaaaacgac cggctctcgg 2640 catatttgct gtcgttttgt atcggtggac actcaaaact ccaggggaac ggtggaatgg 2700 tggacattat ataggatgtc ccccaatggc attcgtgtaa ataggtagac ttccatttca 2760 aatttgaatg tcgaatattg gcggccatcc gattaatatt 2800 //