ID JF909139; SV 1; circular; genomic DNA; STD; VRL; 2800 BP. XX AC JF909139; XX DT 21-JUN-2012 (Rel. 113, Created) DT 05-DEC-2012 (Rel. 115, Last updated, Version 3) XX DE East African cassava mosaic virus-Kenya isolate Comoros:Moheli:MO15AZ1:2009 DE segment DNA-A, complete sequence. XX KW . XX OS East African cassava mosaic virus-Kenya OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RC Publication Status: Online-Only RP 1-2800 RX DOI; 10.1186/1471-2148-12-228. RX PUBMED; 23186303. RA De Bruyn A., Villemot J., Lefeuvre P., Villar E., Hoareau M., RA Harimalala M., Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., RA Harkins G.W., Varsani A., Martin D.P., Lett J.M.; RT "East African cassava mosaic-like viruses from Africa to Indian ocean RT islands: molecular diversity, evolutionary history and geographical RT dissemination of a bipartite begomovirus"; RL BMC Evol. Biol. 12(1):228-228(2012). XX RN [2] RP 1-2800 RA Villemot J., Lefeuvre P., Villar E., Hoareau M., Harimalala M., RA Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., Varsani A., Martin D.P., RA Lett J.-M.; RT ; RL Submitted (24-MAR-2011) to the INSDC. RL UMR PVBMT, CIRAD, 7, chemin de l'IRAT, Saint-Pierre, Reunion 97410, France XX DR MD5; de4f2e6ac2f374bb06f86214058e86d6. XX FH Key Location/Qualifiers FH FT source 1..2800 FT /organism="East African cassava mosaic virus-Kenya" FT /segment="DNA-A" FT /host="Manihot esculenta (cassava)" FT /isolate="Comoros:Moheli:MO15AZ1:2009" FT /mol_type="genomic DNA" FT /country="Comoros:Moheli" FT /lat_lon="12.3 S 43.64 E" FT /collection_date="2009" FT /db_xref="taxon:1229189" FT gene 173..538 FT /gene="AV2" FT CDS 173..538 FT /codon_start=1 FT /gene="AV2" FT /product="movement protein" FT /db_xref="GOA:I6LYB1" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:I6LYB1" FT /protein_id="AEG90168.1" FT /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL FT IGVLRCKSYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS FT EAQYVQNVSKPRCSEGL" FT gene 333..1106 FT /gene="AV1" FT CDS 333..1106 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:I6LYB0" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:I6LYB0" FT /protein_id="AEG90167.1" FT /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA FT NRPMYRKPNMYRMYRSPDVPKGCEGPCKVQSYEQRDDVKHTGMVRCVSDVTRGSGITHR FT VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGQSPQDFGQVFNMFDNE FT PTTATVKNDLRDRYQVLRKFYTTVVGGPSGMKEQALVKRFFRINNHVVYNHQEQAKYEN FT HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN" FT gene complement(1103..1507) FT /gene="AC3" FT CDS complement(1103..1507) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer" FT /db_xref="GOA:I6LYB4" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:I6LYB4" FT /protein_id="AEG90171.1" FT /translation="MDSRTGELITAPQATNGVFTWEITNPLYFEITNHDKRPGNMNHDI FT ITLQIRFNHNLRKALGIHKCFLNFKVWTTLRPQTGRFLRVFRYQVLKYLDMIGVISVNT FT VLQTVDHVLYDVLLNTLQVTEQHAIKFNLY" FT gene complement(1248..1655) FT /gene="AC2" FT CDS complement(1248..1655) FT /codon_start=1 FT /gene="AC2" FT /product="transcription activator protein" FT /db_xref="GOA:I6LYB3" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:I6LYB3" FT /protein_id="AEG90170.1" FT /translation="MPPSSPSTSHCSLVPIKVQHRTAKTRAVRRRRVDLECGCSFYLHI FT DCINHGFSHRGTHHCASSNEWRFYLGNNKSPLFRNHQPRQAAREHEPRHHHTPDTVQPQ FT PTEGTGDSQVFSQLQGLDDLTASDWSFLKSI" FT gene complement(1564..2643) FT /gene="AC1" FT CDS complement(1564..2643) FT /codon_start=1 FT /gene="AC1" FT /product="replication associated protein" FT /db_xref="GOA:I6LYB2" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:I6LYB2" FT /protein_id="AEG90169.1" FT /translation="MPRAGRFQINARNYFITYPRCSLTKEEALSQLKALSYPTNIKFIR FT VCRELHQDGVPHLHVLIQFEGKFQCTNPRFFDLISTSRSTHFHPNIQGAKSSSDVKAYI FT EKGGEFLDDGIFQVDARSARGEGQHLAQVYADALNASSKTEALQIIKEKDPKSFFLQFH FT NISANADRIFQAPAQTYVSPFLSSSFTQVPEDIEVWVSENICSPAARPWRPISIVLEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW FT QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLDEEKNQSLKAWALKNATFITLHEPL FT FSSAHQSPTPHSEDQGRQT" FT gene complement(2253..2486) FT /gene="AC4" FT CDS complement(2253..2486) FT /codon_start=1 FT /gene="AC4" FT /product="C4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:I6LYH5" FT /protein_id="AEG90172.1" FT /translation="MGCLISMFSSSSKGSSNVPTQDSSISFPHPDPHISIRTFRELNHR FT PMSKLILKREGNFLTMEFSRSMPEVQGGRASI" XX SQ Sequence 2800 BP; 731 A; 561 C; 708 G; 800 T; 0 other; accggatggc cgcgcccgaa aaagcaggtg gatcccacat gttgacgcgc ccgttaaaga 60 aagtggtccc cgcgcacttg tgttggtcgg ccagtcatat tcacgcgtga aagtctagat 120 atttgttgtt tgtctttata gacttcgtcg cgaagtagtg gagcgcgtca acatgtggga 180 tccattgttg aacgatttcc ccgaaaccgt tcacggtttc cgttctatgc ttgctgttaa 240 atacctgtta catctggaac aggaatacga tcgcggtact gtcggggcgg agtatatacg 300 tgatttaata ggggttctac ggtgtaagag ttatgtcgaa gcgaccagga gatataataa 360 tctcaacacc cgtatccaag gtgcggagga ggctgaactt cgacagccca tacacgaacc 420 gtgttgttgc ccccactgtc cgcgtcacca gaagcaaaat atgggccaac aggcccatgt 480 atcggaagcc caatatgtac agaatgtatc gaagcccaga tgttccgaag ggctgtgaag 540 gcccatgtaa ggttcagtcc tatgaacaga gggatgatgt gaagcacact ggtatggtcc 600 gatgtgtcag tgatgttact cgtggatcag gcattaccca tagagtcggg aagaggtttt 660 gtgtgaagtc catatatata ttgggcaaga tttggatgga tgagaatatc aagaagcaaa 720 atcatacgaa ccatgttatg ttcttccttg ttcgagatag aaggccttat ggtcagagtc 780 ctcaagattt tggacaagtg ttcaacatgt ttgataatga acctactacg gcaactgtga 840 agaatgatct tagggaccga tatcaggtgt tacgtaaatt ctatacgact gttgttggtg 900 gaccctctgg gatgaaggaa caagctctgg ttaagaggtt ttttaggatc aataatcatg 960 tagtgtataa tcatcaggaa caggccaagt atgagaatca tactgagaat gcgttgttat 1020 tgtatatggc atgtacacat gcctcgaatc ctgtgtacgc tacgctgaaa atacgcatct 1080 atttctatga tgcagtgaca aattaataaa ggttgaattt tattgcatgt tgctccgtaa 1140 cttggagcgt gtttagtaat acatcgtaca gaacatgatc aacagtctga agtacagtgt 1200 taacggaaat aacgcctatc atatctaaat acttgagcac ttgatatcta aatactctta 1260 agaaacgacc agtctgaggc cgtaaggtcg tccagacctt gaagttgaga aaacacttgt 1320 gaatccccag tgccttccgt aggttgtggt tgaaccgtat ctggagtgtg atgatgtcgt 1380 ggttcatgtt ccctggccgc ttgtcgtggt tggtgatttc gaaatagagg ggatttgtta 1440 tttcccaggt aaaaacgcca ttcgttgctt gaggcgcagt gatgagttcc cctgtgcgag 1500 aatccatggt tgatgcagtc gatatggaga tagaacgagc aaccgcattc gaggtctacc 1560 cgcctacgtc tgacggccct ggtcttcgct gtgcggtgtt ggactttgat gggcactaga 1620 gaacaatggc tcgtggaggg tgatgaaggt ggcattcttt aaagcccagg ctttaaggga 1680 ctggttcttt tcctcgtcca gaaactcttt atatgatgat gttggtccag gattgcagag 1740 gaagatagtg ggaatgccgc ctttaatctg aattggcttc ccgtactttg tattgctttg 1800 ccagtccctt tgggccccca tgaattcttt gaaatgcttt agatagtgcg ggtctacgtc 1860 gtcaatgacg ttgtaccatg cgtcgtttga atataccttt ggagacagat ccaggtgtcc 1920 acatagataa ttatggggtc ccagtgaacg agcccacatg gttttacctg ttcggctatc 1980 accttcgaga acaatactga tcggtctcca tggccgcgca gcgggactgc atatattttc 2040 tgatacccat acttctatgt cttcggggac ttgtgtaaat gatgatgata agaacggact 2100 aacataagtt tgggccggag cctggaagat tctatccgcg ttagcagata tgttatggaa 2160 ctgtaaaaaa aaggactttg gatctttttc tttaataatc tgaagagctt ctgttttaga 2220 agaagcattc aacgcgtcgg catatacctg agctaaatgc tggccctccc cccttgcact 2280 tctggcatcg acctggaaaa ttccatcgtc aagaaattcc cctccctttt caatataagc 2340 tttgacatcg gacgatgatt tagctccctg aatgttcgga tggaaatgtg tggatctgga 2400 tgtggaaatg agatcgaaga atcttgggtt ggtacattgg aacttccctt cgaactggat 2460 gagaacatgg agatgaggca ccccatcctg atgtagttct cggcaaaccc tgatgaattt 2520 gatattcgtc gggtaagaaa gggcttttaa ttgggaaagt gcctcttcct ttgttaatga 2580 gcatcgggga taggtaatga aataatttct ggcatttatc tgaaaacgac cggctctcgg 2640 catatttgct gtcgttttgt atcggtggac actcaaaact ccaggggaac ggtggaatgg 2700 tggacattat ataggatgtc ccccaatggc attcgtgtaa ataggtagac ttccatttca 2760 aatttgaatg tcgaatattg gcggccatcc gattaatatt 2800 //