ID JF909127; SV 1; circular; genomic DNA; STD; VRL; 2798 BP. XX AC JF909127; XX DT 21-JUN-2012 (Rel. 113, Created) DT 05-DEC-2012 (Rel. 115, Last updated, Version 3) XX DE East African cassava mosaic Kenya virus isolate Comoros:Moheli:MO02B00:2005 DE segment DNA-A, complete sequence. XX KW . XX OS East African cassava mosaic Kenya virus OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RC Publication Status: Online-Only RP 1-2798 RX DOI; 10.1186/1471-2148-12-228. RX PUBMED; 23186303. RA De Bruyn A., Villemot J., Lefeuvre P., Villar E., Hoareau M., RA Harimalala M., Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., RA Harkins G.W., Varsani A., Martin D.P., Lett J.M.; RT "East African cassava mosaic-like viruses from Africa to Indian ocean RT islands: molecular diversity, evolutionary history and geographical RT dissemination of a bipartite begomovirus"; RL BMC Evol. Biol. 12(1):228-228(2012). XX RN [2] RP 1-2798 RA Villemot J., Lefeuvre P., Villar E., Hoareau M., Harimalala M., RA Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., Varsani A., Martin D.P., RA Lett J.-M.; RT ; RL Submitted (24-MAR-2011) to the INSDC. RL UMR PVBMT, CIRAD, 7, chemin de l'IRAT, Saint-Pierre, Reunion 97410, France XX DR MD5; 5e3d44ca7e7ef87c4c4b716b961bc9e9. XX FH Key Location/Qualifiers FH FT source 1..2798 FT /organism="East African cassava mosaic Kenya virus" FT /segment="DNA-A" FT /host="Manihot esculenta (cassava)" FT /isolate="Comoros:Moheli:MO02B00:2005" FT /mol_type="genomic DNA" FT /country="Comoros:Moheli" FT /lat_lon="12.35 S 43.68 E" FT /collection_date="2005" FT /db_xref="taxon:393599" FT gene 173..529 FT /gene="AV2" FT CDS 173..529 FT /codon_start=1 FT /gene="AV2" FT /product="movement protein" FT /db_xref="GOA:I6LY39" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:I6LY39" FT /protein_id="AEG90096.1" FT /translation="MWDPLVNDFPETVHGFPSKLAVKYLLHLEQEYDRGTVGAEYIRDL FT IGVLRCKSYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS FT EAEDVQNVSKPRCP" FT gene 333..995 FT /gene="AV1" FT CDS 333..995 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:I6LY38" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:I6LY38" FT /protein_id="AEG90095.1" FT /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA FT NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSYEQRDDVKHTGMVRCVSDVTRGPGITHR FT VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGPSPQDFGQVFNMFDNE FT PTTATVKNDLRDRYQVLRKFYATVVGGPSGMKEQSLVKRFFRINNHVVYNHQEQAKV" FT gene complement(1104..1508) FT /gene="AC3" FT CDS complement(1104..1508) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer" FT /db_xref="GOA:I6LY42" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:I6LY42" FT /protein_id="AEG90099.1" FT /translation="MDSRTGELITAPQAKNGVFTWEITNPLYFDITNHDTRPGNMNHDI FT ITLQIRFNHNLRKALAIHKCFLNFKVWTTLRPPTGRFLRVFRYQVLKYLDMIGVISINT FT VLQAVDHVLYDVLLNTLQVTEQHAIKFNLY" FT gene complement(1249..1656) FT /gene="AC2" FT CDS complement(1249..1656) FT /codon_start=1 FT /gene="AC2" FT /product="transcription activator protein" FT /db_xref="GOA:I6LY41" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:I6LY41" FT /protein_id="AEG90098.1" FT /translation="MPPSSPSTSHCSQVPIKVQHRTAKTRALRRRRVDLECGCSFYLHI FT DCINHGFSHRGTHHCASSKEWRFYLGNNKSPLFRHHQPRHEAREHEPRHHHTPDTFQPQ FT PPEGIGDSQVFSQLQGLDDLTASDWSFLKSI" FT gene complement(1565..2644) FT /gene="AC1" FT CDS complement(1565..2644) FT /codon_start=1 FT /gene="AC1" FT /product="replication associated protein" FT /db_xref="GOA:I6LY40" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:I6LY40" FT /protein_id="AEG90097.1" FT /translation="MPRAGRFSIKAKNYFLTYPKCSLSKEEALDQIRKLQTPTNKLFIK FT ICRELHENGEPHLHALIQFEGKYNCTNQRFFDLISPSRSAHFHPNIQGAKSSSDVKSYL FT DKDGDTIQWGEFQIDGRSARGGQQSANDAYAKALNSANKSEALNVIRELAPKDFVLQFH FT NLNSNLDRIFQEPLPPYVSPFLSSSFTNVPEELEAWVSENVMGSAARPWRPSSIVIEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW FT QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLEEEKNQSLKAWALKNATFVTLHEPL FT FSSADQSPTPHSEDQGPQT" FT gene complement(2197..2493) FT /gene="AC4" FT CDS complement(2197..2493) FT /codon_start=1 FT /gene="AC4" FT /product="C4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:I6LY43" FT /protein_id="AEG90100.1" FT /translation="MKMGNLICMRSFSSRASTIVPTNDSSTSYPLPGPPISTQTFRELN FT QAPTSSPIWTRTETPSNGASFRSTDDLLAEDNNPPMTLTPRLLTQQISRRLLM" XX SQ Sequence 2798 BP; 719 A; 575 C; 736 G; 768 T; 0 other; accggatggc cgcgcccgaa aaacaggtgg accccaccga atggccgcgc ccgtgaaaga 60 aagtggtccc cgcgcacgtg tttcggtcgg ccagtcatat ttacgcgtga aagtctagat 120 atttgttgtt tgtctttata gacttcgtcg cgaagtagtg aagcgcgtca acatgtggga 180 tccactagtg aacgatttcc ctgaaaccgt gcacggtttc ccctctaagc ttgctgttaa 240 atacctgtta catctggaac aggaatacga tcgcggtact gtcggggctg agtatatccg 300 ggatctaata ggggttctac ggtgtaagag ttatgtcgaa gcgaccagga gatataataa 360 tctcaacacc cgtatccaag gtgcggagga ggctgaactt cgacagccca tacacgaacc 420 gtgttgttgc ccccactgtc cgcgtcacca gaagcaaaat atgggccaac aggcccatgt 480 atcggaagcc gaagatgtac agaatgtatc gaagcccaga tgtccctaag ggctgtgaag 540 gcccatgtaa ggttcagtcg tatgaacaga gggatgatgt taagcatact ggtatggtcc 600 gatgtgtcag tgatgttact cgtgggccag gcatcaccca tagagttggg aagaggtttt 660 gtgtgaagtc catatatata ttgggcaaga tctggatgga tgagaatatc aagaagcaaa 720 atcatacgaa ccatgttatg ttcttcctcg ttcgagatag aaggccttat ggtccgagcc 780 cgcaagattt tggacaagtg ttcaacatgt ttgataatga acctactacg gcaacggtga 840 agaatgatct gagggaccgg tatcaggtgt tacgaaaatt ctatgccacc gttgttggtg 900 gaccctccgg gatgaaggaa caatcgctgg ttaagaggtt ttttaggatc aataatcatg 960 tagtgtataa tcatcaggaa caggccaaag tatgagaatc atacggagaa tgcgttgtta 1020 ttgtatatgg catgtacaca tgcctcaaat ccagtgtacg ctactctgaa aatacgcatc 1080 tatttctatg atgcagtgac aaattaataa aggttgaatt ttattgcatg ttgctccgta 1140 acttggagtg tgtttagtaa tacatcgtac agaacatgat caacagcttg tagtacagtg 1200 ttaatggaaa taacgcctat catatctaaa tacttgagca cttgatatct aaatactctt 1260 aagaaacgac cagtcggagg ccgtaaggtc gtccagacct tgaagttgag aaaacacttg 1320 tgaatcgcca atgccttccg gaggttgtgg ttgaaacgta tctggagtgt gatgatgtcg 1380 tggttcatgt tccctggcct cgtgtcgtgg ttggtgatgt cgaaatagag gggatttgtt 1440 atttcccagg taaaaacgcc attctttgct tgaggcgcag tgatgagttc ccctgtgcga 1500 gaatccatgg ttgatgcagt cgatatggag atagaacgag cagccacatt cgaggtctac 1560 tcgcctacgt ctgagggccc tggtcttcgc tgtgcggtgt tggactttga tcggcacttg 1620 agaacaatgg ctcgtggagg gtgacgaagg tggcattctt taaagcccag gctttaaggg 1680 actgattctt ttcctcttcg agaaactctt tatatgatga tgttggtcct ggattgcaga 1740 ggaagatagt gggaatgccg cctttaattt gaattggctt tccgtacttt gtattgcttt 1800 gccagtccct ttgggccccc atgaattctt taaagtgttt gaggtagtgg gggtcgacgt 1860 catcaatgac gttgtaccag gcgtcgttgc tgtagacctt tggactgaga tccaggtgtc 1920 cacacaagta gttgtgtgga cccagagagc gggcccacat cgtcttcccc gtcctactat 1980 cgccctcgat gacgatgcta ctcggtctcc atggccgcgc agcggaaccc atcacgttct 2040 cggaaaccca agcttcaagt tcctcaggaa cgttagtgaa agaagaagaa agaaagggag 2100 aaacataagg aggcagaggc tcttgaaaaa tcctatctaa attgctattt aaattatgaa 2160 actgtaaaac aaaatctttt ggggcaagtt cccgtattac attaagagcc tccgacttat 2220 ttgctgagtt aagagccttg gcgtaagcgt cattggcgga ttgttgtcct ccgcgagcag 2280 atcgtccgtc gatctgaaac tcgccccatt ggatggtgtc tccgtccttg tccagatagg 2340 acttgacgtc ggagcttgat ttagctccct gaatgtttgg gtggaaatgg gcggaccggg 2400 aaggggatat gaggtcgaag aatcgttggt tggtacaatt gtacttgccc tcgaactgaa 2460 tgagcgcatg cagatgaggt tccccatttt catggagttc tctgcagatc ttgatgaaca 2520 atttatttgt tggggtttgg agtttccgga tctgatccaa tgcctcctct ttggacagag 2580 agcatttggg atatgttaag aaatagtttt tcgctttgat gctaaaacga ccagcccttg 2640 gcatttttgc tgtcgtatag caatcggggg gcactcaaag tctgtagcaa tcgggggaat 2700 gggggggcaa tttatatgat gccccctaaa tggcatttat gtaatatcct cattgaattt 2760 gaatttcaaa cgtggaaagc ggccatccgt ataatatt 2798 //