ID JF909193; SV 1; circular; genomic DNA; STD; VRL; 2799 BP. XX AC JF909193; XX DT 21-JUN-2012 (Rel. 113, Created) DT 05-DEC-2012 (Rel. 115, Last updated, Version 3) XX DE East African cassava mosaic virus-Kenya isolate DE Comoros:Mayotte:YT55B06:2008 segment DNA-A, complete sequence. XX KW . XX OS East African cassava mosaic virus-Kenya OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RC Publication Status: Online-Only RP 1-2799 RX DOI; 10.1186/1471-2148-12-228. RX PUBMED; 23186303. RA De Bruyn A., Villemot J., Lefeuvre P., Villar E., Hoareau M., RA Harimalala M., Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., RA Harkins G.W., Varsani A., Martin D.P., Lett J.M.; RT "East African cassava mosaic-like viruses from Africa to Indian ocean RT islands: molecular diversity, evolutionary history and geographical RT dissemination of a bipartite begomovirus"; RL BMC Evol. Biol. 12(1):228-228(2012). XX RN [2] RP 1-2799 RA Villemot J., Lefeuvre P., Villar E., Hoareau M., Harimalala M., RA Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., Varsani A., Martin D.P., RA Lett J.-M.; RT ; RL Submitted (24-MAR-2011) to the INSDC. RL UMR PVBMT, CIRAD, 7, chemin de l'IRAT, Saint-Pierre, Reunion 97410, France XX DR MD5; dc52d374c78b9c853447ffda4ae9ad28. XX FH Key Location/Qualifiers FH FT source 1..2799 FT /organism="East African cassava mosaic virus-Kenya" FT /segment="DNA-A" FT /host="Manihot esculenta (cassava)" FT /isolate="Comoros:Mayotte:YT55B06:2008" FT /mol_type="genomic DNA" FT /country="Mayotte" FT /lat_lon="12.84 S 45.17 E" FT /collection_date="2008" FT /db_xref="taxon:1229189" FT gene 172..528 FT /gene="AV2" FT CDS 172..528 FT /codon_start=1 FT /gene="AV2" FT /product="movement protein" FT /db_xref="GOA:I6LX83" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:I6LX83" FT /protein_id="AEG90492.1" FT /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL FT IGVLRCKSYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS FT EAEDVQNVSKPRCP" FT gene 332..1105 FT /gene="AV1" FT CDS 332..1105 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:I6LZ84" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:I6LZ84" FT /protein_id="AEG90491.1" FT /translation="MSKRPGDIIISTPVSKVRRRLNFDNPYTNRVVAPTVRVTRSKIWA FT NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSYEQRDDVKHIGMVRCVSDVTRGSGITHR FT VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGPSPQDFGQVFNMFDNE FT PTTATVKNDLRDRYQVLRKFYATVVGGPSGMKEQALVKRFFRINNHVVYNHQEQAKYEN FT HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN" FT gene complement(1102..1506) FT /gene="AC3" FT CDS complement(1102..1506) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer" FT /db_xref="GOA:I6LZ88" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:I6LZ88" FT /protein_id="AEG90495.1" FT /translation="MDSRTGELITAPQAKNGVFTWEITNPLYFEITNHDKRPGNMNHDI FT ITLQIRFNHNLRKALAIHKCFLNFKVWTTLRPQTGLFLRVFRYQVLKYLDMIGVISINT FT VITAVDHVLYAVLLNTLQVTEHHAIKFNLY" FT gene complement(1247..1654) FT /gene="AC2" FT CDS complement(1247..1654) FT /codon_start=1 FT /gene="AC2" FT /product="transcription activator protein" FT /db_xref="GOA:I6LYT7" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:I6LYT7" FT /protein_id="AEG90494.1" FT /translation="MPPSSPSTSHCSQVPIKVQHRTAKNRALRRRRVDLECGCSFYLHI FT DCINHGFSHRGTHHCASSKEWRFYLGNNKSPLFRNHQPRQEAREHEPRHHHTPDTFQPQ FT PPEGIGDSQVFSQLQGLDDLTASDWSFLKSI" FT gene complement(1563..2642) FT /gene="AC1" FT CDS complement(1563..2642) FT /codon_start=1 FT /gene="AC1" FT /product="replication associated protein" FT /db_xref="GOA:I6LZ86" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:I6LZ86" FT /protein_id="AEG90493.1" FT /translation="MPRAGRFQINAKNYFITYPRCSLTKEEAISQLKALSYPTNIKFIR FT VCRELHRDGVPHLHVLIQFEGKFQCTNQRFFDLISPSRSTHFHPNIQGAKSSSDVKAYI FT EKGGEFLDDGIFQVDARSARGEGQHLAQVYADALNASSKSEALQIIKEKDPKSFFLQFH FT NISANADRIFQAPPQTYASPFLSSSFTQVPEELEVWVSENICSPAARPWRPISIVLEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW FT QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLDEEKNQSLKAWALKNATFITLHEPL FT FSSAHQSPTPHSEEPGPQT" FT gene complement(2252..2485) FT /gene="AC4" FT CDS complement(2252..2485) FT /codon_start=1 FT /gene="AC4" FT /product="C4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:I6LZ89" FT /protein_id="AEG90496.1" FT /translation="MGCLISMFSSNSKASSNVQTRDSSISFPPPDQHISIRTFRVLNHR FT PMSKPTLKREGNFLTMEFSKSMPEVQGGRASI" XX SQ Sequence 2799 BP; 728 A; 560 C; 722 G; 789 T; 0 other; accggatggc cgcgcccgaa aaagcaggtg gaccccacca gatggccgcg cccgtgaaag 60 caagtggtcc ccgcgcacgt gttggtcggc cagtcatatt tacgcgtgaa agtctagata 120 tttgttgttt gtctttatag acttcgtcgc gaagtagtgg agcgtgtcaa catgtgggat 180 ccattgttga atgattttcc cgaaaccgtt cacggtttcc gttctatgct tgctgttaaa 240 tacctgttac atctggaaca ggaatacgat cgcggtactg tcggggctga gtatatacgg 300 gatctaatag gggttctacg gtgtaagagt tatgtcgaag cgaccaggag atataataat 360 ctcaacaccc gtatccaagg tgcggaggag gctgaacttc gacaacccat acacgaaccg 420 tgttgttgcc cccactgtcc gcgtcaccag aagcaaaata tgggccaaca ggcccatgta 480 tcggaagccg aagatgtaca gaatgtatcg aagcccagat gtccctaagg gctgtgaagg 540 cccatgtaag gttcagtcgt atgaacagag ggatgatgtt aagcacattg gtatggtccg 600 atgtgtcagt gatgttactc gtgggtcagg catcacccat agagtcggga agaggttttg 660 tgtgaagtcc atatatatat tgggcaagat ctggatggat gagaatatca agaagcaaaa 720 tcacacgaac catgttatgt tcttcctcgt tcgagacaga aggccgtatg gtccgagccc 780 gcaagatttt ggacaagtgt tcaacatgtt tgataatgaa cctactactg caactgtgaa 840 gaatgatctt agggaccggt atcaggtgtt acgtaaattc tatgcgactg ttgtcggtgg 900 accctccggg atgaaggaac aagcgctggt taagaggttt tttaggatca ataatcatgt 960 agtgtataat catcaggaac aggccaagta tgagaatcat actgagaatg cgttgttatt 1020 gtatatggca tgtacacatg cctcaaatcc tgtgtacgct actctgaaaa tacgcatcta 1080 tttctatgat gcagtgacaa attaataaag gttgaatttt attgcatggt gctccgtaac 1140 ttggagtgtg tttagtaata cagcgtacag aacatgatca acagctgtaa ttacagtgtt 1200 aatggaaata acgcctatca tatctaaata cttgagcact tgatatctaa atactcttaa 1260 gaaaagacca gtctgaggcc gtaaggtcgt ccagaccttg aagttgagaa aacacttgtg 1320 aatcgccaat gccttccgga ggttgtggtt gaaacgtatc tggagtgtga tgatgtcgtg 1380 gttcatgttc cctggcctct tgtcgtggtt ggtgatttcg aaatagaggg gatttgttat 1440 ttcccaggta aaaacgccat tctttgcttg aggcgcagtg atgagttccc ctgtgcgaga 1500 atccatggtt gatgcagtcg atatggagat agaacgagca gccacattcg aggtctaccc 1560 gcctacgtct gagggcccgg ttcttcgctg tgcggtgttg gactttgatg ggcacttgag 1620 aacaatggct cgtggagggt gatgaaggtg gcattcttta aagcccaggc tttaagggac 1680 tgattctttt cctcgtccag aaactcttta tatgatgatg ttggtcctgg attgcagagg 1740 aagatagtgg gaatgccgcc tttaatttga attggcttcc cgtactttgt attgctttgc 1800 cagtcccttt gggcccccat gaattctttg aagtgcttta gataatgcgg gtctacgtcg 1860 tcaatgacgt tgtaccatgc gtcgtttgaa tagacctttg gagacagatc caggtgtcca 1920 catagataat tatggggtcc cagtgaacga gcccacatgg ttttcccggt ccggctatca 1980 ccttcgagaa caatactgat cggtctccat ggccgcgcag cgggactgca tatattttct 2040 gatacccata cctcaagttc ttcgggaact tgtgtaaatg atgatgataa gaacggacta 2100 gcgtaagttt gaggcggagc ctggaagatc ctatctgcgt tagcagatat gttatggaac 2160 tgtaaaaaaa aggacttggg atctttttct ttgatgattt gaagagcttc tgatttagaa 2220 gaagcattca acgcgtcggc atatacctga gctaaatgct ggccctcccc ccttgcactt 2280 ctggcatcga cttggaaaat tccatcgtca agaaattccc ctcccttttc aatgtaggct 2340 ttgacatcgg acgatgattt agcaccctga atgttcggat ggaaatgtgt tgatctggag 2400 ggggaaatga gatcgaagaa tctctggttt gtacattgga acttgccttc gaattggatg 2460 agaacatgga gatgaggcac cccatcccga tgtagttctc tgcaaaccct aatgaatttg 2520 atattcgtcg ggtaagaaag ggcttttaat tgggaaatgg cctcttcctt ggttaatgag 2580 catcggggat aggttatgaa ataatttttg gcatttattt gaaaacgacc ggctcttggc 2640 atattggctg tcgttttgga tcgggggaca ctcaaagtct atagcaatcg gtggaacggg 2700 gggcaattta tatgatgtcc cccaatggca tatgtgtaaa taggtcgacc tccattcaaa 2760 atttgaattg cgaatattgg cggccatccg attaatatt 2799 //