ID JF909078; SV 1; circular; genomic DNA; STD; VRL; 2797 BP. XX AC JF909078; XX DT 21-JUN-2012 (Rel. 113, Created) DT 05-DEC-2012 (Rel. 115, Last updated, Version 3) XX DE East African cassava mosaic Kenya virus isolate DE Comoros:Grande-Comore:GC01B01:2005 segment DNA-A, complete sequence. XX KW . XX OS East African cassava mosaic Kenya virus OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RC Publication Status: Online-Only RP 1-2797 RX DOI; 10.1186/1471-2148-12-228. RX PUBMED; 23186303. RA De Bruyn A., Villemot J., Lefeuvre P., Villar E., Hoareau M., RA Harimalala M., Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., RA Harkins G.W., Varsani A., Martin D.P., Lett J.M.; RT "East African cassava mosaic-like viruses from Africa to Indian ocean RT islands: molecular diversity, evolutionary history and geographical RT dissemination of a bipartite begomovirus"; RL BMC Evol. Biol. 12(1):228-228(2012). XX RN [2] RP 1-2797 RA Villemot J., Lefeuvre P., Villar E., Hoareau M., Harimalala M., RA Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., Varsani A., Martin D.P., RA Lett J.-M.; RT ; RL Submitted (24-MAR-2011) to the INSDC. RL UMR PVBMT, CIRAD, 7, chemin de l'IRAT, Saint-Pierre, Reunion 97410, France XX DR MD5; 58fd04de4e632f87fda42151d8a7f930. XX FH Key Location/Qualifiers FH FT source 1..2797 FT /organism="East African cassava mosaic Kenya virus" FT /segment="DNA-A" FT /host="Manihot esculenta (cassava)" FT /isolate="Comoros:Grande-Comore:GC01B01:2005" FT /mol_type="genomic DNA" FT /country="Comoros:Grande-Comore" FT /lat_lon="11.82 S 43.29 E" FT /collection_date="2005" FT /db_xref="taxon:393599" FT gene 173..538 FT /gene="AV2" FT CDS 173..538 FT /codon_start=1 FT /gene="AV2" FT /product="movement protein" FT /db_xref="GOA:I6LX95" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:I6LX95" FT /protein_id="AEG89802.1" FT /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL FT IGVLRCKSYVEATRRYNKLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNRGQQAHVS FT EAQYVQNVSKPRCSEGL" FT gene 333..1106 FT /gene="AV1" FT CDS 333..1106 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:I6LX94" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:I6LX94" FT /protein_id="AEG89801.1" FT /translation="MSKRPGDIISSTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIGA FT NRPMYRKPNMYRMYRSPDVPKGCEGPCKVQSYEQRDDVKHTGMVRCVSDVTRGSGITHR FT VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGQSPQDFGQVFNMFDNE FT PTTATVKNDLRDRYQVLRKFYTTVVGGPSGMKEQSLVKRFFRINNHVVYNHQEQAKYEN FT HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN" FT gene complement(1103..1507) FT /gene="AC3" FT CDS complement(1103..1507) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer" FT /db_xref="GOA:I6LX98" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:I6LX98" FT /protein_id="AEG89805.1" FT /translation="MDSRTGELITAPQAKNGVFTWEITNPLYFDITNHDKRPGNMNHDI FT ITFQIRFNHNIRKALGIHKCFLNFKVWTTLRPPTGLFLKVFRYQVLKYLDMIGVISINT FT VIQAVDHVLYNVLLNTLQVTEQHAIKFNLY" FT gene complement(1248..1655) FT /gene="AC2" FT CDS complement(1248..1655) FT /codon_start=1 FT /gene="AC2" FT /product="transcription activator protein" FT /db_xref="GOA:I6LX97" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:I6LX97" FT /protein_id="AEG89804.1" FT /translation="MPPSSPSTSHCSQVPIKVQHRTAKTRAVRRRRVDLECGCSFYLHI FT DCINHGFSHRGTHHCASSKEWRFYLGNNKSPLFRHHQPRQETREHEPRHHHIPDTVQPQ FT HPEGIGDSQMFSQLQGLDDLTASDWSFLKSL" FT gene complement(1579..2643) FT /gene="AC1" FT CDS complement(1579..2643) FT /codon_start=1 FT /gene="AC1" FT /product="replication associated protein" FT /db_xref="GOA:I6LX96" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:I6LX96" FT /protein_id="AEG89803.1" FT /translation="MPRAGRFSIKAKNYFLTYPKCSLSKEEALGQLRQLQTPTNKLFIK FT ICRELHENGEPHLHALIQFEGKYNCTNQRFFDLISPSRSAHFHPNIQGAKSSSDVKSYL FT DKDGDTIQWGEFQIDGRSARGGQQSANDAYAKALNSANKSEALNVIRELAPKDFVLQFH FT NLNSNLERIFQEPLTPYISPFLSSSFTNVPEELEAWVSENVMGSAARPWRPSSIVIEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW FT QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLDEDKNPSLKAWALKNATFITLHEPL FT FSSAHQSPTPHSED" FT gene complement(2196..2486) FT /gene="AC4" FT CDS complement(2196..2486) FT /codon_start=1 FT /gene="AC4" FT /product="C4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:I6LX99" FT /protein_id="AEG89806.1" FT /translation="MGNLICMPSFSSKASTIVPTNDSSISYPLPGPPISTQIFRALNQA FT PTSSPIWIRTETPSNGASFKSTDDLLEADNNPPMTLTPRLLTQQISQRLLM" XX SQ Sequence 2797 BP; 727 A; 558 C; 731 G; 781 T; 0 other; accggatggc cgcgcccgaa aaaagcaggt ggccccacaa gatggccgcg cccgttaaag 60 aaagtggtcc cgcgcacttg tgttggtcgg ccagtcataa tcacgcgtga aagtctagat 120 atgtgttgtt tgtctttata gacttcgtcg cgaagtagtg gagcgcgtca acatgtggga 180 tccattgttg aacgattttc ccgaaaccgt tcacggtttt cgttctatgc ttgctgttaa 240 atacctgtta catctggaac aggaatacga tcgcggtact gtcggggcgg agtatatacg 300 tgatttaata ggggttctac ggtgtaagag ttatgtcgaa gcgaccagga gatataataa 360 gctcaacacc cgtatccaag gtgcggagga ggctgaactt cgacagccca tacacgaacc 420 gtgttgttgc ccccactgtc cgcgtcacca gaagcaaaat aggggccaac aggcccatgt 480 atcggaagcc caatatgtac agaatgtatc gaagcccaga tgttccgaag ggctgtgaag 540 gcccatgtaa ggttcagtcc tatgaacaga gggatgatgt gaagcacact ggtatggtcc 600 gatgtgtcag tgatgttact cgtggatcag gcattaccca tagagtcggg aagaggtttt 660 gtgtgaagtc catatatata ttgggcaaga tttggatgga tgagaatatc aagaagcaaa 720 atcatacgaa ccatgtgatg ttcttccttg ttcgagatag aaggccttat ggtcagagtc 780 ctcaagattt tggacaagtg ttcaacatgt ttgataatga acctactacg gcaactgtga 840 agaatgatct tagggaccga tatcaggtgt tacgtaaatt ttatacgact gttgttggtg 900 gaccctctgg aatgaaggaa caatctctgg ttaagaggtt ttttaggatc aataatcatg 960 tagtgtataa tcatcaggaa caggccaagt atgagaacca tacggagaat gcgctgctat 1020 tgtatatggc atgtacacat gcctcgaatc ctgtgtacgc tacgctgaaa atacgcatct 1080 atttctatga tgcagtgaca aattaataaa ggttgaattt tattgcatgt tgctccgtaa 1140 cttggagtgt gtttagtaat acattgtaca gaacatgatc aacagcttga attacagtgt 1200 taatggaaat aacgcctatc atatctaaat acttgagcac ttgatatcta aagactttta 1260 agaaaagacc agtcggaggc cgtaaggtcg tccagacctt gaagttgaga aaacatttgt 1320 gaatccccaa tgccttccgg atgttgtggt tgaaccgtat ctggaatgtg atgatgtcgt 1380 ggttcatgtt ccctggtctc ttgtcgtggt tggtgatgtc gaaatagagg ggatttgtta 1440 tttcccaggt aaaaacgcca ttctttgctt gaggcgcagt gatgagttcc cctgtgcgag 1500 aatccatgat tgatgcagtc gatatggaga tagaacgagc agccgcattc gaggtctacc 1560 cgcctacgtc tgacggccct agtcttcgct gtgcggtgtt ggactttgat gggcacttga 1620 gaacaatggc tcgtggaggg tgatgaaggt ggcattcttt aaagcccagg ctttaaggga 1680 cgggttcttg tcctcgtcca gaaactcttt atatgatgat gttggtcctg gattgcatag 1740 gaagatagtg ggaatgccgc ctttaatttg aattggcttc ccgtacttgg tattgctttg 1800 ccagtccctt tgggccccca tgaattcttt gaaatgcttg aggtagtggg ggtcgacgtc 1860 atcaatgacg ttgtaccatg cgtcgttact gtataccttt ggactgagat ccaggtgtcc 1920 acacaagtag ttatgtggtc ccaaagagcg agcccacatt gtcttccctg tcctactatc 1980 tccctcgatg acgatactac tcggtctcca tggccgcgca gcggaaccca tcacgttctc 2040 ggaaacccag gcttcaagtt cctcaggaac gttagtgaaa gaagaagaaa gaaagggaga 2100 aatataagga gtgagaggct cttgaaaaat cctctctaaa ttgctattta aattatgaaa 2160 ctgtaaaaca aaatctttgg gggctagttc ccgtattaca ttaagagcct ctgacttatt 2220 tgctgagtta agagccttgg cgtaagcgtc attggcggat tgttgtccgc ctcgagcaga 2280 tcgtccgtcg atttgaaact cgccccattg gatggtgtct ccgtccttat ccagatagga 2340 cttgacgtcg gagcttgatt tagcgccctg aatatttggg tggaaatggg cggaccggga 2400 aggggatatg agatcgaaga atcgttggtt ggtacaattg tacttgcctt cgaactgaat 2460 gagggcatgc agatgaggtt ccccattttc atggagctct ctgcagatct tgatgaacaa 2520 tttatttgtt ggggtttgga gttgtcggag ctgacccaag gcctcctctt tcgatagaga 2580 acatttggga tatgtgagga aatagttttt ggctttgatg ctaaaacgac cagcccttgg 2640 catttgcgct gtcgtatagc aatcgggggg cactcaaagt ctgtagcaat cgggggaatg 2700 ggggggcaat ttatatgatg ccccccaaat ggcatttatg taatatcctc atgaaatttg 2760 aatttcaaac gtggaaagcg gccatccgta taatatt 2797 //