ID JF909087; SV 1; circular; genomic DNA; STD; VRL; 2800 BP. XX AC JF909087; XX DT 21-JUN-2012 (Rel. 113, Created) DT 05-DEC-2012 (Rel. 115, Last updated, Version 3) XX DE East African cassava mosaic Kenya virus isolate DE Comoros:Grande-Comore:GC11AA6:2008 segment DNA-A, complete sequence. XX KW . XX OS East African cassava mosaic Kenya virus OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RC Publication Status: Online-Only RP 1-2800 RX DOI; 10.1186/1471-2148-12-228. RX PUBMED; 23186303. RA De Bruyn A., Villemot J., Lefeuvre P., Villar E., Hoareau M., RA Harimalala M., Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., RA Harkins G.W., Varsani A., Martin D.P., Lett J.M.; RT "East African cassava mosaic-like viruses from Africa to Indian ocean RT islands: molecular diversity, evolutionary history and geographical RT dissemination of a bipartite begomovirus"; RL BMC Evol. Biol. 12(1):228-228(2012). XX RN [2] RP 1-2800 RA Villemot J., Lefeuvre P., Villar E., Hoareau M., Harimalala M., RA Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., Varsani A., Martin D.P., RA Lett J.-M.; RT ; RL Submitted (24-MAR-2011) to the INSDC. RL UMR PVBMT, CIRAD, 7, chemin de l'IRAT, Saint-Pierre, Reunion 97410, France XX DR MD5; 4eeda2315ab0a810c3a330640d88c6b4. XX FH Key Location/Qualifiers FH FT source 1..2800 FT /organism="East African cassava mosaic Kenya virus" FT /segment="DNA-A" FT /host="Manihot esculenta (cassava)" FT /isolate="Comoros:Grande-Comore:GC11AA6:2008" FT /mol_type="genomic DNA" FT /country="Comoros:Grande-Comore" FT /lat_lon="11.5 S 43.45 E" FT /collection_date="2008" FT /db_xref="taxon:393599" FT gene 175..531 FT /gene="AV2" FT CDS 175..531 FT /codon_start=1 FT /gene="AV2" FT /product="movement protein" FT /db_xref="GOA:I6LXA7" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:I6LXA7" FT /protein_id="AEG89856.1" FT /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL FT IGVLRCKSYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS FT EAQDVQNVSKPRCS" FT gene 335..997 FT /gene="AV1" FT CDS 335..997 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:I6LXE8" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:I6LXE8" FT /protein_id="AEG89855.1" FT /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA FT NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSYEQRDDVKHTGMVRCVSDVTRGSGITHR FT VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGQSPQDFGQVFNMFDNE FT PTTATVKNDLRDRYQVLRKFYTTVVGGPSGMKEQALVKRFFRINNHVVYNHQEQAQV" FT gene complement(1106..1510) FT /gene="AC3" FT CDS complement(1106..1510) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer" FT /db_xref="GOA:I6LXE6" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:I6LXE6" FT /protein_id="AEG89859.1" FT /translation="MDSRTGELITAPQAKNGVFTWEITNPLYFDITNHDRRPGNMNHDI FT ITFQIRFNHNIRKALGIHKCFLNFKVWTTLRPPTGLFLKVFRYQVLKYLDMIGVISLNT FT VIQAVDHVLYNVLLNTLQVTEQHAIKFNLY" FT gene complement(1251..1658) FT /gene="AC2" FT CDS complement(1251..1658) FT /codon_start=1 FT /gene="AC2" FT /product="transcription activator protein" FT /db_xref="GOA:I6LYF5" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:I6LYF5" FT /protein_id="AEG89858.1" FT /translation="MPPSSPSTSHCSQVPIKVQHRTAKTRAVRRRRVDLECGCSFYLHI FT DCINHGFSHRGTHHCASSKEWRFYLGNNKSPLFRHHQPRQETREHEPRHHHIPDTVQPQ FT HPEGIGDSQVFSQLQGLDDLTASDWSFLKSI" FT gene complement(1582..2646) FT /gene="AC1" FT CDS complement(1582..2646) FT /codon_start=1 FT /gene="AC1" FT /product="replication associated protein" FT /db_xref="GOA:I6LXF0" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:I6LXF0" FT /protein_id="AEG89857.1" FT /translation="MPRAGRFSIKAKNYFLTYPKCSLSKEEALDQLRQLQTPTNKLFIK FT ICRELHENGEPHLHALIQFEGKYNCTNQRFFDLISLSRSAHFHPNIQGAKSSSDVKSYL FT DKDGDTIQWGEFQIDGRSARGGQQSANDAYAKALNSANKSEALNVIRELAPKDFVLQFH FT NLNSNLERIFQEPLTPYISPFLSSSFTNVPEELEAWVSENVMGSAARPWRPSSIVIEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW FT QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLDEEKHQSLKAWALKNATFITLHEPL FT FSSAHQSPTPHSED" FT gene complement(2199..2489) FT /gene="AC4" FT CDS complement(2199..2489) FT /codon_start=1 FT /gene="AC4" FT /product="C4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:I6LXF3" FT /protein_id="AEG89860.1" FT /translation="MGNLICMPSFSSKASTIVPTNDSSTSYPFPGPPISTQIFRELNQA FT PTSSPIWIRTETPSNGASFRSTDDLLEADNNPPMTLTPKLLTQQISQRLLM" XX SQ Sequence 2800 BP; 727 A; 557 C; 727 G; 789 T; 0 other; accggatggc cgcgcccgaa aaaagcaggt ggaccccaca agatggccgc gcccgttaaa 60 gaaagtggtc cccgcgcact tgtgttggtc ggccagtcat attcacgtgt gaaaggctag 120 atatttgttg tttgtcttta tagacttcgt cgcgaagtag tggagcgcgt caacatgtgg 180 gatccattgt tgaacgattt tcccgaaacc gttcacggtt tccgttctat gcttgctgtt 240 aaatacctgt tacatctgga acaggaatac gatcgcggta ctgtcggggc ggagtatata 300 cgtgatttaa taggggttct acggtgtaag agttatgtcg aagcgaccag gagatataat 360 aatctcaaca cccgtatcca aggtgcggag gaggctgaac ttcgacagcc catacacgaa 420 ccgtgttgtt gcccccactg tccgcgtcac cagaagcaaa atatgggcca acaggcccat 480 gtatcggaag cccaagatgt acagaatgta tcgaagccca gatgttccta agggctgtga 540 aggcccatgt aaggttcagt cctatgaaca gagggatgat gtgaagcaca ctggtatggt 600 ccgatgtgtc agtgatgtta ctcgtggatc aggcattacc catagagtcg ggaagaggtt 660 ttgtgtgaag tccatatata tattgggcaa gatttggatg gatgagaata tcaagaagca 720 aaatcatacg aaccatgtta tgttcttcct tgttcgagat agaaggcctt atggtcagag 780 tcctcaagat tttggacaag tgttcaacat gtttgataat gaacctacta cggcaactgt 840 gaagaatgat cttagggacc gatatcaggt gttacgtaaa ttctatacga ctgttgtggg 900 tggaccctct gggatgaagg aacaagctct ggttaagagg ttttttagga tcaataatca 960 tgtagtgtat aatcatcagg aacaggccca agtatgagaa tcatactgag aatgcgttgt 1020 tattgtatat ggcatgtaca catgcctcga atcctgtgta cgctacgctg aaaatacgca 1080 tctatttcta tgatgcagtg acaaattaat aaaggttgaa ttttattgca tgttgctccg 1140 taacttggag tgtgttgagt aatacattgt acagaacatg atcaacagct tgaattacag 1200 tgttaaggga aataacgcct atcatatcta aatacttgag cacttgatat ctaaatactt 1260 ttaagaaaag accagtcgga ggccgtaagg tcgtccagac cttgaagttg agaaaacact 1320 tgtgaatccc caatgccttc cggatgttgt ggttgaaccg tatctggaat gtgatgatgt 1380 cgtggttcat gttccctggt ctcctgtcgt ggttggtgat gtcgaaatag aggggatttg 1440 ttatttccca ggtaaaaacg ccattctttg cttgaggcgc agtgatgagt tcccctgtgc 1500 gagaatccat gattgatgca gtcgatatgg agatagaacg agcagccgca ttcgaggtct 1560 acccgcctac gtctgacggc cctagtcttc gctgtgcggt gttggacttt gatgggcact 1620 tgagaacaat ggctcgtgga gggtgatgaa ggtggcattc tttaaagccc aggctttaag 1680 ggactggtgc ttttcctcgt ccagaaactc tttatatgat gatgttggtc ctggattgca 1740 taggaagata gtgggaatgc cgcctttaat ttgaattggc ttcccgtact ttgtattgct 1800 ttgccagtcc ctttgggccc ccatgaattc tttgaaatgc ttgagatagt gggggtcgac 1860 gtcatcaatg acgttgtacc atgcgtcgtt gctgtatacc tttggactga gatccaggtg 1920 tccacacaag tagttatgtg gtcccaaaga gcgagcccac attgtcttcc ctgtcctact 1980 atctccctcg atgacgatac tactaggtct ccatggccgc gcagcggaac ccatcacgtt 2040 ctcggaaacc caggcttcaa gttcctcagg aacgttagtg aaagaagaag aaagaaaggg 2100 agaaatataa ggagtgagag gctcttgaaa aatcctctct aaattgctat ttaaattatg 2160 aaactgtaaa acaaaatctt ttggggctag ttcccgtatt acattaagag cctctgactt 2220 atttgctgag ttaagagctt tggcgtaagc gtcattggcg gattgttgtc cgcctcgagc 2280 agatcgtccg tcgatctgaa actcgcccca ttggatggtg tctccgtcct tatccagata 2340 ggacttgacg tcggagcttg atttagctcc ctgaatattt gggtggaaat gggcggaccg 2400 ggaaagggat atgaggtcga agaatcgttg gttggtacaa ttgtacttgc cttcgaactg 2460 aatgagggca tgcagatgag gttccccatt ttcatggagc tctctgcaga tcttgatgaa 2520 caatttattt gttggggttt ggagttgtcg gagctgatcc aaggcctctt ctttcgatag 2580 agaacatttg ggatatgtga ggaaatagtt tttggctttg atgctaaaac gaccagccct 2640 tggcatttgc gctgtcgtat agcaatcggg gggcactcaa agtctgtagc aatcggggga 2700 atgggggggc aatttatatg atgcccccca aatggcattt atgtaatatc ctcatgaaat 2760 ttgaatttca aacgtggaaa gcggccatcc gtataatatt 2800 //