ID JF909107; SV 1; circular; genomic DNA; STD; VRL; 2799 BP. XX AC JF909107; XX DT 21-JUN-2012 (Rel. 113, Created) DT 05-DEC-2012 (Rel. 115, Last updated, Version 3) XX DE East African cassava mosaic Kenya virus isolate DE Comoros:Grande-Comore:GC31AB2:2009 segment DNA-A, complete sequence. XX KW . XX OS East African cassava mosaic Kenya virus OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RC Publication Status: Online-Only RP 1-2799 RX DOI; 10.1186/1471-2148-12-228. RX PUBMED; 23186303. RA De Bruyn A., Villemot J., Lefeuvre P., Villar E., Hoareau M., RA Harimalala M., Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., RA Harkins G.W., Varsani A., Martin D.P., Lett J.M.; RT "East African cassava mosaic-like viruses from Africa to Indian ocean RT islands: molecular diversity, evolutionary history and geographical RT dissemination of a bipartite begomovirus"; RL BMC Evol. Biol. 12(1):228-228(2012). XX RN [2] RP 1-2799 RA Villemot J., Lefeuvre P., Villar E., Hoareau M., Harimalala M., RA Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., Varsani A., Martin D.P., RA Lett J.-M.; RT ; RL Submitted (24-MAR-2011) to the INSDC. RL UMR PVBMT, CIRAD, 7, chemin de l'IRAT, Saint-Pierre, Reunion 97410, France XX DR MD5; 82142c93f4a7efe07cfb610deed40158. XX FH Key Location/Qualifiers FH FT source 1..2799 FT /organism="East African cassava mosaic Kenya virus" FT /segment="DNA-A" FT /host="Manihot esculenta (cassava)" FT /isolate="Comoros:Grande-Comore:GC31AB2:2009" FT /mol_type="genomic DNA" FT /country="Comoros:Grande-Comore" FT /lat_lon="11.48 S 43.31 E" FT /collection_date="2009" FT /db_xref="taxon:393599" FT gene 175..531 FT /gene="AV2" FT CDS 175..531 FT /codon_start=1 FT /gene="AV2" FT /product="movement protein" FT /db_xref="GOA:I6LXT1" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:I6LXT1" FT /protein_id="AEG89976.1" FT /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL FT IGVLRCKTYVEATRRYNNLNTRIQGAEEAELRQPIHESCCCPHCPRHQKQNMGQQAHVS FT EAQDVQNVSKPRCS" FT gene 335..1108 FT /gene="AV1" FT CDS 335..1108 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:I6LXD6" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:I6LXD6" FT /protein_id="AEG89975.1" FT /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA FT NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSYEQRDDVKHTGMVRCVSDVTRGSGITHR FT VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGQSPQDFGQVFNMFDNE FT PTTATVKNDLRDRYQVLRKFYTTVVGGPSGMKEQALVKRFFRINNHVVYNHQEQAKYEN FT HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN" FT gene complement(1105..1509) FT /gene="AC3" FT CDS complement(1105..1509) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer" FT /db_xref="GOA:I6LXS2" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:I6LXS2" FT /protein_id="AEG89979.1" FT /translation="MDSRTGELITAPQAKNGVFTWEITNPLYFDITNHDRRPGNMNHDI FT ITFQIRFNHNIRKALGIHKCFLNFKVWTTLQPPTGLFLRVFRYQVLKYLDMIGVISINT FT VIQAVDHVLYNVLLNTLQVTEQHAIKFNLY" FT gene complement(1250..1657) FT /gene="AC2" FT CDS complement(1250..1657) FT /codon_start=1 FT /gene="AC2" FT /product="transcription activator protein" FT /db_xref="GOA:I6LXS1" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:I6LXS1" FT /protein_id="AEG89978.1" FT /translation="MPPSSPSTSHCSQVPIKVQHRTAKTRAVRRRRVDLECGCSFYLHI FT DCINHGFSHRGTHHCASSKEWRFYLGNNKSPLFRHHQPRQETREHEPRHNHIPDTVQPQ FT HPEGVGDSQVFSQLQGLDDLTASDWSFLKSI" FT gene complement(1581..2645) FT /gene="AC1" FT CDS complement(1581..2645) FT /codon_start=1 FT /gene="AC1" FT /product="replication associated protein" FT /db_xref="GOA:I6LXS6" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:I6LXS6" FT /protein_id="AEG89977.1" FT /translation="MPRAGRFSIKAKNYFLTYPKCSLSKEEALDQLRQLQTPTNKLFIK FT ICRELHENGEPHLHALIQFEGKYNCTNQRFFDLISPSRSAHFHPNIQGAKSSSDVKSYL FT DKDGDTIQWGEFQIDGRSARGGQQSANDAYAKALNSANKSEALNVIRELAPKDFVLQFH FT NLNSNLERIFQEPLTPYISPFLSSSFTNVPEELEAWVSENVMGSAARPWRPSSIVIEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW FT QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLDEEKNQSLKAWALKNATFITLHEPL FT FSSAHQSPTPHSED" FT gene complement(2198..2737) FT /gene="AC4" FT CDS complement(2198..2737) FT /codon_start=1 FT /gene="AC4" FT /product="C4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:I6LXS3" FT /protein_id="AEG89980.1" FT /translation="MPFGGHHINCPPIPPIATDFECPPIAIRQRKCQGLVVLASKPKTI FT SSHIPNVLYRKKRPWISFDNYKPQQINCSSKSAESSMKMGNLICMPSFSSKASTIVPTN FT DSSTSYPLPGPPISTQIFRELNQAPTSSPIWIRTETPSNGASFRSTDDLLEADNNPPMT FT LTPRLLTQQISQRLLM" XX SQ Sequence 2799 BP; 725 A; 557 C; 723 G; 794 T; 0 other; accggatggc cgcgcccgaa aaaagcaggt ggaccctaca agatggccgc gcccgttaaa 60 gaaagtggtc cccgcgcact tgtgttggtc ggccagtcat atttacgcgt gaaagtctag 120 atatttgttg tttgtcttta tagacttcgt cgcgaagtag tggagcgcgt caacatgtgg 180 gatccattgt tgaacgattt tcccgaaacc gttcacggtt tccgttctat gcttgctgtt 240 aaatacctgt tacatctgga acaggaatac gatcgcggta ctgtcggggc ggagtatata 300 cgtgatttaa taggggttct acggtgtaag acctatgtcg aagcgaccag gagatataat 360 aatctcaaca cccgtatcca aggtgcggag gaggctgaac ttcgacagcc catacacgaa 420 tcgtgttgtt gcccccactg tccgcgtcac cagaagcaaa atatgggcca acaggcccat 480 gtatcggaag cccaagatgt acagaatgta tcgaagccca gatgttccta agggctgtga 540 aggcccatgt aaggttcagt cctatgaaca gagggatgat gtgaagcaca ctggtatggt 600 ccgatgtgtc agtgatgtta ctcgtggatc aggcattacc catagagtcg ggaagaggtt 660 ttgtgtgaag tccatatata tattgggcaa gatttggatg gatgagaata tcaagaagca 720 aaatcatacg aaccatgtta tgttcttcct tgttcgagat agaaggcctt atggtcagag 780 tcctcaagat tttggacaag tgttcaacat gtttgataat gaacctacta cggcaactgt 840 gaagaatgat cttagggacc gatatcaggt gttacgtaaa ttctatacga ctgttgttgg 900 tggaccctct gggatgaagg aacaagctct ggttaagagg ttttttagga tcaataatca 960 tgtagtgtat aatcatcagg aacaggccaa gtatgagaat catactgaga atgcgttgtt 1020 attgtatatg gcatgtacac atgcctcgaa tccggtgtac gctacgctga aaatacgcat 1080 ctatttctat gatgcagtga caaattaata aaggttgaat tttattgcat gttgctccgt 1140 aacttggagt gtgtttagta atacattgta caggacatga tcaacagctt gaattacagt 1200 gttaatggaa ataacgccta tcatatctaa atacttgagc acttgatatc taaatactct 1260 taagaaaaga ccagtcggag gctgtaaggt cgtccagacc ttgaagttga gaaaacactt 1320 gtgaatcccc aacgccttcc ggatgttgtg gttgaaccgt atctggaatg tgattatgtc 1380 gtggttcatg ttccctggtc tcctgtcgtg gttggtgatg tcgaaataga ggggatttgt 1440 tatttcccag gtaaaaacgc cattctttgc ttgaggcgca gtgatgagtt cccctgtgcg 1500 agaatccatg attgatgcag tcgatatgga gatagaacga gcagccgcat tcgaggtcta 1560 cccgcctacg tctgacggcc ctagtcttcg ctgtgcggtg ttggactttg atgggcactt 1620 gagaacaatg gctcgtggag ggtgatgaag gtggcattct ttaaagccca ggctttaagg 1680 gactggttct tttcctcgtc cagaaactct ttatatgatg atgttggtcc tggattgcat 1740 aggaagatag tgggaatgcc gcctttaatt tgaattggct tcccgtactt tgtattgctt 1800 tgccagtccc tttgggcccc catgaattct ttgaaatgct tgaggtagtg ggggtcgacg 1860 tcatcaatga cgttgtacca tgcgtcgttg ctgtatacct ttggactgag atccaggtgt 1920 ccacacaagt agttatgtgg tcccaaggag cgagcccaca ttgtcttccc tgtcctacta 1980 tctccctcga tgacgatact actaggtctc catggccgcg cagcggaacc catcacgttc 2040 tcggaaaccc aggcttcaag ttcctcagga acgttagtga aagaagaaga aagaaaggga 2100 gaaatataag gagtgagagg ctcttgaaaa atcctctcta aattgctatt taaattatga 2160 aactgtaaaa caaaatcttt tggggctagt tcccgtatta cattaagagc ctctgactta 2220 tttgctgagt taagagcctt ggcgtaagcg tcattggcgg attgttgtcc gcctcgagca 2280 gatcgtccgt cgatctgaaa ctcgccccat tggatggtgt ctccgtcctt atccagatag 2340 gacttgacgt cggagcttga tttagctccc tgaatatttg ggtggaaatg ggcggaccgg 2400 gaaggggata tgaggtcgaa gaatcgttgg ttggtacaat tgtacttgcc ttcgaactga 2460 atgagggcat gcagatgagg ttccccattt tcatggagct ctctgcagat tttgatgaac 2520 aatttatttg ttggggtttg tagttgtcga agctgatcca aggcctcttc tttcgataga 2580 gaacatttgg gatatgtgag gaaatagttt ttggctttga tgctaaaacg accagccctt 2640 ggcatttgcg ctgtcgtata gcaatcgggg ggcactcaaa gtctgtagca atcgggggaa 2700 tgggggggca atttatatga tgccccccaa atggcattta tgtaaaatcc tcaatgaatt 2760 tgaatttcaa acgtggaaag cggccatccg tataatatt 2799 //