ID JF909097; SV 1; circular; genomic DNA; STD; VRL; 2802 BP. XX AC JF909097; XX DT 21-JUN-2012 (Rel. 113, Created) DT 05-DEC-2012 (Rel. 115, Last updated, Version 3) XX DE East African cassava mosaic virus-Kenya isolate DE Comoros:Grande-Comore:GC20B00:2004 segment DNA-A, complete sequence. XX KW . XX OS East African cassava mosaic virus-Kenya OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RC Publication Status: Online-Only RP 1-2802 RX DOI; 10.1186/1471-2148-12-228. RX PUBMED; 23186303. RA De Bruyn A., Villemot J., Lefeuvre P., Villar E., Hoareau M., RA Harimalala M., Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., RA Harkins G.W., Varsani A., Martin D.P., Lett J.M.; RT "East African cassava mosaic-like viruses from Africa to Indian ocean RT islands: molecular diversity, evolutionary history and geographical RT dissemination of a bipartite begomovirus"; RL BMC Evol. Biol. 12(1):228-228(2012). XX RN [2] RP 1-2802 RA Villemot J., Lefeuvre P., Villar E., Hoareau M., Harimalala M., RA Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., Varsani A., Martin D.P., RA Lett J.-M.; RT ; RL Submitted (24-MAR-2011) to the INSDC. RL UMR PVBMT, CIRAD, 7, chemin de l'IRAT, Saint-Pierre, Reunion 97410, France XX DR MD5; e8a1f5fcbd53590eadf0f24cf6e94d10. XX FH Key Location/Qualifiers FH FT source 1..2802 FT /organism="East African cassava mosaic virus-Kenya" FT /segment="DNA-A" FT /host="Manihot esculenta (cassava)" FT /isolate="Comoros:Grande-Comore:GC20B00:2004" FT /mol_type="genomic DNA" FT /country="Comoros:Grande-Comore" FT /lat_lon="11.69 S 43.41 E" FT /collection_date="2004" FT /db_xref="taxon:1229189" FT gene 175..531 FT /gene="AV2" FT CDS 175..531 FT /codon_start=1 FT /gene="AV2" FT /product="movement protein" FT /db_xref="GOA:I6LXK9" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:I6LXK9" FT /protein_id="AEG89916.1" FT /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQQYDRGTVGAEYIRDL FT IGVLRCKSYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMDQQAHVS FT EAQDVQNVSKPRCP" FT gene 335..1108 FT /gene="AV1" FT CDS 335..1108 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:I6LXK8" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:I6LXK8" FT /protein_id="AEG89915.1" FT /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWT FT NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSYEQRDDVKHTGMVRCVSDVTRGSGITHR FT VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGPSPQDFGQVFNMFDNE FT PTTATVKNDLRDRYQVLRKFYATVVGGPSGMKEQALVKRFFRINNHVVYNHQEQAKYEN FT HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN" FT gene complement(1105..1509) FT /gene="AC3" FT CDS complement(1105..1509) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer" FT /db_xref="GOA:I6LXL2" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:I6LXL2" FT /protein_id="AEG89919.1" FT /translation="MDSRTGELITAPQATNGVFTWEITNPLYFEITNHDKRPGNMHHDI FT ITLRIRFNHNLRKALGIHKCFLNFKVWTTLRPQTGRFLKVFRYQVLKYLDMIGVISINI FT VLQAVDHVLYDVLLNTLQVTEQHAIKFNLY" FT gene complement(1250..1657) FT /gene="AC2" FT CDS complement(1250..1657) FT /codon_start=1 FT /gene="AC2" FT /product="transcription activator protein" FT /db_xref="GOA:I6LXL1" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:I6LXL1" FT /protein_id="AEG89918.1" FT /translation="MPPSSPSTSHCSLVPIKVQHRTAKTRAVRRRRVDLECGCSFYLHI FT DCINHGFSHRGTHHCASSNEWRFYLGNNKSPLFRNHQPRQAAREHAPRHHHTPDTVQPQ FT PPEGIGDSQVFSQLQGLDDLTASDWSFLKSI" FT gene complement(1566..2645) FT /gene="AC1" FT CDS complement(1566..2645) FT /codon_start=1 FT /gene="AC1" FT /product="replication associated protein" FT /db_xref="GOA:I6LXL0" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:I6LXL0" FT /protein_id="AEG89917.1" FT /translation="MPRAGRFQINARNYFITYPRCSLTKEEALSQLKALSYPTNIKFIR FT VCRELHQDGVPHLHVLIQFEGKFQCTNPRFFDLISTSRSTHFHPNIQGAKSSSDVKAYI FT EKGGEFLDDGIFQVDARSARGEGQHLAQVYADALNASSKTEALQIIKEKDPKSFFLQFH FT NITANADRIFQAPPQTYVSPFLSSSFTQVPEDIEVWVSENICSPAARPWRPISIVLEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW FT QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLEEEKNQSLKAWALKNATFITLHEPL FT FSSAHQSPTPHSEDQGRQT" FT gene complement(2255..2488) FT /gene="AC4" FT CDS complement(2255..2488) FT /codon_start=1 FT /gene="AC4" FT /product="C4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:I6LXL3" FT /protein_id="AEG89920.1" FT /translation="MGCLISMFSSSSKGSSNVPTQDSSISFPHPDQHISIRTFRELNHR FT PMSKLILKREGNFLTMAFSRSMPEVQGGRASI" XX SQ Sequence 2802 BP; 730 A; 562 C; 711 G; 799 T; 0 other; accggatggc cgcgcccgaa aaagcaggtg gaccccacag aatggccgcg cccgttaaag 60 aaagtggtcc ccgcgcacgt gttttggtcg gccagtcata ttcacgcgtg aaagtctaga 120 tatttgttgt tggtctttat agacttcgtc gcgaagtagg tggagcgcgt caacatgtgg 180 gatccattgt tgaacgattt ccccgaaacc gttcacggtt tccgttctat gcttgctgtt 240 aaatacctgt tacatctgga acagcaatac gatcgcggta ctgtcggggc tgagtatata 300 cgggatctaa taggggttct acggtgtaag agttatgtcg aagcgaccag gagatataat 360 aatctcaaca cccgtatcca aggtgcggag gaggctgaac ttcgacagcc catacacgaa 420 ccgtgttgtt gcccccactg tccgcgtcac cagaagcaaa atatggacca acaggcccat 480 gtatcggaag cccaagatgt acagaatgta tcgaagccca gatgtcccta agggctgtga 540 aggcccatgt aaggttcagt cctatgaaca gagggatgat gttaagcaca cgggtatggt 600 tcgatgtgtc agtgatgtta ctcgtgggtc aggtattacg catagagtcg ggaagaggtt 660 ttgtgttaag tccatatata tattgggcaa gatctggatg gatgagaata tcaagaagca 720 aaatcatacg aaccatgtta tgttcttcct tgttcgagat agaaggcctt atggtccgag 780 tcctcaagat tttggacaag tgttcaacat gtttgataat gaacctacta ctgcaactgt 840 gaaaaatgat cttagggacc ggtatcaggt gttacgtaaa ttctatgcga ctgttgttgg 900 tggaccctct gggatgaagg aacaagctct ggttaagagg ttttttagga tcaataatca 960 tgtagtgtat aatcatcagg aacaggccaa gtatgagaat catactgaga atgcgttgtt 1020 attgtatatg gcatgtacac atgcctcgaa tcctgtgtac gcgacgctga aaatacgcat 1080 ctatttctat gatgcagtga caaattaata aaggttgaat tttattgcat gttgctctgt 1140 aacttggagc gtgtttagta atacatcgta cagaacatga tcaacagcct gaagtacaat 1200 gttaatggaa ataacgccta tcatatctaa atacttgagc acttgatatc taaatacttt 1260 taagaaacga ccagtctgag gccgtaaggt cgtccagacc ttgaagttga gaaaacactt 1320 gtgaatcccc aatgccttcc ggaggttgtg gttgaaccgt atccggagtg tgatgatgtc 1380 gtggtgcatg ttccctggcc gcttgtcgtg gttggtgatt tcgaaataga ggggatttgt 1440 tatttcccag gtaaaaacgc cattcgttgc ttgaggcgca gtgatgagtt cccctgtgcg 1500 agaatccatg gttgatgcag tcgatatgga gatagaacga gcagccgcat tcgaggtcta 1560 cccgcctacg tctgacggcc ctggtcttcg ctgtgcggtg ttggactttg atgggcacta 1620 gagaacaatg gctcgtggag ggtgatgaag gtggcattct ttaaagccca ggctttaagg 1680 gactggttct tttcctcctc cagaaactct ttatatgatg atgttggtcc aggattgcag 1740 aggaagatag tgggaatgcc gcctttaatt tgaattggct tcccgtactt tgtattgctt 1800 tgccagtctc tttgggcccc catgaattct ttgaaatgct ttagatagtg cgggtctacg 1860 tcgtcaatga cgttgtacca tgcgtcgttt gaatatacct ttggagacag atccaggtgt 1920 ccacatagat aattatgggg tcccagtgaa cgagcccaca tggttttccc tgttcggcta 1980 tcaccttcga gaacaatact gatcggtctc catggccgcg cagcgggact gcatatattt 2040 tctgataccc atacttctat gtcttcgggg acttgtgtaa atgatgatga taagaacgga 2100 ctaacataag tttggggcgg agcctggaag attctatccg cgttagcagt tatgttatgg 2160 aactgtaaaa aaaaggactt tggatctttt tctttaataa tctgaagagc ttctgtttta 2220 gaagaagcat tcaacgcgtc tgcatatacc tgagctaaat gctggccctc cccccttgca 2280 cttctggcat cgacctggaa aatgccatcg tcaagaaatt cccctccctt ttcaatataa 2340 gctttgacat cggacgatga tttagctccc tgaatgttcg gatggaaatg tgttgatctg 2400 gatgtggaaa tgagatcgaa gaatcttggg ttggtacatt ggaacttccc ttcgaactgg 2460 atgagaacat ggagatgagg caccccatcc tgatgtagtt ctcggcaaac cctaatgaat 2520 ttgatattcg tcgggtaaga aagggctttt aattgggaaa gtgcctcttc ctttgttaat 2580 gagcatcggg gataggtaat gaaataattt ctggcattta tttgaaaacg accggctctc 2640 ggcatatttg ctgtcgtttt ggatcggtgg acactcaaaa ctccagggga acggtggaat 2700 ggtggacatt atataggatg tcccccaatg gcattcgtgt aaataggtag acttccattt 2760 caaatttgaa tgtcgaatat tggcggccat ccgattaata tt 2802 //