ID JF909092; SV 1; circular; genomic DNA; STD; VRL; 2801 BP. XX AC JF909092; XX DT 21-JUN-2012 (Rel. 113, Created) DT 05-DEC-2012 (Rel. 115, Last updated, Version 3) XX DE East African cassava mosaic virus-Kenya isolate DE Comoros:Grande-Comore:GC14E06:2008 segment DNA-A, complete sequence. XX KW . XX OS East African cassava mosaic virus-Kenya OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RC Publication Status: Online-Only RP 1-2801 RX DOI; 10.1186/1471-2148-12-228. RX PUBMED; 23186303. RA De Bruyn A., Villemot J., Lefeuvre P., Villar E., Hoareau M., RA Harimalala M., Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., RA Harkins G.W., Varsani A., Martin D.P., Lett J.M.; RT "East African cassava mosaic-like viruses from Africa to Indian ocean RT islands: molecular diversity, evolutionary history and geographical RT dissemination of a bipartite begomovirus"; RL BMC Evol. Biol. 12(1):228-228(2012). XX RN [2] RP 1-2801 RA Villemot J., Lefeuvre P., Villar E., Hoareau M., Harimalala M., RA Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., Varsani A., Martin D.P., RA Lett J.-M.; RT ; RL Submitted (24-MAR-2011) to the INSDC. RL UMR PVBMT, CIRAD, 7, chemin de l'IRAT, Saint-Pierre, Reunion 97410, France XX DR MD5; 881d3bb47ffc0d9d63ef9868fda9ec97. XX FH Key Location/Qualifiers FH FT source 1..2801 FT /organism="East African cassava mosaic virus-Kenya" FT /segment="DNA-A" FT /host="Manihot esculenta (cassava)" FT /isolate="Comoros:Grande-Comore:GC14E06:2008" FT /mol_type="genomic DNA" FT /country="Comoros:Grande-Comore" FT /lat_lon="11.82 S 43.29 E" FT /collection_date="2008" FT /db_xref="taxon:1229189" FT gene 174..539 FT /gene="AV2" FT CDS 174..539 FT /codon_start=1 FT /gene="AV2" FT /product="movement protein" FT /db_xref="GOA:I6LXH9" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:I6LXH9" FT /protein_id="AEG89886.1" FT /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL FT IGVLRCKNYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS FT EAQDVQNVSKPRCSEGL" FT gene 334..1107 FT /gene="AV1" FT CDS 334..1107 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:I6LXH8" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:I6LXH8" FT /protein_id="AEG89885.1" FT /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA FT NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSYEQRDDVKHTGMVRCVSDVTRGSGITHR FT VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGQSPQDFGQVFNMFDNE FT PTTATVKNDLRDRYQVLRKFYTTVVGGPSGMKEQSLVKRFFRINNHVVYNHQEQAKYEN FT HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN" FT gene complement(1104..1508) FT /gene="AC3" FT CDS complement(1104..1508) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer" FT /db_xref="GOA:I6LXI2" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:I6LXI2" FT /protein_id="AEG89889.1" FT /translation="MDSRTGALITAPQAKNGVFTWAITNPLYFDITNHDTRPGNMNHDI FT ITLQIRFNHNIRKALGIHKCFLHFKVWTTLRPPTGLFLRVFRYQVLKYLDMLGVISINT FT VIQAVDHVLYDVLLNTLQVTEHHAIKFNLY" FT gene complement(1249..1656) FT /gene="AC2" FT CDS complement(1249..1656) FT /codon_start=1 FT /gene="AC2" FT /product="transcription activator protein" FT /db_xref="GOA:I6LXI1" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:I6LXI1" FT /protein_id="AEG89888.1" FT /translation="MPPSSPSTSHCSQVPIKVQHRTTKTRALRRRRVDLECGCSFYLHI FT DCINHGFSHRGTHHCASSKEWRFYLGNNKSPLFRHHQPRHEAREHEPRHHHTPDTVQPQ FT HPEGIGDSQVFSPLQGLDDLTASDWSFLKSI" FT gene complement(1565..2644) FT /gene="AC1" FT CDS complement(1565..2644) FT /codon_start=1 FT /gene="AC1" FT /product="replication associated protein" FT /db_xref="GOA:I6LXI0" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:I6LXI0" FT /protein_id="AEG89887.1" FT /translation="MPRAGRFQINAKNYFITYPRCSLTKEEALSQLKTLSYPTNIKFIR FT VCRELHQDGVPHLHVLIQFEGKFQCTNPRFFDLISPSRSTHFHPNIQGAKSSSDVKAYI FT EKGGEFLDDGIFQVDARSARGEGQHLAQVYAEALNASSKSEALQIIKEKDPKSFFLQFH FT NISANADRIFQAPPQTYVSPFLSSSFTQVPEDIEVWVSENICSPAARPWRPISIVLEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW FT QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYTEFLAEEKNQSLKAWALKNATFVTLHEPL FT FSSTHQSPTPHNEDQGPQT" FT gene complement(2254..2487) FT /gene="AC4" FT CDS complement(2254..2487) FT /codon_start=1 FT /gene="AC4" FT /product="C4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:I6LXI3" FT /protein_id="AEG89890.1" FT /translation="MGCLISMFSSNSKASSNVPTRDFSISFPHPDQHISIRTFRELNHR FT PMSKLTLKREGNFLTMAFSKSMPEVRGGRANI" XX SQ Sequence 2801 BP; 736 A; 549 C; 728 G; 788 T; 0 other; accggatggc cgcgcccgaa aaaagcaggt ggccccacaa gatggccgcg cccgttaaag 60 aaagtggtcc ccgcgcactt gggtttgtcg gccagtcata ttcacgcgtg aaagtctaga 120 tatttgttgt tggtctttat agacttcgtc gcgaagtagt ggagcgcgtc aacatgtggg 180 atccattgtt gaacgatttt cccgaaaccg ttcacggttt tcgttctatg cttgctgtta 240 aatacctgtt acatctggaa caggaatacg atcgcggtac tgtcggggcg gagtatatac 300 gtgatttaat aggggttcta cggtgtaaga attatgtcga agcgaccagg agatataata 360 atctcaacac ccgtatccaa ggtgcggagg aggctgaact tcgacagccc atacacgaac 420 cgtgttgttg cccccactgt ccgcgtcacc agaagcaaaa tatgggccaa caggcccatg 480 tatcggaagc ccaagatgta cagaatgtat cgaagcccag atgttccgaa gggctgtgaa 540 ggcccatgta aggttcagtc ctatgaacag agggatgatg tgaagcacac tggtatggtc 600 cgatgtgtca gtgatgttac tcgtggatca ggcataaccc atagagttgg gaagaggttt 660 tgtgtgaagt ccatatatat attgggcaag atttggatgg atgagaatat caagaagcaa 720 aatcatacga accatgttat gttcttcctg gttcgagata gaaggcctta tggtcagagt 780 cctcaagatt tcggacaagt gttcaacatg tttgataatg aacctactac ggcaactgtg 840 aagaatgatc ttagggatcg atatcaggtg ttacgtaaat tttatacgac tgttgttggt 900 ggaccctctg ggatgaagga acaatctctg gttaagaggt tttttaggat caataatcat 960 gtagtgtata atcatcagga acaggccaag tatgagaacc atactgagaa tgcgttgtta 1020 ttgtatatgg catgtacaca tgcctcaaat cctgtgtacg ctactctgaa aatacgcatc 1080 tatttctatg atgcagtgac aaattaataa aggttgaatt ttattgcatg gtgctccgta 1140 acttggagtg tgtttagtaa tacatcgtac agaacatgat caacagcttg aatgacagtg 1200 ttaatggaaa taacgccaag catatctaaa tacttgagca cctgatatct aaatactctt 1260 aagaaaagac cagtcggagg ccgtaaggtc gtccagacct tgaagtggag aaaacacttg 1320 tgaatcccca atgccttccg gatgttgtgg ttgaaccgta tctggagtgt gatgatgtcg 1380 tggttcatgt tccctggcct cgtgtcgtgg ttggtgatgt cgaaatagag gggatttgtt 1440 attgcccagg taaaaacgcc attctttgct tgaggcgcag tgatgagtgc ccctgtgcga 1500 gaatccatgg ttgatgcagt cgatatggag atagaacgag cagccgcatt cgaggtctac 1560 ccgcctacgt ctgagggccc tggtcttcgt tgtgcggtgt tggactttga tgggtacttg 1620 agaacaatgg ctcgtggagg gtgacgaagg tggcattctt taaagcccag gctttaaggg 1680 actggttctt ttcttcggcc agaaactctg tatatgatga tgttggtcct ggattgcaga 1740 ggaagatagt gggaatgccg cctttaattt gaattggctt cccgtacttg gtattgcttt 1800 gccagtccct ttgggccccc atgaattctt tgaagtgctt tagataatgc gggtctacgt 1860 cgtcaatgac gttgtaccat gcgtcgtttg aatatacctt tggagacaga tctaggtgtc 1920 cacatagata attatggggt cccagtgaac gagcccacat ggttttcccg gttcggctat 1980 caccttctag aacaatactg atcggtctcc atggccgcgc agcgggactg catatatttt 2040 cggataccca tacctctatg tcctccggga cttgtgtaaa tgaggatgat aagaacggac 2100 taacgtaagt ttgtggcgga gcctggaaga tgcgatctgc gttagcagat atgttatgga 2160 actgtaaaaa aaaggacttt ggatcctttt ctttaataat ttgaagagct tctgatttag 2220 aagaagcatt caacgcttct gcatatacct gagctaaatg ttggccctcc cccctcgcac 2280 ttcgggcatc gacttggaaa atgccatcgt caagaaattc ccctcccttt tcaatgtaag 2340 ctttgacatc ggacgatgat ttagctccct gaatgttcgg atggaaatgt gttgatcggg 2400 atggggaaat gagatcgaaa aatcgcgggt tggtacattg gaacttgcct tcgaattgga 2460 tgagaacatg gagatgaggc accccatcct gatgtagttc tctgcaaacc ctaatgaatt 2520 tgatattcgt cgggtaagaa agggttttta attgggaaag ggcctcttcc ttggttaatg 2580 aacatcgggg ataagttatg aaataatttt tggcatttat ttgaaaacga ccggctcttg 2640 gcatatttgc tgtcgtttag gatcggggga cactcaaaac tccaggggaa cggtggaacg 2700 gggggcatta tataggatgt cccccaatgg catatgtgta aataggtaga cttccatttg 2760 aatttcgaat gtcgaatatt ggcggccatc cgattaatat t 2801 //