ID JF909121; SV 1; circular; genomic DNA; STD; VRL; 2801 BP. XX AC JF909121; XX DT 21-JUN-2012 (Rel. 113, Created) DT 05-DEC-2012 (Rel. 115, Last updated, Version 3) XX DE East African cassava mosaic virus-Kenya isolate DE Comoros:Grande-Comore:GC47BZ2:2009 segment DNA-A, complete sequence. XX KW . XX OS East African cassava mosaic virus-Kenya OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RC Publication Status: Online-Only RP 1-2801 RX DOI; 10.1186/1471-2148-12-228. RX PUBMED; 23186303. RA De Bruyn A., Villemot J., Lefeuvre P., Villar E., Hoareau M., RA Harimalala M., Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., RA Harkins G.W., Varsani A., Martin D.P., Lett J.M.; RT "East African cassava mosaic-like viruses from Africa to Indian ocean RT islands: molecular diversity, evolutionary history and geographical RT dissemination of a bipartite begomovirus"; RL BMC Evol. Biol. 12(1):228-228(2012). XX RN [2] RP 1-2801 RA Villemot J., Lefeuvre P., Villar E., Hoareau M., Harimalala M., RA Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., Varsani A., Martin D.P., RA Lett J.-M.; RT ; RL Submitted (24-MAR-2011) to the INSDC. RL UMR PVBMT, CIRAD, 7, chemin de l'IRAT, Saint-Pierre, Reunion 97410, France XX DR MD5; d7a59eaaebbbd695c8c4ad978b85ea5e. XX FH Key Location/Qualifiers FH FT source 1..2801 FT /organism="East African cassava mosaic virus-Kenya" FT /segment="DNA-A" FT /host="Manihot esculenta (cassava)" FT /isolate="Comoros:Grande-Comore:GC47BZ2:2009" FT /mol_type="genomic DNA" FT /country="Comoros:Grande-Comore" FT /lat_lon="11.76 S 43.25 E" FT /collection_date="2009" FT /db_xref="taxon:1229189" FT gene 174..539 FT /gene="AV2" FT CDS 174..539 FT /codon_start=1 FT /gene="AV2" FT /product="movement protein" FT /db_xref="GOA:I6LXZ7" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:I6LXZ7" FT /protein_id="AEG90060.1" FT /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL FT IGVLRCKSYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS FT EAQDVQNVSKPRCSEGL" FT gene 334..1107 FT /gene="AV1" FT CDS 334..1107 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:I6LXZ6" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:I6LXZ6" FT /protein_id="AEG90059.1" FT /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA FT NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSYEQRDDVKHTGMVRCVSDVTRGSGITHR FT VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGQSPQDFGQVFNMFDNE FT PTTATVKNDLRDRYQVLRKFYTTVVGGPSGMKEQALVKRFFRINNHVVYNHQEQAKYEN FT HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN" FT gene complement(1104..1508) FT /gene="AC3" FT CDS complement(1104..1508) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer" FT /db_xref="GOA:I6LY06" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:I6LY06" FT /protein_id="AEG90063.1" FT /translation="MDSRTGELITAPQATNGVFTWEITNPLYFEITNHDKRPGNMNHDI FT ITLQIRFNHTLRKALGIHKCFLNFKVWTTLRPQTGRFLKVFRYQVLKYLDMIGVISINT FT VLQAVDHVLYDVLLNTLQVTEQHAIKFNLY" FT gene complement(1249..1656) FT /gene="AC2" FT CDS complement(1249..1656) FT /codon_start=1 FT /gene="AC2" FT /product="transcription activator protein" FT /db_xref="GOA:I6LY05" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:I6LY05" FT /protein_id="AEG90062.1" FT /translation="MPPSSPSTSHCSLVPIKVQHRTAKTRAVRRRRVDLACGCSFYLHI FT DCINHGFSHRGTHHCASSNEWRFYLGNNKSPLFRNHQPRQAAREHEPRHHHTPDTVQPH FT PPEGTGDSQVFSQLQGLDDLTASDWSFLKSI" FT gene complement(1565..2644) FT /gene="AC1" FT CDS complement(1565..2644) FT /codon_start=1 FT /gene="AC1" FT /product="replication associated protein" FT /db_xref="GOA:I6LY04" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:I6LY04" FT /protein_id="AEG90061.1" FT /translation="MPRAGRFQINARNYFITYPRCSLTKEEALSQLKALSYPTNIKFIR FT ICRELHQDGVPHLHVLIQFEGKFQCTNPRFFDLISTSRSTHFHPNIQGAKSSSDVKAYI FT EKGGEFLDDGIFQVDARSARGEGQHLAQVYADALNASSKTEALQIIKEKDPKSFFLQFH FT NISANADRIFQAPAQTYVSPFLSSSFTQVPEDIEVWVSENICRPAARPWRPISIVLEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDAHPHYLKHFTEFMGAQRDW FT QSNTKYGKPIQIKGGIPTIFLCYPGPTSSYKEFLDEEKNQSLKAWALKNATFITLHEPL FT FSSAHQSPTPHSEDQGRQT" FT gene complement(2254..2487) FT /gene="AC4" FT CDS complement(2254..2487) FT /codon_start=1 FT /gene="AC4" FT /product="C4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:I6LYH5" FT /protein_id="AEG90064.1" FT /translation="MGCLISMFSSSSKGSSNVPTQDSSISFPHPDPHISIRTFRELNHR FT PMSKLILKREGNFLTMEFSRSMPEVQGGRASI" XX SQ Sequence 2801 BP; 729 A; 559 C; 722 G; 791 T; 0 other; accggatggc cgcgcccgaa aaagcaggtg gaccccacaa gatggccgcg cccgtgaaag 60 aaagtggtcc ccgcgcactt gtgttggtcg gccagtcata ttcacgcgtg aaagtctaga 120 tatttgttgt ttgtctttat agacttcgtc gcgaagtagt ggagcgcgtc aacatgtggg 180 atccattgtt gaacgatttc cccgaaaccg ttcacggttt ccgttctatg cttgctgtta 240 aatacctgtt acatctggaa caggaatacg atcgcggtac tgtcggggcg gagtatatac 300 gtgatttaat aggggttcta cggtgtaaga gttatgtcga agcgaccagg agatataata 360 atctcaacac ccgtatccaa ggtgcggagg aggctgaact tcgacagccc atacacgaac 420 cgtgttgttg cccccactgt ccgcgtcacc agaagcaaaa tatgggccaa caggcccatg 480 tatcggaagc ccaagatgta cagaatgtat cgaagcccag atgttccgaa gggctgtgaa 540 ggcccatgta aggttcagtc ctatgaacag agggatgatg tgaagcacac tggtatggtc 600 cgatgtgtca gtgatgttac tcgtggatca ggcattaccc atagagtcgg gaagaggttt 660 tgtgtgaagt ccatatatat attgggcaag atttggatgg atgagaatat caagaagcaa 720 aatcatacga accatgttat gttcttcctt gttcgagata gaaggcctta tggtcagagt 780 cctcaagatt ttggacaagt gttcaacatg tttgataatg aacctactac ggcaactgtg 840 aagaatgatc ttcgggaccg atatcaggtg ttacgtaaat tctatacgac tgttgtaggt 900 ggaccctctg ggatgaagga acaagctctg gttaagaggt tttttaggat caataatcat 960 gtagtgtata atcatcagga acaggccaag tatgagaatc atactgagaa tgcgttgtta 1020 ttgtatatgg catgtacaca tgcctcgaat cctgtgtacg ctacgctgaa aatacgcatc 1080 tatttctatg atgcagtgac aaattaataa aggttgaatt ttattgcatg ttgctccgta 1140 acttggagcg tgtttagtaa tacatcgtac agaacatgat caacagcctg aagtacagtg 1200 ttaatggaaa taacgcctat catatctaaa tacttgagca cttgatatct aaatactttt 1260 aagaaacgac cagtctgagg ccgtaaggtc gtccagacct tgaagttgag aaaacacttg 1320 tgaatcccca gtgccttccg gagggtgtgg ttgaaccgta tctggagtgt gatgatgtcg 1380 tggttcatgt tccctggccg cttgtcgtgg ttggtgattt cgaaatagag gggatttgtt 1440 atttcccagg taaaaacgcc attcgttgct tgaggcgcag tgatgagttc ccctgtgcga 1500 gaatccatgg ttgatgcagt cgatatggag atagaacgag cagccgcatg cgaggtctac 1560 ccgcctacgt ctgacggccc tggtcttcgc tgtgcggtgt tggactttga tgggcactag 1620 agaacaatgg ctcgtggagg gtgatgaagg tggcattctt taaagcccag gctttaaggg 1680 actggttctt ttcctcgtcc agaaactctt tatatgatga tgttggtcca ggatagcaga 1740 ggaagatagt gggaataccg cctttaattt gaattggctt cccgtacttt gtattgcttt 1800 gccagtctct ttgggccccc atgaattctg tgaaatgctt tagatagtgc gggtgtgcgt 1860 cgtcaatgac gttgtaccat gcgtcgtttg aatatacctt tggagacaga tccaggtgtc 1920 cacatagata attatggggt cccagtgaac gagcccacat ggttttccct gttcggctat 1980 caccttcgag aacaatactg atcggtctcc atggccgcgc agcgggacgg catatatttt 2040 ctgagaccca tacttctatg tcttcgggga cttgtgtaaa tgatgatgat aagaacggac 2100 taacataagt ttgggccgga gcctggaaga ttctatccgc gttagcagat atgttgtgga 2160 actgtaaaaa aaaggacttt ggatcttttt ctttaataat ctgaagagct tctgttttag 2220 aagaagcatt caacgcgtcg gcatatacct gagctaaatg ctggccctcc ccccttgcac 2280 ttctggcatc gacctggaaa attccatcgt caagaaattc ccctcccttt tcaatataag 2340 ctttgacatc ggacgatgat ttagctccct gaatgttcgg atggaaatgt gtggatctgg 2400 atgtggaaat gagatcgaag aatcttgggt tggtacattg gaacttccct tcgaactgga 2460 tgagaacatg gagatgaggc accccatcct gatgtagttc tcggcaaatc ctgatgaatt 2520 tgatattcgt cgggtaagaa agggctttta attgggaaag tgcctcttcc tttgttaatg 2580 agcatcgggg ataggtaatg aaataatttc tggcgtttat ttgaaaacga ccggctctcg 2640 gcatatttgc tgtcgttttg tatcggggga cactcaaaac tccaggggaa cggtggaatg 2700 gtggacatta tataggatgt cccccaatgg cattcgtgta aataggtaga cttccatttc 2760 aaatttgaat gtcgaatatt ggcggccatc cgattaatat t 2801 //