ID JF909073; SV 1; circular; genomic DNA; STD; VRL; 2801 BP. XX AC JF909073; XX DT 21-JUN-2012 (Rel. 113, Created) DT 05-DEC-2012 (Rel. 115, Last updated, Version 3) XX DE East African cassava mosaic virus-Kenya isolate DE Comoros:Anjouan:AJ18BW3:2009 segment DNA-A, complete sequence. XX KW . XX OS East African cassava mosaic virus-Kenya OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RC Publication Status: Online-Only RP 1-2801 RX DOI; 10.1186/1471-2148-12-228. RX PUBMED; 23186303. RA De Bruyn A., Villemot J., Lefeuvre P., Villar E., Hoareau M., RA Harimalala M., Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., RA Harkins G.W., Varsani A., Martin D.P., Lett J.M.; RT "East African cassava mosaic-like viruses from Africa to Indian ocean RT islands: molecular diversity, evolutionary history and geographical RT dissemination of a bipartite begomovirus"; RL BMC Evol. Biol. 12(1):228-228(2012). XX RN [2] RP 1-2801 RA Villemot J., Lefeuvre P., Villar E., Hoareau M., Harimalala M., RA Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., Varsani A., Martin D.P., RA Lett J.-M.; RT ; RL Submitted (24-MAR-2011) to the INSDC. RL UMR PVBMT, CIRAD, 7, chemin de l'IRAT, Saint-Pierre, Reunion 97410, France XX DR MD5; 6da3002c39adb1ca35a455d00c0b6945. XX FH Key Location/Qualifiers FH FT source 1..2801 FT /organism="East African cassava mosaic virus-Kenya" FT /segment="DNA-A" FT /host="Manihot esculenta (cassava)" FT /isolate="Comoros:Anjouan:AJ18BW3:2009" FT /mol_type="genomic DNA" FT /country="Comoros:Anjouan" FT /lat_lon="12.28 S 44.41 E" FT /collection_date="2009" FT /db_xref="taxon:1229189" FT gene 174..530 FT /gene="AV2" FT CDS 174..530 FT /codon_start=1 FT /gene="AV2" FT /product="movement protein" FT /db_xref="GOA:I6LX65" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:I6LX65" FT /protein_id="AEG89772.1" FT /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL FT IGVLRCKSYVEATRRYYNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS FT EAQDVQNVSKPRCP" FT gene 334..1107 FT /gene="AV1" FT CDS 334..1107 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:I6LX04" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:I6LX04" FT /protein_id="AEG89771.1" FT /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA FT NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSYEQRDDVKHTGMVRCVSDVTRGSGITHR FT VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGPSPQDFGQVFNMFDNE FT PTTATVKNDLRDRYQVLRKFYATVIGGPSGMKEQALVKRFFRINNHVVYNHQEQAKYEN FT HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN" FT gene complement(1104..1508) FT /gene="AC3" FT CDS complement(1104..1508) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer" FT /db_xref="GOA:I6LX68" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:I6LX68" FT /protein_id="AEG89775.1" FT /translation="MDSRTGELITAPQAKNGVFTWVITNPLYFEITNHDKRPGNMNHDI FT ITLQIRFNHNLRKALAIHKCFLNFKVWTTLRPPTGLFLRVFRYQVLKYLDMIGVISINT FT VIRAVDHVLYAVLLNTLQVTEQHAIKFNIY" FT gene complement(1249..1656) FT /gene="AC2" FT CDS complement(1249..1656) FT /codon_start=1 FT /gene="AC2" FT /product="transcription activator protein" FT /db_xref="GOA:I6LX67" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:I6LX67" FT /protein_id="AEG89774.1" FT /translation="MPPSSPSTSHCSQVPIKVQHRTAKTRALRRRRVDLECGCSFYLHI FT DCINHGFSHRGTHHCASSKEWRFYLGNNKSPLFRNHQPRQEAREHEPRHHHTPDTFQPQ FT PPEGIGDSQVFSQLQGLDDLTASDWSFLKSI" FT gene complement(1565..2644) FT /gene="AC1" FT CDS complement(1565..2644) FT /codon_start=1 FT /gene="AC1" FT /product="replication associated protein" FT /db_xref="GOA:I6LX66" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:I6LX66" FT /protein_id="AEG89773.1" FT /translation="MPRAGRFQINAKNYFITYPRCSLTKEEALSQLKALSYPTNIKFIR FT VCRELHQDGVPHLHVLIQFEGKFQCTNQRFFDLISPSRSTHFHPNIQGAKSSSDVKAYI FT EKGGEFLDDGIFQVDARSARGEGQHLAQVYADALNASSKSEALQIIKEKDPKSFFLQFH FT NISANADRIFQAPPQTYVSPFLSSSFTQVFDALEVWVSENICSPAARPWRPIRIVLEGD FT SLTGKTMWARLLGPHNYLCGHLDLSPKVYSNDPWYNDIDDVDPHYLKHFKDLMGAQRDW FT QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLEEEKNQSLKAWALKNATFVTLHEPL FT FSSAHQSPTPHSEDQGPQT" FT gene complement(2254..2487) FT /gene="AC4" FT CDS complement(2254..2487) FT /codon_start=1 FT /gene="AC4" FT /product="C4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:I6LX69" FT /protein_id="AEG89776.1" FT /translation="MGCLISMFSSNSKASSNVPTSDSSISFPHPDQHISIRTFRELNRR FT PMSKLTLKREGNFLTMEFSKSMPEVRGARASI" XX SQ Sequence 2801 BP; 732 A; 554 C; 731 G; 784 T; 0 other; accggatggc cgcgcccgaa aaagcaggtg gaccccacca gatggccgcg cccgtgaaag 60 aaagtggtcc ccgcgcacgt gtttcggtcg gccagtcata tttacgcgtg aaagtctaga 120 tatttgtggt ttgtctttat agacttcgtc gcgaagtagt gaagcgcgtc aacatgtggg 180 atccattatt gaacgatttc cctgaaaccg ttcacggttt ccgttctatg cttgctgtta 240 aatacctgtt acatctggaa caggaatacg atcgcggtac tgtcggggct gagtatatac 300 gggatctaat aggggtgcta cggtgtaaga gttatgtcga agcgaccagg agatattata 360 atctcaacac ccgtatccaa ggtgcggagg aggctgaact tcgacagccc atacacgaac 420 cgtgttgttg cccccactgt ccgcgtcacc agaagcaaaa tatgggccaa caggcccatg 480 tatcggaagc ccaagatgta cagaatgtat cgaagcccag atgtccctaa gggctgtgaa 540 ggcccatgta aggttcagtc gtatgaacag agggatgatg ttaagcacac tggtatggtc 600 cgatgtgtca gtgatgttac tcgtgggtca ggcattaccc atagagtcgg gaagaggttt 660 tgtgtgaagt ccatatatat attgggcaag atctggatgg atgagaatat caagaagcaa 720 aatcatacga accatgttat gttcttcctc gttcgagata gaaggccgta tggtccgagc 780 ccgcaagatt ttggacaagt gttcaacatg tttgataatg aacctactac tgcaacggtt 840 aagaatgatc ttagggaccg gtatcaggtg ttacgtaaat tctatgcgac tgtgattggt 900 ggaccctccg ggatgaagga acaagcgctg gttaagaggt tttttaggat caataatcat 960 gtagtgtata atcatcagga acaggccaag tatgagaatc ataccgagaa tgcgttgtta 1020 ttgtatatgg catgtacaca tgcctcaaat ccggtgtacg ctactttgaa aatacgcatc 1080 tatttctatg atgcagtgac aaattaataa atgttgaatt ttattgcatg ttgctccgta 1140 acttggagtg tgtttagtaa tacagcgtac agaacatgat caacagctct aattacagtg 1200 ttaatggaaa taacgcctat catatctaaa tatttgagca cttgatatct aaatactctt 1260 aagaaaagac cagtcggagg ccgtaaggtc gtccagacct tgaagttgag aaaacacttg 1320 tgaatcgcca atgccttccg gaggttgtgg ttgaaacgta tctggagtgt gatgatgtcg 1380 tggttcatgt tccctggcct cttgtcgtgg ttggtgattt cgaaatagag gggatttgtt 1440 attacccagg taaaaacgcc attctttgct tgaggcgcag tgatgagttc ccctgtgcga 1500 gaatccatgg ttgatgcagt cgatatggag atagaacgag cagccgcatt cgaggtctac 1560 ccgcctacgt ctgagggccc tggtcttcgc tgtgcggtgt tggactttga tgggcacttg 1620 agaacaatgg ctcgtggagg gtgacgaagg tggcattctt taaagcccag gctttaaggg 1680 attgattctt ttcctcttcc agaaactctt tatatgatga tgttggtcct ggattgcaga 1740 ggaagatagt gggaatgccg cctttaattt gaattggctt cccgtacttt gtattgcttt 1800 gccagtccct ttgggccccc atgagatctt tgaagtgttt gaggtagtgg gggtcgacgt 1860 catcaatgtc gttgtaccat gggtcgtttg aatatacctt tggagacaga tccaggtgtc 1920 cacatagata attatggggt cccagtaaac gagcccacat ggttttcccg gtcaggctat 1980 cgccttcgag aacaattctg atcggtctcc atggccgcgc agcgggactg catatatttt 2040 cggataccca tacctctagt gcgtcgaaga cttgtgtaaa tgatgatgat aagaacggac 2100 taacgtaagt ttgtggcgga gcctggaaga ttctatctgc gttagcagat atgttatgga 2160 actgtaaaaa aaaggacttg ggatcttttt ctttgataat ttgaagagct tctgatttag 2220 aagaagcatt taacgcgtcg gcatatacct gagctaaatg ctggccctcg cccctcgcac 2280 ttcgggcatc gacttggaaa attccatcgt caagaaattc ccctcccttt tcaatgtaag 2340 ctttgacatc ggacgacgat ttagctccct gaatgttcgg atggaaatgt gttgatcggg 2400 atggggaaat gagatcgaag aatcgctggt tggtacattg gaacttgcct tcgaattgga 2460 tgagaacatg gagatgaggc accccatcct gatgtagttc tctgcaaacc ctaatgaatt 2520 tgatattcgt cgggtaagaa agggctttta attgggaaag ggcctcttcc ttggttaatg 2580 agcatcgggg ataggtgatg aaataatttt tggcatttat ttgaaaacga ccggctcttg 2640 gcatatttgc tgtcgtttag gatcggggga cactcaaaac tccaggggaa tggtggaacg 2700 gggggcaata tatatgatgt cccccaatgg catatgtgta aataagtcga cctccattca 2760 aaatttgaat tgcgaatatt ggcggccatc cgattaatat t 2801 //