ID JF909116; SV 1; circular; genomic DNA; STD; VRL; 2798 BP. XX AC JF909116; XX DT 21-JUN-2012 (Rel. 113, Created) DT 05-DEC-2012 (Rel. 115, Last updated, Version 3) XX DE East African cassava mosaic Kenya virus isolate DE Comoros:Grande-Comore:GC40BR2:2009 segment DNA-A, complete sequence. XX KW . XX OS East African cassava mosaic Kenya virus OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RC Publication Status: Online-Only RP 1-2798 RX DOI; 10.1186/1471-2148-12-228. RX PUBMED; 23186303. RA De Bruyn A., Villemot J., Lefeuvre P., Villar E., Hoareau M., RA Harimalala M., Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., RA Harkins G.W., Varsani A., Martin D.P., Lett J.M.; RT "East African cassava mosaic-like viruses from Africa to Indian ocean RT islands: molecular diversity, evolutionary history and geographical RT dissemination of a bipartite begomovirus"; RL BMC Evol. Biol. 12(1):228-228(2012). XX RN [2] RP 1-2798 RA Villemot J., Lefeuvre P., Villar E., Hoareau M., Harimalala M., RA Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., Varsani A., Martin D.P., RA Lett J.-M.; RT ; RL Submitted (24-MAR-2011) to the INSDC. RL UMR PVBMT, CIRAD, 7, chemin de l'IRAT, Saint-Pierre, Reunion 97410, France XX DR MD5; 3a95333980589c6cd937b7f8c76d56e0. XX FH Key Location/Qualifiers FH FT source 1..2798 FT /organism="East African cassava mosaic Kenya virus" FT /segment="DNA-A" FT /host="Manihot esculenta (cassava)" FT /isolate="Comoros:Grande-Comore:GC40BR2:2009" FT /mol_type="genomic DNA" FT /country="Comoros:Grande-Comore" FT /lat_lon="11.82 S 43.29 E" FT /collection_date="2009" FT /db_xref="taxon:393599" FT gene 174..539 FT /gene="AV2" FT CDS 174..539 FT /codon_start=1 FT /gene="AV2" FT /product="movement protein" FT /db_xref="GOA:I6LXU9" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:I6LXU9" FT /protein_id="AEG90030.1" FT /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL FT IGVLRCKSYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS FT EAQDVQNVSKPRCSEGL" FT gene 334..1107 FT /gene="AV1" FT CDS 334..1107 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:I6LXX2" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:I6LXX2" FT /protein_id="AEG90029.1" FT /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA FT NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSYEQRDDVKHTGMVRCVSDVTRGSGITHR FT VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGQSPQDFGQVFNMFDNE FT PTTATVKNDLRDRYQVLRKFYTTVVGGPSGMKEQSLVKRFFRINNHVVYNHQEQAKYEN FT HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN" FT gene complement(1104..1508) FT /gene="AC3" FT CDS complement(1104..1508) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer" FT /db_xref="GOA:I6LXX6" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:I6LXX6" FT /protein_id="AEG90033.1" FT /translation="MDSRTGELITAPQAKNGVFTWEITNPLYFDITNHDRRPGNMNHDI FT ITFQIRFNHNIRKALGIHKCFLNFKVWTTLRPPTGLFLKVFRYQVLKYLDMIGVISINT FT VIQAVDHVLYNVLLNTLQVTEQHAIKFNLY" FT gene complement(1249..1656) FT /gene="AC2" FT CDS complement(1249..1656) FT /codon_start=1 FT /gene="AC2" FT /product="transcription activator protein" FT /db_xref="GOA:I6LY29" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:I6LY29" FT /protein_id="AEG90032.1" FT /translation="MPPSSPSTSHCSQVPIKVQHRTAKTRAVRRRRVDLECGCSFYLHI FT DCINHGFSHRGTHHCASSKEWRFYLGNNKSPLFRHHQPRQETREHEPRHHHIPDTVQPQ FT HPEGIGDSQMFSQLQGLDDLTASDWSFLKSI" FT gene complement(1580..2644) FT /gene="AC1" FT CDS complement(1580..2644) FT /codon_start=1 FT /gene="AC1" FT /product="replication associated protein" FT /db_xref="GOA:I6LXX4" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:I6LXX4" FT /protein_id="AEG90031.1" FT /translation="MPRAGRFSIKAKNYFLTYPKCSLSKEEALDQLRQLQTPTNKLFIK FT ICRELHENGEPHLHALIQFEGKYNCTNQRFFDLISPSRSAHFHPNIQGAKSSSDVKSYL FT DKDGDTIQWGEFQIDGRSARGGQQSANDAYAKALNSANKSEALNVIRELAPKDFVLQFH FT NLNSNLERIFQEPLTPYISPFLSSSFTNVPEELEAWVSENVMGSAARPWRPSSIVIEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNIIDDVDPHYLKHFKEFMGAQRDW FT QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLDEEKNQSLKAWAIKNATFITLHEPL FT FSSAHQSPTPHSED" FT gene complement(2197..2736) FT /gene="AC4" FT CDS complement(2197..2736) FT /codon_start=1 FT /gene="AC4" FT /product="C4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:I6LXX7" FT /protein_id="AEG90034.1" FT /translation="MPFGGHHINCPPIPPIATDFECPPIAIRQRKCQGLVVLASKPKTI FT SSHIPNVLYRKKRPWISSDSSKPQQINCSSRSAESSMKMGNLICMRSFSSKASTIVPTN FT DSSTSYHLPGQPISTQIFRELNQAPTSSPIWIRTETPSNGASFRSTDDLLEADNNPPMT FT LTPRLLTQQISQRLLM" XX SQ Sequence 2798 BP; 726 A; 557 C; 718 G; 797 T; 0 other; accggatggc cgcgcccgaa aaaagcaggt ggccccacaa gatggccgcg cccgttaaag 60 aaagtggtcc ccgcgcactt gtgttggtcg gccagtcata ttcacgcgtg aaagtctaga 120 tatttgttgt ttgtctttat agacttcgtc gcgaagtagt ggagcgcgtc aacatgtggg 180 atccattgtt gaacgatttt cccgaaaccg ttcacggttt tcgttctatg cttgctgtta 240 aatacctgtt acatctggaa caggaatacg atcgcggtac tgtcggggcg gagtatatac 300 gtgatttaat aggggttcta cggtgtaaga gttatgtcga agcgaccagg agatataata 360 atctcaacac ccgtatccaa ggtgcggagg aggctgaact tcgacagccc atacacgaac 420 cgtgttgttg cccccactgt ccgcgtcacc agaagcaaaa tatgggccaa caggcccatg 480 tatcggaagc ccaagatgta cagaatgtat cgaagcccag atgttccgaa gggctgtgaa 540 ggcccatgta aggttcagtc ctatgaacag agggatgatg tgaagcacac tggtatggtc 600 cgatgtgtta gtgatgttac tcgtggatca ggcattaccc atagagtcgg gaagaggttt 660 tgtgtgaagt ccatatatat attgggcaag atttggatgg atgagaatat caagaagcaa 720 aatcatacga accatgttat gttcttcctt gttcgagata gaaggcctta tggtcagagt 780 ccgcaagatt ttggacaagt gttcaacatg tttgataatg aacctactac ggcaactgtg 840 aagaatgatc ttagggaccg atatcaggtg ttacgtaaat tttatacgac tgttgttggt 900 ggaccctctg ggatgaagga acaatctctg gttaagaggt tttttaggat caataatcat 960 gtagtgtata atcatcagga acaggccaag tatgagaacc atactgagaa tgcgttgtta 1020 ttgtatatgg catgtacaca tgcctcgaat cctgtgtacg ctacgctgaa aatacgcatc 1080 tatttctatg atgcagtgac aaattaataa aggttgaatt ttattgcatg ttgctccgta 1140 acttggagtg tgtttagtaa tacattgtac aaaacatgat caacagcttg aattacagtg 1200 ttaatggaaa taacgcctat catatctaaa tacttgagca cttgatatct aaatactttt 1260 aagaaaagac cagtcggagg ccgtaaggtc gtccagacct tgaagttgag aaaacatttg 1320 tgaatcccca atgccttccg gatgttgtgg ttgaaccgta tctggaatgt gatgatgtcg 1380 tggttcatgt tccctggtct cctgtcgtgg ttggtgatgt cgaaatagag gggatttgtt 1440 atttcccagg taaaaacgcc attctttgct tgaggcgcag tgatgagttc ccctgtgcga 1500 gaatccatga ttgatgcagt cgatatggag atagaacgag cagccgcatt cgaggtctac 1560 ccgcctacgt ctgacggccc tagtcttcgc tgtgcggtgt tggactttga tgggcacttg 1620 agaacaatgg ctcgtggagg gtgatgaagg tggcattctt tatagcccag gctttaaggg 1680 actggttctt ttcctcgtcc agaaactctt tatatgatga tgttggtcct ggattgcata 1740 ggaagatagt gggaatgccg cctttaattt gaattggctt cccgtacttt gtattgcttt 1800 gccagtccct ttgggccccc atgaattctt tgaaatgctt gaggtagtgg gggtcgacgt 1860 catcaatgat gttgtaccat gcgtcgttac tgtatacctt tggactgaga tccaggtgtc 1920 cacacaagta attatgtggt cccaaagagc gagcccacat tgtcttccct gtcctactat 1980 ctccctcgat gacgatacta ctcggtctcc atggccgcgc agcggaaccc atcacgttct 2040 cggaaaccca ggcttcaagt tcctcaggaa cgttagtgaa agaagaagaa agaaagggag 2100 aaatataagg agtgagaggc tcttgaaaaa tcctctctaa attgctattt aaattatgaa 2160 actgtaaaac aaaatctttt ggggctagtt cccgtattac attaagagcc tctgacttat 2220 ttgctgagtt aagagccttg gcgtaagcgt cattggcgga ttgttgtccg cctcgagcag 2280 atcgtccgtc gatctgaaac tcgccccatt ggatggtgtc tccgtcctta tccagatagg 2340 acttgacgtc ggagcttgat ttagctccct gaatatttgg gtggaaatgg gctgaccggg 2400 aaggtgatat gaggtcgaag aatcgttggt tggtacaatt gtacttgcct tcgaactgaa 2460 tgagcgcatg cagatgaggt tccccatttt catggagctc tctgcagatc ttgatgaaca 2520 atttatttgt tggggtttgg agctgtcgga gctgatccaa ggcctcttct ttcgatagag 2580 aacatttggg atatgtgagg aaatagtttt tggctttgat gctaaaacga ccagcccttg 2640 gcatttgcgc tgtcgtatag caatcggggg gcactcaaaa tctgtagcaa tcgggggaat 2700 gggggggcaa tttatatgat gccccccaaa tggcatttat gtaatatcct catgaaattt 2760 gaatttcaaa cgtggaaagc ggccatccgt ataatatt 2798 //