ID JF909064; SV 1; circular; genomic DNA; STD; VRL; 2802 BP. XX AC JF909064; XX DT 21-JUN-2012 (Rel. 113, Created) DT 05-DEC-2012 (Rel. 115, Last updated, Version 3) XX DE East African cassava mosaic virus-Kenya isolate DE Comoros:Anjouan:AJ02B00:2004 segment DNA-A, complete sequence. XX KW . XX OS East African cassava mosaic virus-Kenya OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RC Publication Status: Online-Only RP 1-2802 RX DOI; 10.1186/1471-2148-12-228. RX PUBMED; 23186303. RA De Bruyn A., Villemot J., Lefeuvre P., Villar E., Hoareau M., RA Harimalala M., Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., RA Harkins G.W., Varsani A., Martin D.P., Lett J.M.; RT "East African cassava mosaic-like viruses from Africa to Indian ocean RT islands: molecular diversity, evolutionary history and geographical RT dissemination of a bipartite begomovirus"; RL BMC Evol. Biol. 12(1):228-228(2012). XX RN [2] RP 1-2802 RA Villemot J., Lefeuvre P., Villar E., Hoareau M., Harimalala M., RA Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., Varsani A., Martin D.P., RA Lett J.-M.; RT ; RL Submitted (24-MAR-2011) to the INSDC. RL UMR PVBMT, CIRAD, 7, chemin de l'IRAT, Saint-Pierre, Reunion 97410, France XX DR MD5; f13a0dba1602bcf2da1403db00a9f651. XX FH Key Location/Qualifiers FH FT source 1..2802 FT /organism="East African cassava mosaic virus-Kenya" FT /segment="DNA-A" FT /host="Manihot esculenta (cassava)" FT /isolate="Comoros:Anjouan:AJ02B00:2004" FT /mol_type="genomic DNA" FT /country="Comoros:Anjouan" FT /lat_lon="12.13 S 44.42 E" FT /collection_date="2004" FT /db_xref="taxon:1229189" FT gene 175..531 FT /gene="AV2" FT CDS 175..531 FT /codon_start=1 FT /gene="AV2" FT /product="movement protein" FT /db_xref="GOA:I6LZ01" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:I6LZ01" FT /protein_id="AEG89718.1" FT /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL FT IGVLRCKSYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS FT EAEAVQNVSKPRCP" FT gene 335..1108 FT /gene="AV1" FT CDS 335..1108 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:I6LZ78" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:I6LZ78" FT /protein_id="AEG89717.1" FT /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA FT NRPMYRKPKLYRMYRSPDVPKGCEGPCKVQSYEQRDDVKHIGMVRCVSDVTRGSGITHR FT VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGPSPQDFGQVFNMFDNE FT PTTATVKNDLRDRYQVLRKFYATVVGGPSGMKEQALVKRFFRINNHVVYNHQEQAKYEN FT HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN" FT gene complement(1105..1509) FT /gene="AC3" FT CDS complement(1105..1509) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer" FT /db_xref="GOA:I6LX26" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:I6LX26" FT /protein_id="AEG89721.1" FT /translation="MDSRTGELITAPQAKNGVFTWAITNPLYFEITNHDKRPGNMNHDI FT ITLQIRFNHNLRKALAIHKCFLNFKVWTTLRPQTGLFLKVFRYQVLKYLDMIGVISINT FT VIRAVDHVLYAVLLNTLQVTEHHAIKFNLY" FT gene complement(1250..1657) FT /gene="AC2" FT CDS complement(1250..1657) FT /codon_start=1 FT /gene="AC2" FT /product="transcription activator protein" FT /db_xref="GOA:I6LX25" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:I6LX25" FT /protein_id="AEG89720.1" FT /translation="MPPSSPSTSHCSQVPIKVQHRTAKNRALRRRRVDLACGCSFYLHI FT DCINHGFSHRGTHHCASSKEWRFYLGNNKSPLFRNHQPRQEAREHEPRHHHTPDTFQPQ FT PPEGIGDSQVFSQLQGLDDLTASDWSFLKSI" FT gene complement(1566..2645) FT /gene="AC1" FT CDS complement(1566..2645) FT /codon_start=1 FT /gene="AC1" FT /product="replication associated protein" FT /db_xref="GOA:I6LX12" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:I6LX12" FT /protein_id="AEG89719.1" FT /translation="MPRAGRFQVNAKNYFITYPRCSLSKEEALSQLKALSYPTNIKFIR FT VCRELHQDGVPHLHVLIQFEGKFQCTNQRFFDLISPSRSTHFHPNIQGAKSSSDVKAYI FT EKGGEFLDDGIFQVDARSARGEGQHLAQVYADALNASSKSEALQIIKEKDPKSFFLQFH FT NISANADRIFQAPPQTYVSPFLSSSFTHVPEDIEVWVSENICSPAARPWRPISIVLEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW FT QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLEEDKNQSLKAWALKNATFVTLHEPL FT FSSTHQSPTPHSEEQGPQT" FT gene complement(2255..2488) FT /gene="AC4" FT CDS complement(2255..2488) FT /codon_start=1 FT /gene="AC4" FT /product="C4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:I6LX27" FT /protein_id="AEG89722.1" FT /translation="MGCLISMFSSNSKASSNVQTSDSSISFPRPDQHISIQTFRELNHR FT PMSKLTLKREGNFLTMAFSRSMPEVQGERASI" XX SQ Sequence 2802 BP; 735 A; 556 C; 718 G; 793 T; 0 other; accggatggc cgcgcccgaa aaagcaggtg gaccccacca gatggctacg cccgtgaaag 60 atagtggtcc ccgcgcactc gtttcggtcg gccagtcata tttacgcgtg aaagtctaga 120 tatttgttgt ttgtctttat agacttcgtc gcgaagtagt taagcgcgtc aaacatgtgg 180 gatccattgt tgaacgattt cccagaaacc gtgcacggtt tccgttctat gcttgctgtt 240 aaatacctgt tacatctgga acaggaatac gatcgcggta ctgtcggggc tgagtacata 300 cgggatctaa taggggttct acggtgtaag agttatgtcg aagcgaccag gagatataat 360 aatctcaaca cccgtatcca aggtgcggag gaggctgaac ttcgacagcc catacacgaa 420 ccgtgttgtt gcccccactg tccgcgtcac cagaagcaaa atatgggcca acaggcccat 480 gtatcggaag ccgaagctgt acagaatgta tcgaagccca gatgtcccta agggctgtga 540 aggcccatgt aaggttcagt cgtatgaaca gagggatgat gttaagcaca ttggtatggt 600 ccgatgtgtc agtgatgtta ctcgtgggtc aggcattacc catagagtcg ggaagaggtt 660 ttgtgtgaag tccatatata tattgggcaa gatctggatg gatgagaata ttaagaagca 720 aaatcatacg aaccatgtta tgttcttcct cgttcgagac agaaggcctt atgggccgag 780 cccgcaagat tttggacaag tgttcaacat gtttgataat gaacctacta ctgcaactgt 840 gaagaatgat ctgagggacc ggtatcaggt gttacgtaaa ttctatgcga ctgttgttgg 900 tggaccctcc gggatgaagg aacaagcgct ggttaagagg ttttttagga tcaataatca 960 tgtagtgtat aatcatcagg aacaggccaa gtatgagaat catactgaga atgcgttgtt 1020 attgtatatg gcatgtacac atgcctcaaa tcctgtgtac gctacgttga aaatacgcat 1080 ctatttctat gatgcagtaa caaattaata aaggttgaat tttattgcat ggtgctccgt 1140 aacttggagt gtgtttagta atacagcgta cagaacatga tcaacagctc taattacagt 1200 gttaatggaa ataacgccta tcatatctaa atacttgagc acttgatatc taaatacttt 1260 taagaaaaga ccagtctgag gccgtaaggt cgtccagacc ttgaagttga gaaaacactt 1320 gtgaatcgcc aatgccttcc ggaggttgtg gttgaaacgt atctggagtg tgatgatgtc 1380 gtggttcatg ttccctggcc tcttgtcgtg gttggtgatt tcgaaataga ggggatttgt 1440 tattgcccag gtaaaaacgc cattctttgc ttgaggcgca gtgatgagtt cccctgtgcg 1500 agaatccatg gttgatgcag tcgatatgga gatagaacga gcagccacat gcgaggtcta 1560 cccgcctacg tctgagggcc ctgttcttcg ctgtgcggtg ttggactttg atgggtactt 1620 gagaacaatg gctcgtggag ggtgacgaag gtggcattct ttaaagccca ggctttaagg 1680 gactgattct tgtcctcctc cagaaactct ttatatgatg atgttggtcc tggattgcag 1740 aggaagatag tgggaatgcc gcctttaatt tgaattggct tcccgtactt tgtattgctt 1800 tgccagtccc tttgggcccc catgaattct ttgaagtgct ttagataatg cgggtctacg 1860 tcgtcaatga cgttgtacca tgcgtcgttt gaatatacct ttggagacag atccaggtgt 1920 ccacatagat aattatgggg tcccagtgaa cgagcccaca tggtttttcc ggttcggcta 1980 tcaccttcga gaacaatact gatcggtctc catggccgcg cagcgggact gcatatattt 2040 tctgataccc atacctctat gtcttcgggg acgtgtgtaa atgatgatga taagaacgga 2100 ctaacgtaag tttgtggcgg agcctggaag attctatctg cgttagcaga tatgttatgg 2160 aactgtaaaa aaaatgactt gggatctttt tctttaataa tttgaagagc ttcggattta 2220 gaagaagcat tcaacgcgtc tgcatatacc tgagctaaat gctggccctc tccccttgca 2280 cttctggcat cgacctggaa aatgccatcg tcaagaaatt cccctccctt ttcaatgtaa 2340 gctttgacat cggacgatga tttagctccc tgaatgtttg gatggaaatg tgttgatctg 2400 gacggggaaa tgagatcgaa gaatcgctgg tttgtacatt ggaacttgcc ttcgaattgg 2460 atgagaacat ggagatgagg caccccatcc tgatgaagtt ctctgcaaac cctaatgaat 2520 ttgatatttg tcgggtaaga aagggctttt aattgggaaa gggcctcttc cttggataat 2580 gagcatcggg gataggtgat gaaataattt ttggcattta cttgaaaacg accggctctt 2640 ggcatatttg ctgtcgtttt ggatcggtgg acactcaaaa ctccagggga atggtggaac 2700 ggtgggcatt atatatgatg tcccccaatg gcaaatgtgt aaataggtca acctccattc 2760 aaaatttgaa ttgcgaatat tggcggccat ccgattaata tt 2802 //