ID JF909094; SV 1; circular; genomic DNA; STD; VRL; 2799 BP. XX AC JF909094; XX DT 21-JUN-2012 (Rel. 113, Created) DT 05-DEC-2012 (Rel. 115, Last updated, Version 3) XX DE East African cassava mosaic Cameroon virus-Cameroon isolate DE Comoros:Grande-Comore:GC15E11:2008 segment DNA-A, complete sequence. XX KW . XX OS East African cassava mosaic Cameroon virus-Cameroon OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RC Publication Status: Online-Only RP 1-2799 RX DOI; 10.1186/1471-2148-12-228. RX PUBMED; 23186303. RA De Bruyn A., Villemot J., Lefeuvre P., Villar E., Hoareau M., RA Harimalala M., Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., RA Harkins G.W., Varsani A., Martin D.P., Lett J.M.; RT "East African cassava mosaic-like viruses from Africa to Indian ocean RT islands: molecular diversity, evolutionary history and geographical RT dissemination of a bipartite begomovirus"; RL BMC Evol. Biol. 12(1):228-228(2012). XX RN [2] RP 1-2799 RA Villemot J., Lefeuvre P., Villar E., Hoareau M., Harimalala M., RA Abdoul-Karime A.L., Abdou-Chakour C., Reynaud B., Varsani A., Martin D.P., RA Lett J.-M.; RT ; RL Submitted (24-MAR-2011) to the INSDC. RL UMR PVBMT, CIRAD, 7, chemin de l'IRAT, Saint-Pierre, Reunion 97410, France XX DR MD5; 274d1525f3282dbdf555c40ea735554c. XX FH Key Location/Qualifiers FH FT source 1..2799 FT /organism="East African cassava mosaic Cameroon FT virus-Cameroon" FT /segment="DNA-A" FT /host="Manihot esculenta (cassava)" FT /isolate="Comoros:Grande-Comore:GC15E11:2008" FT /mol_type="genomic DNA" FT /country="Comoros:Grande-Comore" FT /lat_lon="11.82 S 43.29 E" FT /collection_date="2008" FT /db_xref="taxon:1229190" FT gene 172..528 FT /gene="AV2" FT CDS 172..528 FT /codon_start=1 FT /gene="AV2" FT /product="movement protein" FT /db_xref="GOA:I6LXJ1" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:I6LXJ1" FT /protein_id="AEG89898.1" FT /translation="MWNPLVNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL FT IGVLRCKNYGEATRRYNNLNTRIQGAEEAELRQPIHEPCGCPYCPRHQKQNMGQQAHVS FT EAQDVQDVSKPRCP" FT gene 332..1105 FT /gene="AV1" FT CDS 332..1105 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:I6LXJ0" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:I6LXJ0" FT /protein_id="AEG89897.1" FT /translation="MAKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSRIWA FT NRPMYRKPKMYRMYRSPDVPNGCEGPCKVQSYEQRDDVKHTGMVRCVSDVTRGPGITHR FT VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGPSPQDFGQVFNMFDNE FT PTTATVKNDLRDRYQVLRKFYATVIGGPSGMKEQALIKRFFRINNHVVYNHQEQAKYEN FT HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN" FT gene complement(1102..1506) FT /gene="AC3" FT CDS complement(1102..1506) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer" FT /db_xref="GOA:I6LXJ4" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:I6LXJ4" FT /protein_id="AEG89901.1" FT /translation="MDSRTGEDIGALQWKNGAFIWEIPNPLYFKILNHDSMPFNMNHDI FT IDVQIRFNYNLRKALGMHKCFLNFRIWTRLHPQTWRFFRTFKTQVMKYLNNLGVISIST FT VIDAVHHVLNIVFVGTLSVSQDHEIKFNIY" FT gene complement(1247..1654) FT /gene="AC2" FT CDS complement(1247..1654) FT /codon_start=1 FT /gene="AC2" FT /product="transcription activator protein" FT /db_xref="GOA:I6LXJ3" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:I6LXJ3" FT /protein_id="AEG89900.1" FT /translation="MRSSSPSQSHSSPPPIKARHRQAKIRAPRRRRIDLPCGCSIYRSI FT NCHNHGFTHRGRHWCSSMEEWRIYMGDTKSPIFQNPQPRQHAVQHEPRHHRCPDSVQLQ FT PEESTGDAQVFSQLPDLDSFTSSDLAFLQNL" FT gene complement(1584..2642) FT /gene="AC1" FT CDS complement(1584..2642) FT /codon_start=1 FT /gene="AC1" FT /product="replication associated protein" FT /db_xref="GOA:I6LXJ2" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:I6LXJ2" FT /protein_id="AEG89899.1" FT /translation="MPRAGRFQINAKNYFITYPRCSLTKEEALSQLKALSYPTNIKFIR FT VCRELHQDGVPHLHILIQFEGKFQCTNPRFFDLISTSRSTHFHPNIQGAKSSSDVKAYI FT EKGGEFLDDGIFQVDARSARGEGQHLAQVYADALNASSKTEALQIIKEKDPKSFFLQFH FT NISANADRIFQAPPQTYVSPFLSSSFTRVPEDIEVWVSENICSPAARPWRPISIVLEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW FT QSNTKYGKPIQIKGGIPTIFLCNPGPNSSYKAYLDEDKNSNLKNWAIKNALFISLTEPL FT FSSTDQSQAQAS" FT gene complement(2252..2485) FT /gene="AC4" FT CDS complement(2252..2485) FT /codon_start=1 FT /gene="AC4" FT /product="C4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:I6LXJ5" FT /protein_id="AEG89902.1" FT /translation="MGCLISIFSSSSKASSNVPTRDSSISFPHPDPHISIRTFRELNHR FT PMSKLILKREGNFLTMEFSKSMPEVQGGRASI" XX SQ Sequence 2799 BP; 751 A; 541 C; 699 G; 808 T; 0 other; accggatggc cgcgcccgaa aaagcatatg gtccccacaa tggtccgcgt ctgtgaaaga 60 aagtggtccc cgcgcacgta tgtttgtcgg ccaatcatat ttgcgcctga aaggctatat 120 attgatttgt gacgttatat acttcgtcac gaagtagtgg agcgcgtcaa catgtggaat 180 ccgttagtta acgattttcc ggaaaccgta cacggtttcc gttcgatgct tgctgttaaa 240 tacctgctac atctggaaca ggaatacgat cgcggtactg tcggggcgga gtatatacgt 300 gatttaatag gtgttctacg gtgtaagaat tatggcgaag cgaccaggag atataataat 360 ctcaacaccc gtatccaagg tgcggaggag gctgaacttc gacagcccat acacgaaccg 420 tgtggttgcc cctactgtcc gcgtcaccag aagcagaata tgggccaaca ggcccatgta 480 tcggaagccc aagatgtaca ggatgtatcg aagcccagat gtccctaatg gctgtgaagg 540 cccatgtaag gttcagtcgt atgaacagag agatgatgtt aagcacactg gtatggttcg 600 atgtgtcagt gatgttacgc gtgggccagg tattacccat agagttggga agaggttttg 660 tgtgaagtcc atatatattt tgggcaagat ttggatggat gagaatatca aaaagcaaaa 720 tcatacgaac catgttatgt tcttccttgt tcgagataga aggccttatg ggccgagtcc 780 tcaagatttt ggacaagtgt tcaacatgtt tgataatgaa cctactacgg caactgtgaa 840 gaatgatctt agggaccggt atcaggtgtt acgtaaattc tatgcgactg ttattggtgg 900 accctctggg atgaaggagc aagcgttgat taagaggttt tttaggatca ataatcatgt 960 agtgtataat catcaggaac aggccaagta tgagaatcat accgagaatg cgttgttatt 1020 gtatatggca tgtacacatg cctcaaatcc tgtgtacgct acgctgaaaa tacgcatcta 1080 tttctatgat gcagtgacaa attaataaat attaaatttt atttcatgat cttgtgatac 1140 agatagggtt cccacaaaaa cgatattcaa tacatgatgc actgcatcta ttacagttga 1200 aattgaaata acacctaaat tattgagata tttcatgact tgtgtcttaa aggttctgaa 1260 gaaacgccaa gtctgaggat gtaaacgagt ccagatccgg aagttgagaa aacacttgtg 1320 catccccagt gctttcctca ggttgtagtt gaaccgaatc tggacatcta tgatgtcgtg 1380 gttcatgttg aacggcatgc tgtcgtggtt gaggattttg aaatataggg gatttggtat 1440 ctcccatata aatgcgccat tcttccattg aagagcacca atgtcttccc ctgtgcgtga 1500 atccatggtt atgacaattg atgctgcggt agatggaaca gccacaaggt aggtcgattc 1560 gtcgacgtct gggtgctctg atcttagctt gcctgtgcct ggctttgatc ggtggaggag 1620 aagagtggct ctgtgaggga gatgaagagc gcattcttga tcgcccaatt ctttagattg 1680 gaattcttgt cctcgtctag gtatgcttta taggacgaat tggggcccgg attgcataag 1740 aagatggtgg gaatcccacc tttaatttga atgggctttc cgtatttggt gtttgattgc 1800 caatctctct gggcccccat gaactctttg aaatgcttta gatagtgcgg gtctacgtcg 1860 tcaatgacgt tgtaccatgc gtcgttggaa tatacttttg gagacagatc caggtgtcca 1920 caaagataat tgtggggtcc cagtgaacga gcccacatgg ttttccctgt tcggctatca 1980 ccttctagaa caatactgat cggtctccat ggccgcgcag cgggactgca tatattttct 2040 gatacccata cttctatgtc ttcggggact cgtgtaaagg acgatgataa gaatggacta 2100 acgtaagttt ggggcggagc ctggaagatt ctatctgcgt tagcagatat gttatgaaac 2160 tgtaaaaaaa aagactttgg atctttttct ttaataattt gaagagcttc tgttttagaa 2220 gaagcattca acgcgtctgc atatacctga gctaaatgct ggccctcccc ccttgcactt 2280 ctggcatcga cttggaaaat tccatcgtca agaaattccc ctcccttttc aatataagct 2340 ttgacatcgg acgatgattt agctccctga atgttcggat ggaaatgtgt ggatctggat 2400 gtggaaatga gatcgaagaa tctcgggttg gtacattgga acttgccttc gaactggatg 2460 agaatatgga gatgaggcac cccatcttga tgtagctctc tgcaaaccct aatgaatttg 2520 atattcgtcg ggtaagaaag ggcttttaat tgggaaagtg cctcttcctt tgttaatgaa 2580 catcggggat aggttatgaa ataatttttg gcatttattt gaaaacgacc ggctctcggc 2640 atattggctg tcgttttgga tcgggggaca ctcaaaactc caggggaacg gtggaacggg 2700 gggcattata taggatgtcc cccaatggca tatgtgtaaa taggtagagt tccattcaaa 2760 atttgaattt cgaatattgg cggccatccg attaatatt 2799 //