ID JN941177; SV 1; circular; genomic DNA; STD; VRL; 2799 BP. XX AC JN941177; XX DT 01-NOV-2012 (Rel. 114, Created) DT 23-DEC-2012 (Rel. 115, Last updated, Version 2) XX DE East African cassava mosaic virus isolate AO7 segment A, complete sequence. XX KW . XX OS East African cassava mosaic virus OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RP 1-2799 RA Matic S., Pais da Cunha A.T., Thompson J.R., Tepfer M.; RT "An analysis of viruses associated with cassava mosaic disease in three RT Angolan provinces"; RL J. Plant Pathol. 94(2):443-450(2012). XX RN [2] RP 1-2799 RA Matic S., Thompson J.R., da Cunha A.T.P., Tepfer M.; RT ; RL Submitted (28-OCT-2011) to the INSDC. RL Institute of Plant Virology, National Research Council, Strada delle Cacce RL 73, Turin 10135, Italy XX DR MD5; 12727ab8a750cb769c91ed80f8bf7c09. XX FH Key Location/Qualifiers FH FT source 1..2799 FT /organism="East African cassava mosaic virus" FT /segment="A" FT /host="Manihot esculenta" FT /isolate="AO7" FT /mol_type="genomic DNA" FT /country="Angola" FT /collection_date="08-Jan-2009" FT /db_xref="taxon:62079" FT gene 172..528 FT /gene="AV2" FT CDS 172..528 FT /codon_start=1 FT /gene="AV2" FT /product="AV2 protein" FT /db_xref="GOA:A0A3S5ZP69" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:A0A3S5ZP69" FT /protein_id="AFR24301.1" FT /translation="MWDPLVNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL FT IGVLRCKNYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS FT ETQDVQNVSKPRCP" FT gene 332..1105 FT /gene="AV1" FT CDS 332..1105 FT /codon_start=1 FT /gene="AV1" FT /product="AV1 protein" FT /db_xref="GOA:K4HKI9" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:K4HKI9" FT /protein_id="AFR24302.1" FT /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA FT NRPMYRKPKMYRMYRSPDVPKGCEGPCKVQSFEQRDDVKHLGICKVISDVTRGPGLTHR FT VGKRFCIKSVYILGKIWMDENIKKQNHTNNVMFYLLRDRRPYGNAPQDFGQIFNMFDNE FT PSTATIKNDLRDRFQVLRKFHATVVGGPSGMKEQALVKRFYKLNHHVTYNHQEAGKYEN FT HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN" FT gene complement(362..973) FT /gene="AC5" FT CDS complement(362..973) FT /codon_start=1 FT /gene="AC5" FT /product="AC5 protein" FT /db_xref="InterPro:IPR006892" FT /db_xref="InterPro:IPR013671" FT /db_xref="UniProtKB/TrEMBL:F1AFY3" FT /protein_id="AFR24303.1" FT /translation="MIICHMMIQLVKPFHQRLLLHARWTTHNSGMKFPQHLKPIPQIVL FT NCCSTGLIIKHVKYLPKVLGRIAIRPSIPKQVKHHIISVILLLNIFIHPDLTKNVNGLD FT TKPLSDPVCQPRPTRHITNHLTDTKVLHIIPLLKRLDLTWAFTALRDIWASIHSVHLGF FT PIHGPVGPYFASGDADSGGNNTVRVWAVEVQPPPHLGYGC" FT gene complement(1102..1506) FT /gene="AC3" FT CDS complement(1102..1506) FT /codon_start=1 FT /gene="AC3" FT /product="AC3 protein" FT /db_xref="GOA:F1AFY4" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:F1AFY4" FT /protein_id="AFR24304.1" FT /translation="MDSRTGELITAPQARNGVFTWDITNPLYFEITDHDKRPGNMNHDI FT ITLQIRFNHNLRKALGIHKCFLNFKVWTTLRPQTGRFLRVFRYQVLKYLNMIGVISINT FT VLQAVDHVVYDVLLNTLQVTEQHEIKFNLY" FT gene complement(1247..1654) FT /gene="AC2" FT CDS complement(1247..1654) FT /codon_start=1 FT /gene="AC2" FT /product="AC2 protein" FT /db_xref="GOA:Q58WG6" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:Q58WG6" FT /protein_id="AFR24305.1" FT /translation="MPPSSPSTSHCSLVPIKVQHRTAKTRAIRRRRVDLECGCSFYLHI FT DCINHGFSHRGTHHCASSKEWRFYLGHNKSPLFRNHRPRQEAREHEPRHHHTPDTVQPQ FT PSEGIGDSQVFSQLQGLDDLTASDWSFLKSI" FT gene complement(1563..2642) FT /gene="AC1" FT CDS complement(1563..2642) FT /codon_start=1 FT /gene="AC1" FT /product="AC1 protein" FT /db_xref="GOA:K4HHP4" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:K4HHP4" FT /protein_id="AFR24306.1" FT /translation="MPRAGRFQINAKNYFITYPRCSLTKEEALSQLQALSYPTNIKFIR FT VCRELHQDGVPHLHVLIQFEGKFQCTNPRFFDLISPSRSTHFHPNIQGAKSSSDVKAYI FT EKGGEFLDAGLFQVDARSARGEGQHLAQVYADALNASSKSEALQIIKEKDPKSFFLQFH FT NISANADRIFQAPPQTYVSPFLSSSFTQVPDDIEVWVSENICSPAARPWRPISIVLEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW FT QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLDEEKNRSLKAWALKNATFITLHEPL FT FSSAHQSPTPHSEDQGHQT" FT gene complement(2252..2485) FT /gene="AC4" FT CDS complement(2252..2485) FT /codon_start=1 FT /gene="AC4" FT /product="AC4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:K4HKJ0" FT /protein_id="AFR24307.1" FT /translation="MGCLISMFSSNSKASSNVPTRDSSISFPHPDQHISIRTFRELNHR FT PMSKLTLKREGNFLTLDFSKSMPEVQGGKASI" XX SQ Sequence 2799 BP; 730 A; 546 C; 731 G; 792 T; 0 other; accggatggc cgcgcccgaa aaagcagggt gaccccacaa tgaccgcgcc cgtgaaagaa 60 agtggtccct gcgcacttgt tttggtcggc cagtcatatt cacgcgtgaa agtctatata 120 tttgttgttt gtctttatag acttcatcgc gaagtagtag agcgtgtcaa catgtgggat 180 ccattggtga acgattttcc cgaaaccgtt cacggtttcc gttctatgct tgctgttaaa 240 tacctgttac atctggaaca ggaatacgat cgcggtactg tcggggctga gtatatacgg 300 gatctaatag gggttctacg gtgtaagaat tatgtcgaag cgaccaggag atataataat 360 ctcaacaccc gtatccaagg tgcggaggag gctgaacttc gacagcccat acacgaaccg 420 tgttgttgcc cccactgtcc gcgtcaccag aagcaaaata tgggccaaca ggcccatgta 480 tcggaaaccc aagatgtaca gaatgtatcg aagcccagat gtccctaagg gctgtgaagg 540 cccatgtaag gtccagtcgt ttgagcagag ggatgatgtg aagcaccttg gtatctgtaa 600 ggtgattagt gatgtgacgc gtgggcctgg gctgacacac agggtcggaa agaggttttg 660 tatcaagtcc gtttacattc ttggtaagat ctggatggat gaaaatatta agaagcagaa 720 tcacactaat aatgtgatgt tttacctgct tagggataga aggccgtatg gcaatgcgcc 780 ccaagacttt gggcagatat ttaacatgtt tgataatgag cccagtactg caacaattaa 840 gaacgatttg agggataggt ttcaggtgtt gaggaaattt catgccactg ttgtgggtgg 900 tccatctggc atgaaggagc aggcgttggt gaaaaggttt tacaagctga atcatcatgt 960 gacatataat catcaggagg cagggaagta tgagaatcac acagagaatg cgttgttatt 1020 gtatatggca tgtacacatg cctcgaatcc tgtgtatgct acgctgaaaa tacgcatcta 1080 tttttatgat gcagtgacaa attaataaag gttgaatttt atttcatgtt gctccgtaac 1140 ttggagtgtg tttagtaata catcgtacac aacatgatca acagcttgaa ggacagtgtt 1200 aatggaaata acgcctatca tatttaaata tttgagcact tgatatctaa atactcttaa 1260 gaaacgacca gtctgaggcc gtaaggtcgt ccagaccttg aagttgagaa aacacttgtg 1320 aatccccaat gccttccgaa ggttgtggtt gaaccgtatc tggagtgtga tgatgtcgtg 1380 gttcatgttc cctggcctct tgtcgtggtc ggtgatttcg aaatagaggg gatttgttat 1440 gtcccaggta aaaacgccat tccttgcttg aggcgcagtg atgagttccc ctgtgcgaga 1500 atccatggtt gatgcagtcg atatggagat agaatgagca gccgcattcg aggtctaccc 1560 gcctacgtct gatggccctg gtcttcgctg tgcggtgttg gactttgatg ggcactagag 1620 aacaatggct cgtggagggt gatgaaggtg gcattcttta aagcccaggc tttaagggat 1680 cggttctttt cctcgtccag aaactcttta tatgaggatg ttggtcctgg attgcagagg 1740 aagatagtgg gaatgccgcc tttaatttga attggcttcc cgtattttgt attgctttgc 1800 cagtctcttt gggcccccat gaattctttg aagtgtttga ggtagtgggg gtcgacgtca 1860 tcaatgacgt tgtaccatgc gtcgtttgaa tataccttgg gagacagatc caggtgtcca 1920 caaagataat tatggggtcc cagtgaacga gcccacattg ttttgccggt tcggctatca 1980 ccttctagaa caatactgat cggtctccat ggccgcgcag cgggactgca tatattttcg 2040 gatacccata cctctatgtc gtctgggact tgtgtaaaag atgatgataa gaacggacta 2100 acgtaagttt gtggcggagc ctggaagatt ctatctgcgt tagcagatat attatggaac 2160 tgtaaaaaaa aggactttgg atctttttct ttgataattt gaagagcctc ggatttagac 2220 gaagcattca acgcgtctgc atatacctga gctaaatgct ggccttcccc ccttgcactt 2280 ctggcatcga cttggaaaag tccagcgtca agaaattccc ctcccttttc aatgtaagct 2340 ttgacatcgg acgatgattt agctccctga atgttcggat ggaaatgtgt tgatcgggat 2400 ggggaaatga gatcgaagaa tctcgggttg gtacattgga acttgccttc gaattggatg 2460 agaacatgga gatgaggcac cccatcctga tgtagttctc tgcaaaccct aatgaatttg 2520 atattcgtcg ggtacgaaag ggcttgtaat tgggaaaggg cctcttcttt tgttaatgag 2580 catcggggat aggttatgaa ataatttttg gcattgattt gaaaacgacc ggctcttggc 2640 atattggctg tcgttttgga tcgggggaca ctcaaaactc cagggatacg gtggaatggg 2700 gggcattata tatgatgtcc cccaatggca tatgtgtaaa taggtagact tccattcaaa 2760 atttgaattc cgaatattgg cggccatccg attaatatt 2799 //