ID AY795982; SV 1; linear; genomic DNA; STD; VRL; 2781 BP. XX AC AY795982; XX DT 24-MAR-2005 (Rel. 83, Created) DT 22-APR-2005 (Rel. 83, Last updated, Version 2) XX DE African cassava mosaic virus isolate TZ segment A, complete sequence. XX KW . XX OS African cassava mosaic virus OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RC Publication Status: Online-Only RP 1-2781 RX PUBMED; 15784145. RA Ndunguru J., Legg J., Aveling T., Thompson G., Fauquet C.; RT "Molecular biodiversity of cassava begomoviruses in Tanzania: evolution of RT cassava geminiviruses in Africa and evidence for East Africa being a center RT of diversity of cassava geminiviruses"; RL Virol J 2:21-21(2005). XX RN [2] RP 1-2781 RA Ndunguru J., Fauquet C.M.; RT ; RL Submitted (21-OCT-2004) to the INSDC. RL ILTAB, Donald Danforth Plant Science Center, 975 North Warson Road, St. RL Louis, MO 63132, USA XX DR MD5; faa9be7642cf6dae40b12ba3dbf19242. DR EuropePMC; PMC1079959; 15784145. DR EuropePMC; PMC3163225; 21812981. XX FH Key Location/Qualifiers FH FT source 1..2781 FT /organism="African cassava mosaic virus" FT /segment="A" FT /isolate="TZ" FT /mol_type="genomic DNA" FT /country="Tanzania" FT /db_xref="taxon:10817" FT gene 135..476 FT /gene="AV2" FT CDS 135..476 FT /codon_start=1 FT /gene="AV2" FT /product="AV2" FT /db_xref="GOA:Q58WK5" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:Q58WK5" FT /protein_id="AAX39334.1" FT /translation="MWDPLVNEFPDSVHGLRCMLAIKYLQALEDTYEPSTLGHDLVRDL FT VSVIRARNYVEATRRYHHFHSRLQGASKTELRQPIQEPCYCPHCPRHKSKTGLDEQAHV FT QKAHDVQDV" FT gene 295..1071 FT /gene="AV1" FT CDS 295..1071 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:Q58WK4" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:Q58WK4" FT /protein_id="AAX39335.1" FT /translation="MSKRPGDIIISTPGSKVRRRLNFDSPYRNRATAPTAHVTNRKRAW FT MNRPMYRKPMMYRMYRSPDIPRGCEGPCKVQSFEQRDDVKHLGICKVISDVTRGPGLTH FT RVGKRFCIKSIYILGKIWMDENIKKQNHTNNVIFYLLRDRRPYGNAPQDFGQIFNMFDN FT EPSTATIKNDLRDRFQVLRKFHATVVGGPSGMKEQALVKRFYRLNHHVTYNHQEAGKYE FT NHTENALLLYMACTHASNPVYATLKIRIYFYDSIGN" FT gene complement(1068..1472) FT /gene="AC3" FT CDS complement(1068..1472) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer" FT /db_xref="GOA:Q58WK3" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:Q58WK3" FT /protein_id="AAX39339.1" FT /translation="MDLRTGDLITAPQAMNGVYTWEINNPLYFTITRHHQRPFLLNQDI FT ITVQVRFNHNLRKELGIHKCFLNFKIWTTLQPQTGLFLRVFRYQVIRYLDNIGVISINT FT VIRAAYHVLFNVIAKTIDCQLTHEIKFNVY" FT gene complement(1213..1620) FT /gene="AC2" FT CDS complement(1213..1620) FT /codon_start=1 FT /gene="AC2" FT /product="transactivator protein" FT /db_xref="GOA:Q58WK2" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:Q58WK2" FT /protein_id="AAX39338.1" FT /translation="MQSSSPSQNRSTQVPIKVNHRQFKKRAIRRRRVDLVCGCSYYLHI FT NCSNHGFTHRGSHHCSSSNEWRVYLGNKQSPVFHNNQAPSTTIPAEPGHHNSPGSIQSQ FT PEEGAGDSQMFSQLQDLDDLTASDWSFLKGL" FT gene complement(1529..2629) FT /gene="AC1" FT CDS complement(1529..2629) FT /codon_start=1 FT /gene="AC1" FT /product="replication-associated protein" FT /db_xref="GOA:Q58WK1" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:Q58WK1" FT /protein_id="AAX39336.1" FT /translation="MSPIDLVNMRTPRFRIQAKNVFLTYPKCSISKEHLLPFIQTLSLP FT SNPKFIKICRELHQNGEPHLHALIQFEGKITLTNNRLFDCVHPSCSTSFHPNIQGAKSS FT SDVKSYLDKDGDTVEWGQFQIDGRSARGGQQSANDAYAKALNSGSKSEALNVIRELVPK FT DFVLQFHNLNSNLDRIFQEPPAPYVSPFLCSSFDQVPVEIEEWVADNVIDSAARPWRPN FT SIVIEGDSRTGKTIWARSLGPHNYLCGHLDLSPKVYNNAAWYNVIDDVDPHYLKHFKEF FT IGAQRDWQSNTKYGKPVQIKGGIPTIFLCNPGPTSSYKEFLDEEKQEALKAWALKNAIF FT VTLTEPLYSGSHQSQSQTIQEASHPA" FT gene complement(2149..2571) FT /gene="AC4" FT CDS complement(2149..2571) FT /codon_start=1 FT /gene="AC4" FT /product="AC4" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:Q58WK0" FT /protein_id="AAX39337.1" FT /translation="MSFSHTQSVLYPKNTCCHSFKHSLSHQTLSSLKSVESCIRMGNLT FT CMPSSNSRVKSRLRTIVSSIVYTQAVAPVSTPTFKVPNQAPMSSPIWIRTEIPSNGDNF FT RSMDDLLEAVNNPRMMLTPKRLTAAVSQKLLMSFGN" XX SQ Sequence 2781 BP; 764 A; 551 C; 683 G; 783 T; 0 other; accggttggc cccgccccct ttaatgtggt ccccgcgcac atacgtatgt cggccaatca 60 tgtggaagct gtaaaggtta tgtattaatg gtgggccatt atatacttgc aggcgaagtt 120 gtggctagtg cgcaatgtgg gatccactgg tgaatgagtt tccagactcg gtgcatgggc 180 ttaggtgtat gcttgcaatt aaatatttgc aggccttaga ggatacatac gagcccagta 240 cgttgggcca cgatctggtt agggatctag tctcagttat cagggctcgt aattatgtcg 300 aagcgaccag gagatatcat catttccact ccaggctcca aggtgcgtcg aagactgaac 360 ttcgacagcc catacaggaa ccgtgctact gcccccactg cccacgtcac aaatcgaaaa 420 cgggcctgga tgaacaggcc catgtacaga aagcccatga tgtacaggat gtatagaagc 480 ccagacatac ctaggggctg tgaaggccca tgtaaggtcc agtcatttga gcagagggat 540 gatgtgaagc accttggtat ctgtaaggtg attagtgatg tgactcgtgg gcctgggctg 600 acacacaggg tcggaaagag gttttgtatc aagtccattt acatccttgg taagatctgg 660 atggatgaaa atattaagaa gcagaatcac actaataatg tgatattcta tctgcttagg 720 gatagaaggc cttatggcaa tgcgccccaa gactttgggc aaatattcaa catgtttgat 780 aacgagccca gtactgcaac aattaagaac gatttgaggg acaggtttca ggtgttgagg 840 aaatttcatg ccactgttgt tgggggtcca tctggcatga aggagcaggc attggtgaaa 900 aggttttaca ggttgaatca tcacgtgaca tataatcatc aggaggcagg gaagtatgaa 960 aatcacacag agaatgcttt gcttctgtat atggcatgta ctcatgcctc taatcctgta 1020 tatgcgacgt tgaaaatacg tatatacttc tatgacagta ttggcaatta ataaacattg 1080 aattttattt catgagtcaa ctgacaatca atagttttgg caattacatt gaacaaaaca 1140 tgataagcag cgcgaattac agtattaatt gagataacac ctatattatc caagtatcta 1200 attacttggt atctaaagac ccttaagaaa agaccagtct gaggctgtaa ggtcgtccag 1260 atcttgaagt tgagaaaaca tttgtgaatc cccagctcct tcctcaggtt gtgattgaat 1320 cgaacctgga ctgttatgat gtcctggttc agcaggaatg gtcgttgatg gtgcctggtt 1380 attgtgaaat acaggggatt gtttatttcc caggtataca cgccattcat tgcttgagga 1440 gcagtgatga gatcccctgt gcgtaaatcc atgattggag cagttgatat gaaggtaata 1500 ggaacagcca cagacaagat ccactcgcct acgccggatg gctcgcttct tgaattgtct 1560 gtgattgact ttgatgggaa cctgagtaga gcggttctgt gagggtgacg aagattgcat 1620 tctttaatgc ccaggccttt agcgcttctt gcttttcctc gtctagaaac tctttatagg 1680 acgaggtagg tcctggattg cagaggaaga tagtgggaat cccgccttta atttgaacgg 1740 gtttcccgta tttcgtgttt gattgccagt ccctctgggc cccaatgaat tccttaaagt 1800 gctttaggta gtgggggtcg acgtcatcaa tgacgttgta ccaggcagca ttattgtaga 1860 cctttggact aaggtccaag tgtccacaaa ggtaattatg tgggcctaat gatctggccc 1920 atatcgtctt ccctgttctg ctatcacctt ctatgacaat actattgggt ctccatggcc 1980 gcgcagcgga atcgataaca ttatcagcga cccattcttc aatttcaaca ggaacttgat 2040 caaaggaaga acataggaag ggagaaacat aaggagctgg tggctcctga aaaatcctat 2100 ctaaattact atttagatta tgaaattgaa gtacaaagtc ttttgggact aattcccgaa 2160 tgacattaag agcttctgac ttactgccgc tgttaagcgc tttggcgtaa gcatcattcg 2220 cggattgttg accgcctcta gcagatcgtc catcgatctg aaattgtccc cattcgacgg 2280 tatctccgtc cttatccaga taggacttga catcggagct tgatttggca ccttgaatgt 2340 tggggtggaa actggtgcta cagcttgggt gtacacaatc gaagagacga ttgttcgtaa 2400 gcgtgatttt accctcgaat tggatgaggg catgcaagtg aggttcccca ttctgatgca 2460 actctctaca gattttaatg aacttagggt ttgatgggag agagagtgtt tgaatgaatg 2520 gcagcaggtg ttctttggat atagaacact ttgggtatgt gagaaagaca ttcttggctt 2580 gaattctaaa acgaggcgtt ctcatgttga ccaagtcaat tggagacatt caactagaga 2640 cactcttgag catctcctcc tgttaattgg agagtttata taggtgtctc taaatggcat 2700 tcttgtaata agttaaactt taatttgaat taaaaggctc aaaaggcgca gaacaccaaa 2760 gggccaaccg tataatatta c 2781 //