ID AY795985; SV 1; linear; genomic DNA; STD; VRL; 2801 BP. XX AC AY795985; XX DT 24-MAR-2005 (Rel. 83, Created) DT 22-APR-2005 (Rel. 83, Last updated, Version 2) XX DE East African cassava mosaic virus isolate TZT segment A, complete sequence. XX KW . XX OS East African cassava mosaic virus OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RC Publication Status: Online-Only RP 1-2801 RX PUBMED; 15784145. RA Ndunguru J., Legg J., Aveling T., Thompson G., Fauquet C.; RT "Molecular biodiversity of cassava begomoviruses in Tanzania: evolution of RT cassava geminiviruses in Africa and evidence for East Africa being a center RT of diversity of cassava geminiviruses"; RL Virol J 2:21-21(2005). XX RN [2] RP 1-2801 RA Ndunguru J., Fauquet C.M.; RT ; RL Submitted (27-OCT-2004) to the INSDC. RL ILTAB, Donald Danforth Plant Science Center, 975 North Warson Road, St. RL Louis, MO 63132, USA XX DR MD5; cb1324e1330111d2d7ce839a9f5bc8aa. DR EuropePMC; PMC1079959; 15784145. XX FH Key Location/Qualifiers FH FT source 1..2801 FT /organism="East African cassava mosaic virus" FT /segment="A" FT /strain="Kenya" FT /isolate="TZT" FT /mol_type="genomic DNA" FT /country="Tanzania" FT /db_xref="taxon:62079" FT gene 174..530 FT /gene="AV2" FT CDS 174..530 FT /codon_start=1 FT /gene="AV2" FT /product="AV2" FT /db_xref="GOA:Q58WI7" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:Q58WI7" FT /protein_id="AAX39352.1" FT /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL FT IGVLRCKSYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS FT EAQDVQNVSKPICP" FT gene 334..1107 FT /gene="AV1" FT CDS 334..1107 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:Q58WI6" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:Q58WI6" FT /protein_id="AAX39353.1" FT /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA FT NRPMYRKPKMYKMYRSPYVPKGCEGPCKVQSYEQRDDVKHTGMVRCVSDVTRGPGITHR FT VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGPSPQDFGQVFNMFDNE FT PTTATVKNDLRDRYQVLRKFYATVVGGPSGMKEQALVKRFFRINNHVVYNHQEQAKYEN FT HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN" FT gene complement(1104..1508) FT /gene="AC3" FT CDS complement(1104..1508) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer" FT /db_xref="GOA:Q58WI5" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:Q58WI5" FT /protein_id="AAX39357.1" FT /translation="MASRTGELITAPQAKNGVFTWAITNPLYFEITNHDKRPGHMNHDI FT ITLQIRFNHNLRKALGIHKCFLNFKVWTTLRPQTGLFLKVFRYQVLKYLDMIGVISINT FT VLRAVDHVLYDVLLNTLQVTEQHAIKFNIY" FT gene complement(1249..1656) FT /gene="AC2" FT CDS complement(1249..1656) FT /codon_start=1 FT /gene="AC2" FT /product="transactivator protein" FT /db_xref="GOA:Q58WI4" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:Q58WI4" FT /protein_id="AAX39356.1" FT /translation="MPPSSPSSSHCSQVPIKIQHRTAKTRALRRRRVDLECGCSFYLIS FT TVINHGFSHRGTHHCASSKEWRFYLGNNKSPLFRNHQPRQEARAHEPRHHHTPDTVQPQ FT PPEGIGDSQVFSQLQGLDDLTASDWSFLKSI" FT gene complement(1565..2644) FT /gene="AC1" FT CDS complement(1565..2644) FT /codon_start=1 FT /gene="AC1" FT /product="replication-associated protein" FT /db_xref="GOA:Q58WI3" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:Q58WI3" FT /protein_id="AAX39354.1" FT /translation="MPRAGRFQINAKNYFITYPRCSLTKEEALSQLKAFSYPTNIKFIR FT VCRELHQDGVPHLHVLIQFEGKFQCTNPRFFDLISSSRSTHFHPNIQGVKSSSDVKAYI FT EKGGEFLDAGVFQVDARSARGEGQHLAQVYADALNASCKSEALQIIKEKDPKSFFLQFH FT NISAKADRIFQAPPQTYVSPFLSSSFTQVPEELEVWVSENVCSPAARPWRPISIVLEGD FT SRTGKTTWARSLGPHNYLCGHLDLSPKIYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW FT QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLEEEKNQSLKAWALKNATFITLFEPL FT FSSAHQNPTPHSEDQGTQT" FT gene complement(2254..2487) FT /gene="AC4" FT CDS complement(2254..2487) FT /codon_start=1 FT /gene="AC4" FT /product="AC4" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:Q58WI2" FT /protein_id="AAX39355.1" FT /translation="MGCLISMFSSNSKANSNVPTPDSSISFPHPDQHISIRTFRALNHR FT PMSRLTLKREGNFLTLEFSKSMPEVPGGRASI" XX SQ Sequence 2801 BP; 740 A; 554 C; 720 G; 787 T; 0 other; accggatggc cgcgcccgaa aaagcaggtg gaccccacca gatggccgca ctcgtgaaag 60 aaagtggtcc ccgcgcactt gtgttggtcg gccagtcata ttcacgcgtg aaagtctaga 120 tatttgtggt ttgactttat atacttcgtc gcgaagtagt ggagcgcgtc aacatgtggg 180 atccattgtt gaacgatttt cccgaaaccg ttcacggttt ccgttctatg cttgctgtta 240 aatacctgtt acatctggaa caggaatacg atcgcggtac tgtcggggct gagtatatac 300 gggatctaat aggggttcta cggtgtaaga gttatgtcga agcgaccagg agatataata 360 atctcaacac ccgtatccaa ggtgcggagg aggctgaact tcgacagccc atacacgaac 420 cgtgttgttg cccccactgt ccgcgtcacc agaagcaaaa tatgggccaa caggcccatg 480 tatcggaagc ccaagatgta caaaatgtat cgaagcccat atgtccctaa gggctgtgaa 540 ggcccatgta aggttcagtc gtatgaacag agggatgatg tgaagcatac tggtatggtc 600 cgatgtgtca gtgatgttac gcgtgggcca ggcattaccc atagagtcgg gaagaggttt 660 tgtgtgaagt ccatatatat attgggcaaa atctggatgg atgagaatat caagaagcaa 720 aatcatacga accatgttat gttctttctc gtgcgagata gaaggcctta tgggccgagc 780 ccacaagatt ttggacaagt gttcaacatg tttgataatg aacccactac ggcaactgtg 840 aagaatgatc ttagggaccg gtatcaggtg ttacgtaaat tctatgcgac tgttgtcggt 900 ggaccctctg ggatgaagga acaagctctg gttaagaggt tttttaggat taataatcat 960 gtagtgtata atcatcagga acaggccaag tatgagaatc atactgagaa tgcgttgtta 1020 ttgtatatgg catgtacaca tgcctcaaat cctgtgtacg ctacgcttaa aatacgcatc 1080 tatttttatg atgcagtgac aaattaataa atgttgaatt ttattgcatg ttgctccgta 1140 acttggagtg tatttagtaa tacatcgtac agaacatgat caacagctcg aagtacagtg 1200 ttaattgaaa taacgcctat catatctaaa tacttgagca cttgatatct aaatactttt 1260 aagaaaagac cagtctgagg ccgtaaggtc gtccagacct tgaagttgag aaaacacttg 1320 tgaatcccca atgccttccg gaggttgtgg ttgaaccgta tctggagtgt gatgatgtcg 1380 tggttcatgt gccctggcct cttgtcgtgg ttggtgattt cgaaatagag gggatttgtt 1440 attgcccagg taaaaacgcc attctttgct tgaggcgcag tgatgagttc ccctgtgcga 1500 gaagccatgg ttgataacag tcgatatgag atagaacgag cagccgcatt cgaggtctac 1560 ccgcctacgt ctgagtgccc tggtcttcgc tgtgcggtgt tggattttga tgggcacttg 1620 agaacaatgg ctcgaagagg gtgatgaagg tggcattctt taaagcccag gctttaagag 1680 actgattctt ttcctcctcc agaaactctt tatatgatga tgttggtcct ggattgcaga 1740 ggaagatagt gggaatgccg cctttaattt gaattggctt cccgtacttt gtattgcttt 1800 gccagtccct ttgggccccc atgaattctt tgaagtgttt gaggtagtgg gggtcgacgt 1860 catcaatgac gttgtaccag gcgtcgtttg aatatatctt tggagacaga tccaggtgtc 1920 cacataaata attatgtggg cccagtgaac gagcccacgt ggtcttcccg gttcggctat 1980 caccttctag aacaatactg atcggtctcc atggccgcgc agcgggactg catacatttt 2040 cggataccca tacttctagt tcctctggga cttgtgtaaa tgaggatgat aagaacggac 2100 taacgtaagt ttggggcgga gcctggaaga ttctatctgc cttagcagat atgttatgga 2160 actgtaaaaa aaaagacttt ggatcttttt ctttgataat ttgaagagct tcggatttac 2220 aagaagcatt caacgcgtct gcatatacct gagctaaatg ctggccctcc cccctggcac 2280 ttcgggcatc gacttggaaa actccagcgt caagaaattc ccctcccttt tcaatgtaag 2340 ccttgacatc ggacgatgat ttaacgccct gaatgttcgg atggaaatgt gttgatctgg 2400 atgaggaaat gagatcgaag aatcgggggt tggtacattg gaatttgcct tcgaattgga 2460 tgagaacatg gagatgaggc accccatcct gatgtagttc tctgcaaacc ctaatgaatt 2520 tgatattcgt cgggtaagaa aaggctttta attgggaaag ggcctcttcc tttgttaatg 2580 agcatcgggg ataggttatg aagtaatttt tggcatttat ttgaaaacga ccggctcttg 2640 gcatatttgc tgtcgtattg gatcggggga cactcaaaac tccaggggaa cggtggaatg 2700 gggggcatta tataggatgt cccccaatgg catatgtgta aataggtaga tgtccattca 2760 aaatttgaat tgcgaatatt ggcggccatc cgattaatat t 2801 //