ID AY795986; SV 1; linear; genomic DNA; STD; VRL; 2804 BP. XX AC AY795986; XX DT 24-MAR-2005 (Rel. 83, Created) DT 22-APR-2005 (Rel. 83, Last updated, Version 2) XX DE East African cassava mosaic virus isolate TZM segment A, complete sequence. XX KW . XX OS East African cassava mosaic virus OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RC Publication Status: Online-Only RP 1-2804 RX PUBMED; 15784145. RA Ndunguru J., Legg J., Aveling T., Thompson G., Fauquet C.; RT "Molecular biodiversity of cassava begomoviruses in Tanzania: evolution of RT cassava geminiviruses in Africa and evidence for East Africa being a center RT of diversity of cassava geminiviruses"; RL Virol J 2:21-21(2005). XX RN [2] RP 1-2804 RA Ndunguru J., Fauquet C.M.; RT ; RL Submitted (27-OCT-2004) to the INSDC. RL ILTAB, Donald Danforth Plant Science Center, 975 North Warson Road, St. RL Louis, MO 63132, USA XX DR MD5; 078cd427b40d51201749724cb698cf34. DR EuropePMC; PMC1079959; 15784145. XX FH Key Location/Qualifiers FH FT source 1..2804 FT /organism="East African cassava mosaic virus" FT /segment="A" FT /strain="Kenya" FT /isolate="TZM" FT /mol_type="genomic DNA" FT /country="Tanzania" FT /db_xref="taxon:62079" FT gene 174..533 FT /gene="AV2" FT CDS 174..533 FT /codon_start=1 FT /gene="AV2" FT /product="AV2" FT /db_xref="GOA:Q58WI1" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:Q58WI1" FT /protein_id="AAX39358.1" FT /translation="MWDPLLNEFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL FT IGVLRCKSYVEATRRYHNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQKYGANRPHV FT SEAQDVQNVSKPKCP" FT gene 334..1110 FT /gene="AV1" FT CDS 334..1110 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:Q58WI0" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:Q58WI0" FT /protein_id="AAX39359.1" FT /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKNMG FT PTGPMYRKPKMYRMYRSPNVPKGCEGPCKVQSYEQRDDVKHTGMVRCVSDVTRGSGITH FT RVGKRFCVKSIYILGKIWMDENVKKQNHTNHVMFFLVRDRRPYGPSPQDFGQVFNMFDN FT EPTTATVKNDLRDRYQVLRKFYATVVGGPSGMKEQSLVKRFFRINNHVVYNHQEQAKYE FT NHTENALLLYMACTHASNPVYATLKIRIYFYDAVTN" FT gene complement(1107..1511) FT /gene="AC3" FT CDS complement(1107..1511) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer" FT /db_xref="GOA:Q58WH9" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:Q58WH9" FT /protein_id="AAX39363.1" FT /translation="MDSRTGELITAPQARNGVFTWEITNPLYFEITNHDKRPGNMNHDI FT ITLQIRFNHNLRKALGIHKCFLNFKVWTTLRPQTGLFLRVFRYQVLKYLDMIGVISINT FT VLHAVDHVLYDVLLNTLQVTEQHAIKFNLY" FT gene complement(1252..1659) FT /gene="AC2" FT CDS complement(1252..1659) FT /codon_start=1 FT /gene="AC2" FT /product="transactivator protein" FT /db_xref="GOA:Q58WH8" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:Q58WH8" FT /protein_id="AAX39362.1" FT /translation="MPPSSPSTSHCSLVPIKVQHRTAKTRAVRRRRVDLGCGCSFYLHI FT DCINHGFSHRGTHHCASSKEWRFYLGNNKSPLFRNHQPRQEAREHEPRHHHTPDTVQPQ FT PPEGIGDSQVFSQLQGLDDLTASDWSFLKSI" FT gene complement(1568..2647) FT /gene="AC1" FT CDS complement(1568..2647) FT /codon_start=1 FT /gene="AC1" FT /product="replication-associated protein" FT /db_xref="GOA:Q58WH7" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:Q58WH7" FT /protein_id="AAX39360.1" FT /translation="MPRAGRFQINAKNYFITYPRCSLTKEEALSQLQALSYPTNIKFIR FT VCRELHQDGVPHIHVLIQFEGKFQCTNPRFFDLISPSRSSHFHPNIQGAKSSSDVKAYI FT EKGGEFLDDGIFQVDARSARGEGQHLAQVYAEALNASSKSEALQIIKEKDPKSFFLQFH FT NISANADRIFQAPPQTYVSPFLSSSFTQVPEDIEVWVSENICSPAARPWRPISIVLEGD FT SRTGKTLWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPPLPQTLKEFMGAQKDW FT QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLDEEKNQSLKAWALKNATFITIHEPL FT FSSAHQSPTPHSEDQGRQT" FT gene complement(2257..2490) FT /gene="AC4" FT CDS complement(2257..2490) FT /codon_start=1 FT /gene="AC4" FT /product="AC4" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:Q58WH6" FT /protein_id="AAX39361.1" FT /translation="MGCLISMFSSNSKASSNVPTRDSSISFPHPDHHISIRTFRALNRR FT PMSRLTLKREGNFLTMEFSKSMPEVQGERASI" XX SQ Sequence 2804 BP; 724 A; 554 C; 734 G; 792 T; 0 other; accggatggc cgcgcctgaa aaagtaggtg gaccccagag gatggccgcg cccgtgaaag 60 aaagtggtcc ctgcgcactt gttttggtcg gccagtcata tgcacgcgtg aaagtctaga 120 tatttgttgt tcgtctttat agacttcgtc gcgaagtagt ggagcgcgtc aacatgtggg 180 atccattgtt gaacgagttt cccgaaaccg ttcacggttt ccgttctatg cttgctgtta 240 aatacctgtt acatcttgaa caggaatacg atcgcggtac tgtcggggct gagtatatac 300 gggatctaat aggggttcta cggtgtaaga gttatgtcga agcgaccagg agatatcata 360 atctcaacac ccgtatccaa ggtgcggagg aggctgaact tcgacagccc atacacgaac 420 cgtgttgttg cccccactgt ccgcgtcacc agaagcaaaa atatggggcc aacaggcccc 480 atgtatcgga agcccaagat gtacagaatg tatcgaagcc caaatgtccc taagggctgt 540 gaaggcccat gtaaggttca gtcctatgaa cagagggatg atgttaagca cactggtatg 600 gtccgatgtg tcagtgatgt tactcgtggg tcaggcatta cccatagagt cgggaagagg 660 ttttgtgtga agtccatata tatattgggc aagatctgga tggatgagaa tgtcaagaag 720 caaaatcata cgaaccatgt tatgttcttc cttgttcgag atagaaggcc ttatgggccg 780 agtccgcaag attttggaca agtgttcaac atgtttgata atgaacctac tactgcaact 840 gtgaagaatg atcttaggga ccggtatcag gtgttacgta aattctatgc gactgttgtt 900 ggtggaccct ctgggatgaa ggaacaatcg ttggttaaga ggttttttag gatcaataat 960 catgtagtgt ataatcatca ggaacaggcc aagtatgaga atcatactga gaatgcgttg 1020 ttattgtata tggcatgtac acatgcctcg aatcctgtgt acgctacgct gaaaatacgc 1080 atttatttct atgatgcagt gacaaattaa taaaggttga attttattgc atgttgctcc 1140 gtaacttgga gtgtgtttag taatacatcg tacagaacat gatcaacagc gtgaagtaca 1200 gtgttaatgg aaataacgcc tatcatatct aaatacttga gcacttgata tctaaatact 1260 cttaagaaaa gaccagtctg aggccgtaag gtcgtccaga ccttgaagtt gagaaaacac 1320 ttgtgaatcc ccaatgcctt ccggaggttg tggttgaacc gtatctggag tgtgatgatg 1380 tcgtggttca tgttccctgg cctcttgtcg tggttggtga tttcgaaata gaggggattt 1440 gttatttccc aggtaaaaac gccattcctt gcttgaggcg cagtgatgag ttcccctgtg 1500 cgagaatcca tggttgatgc agtcgatatg gagatagaac gagcagccgc atccgaggtc 1560 tacgcgccta cgtctgacgg ccctggtctt cgctgtgcgg tgttggactt tgatgggcac 1620 tagagaacaa tggctcgtgg atggtgatga aggtggcatt ctttaaagcc caggctttaa 1680 gggactggtt cttttcctcg tccagaaact ctttatatga tgatgttggt cctggattgc 1740 agaggaagat cgtgggaatg ccgcctttaa tttgaattgg cttcccgtac tttgtattgc 1800 tttgccagtc cttttgggcc cccatgaatt ccttaagtgt ttgaggtagt ggggggtcga 1860 cgtcatcaat gacgttgtac caggcgtcgt tgctgtagac ctttggacta agatccaggt 1920 gtccacacaa gtagttgtgt ggtcccagag agcgggccca cagtgttttc ccggttcggc 1980 tatcaccttc tagaacaata ctgatcggtc tccatggccg cgcagcggga ctgcatatat 2040 tttcggagac ccatacttca atatcttctg ggacttgtgt aaaagaggag gataagaacg 2100 gactaacgta agtttgtggc ggagcctgga agattctatc tgcgttagca gatatgttat 2160 ggaactgtaa aaaaaaggat tttggatctt tttctttaat aatttgaaga gcttctgatt 2220 tagaagaagc attcaacgct tctgcatata cctgggctaa atgctggccc tctccccttg 2280 cacttctggc atcgacttgg aaaattccat cgtcaagaaa ttcccctccc ttttcaatgt 2340 aagccttgac atcggacgac gatttagcgc cctgaatgtt cggatggaaa tgtgatgatc 2400 gggatgggga aatgagatcg aagaatctcg ggttggtaca ttggaacttg ccttcgaatt 2460 ggatgagaac atggatatga ggcaccccat cttgatgtag ttctctgcaa accctaatga 2520 atttgatatt cgtcgggtac gaaagggctt gtaattggga aagggcctct tcctttgtta 2580 atgagcatcg gggataggtt atgaaataat ttttggcatt gatttgaaaa cgaccggctc 2640 tcggcatatt tgctgtcgtt ttggaacggg ggacactcaa aactccaggg gaacggtgga 2700 atggggggca ttatatagga tgtcccccaa tggcatatgt gtaaataggt aaacatccat 2760 tcaaaatttg aatgtcgaat attggcggcc atccgattaa tatt 2804 //