ID AY795987; SV 1; linear; genomic DNA; STD; VRL; 2800 BP. XX AC AY795987; XX DT 24-MAR-2005 (Rel. 83, Created) DT 22-APR-2005 (Rel. 83, Last updated, Version 2) XX DE East African cassava mosaic virus isolate YV segment A, complete sequence. XX KW . XX OS East African cassava mosaic virus OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RC Publication Status: Online-Only RP 1-2800 RX PUBMED; 15784145. RA Ndunguru J., Legg J., Aveling T., Thompson G., Fauquet C.; RT "Molecular biodiversity of cassava begomoviruses in Tanzania: evolution of RT cassava geminiviruses in Africa and evidence for East Africa being a center RT of diversity of cassava geminiviruses"; RL Virol J 2:21-21(2005). XX RN [2] RP 1-2800 RA Ndunguru J., Fauquet C.M.; RT ; RL Submitted (27-OCT-2004) to the INSDC. RL ILTAB, Donald Danforth Plant Science Center, 975 North Warson Road, St. RL Louis, MO 63132, USA XX DR MD5; fd3d4c5e7784daddc47bc5831ccaed37. DR EuropePMC; PMC1079959; 15784145. DR EuropePMC; PMC3163225; 21812981. XX FH Key Location/Qualifiers FH FT source 1..2800 FT /organism="East African cassava mosaic virus" FT /segment="A" FT /isolate="YV" FT /mol_type="genomic DNA" FT /country="Tanzania" FT /db_xref="taxon:62079" FT gene 174..527 FT /gene="AV2" FT CDS 174..527 FT /codon_start=1 FT /gene="AV2" FT /product="AV2" FT /db_xref="GOA:Q58WH5" FT /db_xref="InterPro:IPR002511" FT /db_xref="UniProtKB/TrEMBL:Q58WH5" FT /protein_id="AAX39364.1" FT /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL FT IVFYGVRDYVEATRRYNNLNTRIQGAEEAELRQSIRTVLLPPLSASPEAKYGQQAHVSE FT AQDVQNVSKPRCP" FT gene 334..1104 FT /gene="AV1" FT CDS 334..1104 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:Q58WH4" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:Q58WH4" FT /protein_id="AAX39365.1" FT /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYEPCCCPHCPRHQKQNMAN FT RPMYRKPKMYRMYRSPDVPKGCEGPCKVQSYEQRDDVKHTGMVRCVSDVTRGSGITHRV FT GKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGSSPQDFGQVFNMFDNEP FT TTATVKNDLRDRYQVLRKFYATVVGGPSGMKEQSLVKRFYKINNHVVYNHQEQAKYENH FT TENALLLYMACTHASNPVYATLKIRIYFYDAVTN" FT gene complement(1101..1505) FT /gene="AC3" FT CDS complement(1101..1505) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer" FT /db_xref="GOA:Q58WH3" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:Q58WH3" FT /protein_id="AAX39369.1" FT /translation="MDSRTGELITAPQAKNGVFTWEITNPLYFAITHHDTRPGNMNHDI FT ITLQIRFNHNLRKALGIHKCFLNFKVWTTLRPQTGLFLRVFRYQVLKYLDMIGVISINT FT VLQAVDHVLYDVLLNTLQVTEQHAIKFNLY" FT gene complement(1246..1653) FT /gene="AC2" FT CDS complement(1246..1653) FT /codon_start=1 FT /gene="AC2" FT /product="transactivator protein" FT /db_xref="GOA:Q58WH2" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:Q58WH2" FT /protein_id="AAX39368.1" FT /translation="MPPSSPSTSHCSQVPIKVQHRTAKTRAVRRRRVDLECGCSFYLHI FT DCINHGFSHRGTHHCASSKEWRFYLGDNKPPLFRNHPPRHETREHEPRHHHTPDTVQPQ FT PSEGVGDSQMFSQLQGLDDLTASDWSFLKSI" FT gene complement(1562..2641) FT /gene="AC1" FT CDS complement(1562..2641) FT /codon_start=1 FT /gene="AC1" FT /product="replication-associated protein" FT /db_xref="GOA:Q58WH1" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:Q58WH1" FT /protein_id="AAX39366.1" FT /translation="MPDARSFQINAKNYFITYPRCSLTKEEALSQLKALSFPTNIKFIR FT VCRELHQDGVPHIHVLIQFEGKFQCTNPRFFDLISPSRSTHFHPNIQGAKSSSDVKAYI FT EKGGEFLDDGIFQVDARSARGEGQHLAQVYAEALSASSKSEALQIIKEKDPKSFFLQFY FT NISANADRIFQAPPQTYVSPFLSSSFTQAPEDIEVWVSENICSPAARPWRPISLVLEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW FT QSNTKYGKPIQIKGGIPTIFLCNPGPTSSFKEFLEEEQNQSLKAWAIKNATFITLHEPL FT FSSAHQSPTPHSEDQGRQT" FT gene complement(2251..2484) FT /gene="AC4" FT CDS complement(2251..2484) FT /codon_start=1 FT /gene="AC4" FT /product="AC4" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:Q58WH0" FT /protein_id="AAX39367.1" FT /translation="MGCLISMFSYNSKASSNVPTRDSSISYPHPDQHISIRTFRALNHR FT PMSKLTLKREGNFLTMEFSKSMPEVQGERASI" XX SQ Sequence 2800 BP; 743 A; 555 C; 713 G; 789 T; 0 other; accggatggc cgcgcccgaa aaagcagggg gaccccacaa tatggccgcg cccatgaaag 60 aaagtggtcc ccgcgcacat gtgttggtcg gccagtcata ttcacgcgtg aaagtctaga 120 tatttgttgt gtgtctttat agacttcgtc gcgaagtagt ggagcgcgtc aacatgtggg 180 atccattgtt aaacgatttt cccgaaaccg ttcacggttt ccgttctatg cttgctgtta 240 aatacctgtt gcatttggaa caggaatacg atcgcgggac tgtcggggct gaatatatac 300 gggatctaat agtgttctac ggtgtaagag attatgtcga agcgaccagg agatataata 360 atctcaacac ccgtatccaa ggtgcggagg aggctgaact tcgacagtcc atacgaaccg 420 tgttgttgcc cccactgtcc gcgtcaccag aagcaaaata tggccaacag gcccatgtat 480 cggaagccca agatgtacag aatgtatcga agcccagatg tccctaaggg ctgtgaaggc 540 ccatgtaagg ttcagtccta tgaacagaga gatgatgtta agcacactgg tatggtccga 600 tgtgtcagtg atgttactcg tggatcaggt attacccata gagtcgggaa gagattttgt 660 gtgaagtcta tatatatatt gggcaagatc tggatggatg agaatatcaa gaagcaaaat 720 catacgaacc atgttatgtt cttcctcgtg cgagatagaa ggccttatgg gtcgagtcca 780 caagattttg gacaagtgtt caacatgttt gataatgaac ctactactgc aactgtgaag 840 aatgatctta gggaccggta tcaagtatta cgtaaattct atgcgactgt tgttggcgga 900 ccctctggga tgaaggagca atcgctggtc aagaggtttt ataagatcaa taatcatgta 960 gtgtataatc atcaggaaca ggccaagtat gaaaatcata ctgagaacgc gttgttatta 1020 tatatggcat gtacacatgc ctctaatcct gtgtacgcta cgctgaaaat acgcatctat 1080 ttctatgatg cagtgacaaa ttaataaagg ttgaatttta ttgcatgttg ctccgtaact 1140 tggagagtgt ttagtaatac atcgtacaaa acatgatcaa cagcttgcag tacagtatta 1200 atggaaataa cgcctatcat atctaaatac tttagcactt ggtatctaaa tactcttaag 1260 aaaagaccag tctgaggccg taaggtcgtc cagaccttga agttgagaaa acatttgtga 1320 atccccaacg ccttccgaag gttgtggttg aaccgtatct ggagtgtgat gatgtcgtgg 1380 ttcatgttcc ctggtctcgt gtcgtggtgg gtgattgcga aatagagggg gtttgttatc 1440 tcccaggtaa aaacgccatt ctttgcttga ggcgcagtga tgagttcccc tgtgcgagaa 1500 tccatggttg atacagtcga tatggagata gaacgagcag ccgcattcga ggtctacccg 1560 cctacgtctg acggccctgg tcttcgctgt gcggtgttgg actttgatgg gcacttgaga 1620 acaatggctc gtggagggtg atgaaggtgg cattctttat agcccaggct ttaagggact 1680 ggttctgttc ttcttccaga aactctttaa atgatgatgt tggtcctgga ttgcaaagga 1740 agatagtggg aatgccgcct ttaatttgaa tcggcttccc gtactttgta ttgctttgcc 1800 agtccctttg ggcccccatg aactctttga aatgcttgag gtagtgaggg tcgacgtcat 1860 caatgacgtt gtaccatgcg tcgttactgt atacctttgg agacagatcc aggtgtccac 1920 ataaataatt atggggtccc aaagaacgag cccacatggt tttcccggtt cggctatcac 1980 cttctagaac aagactgatc ggtctccatg gccgcgcagc gggactgcaa atattttcag 2040 atacccatac ttcgatgtct tcaggggctt gtgtaaatga agatgataaa aatggactaa 2100 cgtaagtttg tggcggagcc tggaagattc tatcggcgtt agcagatatg ttatagaact 2160 gtaaaaaaaa tgacttggga tctttttcct taataatttg aagagcttcg gatttagaag 2220 aggcactcaa cgcttctgca tatacttgag ctaaatgctg gccctctccc cttgcacttc 2280 gggcatcgac ttggaaaatt ccatcgtcaa gaaattcccc tcccttttca atgtaagctt 2340 tgacatcgga cgatgattta gcgccctgaa tgttcggatg gaaatgtgtt gatcgggatg 2400 gggatatgag atcgaagaat ctcgggttgg tacattggaa cttgccttcg aattgtatga 2460 gaacatggat atgaggcacc ccatcctgat gtagttctct gcaaacccta atgaatttga 2520 tgttcgtcgg gaaagaaagg gcttttaatt gggaaagggc ctcttccttg gttaatgagc 2580 atcggggata agtgatgaaa taatttttgg catttatttg aaacgaccgg gcgtcgggca 2640 tatttgttgg tcgcttttgg ggtcggtgga cacctcatgc ctatggcaat cgggggaacg 2700 ggggacaata tatagtatgt ccccaatggc atatgtgtaa ataggtagac ttccattcga 2760 aatttgaatt tcgcatattg gcggccatcc gattaatatt 2800 //