ID AF422174; SV 1; circular; genomic DNA; STD; VRL; 2785 BP. XX AC AF422174; XX DT 16-JAN-2002 (Rel. 70, Created) DT 15-APR-2005 (Rel. 83, Last updated, Version 6) XX DE East African cassava mosaic Zanzibar virus segment DNA-A, complete DE sequence. XX KW . XX OS East African cassava mosaic Zanzibar virus OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RP 1-2785 RX DOI; 10.1007/s00705-004-0380-1. RX PUBMED; 15375675. RA Maruthi M.N., Seal S., Colvin J., Briddon R.W., Bull S.E.; RT "East African cassava mosaic Zanzibar virus - a recombinant begomovirus RT species with a mild phenotype"; RL Arch. Virol. 149(12):2365-2377(2004). XX RN [2] RP 1-2785 RA Maruthi M.N., Colvin J., Seal S.E.; RT ; RL Submitted (20-SEP-2001) to the INSDC. RL Plant, Animal and Human Health Group, Natural Resources Institute, RL University of Greenwich, Central Avenue, Chatham Maritime, Kent ME4 4TB, UK XX DR MD5; bf4c9e9e7ce8bbe4605471cffd7a2088. DR EuropePMC; PMC1079959; 15784145. XX FH Key Location/Qualifiers FH FT source 1..2785 FT /organism="East African cassava mosaic Zanzibar virus" FT /segment="DNA-A" FT /host="cassava plant" FT /mol_type="genomic DNA" FT /country="Tanzania:Zanzibar, Uguja Island" FT /note="collected in 1998" FT /db_xref="taxon:223275" FT gene 174..530 FT /gene="AV2" FT CDS 174..530 FT /codon_start=1 FT /gene="AV2" FT /product="precoat protein" FT /db_xref="GOA:Q8V384" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:Q8V384" FT /protein_id="AAL60506.1" FT /translation="MWDPLLNDFPETVHGFRSMLAVKYLLHLEQEYDRGTVGAEYIRDL FT IGVLRCKSYVEATRRYNNLNTRIQGAEEAELRQPIHEPCCCPHCPRHQKQNMGQQAHVS FT EAQDVQTVSKPRCS" FT gene 334..1107 FT /gene="AV1" FT CDS 334..1107 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:Q8V383" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:Q8V383" FT /protein_id="AAL60507.1" FT /translation="MSKRPGDIIISTPVSKVRRRLNFDSPYTNRVVAPTVRVTRSKIWA FT NRPMYRKPKMYRLYRSPDVPKGCEGPCKVQSYEQRDDVKHTGMVRCVSDVTRGPGITHR FT VGKRFCVKSIYILGKIWMDENIKKQNHTNHVMFFLVRDRRPYGPSPQDFGQVFNMFDNE FT PTTATVKNDLRDRYQVLRKFYATVVGGPSGMKEQALVKRFFKINNHVVYNHQEQAKYEN FT HTENALLLYMACTHASNPVYATLKIRIYFYDAVTN" FT gene complement(1104..1508) FT /gene="AC3" FT CDS complement(1104..1508) FT /codon_start=1 FT /gene="AC3" FT /product="unknown" FT /db_xref="GOA:Q8V382" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:Q8V382" FT /protein_id="AAL60508.1" FT /translation="MDSRTGELITAPQAKNGVFTWEITNPLYFEITNHDRRPGNMNHDI FT ITLQIRFNHNLRKALGIHKCFLNFKVWTTLRPQTGLFLRVFKYQVLKYLDMIGVISINT FT VLQAVAHVLYNVLLNTLQVTEQHAIKFNLY" FT gene complement(1249..1656) FT /gene="AC2" FT CDS complement(1249..1656) FT /codon_start=1 FT /gene="AC2" FT /product="transcriptional activator protein" FT /db_xref="GOA:Q8V381" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:Q8V381" FT /protein_id="AAL60509.1" FT /translation="MPPSSPSTSHCSLVPIKVQHRTAKTRAVRRRRVDLECGCSFYLHI FT DCINHGFSHRGTHHCASSQEWRFYLGNNKSPLFRNHQPRQEAREHEPRHHHTPDTVQPQ FT PEEGAGDSQMFSQLQGLDDLTASDWSFLKSI" FT gene complement(1565..2644) FT /gene="AC1" FT CDS complement(1565..2644) FT /codon_start=1 FT /gene="AC1" FT /product="replication protein" FT /db_xref="GOA:Q8V380" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:Q8V380" FT /protein_id="AAL60510.1" FT /translation="MTPPKRFKIQAKNYFLTYPKCSLSKHEALSQILNIPTPTNKKYIK FT VCRELHDDGQPHLHMLIQFEGKFSCTNKRFFDLVSPTRSTHFHPNIQGAKSSSDVKSYI FT DKDGDTTEWGEFQIDARSARGGCHNANDACAEALNSGSKAAALLIIKEKLPKEFIFQYH FT NLSSNLDRIFQEPPAPYVSPFLSSSFTNVPEELEVWVSENVMGSAARPWRPNSIVIEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW FT QSNTKYGKPIQIKGGIPTIFLCNPGPTSSYKEFLDEEKNQSLKAWALKNATFITLHEPL FT FSSAHQSPTPHSEDQGRQT" FT gene complement(2230..2487) FT /gene="AC4" FT CDS complement(2230..2487) FT /codon_start=1 FT /gene="AC4" FT /product="unknown" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:Q8V379" FT /protein_id="AAL60511.1" FT /translation="MGNLISTCLFSSKANSHAQISDSSTWYPQHDQHISIRTFKELNPA FT PTSSPTSTKMGIPLSGANSRSTPDRLEAAATMLMTHVPKH" XX SQ Sequence 2785 BP; 722 A; 548 C; 720 G; 795 T; 0 other; accggttggc cgcgcccgaa aaagcaggtg gaccccacag gatggccgcg cccgtgaaag 60 aaagtggtcc ccgcgcactt gtttcggtca gccagtcata ttcacgcgtg gaagtctaga 120 tatttgttgt ttgtctttat agacttcgtc gcgaagtagt ggagcgcgtc aacatgtggg 180 atccattgtt aaacgatttc cctgaaaccg ttcacggttt ccgttccatg cttgctgtta 240 aatacctgtt acatcttgaa caggaatacg atcgcggtac tgtcggggct gagtatatac 300 gggatctaat aggggttcta cggtgtaaga gttatgtcga agcgaccagg agatataata 360 atctcaacac ccgtatccaa ggtgcggagg aggctgaact tcgacagccc atacacgaac 420 cgtgttgttg cccccactgt ccgcgtcacc agaagcaaaa tatgggccaa caggcccatg 480 tatcggaagc ccaagatgta cagactgtat cgaagcccag atgttcctaa gggctgtgaa 540 ggcccatgta aggttcagtc gtatgaacag agggatgatg ttaagcacac tggtatggtt 600 cgatgtgtca gtgatgttac gcgtgggcca ggcattaccc atagagtcgg gaagaggttc 660 tgtgtgaagt ccatatatat attgggcaag atctggatgg atgagaatat caagaagcaa 720 aatcatacga accatgttat gttcttcctc gttcgagata gaaggcctta tgggccgagt 780 cctcaagatt ttggacaagt gttcaacatg tttgataatg aaccgactac ggcaaccgtg 840 aagaatgatc ttagggaccg gtatcaggtg ttacgtaaat tctatgcaac tgttgttggt 900 ggaccctctg ggatgaagga acaagcgctg gttaagaggt tttttaagat caataatcat 960 gtagtgtata atcatcagga acaggccaag tatgagaatc atactgagaa tgcgttgtta 1020 ttgtatatgg catgtacaca tgcctcgaat cctgtgtacg ctacgctgaa aatacgcatc 1080 tatttctatg atgcagtgac aaattaataa aggttgaatt ttattgcatg ttgctccgta 1140 acttggagtg tgtttagtaa tacattgtac agaacatgag caacagcttg aagtacagtg 1200 ttaatggaaa taacgcctat catatctaaa tacttgagca cctgatattt aaatactctt 1260 aagaaaagac cagtctgagg ccgtaaggtc gtccagacct tgaagttgag aaaacatttg 1320 tgaatcccca gcgccttcct caggttgtgg ttgaaccgta tctggagtgt gatgatgtcg 1380 tggttcatgt tccctggcct cctgtcgtgg ttggtgattt cgaaatagag gggatttgtt 1440 atttcccagg taaaaacgcc attcttggct tgaggcgcag tgatgagttc ccctgtgcga 1500 gaatccatgg ttgatgcagt cgatatggag atagaacgag cagccgcatt cgaggtctac 1560 ccgcctacgt ctgacggccc tggtcttcgc tgtgcggtgt tggactttga tgggcactag 1620 agaacaatgg ctcgtggagg gtgatgaagg tggcattctt taaagcccag gctttaaggg 1680 actggttctt ttcctcgtcc agaaactctt tatatgatga tgttggtcct ggattgcaga 1740 ggaagatagt gggaatgcca cctttaattt gaattggctt cccgtacttt gtattgcttt 1800 gccagtctct ttgggccccc atgaattctt tgaagtgctt tagatagtgg ggatcgacgt 1860 catcaatgac gttgtaccag gcgtcgttgc tgtagacctt tggactgaga tccaggtgtc 1920 cacataagta gttgtgtgga cccagagagc gggcccacat tgtctttccg gtcctactat 1980 cgccctcgat gacgatacta ttaggtctcc atggccgcgc agcggaaccc atcacgttct 2040 cggaaaccca gacttcaagt tcctcaggaa cgttagtaaa agaagaagac agaaaaggag 2100 aaacataagg agctggtggc tcttgaaaaa tcctatctaa attactactt aagttatgat 2160 attgaaaaat aaattctttt gggagtttct ccttaataat tagaagtgct gctgccttgg 2220 aacctgagtt taatgcttcg gcacatgcgt cattagcatt gtggcagccg cctctagccg 2280 atctggcgtc gatctggaat tcgccccact cagtggtatc cccatctttg tcgatgtagg 2340 acttgacgtc ggagctggat ttagctcctt gaatgttcgg atggaaatgt gttgatcgtg 2400 ttggggatac caggtcgaag aatcgcttat ttgtgcatga gaatttgcct tcgaactgaa 2460 taagcatgtg gagatgaggt tgcccatcat cgtgaagttc tcggcaaact ttgatgtatt 2520 tcttgtttgt gggagttggg atgtttaata tttgggataa tgcttcgtgt ttagatagag 2580 aacatttggg atatgtgaga aaatagtttt tggcctgtat tttaaaacgc ttggggggag 2640 tcatttatgc gagagcaatt ggagacacct cggttgatgt ctctactgaa ttggagacaa 2700 tatatagtgt ctccaaatgg cataatggta attaggtaga tctattttca aaatttgaac 2760 caaaagcggc catccgatta atatt 2785 //