ID U59751; SV 1; linear; genomic DNA; STD; VRL; 8159 BP. XX AC U59751; XX DT 02-JUL-1996 (Rel. 48, Created) DT 18-APR-2005 (Rel. 83, Last updated, Version 6) XX DE Cassava vein mosaic virus, complete genome. XX KW . XX OS Cassava vein mosaic virus OC Viruses; Ortervirales; Caulimoviridae; Cavemovirus. XX RN [1] RP 1-8159 RX DOI; 10.1007/s007050050344. RX PUBMED; 9645200. RA de Kochko A., Verdaguer B., Taylor N., Carcamo R., Beachy R.N., Fauquet C.; RT "Cassava vein mosaic virus (CsVMV), type species for a new genus of plant RT double stranded DNA viruses?"; RL Arch. Virol. 143(5):945-962(1998). XX RN [2] RP 1-8159 RA Kochko de A., Verdaguer B., Beachy R.N., Fauquet C.; RT ; RL Submitted (02-JUN-1996) to the INSDC. RL Cell Biology, ORSTOM/The Scripps Research Institute, 10666 N. Torrey Pines RL Rd., La Jolla, CA 92037, USA XX DR MD5; 39d1b5741130bd68e1900914a47d389e. XX FH Key Location/Qualifiers FH FT source 1..8159 FT /organism="Cassava vein mosaic virus" FT /mol_type="genomic DNA" FT /note="double stranded DNA plant pararetrovirus that was FT originally considered to be a caulimovirus" FT /db_xref="taxon:38062" FT CDS 30..4148 FT /codon_start=1 FT /product="ORF 1" FT /note="polyprotein; MW 163745.30 Daltons; contains the coat FT protein and the movement protein consensus sequences at the FT C terminus. Unknown function(s) at the N terminus." FT /db_xref="GOA:Q66283" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR028919" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/Swiss-Prot:Q66283" FT /protein_id="AAB03325.1" FT /translation="MDSKDFTQLNLEEHSYKVNREKLPIDSYIQYGGFTYANFTPYIIH FT GDDGFGEHKQLNWTNKLLWNKLGKLNIKDTQILMQNNISEEQHNELISLEAQKIARENL FT ADRINYLQNINTSIDFKLWKMNKENLERQELLLRQINELKEEIKSLKNIPSTVAIIPTN FT TYTINMIRTETEDWKYFKYIEKELVQNKTEAIAKILDNSYIINDNLGLLYERYEEINPT FT PKPYKRPETIFDTPQYAKYIRNQKRQEEYEKQQELKKENENKEYQEFLEWKEKQQKDKG FT KGIQTVYPTLIIPDIKPEKQKKEDMMLEMIKNLQNELEQLKIQRHKEHEKQAELTKIQM FT LEEELEEELDPDNLEKEVLNNIQNIQISSDISESSEINEISDNETEQISGSDSDYNNEQ FT INVKIEGEEYEYKDNYRYYKPQPPYYKKDIRRERQYKGQSSQRADYIKNRREQFESTYQ FT ANMNTTINDSGEILNLDCTTPEEAEDRIQKWTQSMSIALVKQQLSNEQAKQFIRRTFIG FT NVKEWYKNLTNEAKQKLEGNAPLLSLTHMELGLRAEFGKLGIESDVEKHEKKTSIARHK FT ILQLQICSMDHQNLNAYLCEFQEYYYSANYTEAESENILNMFYSKLPEPWGQQVLNGYL FT SEIKGKNLLDSIGARMTYLQEFISDKCKENWTQKQARKIQLSKNLDCSYYEVGKYGCKQ FT IRPHKRKRYYKKYIPIKRKYFNKKRYKKYYRPKKFLKRKNPHKACKCYNCGEEGHISPN FT CKKPKKKTRINNLEALEFKNTEMENLEFETNKNDIIWVEEIEVIQPLHYEEEEKYKGNY FT SDRILQNPYYINSISIEELDNLDWEFEYQEDIEDDLEYQNFVYQESNDNWYSDQENWYS FT DEQYLGIYMFIGETSGENNQDNMEGIIKEYNKTEPEKINKIIFTSEKFKQIMENDLNMT FT KDKIFHNNKLKKLFGKKEIEYYIVTDIEHPIDVKYVQNQDKIINLPLYNQEIFENEIQK FT IPDKDQNKIRNIHLAAVEIVVKAYFREGIDTPFEIILCDDRITYPQEGSLVEVLIGNLI FT YQKVKFTKIINYSISIEDKNLDKSLVMYWNLEGIKMIKDSKIFSIRLRNLYVLSNKHIV FT KNKKQYNGNIIIEPIFQDVIQNNNRNYIEYGKPGKFDRTKLKSYSRRFNEPLRLDDRTN FT IQREKDQIEKADHNLELQKELNNLNYYSQQGQSSNVLDIPKILKIENTKNNYKQFHIIG FT KITEGRLNKFYPILIDTGAADSYISSKILEDEKLVSNKLSKVVTSYNADNEKHIYDRNT FT EVIIELIDKNNEKYKINFIGLVDQLRLLEGGKAEILLGMNILQNLKPYCITDDYLEINL FT GFRCIKINRIKKDIFEIREDLQQMNVLNE" FT CDS 4132..4347 FT /codon_start=1 FT /product="ORF 2" FT /function="unknown" FT /note="MW 8797.30 Daltons" FT /db_xref="UniProtKB/Swiss-Prot:Q89635" FT /protein_id="AAB03326.1" FT /translation="MSLMNSTNPEFIEYYHTSWKPHKIELVDNIYHSYGYYVYTRSVIK FT RFNKHLIKTTYKRIFSHPENIVLHFR" FT CDS 4344..6302 FT /codon_start=1 FT /product="ORF 3" FT /note="MW 77063.30 Daltons; polyprotein, contains an FT aspartic proteinase and reverse transcriptase (with FT ribonuclease H domain) consensus" FT /db_xref="GOA:Q89703" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR000588" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR041577" FT /db_xref="UniProtKB/Swiss-Prot:Q89703" FT /protein_id="AAB03327.1" FT /translation="MNKITYMTIKISIPKYMSRIYHGLFDTGANICICKKKVLPDELWH FT KTENLVLRGFNDEKHVAEYRADNITIMIAKEKFIIPYIYAMDEMSPDIIIGATFYNKYS FT PIELDIGKGIIKFTKNNEKYPNYLVKYPKKRKLVPWTKGNPSVTETMENIGINQIESRN FT PIEEEINQILGTDIYGENPLEKWEKHKTLAKIELKNETDNIYKPPMLYQETDLPEFKMH FT IEEMIKEGFIEEKTNFEDKKYSSPAFIVNKHSEQKRGKTRMVIDYKDLNKKAKVVKYPI FT PNKDTLIHRSIQARYYSKFDCKSGFYHIKLEEDSKKYTAFTVPQGYYQWKVLPFGYHNS FT PSIFQQFMDRIFRPYYDFIIVYIDDILVFSKTIEEHKIHIAKFRDITLANGLIISKKKT FT ELCKEKIDFLGVQIEQGGIELQPHIINKILEKHTKIKNKTELQSILGLLNQIRHFIPHL FT AQILLPIQKKLKIKDEEIWTWTKEDEEKIKLIQDYSKNLVIKMKYPINKEDMNWIIEVD FT ASNNAYGSCLKYKPKNSKIEYLCRYNSGTFKENEQKYDINRKELIAVYQGLQSYSLFTC FT EGNKLVRTDNSQVYYWIKNDTNKKSIEFRNIKYLLAKIAVYNFEIQLIDGKTNIIADYL FT SRYNSSDTDGRYDEANT" FT CDS 6274..7452 FT /codon_start=1 FT /product="ORF 4" FT /note="MW 46297.90 Daltons; putative transactivator factor FT (TAV)" FT /db_xref="GOA:Q66284" FT /db_xref="UniProtKB/Swiss-Prot:Q66284" FT /protein_id="AAB03328.1" FT /translation="MEDMMKQILEKLNTIEKNISETNIRIEKIEKEQELKRKVELYGKE FT PEKKLHKENIEKLSSSIEDKIIQNIDKKLKKIENVEEQYQWKNIVKINKPLSVGEKYME FT NFKKILVYLGEKHPKLEELYSLTDYNKLVADIYTDRNLVISAYNYGLLQVLYIEHPSQL FT ELFDENIKLAYMKFRNVTKAQLIYMRIYSAMAEPYDKGVIPKIEIIKFGITYSKLKYDE FT VYEHQPIEKLDLKKFISQKRALGILVIQKEIENLNGNVWLYSNLDGRLILSNHNKAENV FT EVKNILSDWSKKLSIPEGNYPRCSIKNPMFTGKTMEVLCELSKKQINMRHICNLCSKMK FT NVQIQDPILPEYEEEYVEIEKEEPGEEKNLEDVSTDDNNEKKKIRSVIVKET" FT CDS 7973..8137 FT /codon_start=1 FT /product="ORF 5" FT /function="unknown" FT /note="putative ORF; MW 6293.50 Daltons; putative role in FT transcription regulation" FT /db_xref="GOA:Q66285" FT /db_xref="UniProtKB/Swiss-Prot:Q66285" FT /protein_id="AAB03329.1" FT /translation="MKGIKCLSITCFLSDNSIIRVVLIRRMLKILLFSLLVLIILCFID FT PILFYFICL" XX SQ Sequence 8159 BP; 3804 A; 826 C; 1208 G; 2321 T; 0 other; tggtatcaga gcttagttta aaaataataa tggatagtaa agattttaca caacttaatc 60 tagaagaaca tagttataaa gttaatagag aaaaattacc tatagactca tatattcagt 120 atggaggatt tacttatgca aattttacac catatattat acatggagat gatggatttg 180 gagaacataa acaactgaat tggactaata aattattatg gaataaatta ggtaaattaa 240 atatcaaaga tacacaaatt ctaatgcaga ataatattag tgaagaacaa cataatgaat 300 taataagctt agaagcacaa aaaatagcta gagaaaattt agctgataga attaattatt 360 tacaaaatat taatacttca atagatttta agttatggaa aatgaataaa gaaaatttag 420 aaaggcaaga attattatta agacaaataa atgaattaaa agaagaaata aaatcattaa 480 aaaatatacc aagtacagta gcaataatac caactaatac ttatactatt aatatgatta 540 ggacagaaac agaagattgg aaatatttta aatatattga aaaagaatta gttcagaata 600 aaactgaagc aatagcaaaa atattagata atagttatat aataaatgat aatttaggat 660 tattatatga aagatatgaa gaaattaatc ccacacctaa accatataaa agaccagaaa 720 ctatatttga tacacctcaa tatgcaaaat atattaggaa tcaaaaaaga caagaagagt 780 atgaaaaaca acaagaatta aaaaaggaaa atgaaaataa agaataccaa gaatttttag 840 aatggaaaga aaaacaacaa aaagataaag gtaaaggaat acaaactgta tacccgacat 900 taataatacc agatataaaa cctgaaaaac aaaagaaaga agatatgatg ttagaaatga 960 ttaaaaattt acaaaatgaa ttagaacaat taaaaattca aagacataaa gaacacgaaa 1020 aacaagcaga gttaactaaa atacagatgt tagaagagga attagaagaa gaattagatc 1080 ctgataatct agaaaaagaa gttttaaata acattcaaaa tatacaaatc agtagtgata 1140 taagtgaatc atcagagata aatgaaattt cagataacga aacagaacaa atttcaggaa 1200 gtgatagtga ttataataat gaacaaatta atgttaagat agaaggagaa gaatatgaat 1260 acaaagacaa ttatagatat tataaaccac aaccaccata ttacaaaaaa gatataagaa 1320 gagaaaggca atataaaggt caaagttcac aaagagcaga ttacataaaa aacagaagag 1380 aacaatttga atctacatat caagcaaata tgaatacaac cataaacgat agtggagaaa 1440 ttcttaattt agattgcaca acaccagagg aagcagaaga tagaatacaa aaatggactc 1500 aatctatgtc aatagcatta gttaaacaac aattatctaa tgaacaagca aaacaattca 1560 taagaagaac ttttatagga aatgtaaaag aatggtataa aaatttaaca aatgaagcta 1620 aacaaaaatt agaagggaat gcaccattac taagtttaac tcatatggaa ttaggactta 1680 gagcagaatt tggaaaatta ggaatagaat cagatgtaga aaaacatgaa aagaaaacat 1740 ctattgcaag acataaaata ttacaattac aaatatgtag tatggatcat caaaacttaa 1800 atgcatattt atgtgaattt caagaatatt attatagtgc aaattataca gaagcagaat 1860 cagaaaatat attaaatatg ttttatagta aactcccaga accatgggga caacaagtat 1920 taaatggata tttatcagaa ataaaaggta aaaatttact agatagtatt ggagctcgaa 1980 tgacatattt acaagaattt atatcagata aatgtaaaga aaactggaca caaaaacaag 2040 ctagaaaaat acaattatca aagaatctag attgtagcta ttatgaagta ggaaaatatg 2100 gatgtaaaca aataagacct cataagagaa aaagatatta taaaaaatat attcctataa 2160 aaaggaaata ttttaataag aaaagataca aaaaatatta tagacctaaa aaattcttaa 2220 aaaggaaaaa tccgcataaa gcatgtaaat gttataattg tggagaagaa ggacacataa 2280 gtccaaattg taaaaaacca aaaaagaaaa ctagaattaa taatttagaa gctttagaat 2340 ttaaaaatac tgagatggag aatctagaat ttgaaacaaa taaaaatgat ataatatggg 2400 tagaagaaat agaagttata caacctttac attatgaaga agaagaaaaa tacaaaggaa 2460 attactcaga tagaatatta caaaatcctt attatataaa ttcaataagt attgaagaat 2520 tagataattt agattgggaa tttgaatatc aagaagatat tgaagatgat ttagaatacc 2580 aaaattttgt atatcaagaa tcaaatgata attggtatag tgatcaagaa aattggtata 2640 gtgatgaaca atatttagga atatatatgt ttataggaga aacatcagga gaaaataatc 2700 aagacaatat ggaaggaata ataaaagaat ataataaaac agaaccagaa aaaataaata 2760 aaataatatt tacctcagaa aaatttaaac aaattatgga aaatgattta aatatgacga 2820 aagataaaat atttcataat aataaactta aaaaattatt tggaaagaaa gaaatagaat 2880 actatatagt tacagatatc gaacatccta ttgatgtaaa atatgtacaa aatcaagata 2940 aaataataaa cctaccatta tataatcaag aaatttttga aaatgaaata caaaaaatac 3000 ctgacaagga tcaaaataaa ataagaaata tacatttagc agcagtagaa attgtagtaa 3060 aagcatattt tagagaagga atagacacac catttgaaat tatattatgc gatgatagaa 3120 taacatatcc acaagaggga agtttagtag aagttttaat aggaaattta atatatcaga 3180 aagtaaaatt tacaaaaata attaattatt ctataagcat agaagataaa aatttagata 3240 aaagtttagt aatgtattgg aacttagaag gaattaaaat gataaaagat agcaaaatat 3300 ttagtataag attaagaaat ctttatgttt taagtaataa acatatagtg aaaaataaaa 3360 aacaatataa tggaaatata attattgaac ctatatttca agatgtaata caaaataata 3420 atagaaatta tatagaatat ggtaaaccag gaaaatttga tagaacaaaa ttaaaatcat 3480 atagtagaag atttaatgaa ccgttaagac ttgatgatag aacaaatatc cagagagaaa 3540 aggaccaaat cgaaaaggca gaccataatt tagaattaca aaaagaacta aataatttaa 3600 attattattc tcaacaagga caaagttcta atgttttaga tataccaaaa atattaaaga 3660 tagagaacac taagaataat tataagcaat ttcatattat tggaaaaatt actgaaggaa 3720 gattaaataa attttatccg atattaatag atacaggagc agcagattca tatataagtt 3780 caaagatctt agaagatgag aaattagttt caaataaatt atctaaagta gttacatcat 3840 ataacgcaga taatgaaaaa catatttatg acagaaacac agaagtaata attgaattaa 3900 tagataaaaa taatgaaaaa tataaaatca attttatcgg attagtagat caattaagac 3960 ttttagaagg aggaaaagct gaaatattat taggaatgaa catattacaa aatttaaaac 4020 cctattgtat aacagatgat tacctagaaa ttaatttagg ctttagatgt ataaaaatta 4080 atagaataaa gaaagatatt ttcgaaatta gagaagattt acaacaaatg aatgtcctta 4140 atgaatagta caaatccaga atttatagaa tattatcata cttcatggaa accacataaa 4200 atagaattag ttgataatat atatcattca tatggatatt atgtgtatac aagaagtgta 4260 ataaaaagat ttaataaaca tttaattaaa actacatata aaagaatatt tagtcatcca 4320 gaaaatatag tattgcattt tagatgaata agattaccta tatgacaatc aaaatatcaa 4380 tacctaaata tatgtcaaga atttatcatg gattgtttga tactggagca aatatatgta 4440 tttgtaaaaa gaaagtttta ccagatgaat tatggcataa gacagaaaat ttagtactaa 4500 gaggatttaa tgatgaaaag catgttgctg aatatagagc agataatata acaataatga 4560 tagcaaaaga aaaatttata atcccatata tatatgctat ggatgaaatg tcaccagata 4620 ttattattgg agcaactttt tataataaat atagtcccat agaactagac ataggaaaag 4680 gaattataaa atttacaaaa aataatgaaa aatacccaaa ttatttagta aaatatccta 4740 agaagagaaa attagtacca tggactaaag gaaacccgag tgttactgaa acaatggaaa 4800 atataggaat aaatcagatt gaaagtagaa atcctataga agaagaaata aatcaaatat 4860 taggaacaga tatatatggc gaaaatccat tagaaaaatg ggaaaaacat aaaacattag 4920 caaaaataga attaaaaaat gaaacagata atatttataa accaccaatg ttatatcaag 4980 aaacagattt accagaattt aaaatgcata tagaagaaat gattaaagaa ggttttatag 5040 aagaaaaaac taattttgaa gataaaaaat attctagccc agcttttata gtaaataaac 5100 attcagaaca aaaaagagga aaaactagaa tggtaattga ttataaagat ttaaataaaa 5160 aagctaaagt agtaaaatat cctatcccaa ataaagatac attaatacat agaagtatac 5220 aggcaagata ctatagtaaa tttgattgta aatcaggatt ttatcatata aagttagaag 5280 aagatagtaa aaaatataca gcttttacag ttccgcaggg atattatcag tggaaagtat 5340 taccatttgg atatcataac tcaccaagta ttttccaaca atttatggat agaatattta 5400 gaccatatta tgattttatt attgtgtata ttgatgatat attagtcttt tctaagacta 5460 ttgaagagca taaaatacat attgcaaaat ttagagatat aacacttgca aatggattaa 5520 ttattagtaa gaaaaaaaca gaattatgta aagaaaaaat agatttcctt ggagtccaaa 5580 tagaacaagg aggtatagaa ttacagcctc atattattaa taaaatatta gaaaaacata 5640 cgaaaataaa aaataaaact gaattacaaa gtatattagg attattaaac cagattagac 5700 attttatacc acatttagca cagattttat tacctataca gaaaaaatta aaaataaaag 5760 atgaagagat ttggacttgg actaaagaag atgaagaaaa aataaaactt atacaagatt 5820 atagtaaaaa tttagtaata aaaatgaaat atcctattaa taaagaagat atgaattgga 5880 ttattgaagt agacgcaagt aataatgctt atggtagttg tttaaaatat aaaccgaaaa 5940 actcaaaaat agaatattta tgtagatata actcaggaac ttttaaagaa aatgagcaaa 6000 aatatgatat taatagaaaa gaactgattg cagtatacca aggattgcag agttattctt 6060 tatttacatg tgaaggaaat aaattagtta gaacagataa ttcacaagta tattattgga 6120 taaaaaatga tactaataaa aaatcaattg aatttagaaa tataaaatat ttattagcta 6180 aaattgcagt ctataatttc gaaattcaac taatagatgg aaaaacaaat attattgctg 6240 attatttgag caggtacaat tcatctgata ctgatggaag atatgatgaa gcaaatactt 6300 gagaaactca atacaattga aaaaaatatt tctgaaacaa atatacgaat tgaaaaaata 6360 gaaaaagaac aagagcttaa aagaaaagta gagctctatg gaaaagaacc agagaaaaaa 6420 ttacataaag aaaatataga aaaattaagc tcttctattg aagacaaaat catacaaaat 6480 attgataaga aattaaagaa aattgaaaat gtcgaagaac aatatcagtg gaaaaatata 6540 gtcaaaataa ataaaccact ttctgtggga gaaaaatata tggagaattt caaaaagata 6600 cttgtttatc ttggagaaaa acatcctaaa ttggaagaat tatattcttt gacagattac 6660 aacaaattag tagcagatat ttatactgat agaaatttag taatttcagc atataattat 6720 ggattacttc aagtgcttta tattgagcat ccgtctcaat tagagttatt tgatgaaaat 6780 ataaaacttg cttatatgaa atttagaaat gttaccaagg cacaacttat atatatgcga 6840 atttacagtg caatggctga accctatgat aaaggggtga ttccaaaaat tgaaataatt 6900 aaatttggaa ttacttattc aaaacttaaa tatgatgaag tatatgaaca tcaaccaata 6960 gaaaaacttg atttgaaaaa atttatatca caaaaaagag cattagggat acttgttatc 7020 caaaaagaaa tagaaaactt gaatggaaac gtatggttgt actccaacct agatggaaga 7080 cttatattat caaatcacaa caaagcagaa aatgtagaag tcaaaaatat attatcagat 7140 tggtcaaaaa aattatccat tccagaaggt aattatccaa gatgtagcat caagaatcca 7200 atgtttacgg gaaaaactat ggaagtatta tgtgagctca gcaagaagca gatcaatatg 7260 cggcacatat gcaacctatg ttcaaaaatg aagaatgtac agatacaaga tcctatactg 7320 ccagaatacg aagaagaata cgtagaaatt gaaaaagaag aaccaggcga agaaaagaat 7380 cttgaagacg taagcactga cgacaacaat gaaaagaaga agataaggtc ggtgattgtg 7440 aaagagacat agaggacaca tgtaaggtgg aaaatgtaag ggcggaaagt aaccttatca 7500 caaaggaatc ttatccccca ctacttatcc ttttatattt ttccgtgtca tttttgccct 7560 tgagttttcc tatataagga accaagttcg gcatttgtga aaacaagaaa aaatttggtg 7620 taagctattt tctttgaagt actgaggata caacttcaga gaaatttgta agtttgtaat 7680 gtttttagtt tttataataa tatgtttatg tttgttttaa taatgagtga gtagtttctt 7740 aattcttcta attaatatga aataattatg ctgtctagaa acaagtttaa atctgtttga 7800 tattcccaga agctatgaac tgttacgatc tagataatct ttcaattatc cccatgatat 7860 cctgcgtgtg atattactga gccctttata gaccatgtgt gtcgtttgta tgggcatgaa 7920 catctgtgta tgttagggag aaagagagat tgaatgatca ttgaaagttt taatgaaggg 7980 aatcaaatgt ttatctatta cttgttttct atcagacaat agcataatta gagtagtatt 8040 aattaggaga atgttaaaaa tattattgtt ttcactctta gttttaatta tcctttgttt 8100 tatcgatcct atattatttt actttatttg tttataattc cgctttcttt tcacccctt 8159 //