ID DQ822074; SV 1; circular; genomic DNA; STD; VRL; 7276 BP. XX AC DQ822074; XX DT 01-FEB-2007 (Rel. 90, Created) DT 24-MAR-2007 (Rel. 91, Last updated, Version 2) XX DE Dioscorea bacilliform virus clone B39-6, complete genome. XX KW . XX OS Dioscorea bacilliform virus OC Viruses; Ortervirales; Caulimoviridae; Badnavirus. XX RN [1] RP 1-7276 RX DOI; 10.1007/s00705-006-0888-7. RX PUBMED; 17195956. RA Seal S., Muller E.; RT "Molecular analysis of a full-length sequence of a new yam badnavirus from RT Dioscorea sansibarensis"; RL Arch. Virol. 152(4):819-825(2007). XX RN [2] RP 1-7276 RA Seal S., Muller E.; RT ; RL Submitted (26-JUN-2006) to the INSDC. RL UMR BGPI, CIRAD, Campus International de Baillarguet, Montpellier 34398, RL France XX DR MD5; 9120f8b0be95861cad54caca6d91809a. XX FH Key Location/Qualifiers FH FT source 1..7276 FT /organism="Dioscorea bacilliform virus" FT /host="Dioscorea sansibarensis" FT /isolate="B39" FT /mol_type="genomic DNA" FT /country="Benin" FT /clone="B39-6" FT /db_xref="taxon:52996" FT CDS 441..872 FT /codon_start=1 FT /product="ORF1 protein" FT /db_xref="InterPro:IPR010746" FT /db_xref="UniProtKB/TrEMBL:A2T398" FT /protein_id="ABI47984.1" FT /translation="MNRSIDLEGSLKDYLDRSTEPEYLDLYIVEKPTNKQLANNLCYLN FT HQAKLLSRVSLKHFFKLQEEIQELKTENQVLRKHLVNLTKEVVENRPLTEKKVQELVLR FT IIEQPKEIEQQAVRLTLDLQKKLDRVEAILSRLEGAVLL" FT CDS 869..1249 FT /codon_start=1 FT /product="ORF2 protein" FT /db_xref="UniProtKB/TrEMBL:A2T399" FT /protein_id="ABI47985.1" FT /translation="MSLTSESKEYQLAIKATEAIEPPAVGYTRPADYKGLATAIASIQK FT QNNTLIQLFTQLFTQISQLQTEVRSLRTEQGSSAQTDQLLDQVITKLGQLSIQDKLPEK FT KGKLLVHKDPVIIYNEEKAKIQ" FT CDS 1246..5133 FT /codon_start=1 FT /product="ORF3 protein" FT /note="contains motifs for putative movement protein, FT capsid protein and aspartyl proteinase" FT /db_xref="GOA:A2T3A0" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR008181" FT /db_xref="InterPro:IPR018061" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR029054" FT /db_xref="InterPro:IPR033704" FT /db_xref="InterPro:IPR036157" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/TrEMBL:A2T3A0" FT /protein_id="ABI47986.1" FT /translation="MSRTATQEVSTSQRQFEDQIRDYRLGARRRYNLRQGLNKFKNVLT FT LGTYYAPTLEQQVDPERTLQLSAIKRASMIPAEVLYHARPDTINHKVYMHWSEEADLIA FT NNQHDSTFIRPESYLELRKANMQFIHIGLMQVRIQVLHRHEAGTMALIVFRDCSWKGRR FT STIARMEVDLTKGSQLVYIAPNITKKLTDFYNNIQVSILTKGYDDYQNSEANLLITKGM FT IGRLSNTSNVGFAYTVDGVAEYFITHGVQALKGMPMTTDDLQGTLWNIKPSRIIVPSQP FT TQAEDRELADGSVSMRFGAYKAAPALMSSEEFDSDENQVINVFTTTVSSLLIKLMHRDA FT IIPQRITDGSVGYDLAACEDVVIQPRRPTLVKTGIAIEVPAGTYAQLYLKSSFALKTGL FT QITGGVVDPDYRGELQIILINNTFTCVPILKGTFVAQVVLHQVATPAVQETTTLTPTAR FT GTRGFGTLTAEASNMITKVRVEGRPVNPVPTPTDHMEEDDEDAEAYGDYLSYLSSRSFS FT REGGEKSREVEAEADEDEDLYFRPSPARVTFSDDPFSEDPTIATIRESGDASQPIFDES FT DDNEDHELIQELVSLEADFDYPQLRQLEDCVIAASSAVSNYTPPEDTTMNPPTYPPARI FT DQAGPSYSGPSYSVPSGTRFKPMNYSANLQLPSAQQTTGAMFYMPLELDKFDEVFSRWE FT SITKNITSQHSFISGREKADFIENLLGETEKLTWIQWRTTFNAEFDSLIAAADGREGTQ FT NIISQIRTIFSLEDPFRGSAKIQEDAYRDLERISCKDVQDIIPFMNEYMRIAAKTGRLF FT ISSELSEKFWMKLPGDLGKTIKDSFDKTYVGNTVGVHPRILFTYKYLQEECKKAALSRS FT LKNLQFCSKIPIPGYYKGREKKYGVRKSTTYKGKPHDSHVRIERRKHLIRSKKCKCFLC FT GAEGHFARECPNDKRNVKRIAIFEGMEIPEDCEIVSVQEGDELSDSIFSVSEGEEVLQS FT LNPFIQQNLLYIREEDGKHFLGGGGWRAAVQVTPQEHDCHHQWEYYILNTGLCKFCKRD FT ITQRWHAKCMHCNLVTCGFCSQHYLDLEIPCQREEVQVYNPQRLLQEQQNYIQVCEVEI FT EKLKESVKAANKIAEDKIMAASIRHAVELEELRKQLQAIKLDKERLELENHDLKKILEE FT SEQVNVLICDGREKSERIASIKERRNGLYNLMVQLEIPGAAPIQVNAILDTGATTCCVN FT EEGLPKAILEDNTFEVKFTGANSEMVARKKLKGGTMKIGDNQFRIPYTFAFPLKLGGGE FT QLIIGCNLSEL" FT CDS 5130..6908 FT /codon_start=1 FT /product="ORF4 protein" FT /note="contains motifs for reverse transcriptase and FT ribonuclease H" FT /db_xref="GOA:A2T3A1" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR002156" FT /db_xref="InterPro:IPR041373" FT /db_xref="UniProtKB/TrEMBL:A2T3A1" FT /protein_id="ABI47987.1" FT /translation="MNGGVRIEGDTVTFYKNVTTIQTQQQVPKLLALEELEMDEDEYIE FT LQQSISSIEPNNYFSNKLSQVFQELKAAGYIGDDPLKFWSKNQVVCELNIINPDLTIQN FT KPLKHVTPAMEETFRKHIDALLKLKVIRASKSRHRTTAFIVYSGTTVDPVTGKENKGKE FT RMVFNYKRLNDNTEKDQYSLPGINTILKRVGQSKIYSKFDLKSGFHQVAMAPQSVEWTA FT FLAPGGLYEWLVMPFGLKNAPAVFQRKMDNVFRGTEDFIAVYIDDILVFSETEEEHLKH FT LRILLQICQQHGLVLSPTKMKIGTKTIEFLGAVIGNRKIKLQEHIIKKIADFSDDELKT FT TKGLRSWLGILNYARAYMPNMGRILGPLYSKVSPNGEKRLNAEDWRLIREVRTTIRALP FT DLELPPADSFIVIEADGCMEGWGGVCKWKKLKKDFRRNERVCAYASGKFDPPKGTIDAE FT LFAVMNSMDKFKIHYLDRKELLIRSDCQALISFYNKSSENKPSRVRWIAFNDFVTGLGI FT DVTFEHIDGKDSQLADSLSRLVTSLVRHEGKWQQQRMALVIIEEMMQKALSKEIRSQVA FT HQISALLKIIDKGKSI" XX SQ Sequence 7276 BP; 2337 A; 1492 C; 1649 G; 1798 T; 0 other; tggtatcaga gcttggtttg cttattcttg tttaatggct aactaaatat cgaaggtatt 60 tcaggtgaag gaaactgtct aaggtgtcaa attaagtttc ctgctgatct ttgaaattca 120 taaattcatg attattataa cgactgcata agtttatcac tgagttttgg tatgggtgtt 180 gatttttaga ttattactgt atgtgttaat gggtgtttgc tcagcgaact gttagatcct 240 aggtgcaaaa gacaagggca ctatatcccg tcaggcgagt ggccgtttag ggtgaacccc 300 caaggtgagc actgaaggat cgaagcagta agtgaaagct tgcggtacct taatacatat 360 cagtaagatc gaaaaagttg gcggtaccta ccatcttagt agggaattta tttggttttt 420 ataatttcct atgagttttc atgaatagat caatagatct agaagggagc ttaaaggatt 480 atcttgacag gagtactgaa cctgaatatc ttgaccttta cattgttgaa aagcctacaa 540 acaagcagtt agcaaataac ttgtgttatt tgaaccacca agctaagtta ctctctcgtg 600 tttctctaaa acatttcttc aagttacagg aagaaatcca ggagttgaag actgagaatc 660 aagttctcag aaagcatttg gtgaacctca ccaaagaagt tgttgaaaac agacctctta 720 ctgagaagaa ggtccaagag ctcgtgctca gaattattga acaaccgaag gaaatcgagc 780 aacaggctgt gcgcctaacg cttgatttac agaagaagct tgatagagtt gaagccatcc 840 tatcaagact tgaaggagca gtgcttttat gagcctgacc agcgagtcaa aagaatatca 900 gttggccatc aaagcaactg aggcgatcga accaccagct gttggttaca caaggccagc 960 agactacaaa gggcttgcaa cagcaattgc aagtatccag aagcagaaca acacactcat 1020 ccagttgttc acacagctat tcactcagat atcacagctc cagacagaag ttcggtctct 1080 aaggactgaa cagggctcat ctgcccagac tgatcaatta cttgaccagg taatcactaa 1140 acttgggcag ttatctatac aggacaagct accggaaaag aagggtaagc ttttggtgca 1200 taaagatccc gtcatcatct ataacgaaga aaaggccaag atacaatgag tcgaacagca 1260 actcaagaag tctcgacctc tcaacgacaa ttcgaagatc aaatcagaga ctaccgactt 1320 ggcgcaagac ggcggtataa tctaagacag ggattaaaca agttcaaaaa tgttctaact 1380 cttggaacat actatgcacc tactcttgaa caacaggtcg acccagagag gactctgcaa 1440 ctctctgcta tcaaaagagc ttccatgata cctgctgagg tattatatca tgctagacca 1500 gatactatca atcacaaggt ctacatgcac tggtcggagg aagctgacct gatagcgaac 1560 aatcaacatg actcaacatt catcagaccg gagtcatatc tagaattgcg caaggccaac 1620 atgcagttca tccatatcgg ccttatgcaa gtcagaatac aagtgctaca ccgacacgaa 1680 gcaggcacca tggcacttat tgtattcaga gattgttcct ggaaaggaag gcgatctaca 1740 atcgcccgaa tggaggtaga tcttacaaag gggagtcaat tggtctatat tgctcccaat 1800 attacaaaga agctcactga tttctacaac aacatccaag tatctatcct caccaaagga 1860 tatgacgact atcaaaacag tgaagcgaac ctcctcatca ccaaaggtat gattggacga 1920 ttgtccaata cttctaatgt tggttttgcc tatacagtgg atggagtagc agaatacttt 1980 atcacccatg gagttcaggc attgaaagga atgccaatga caactgacga cctccagggc 2040 acactgtgga acatcaaacc ttccagaatc attgttccat cccaaccaac acaggcagaa 2100 gatagggaac tggctgatgg ttccgtatcg atgcgattcg gtgcatacaa agctgcacca 2160 gctctcatgt catctgaaga gtttgattca gatgaaaatc aagtaatcaa cgtcttcact 2220 actactgttt ctagcttact gattaaattg atgcacaggg atgcaattat cccacaaaga 2280 ataaccgatg gatcggtcgg ttatgatctg gcggcctgcg aagatgtagt gatccaacca 2340 agaagaccaa ccttggtcaa gacaggcatc gcaattgaag ttcctgctgg aacttacgcc 2400 cagctttatc tcaaatccag ttttgctctc aagactggcc tgcaaattac tggcggcgtt 2460 gtagacccag actaccgagg tgaattgcaa atcatactaa tcaacaatac attcacctgt 2520 gttccaatcc ttaaggggac gtttgtggct caagttgttc tacatcaagt agccacacct 2580 gctgtacaag agactaccac tctcacacca actgcacgag gcacaagggg atttggaacc 2640 cttactgctg aagcctccaa catgattaca aaagtgcggg tagagggaag gcctgtcaat 2700 cctgttccta cacccactga tcacatggaa gaagacgatg aagatgctga agcatacggt 2760 gattatttat cttacttatc atcacgctct ttttccaggg aggggggaga gaaatcccga 2820 gaagtcgaag ccgaagccga cgaagacgaa gatctttatt ttcggccttc tccagcaaga 2880 gtgactttct ctgatgatcc tttttctgag gatcctacaa ttgccacaat tagagaaagt 2940 ggtgatgcct cacaaccaat ttttgatgag agtgatgata atgaagatca tgagctcatt 3000 caagaactgg tttcactaga agctgacttt gactatccac agcttcgaca gcttgaagac 3060 tgcgtcatag cagcatcatc ggcggtgtct aactacacac caccagagga tacgacaatg 3120 aatccaccta cgtatccacc agcaaggatc gatcaagccg gtccttcata ctctggtccc 3180 tcatattctg taccatccgg gacaaggttc aagccgatga actactctgc aaatctacag 3240 ttgccatctg cgcaacaaac aacaggggcg atgttctata tgcccttaga actcgacaaa 3300 ttcgacgagg tattctctag atgggagagc attacaaaga atattacatc acaacattct 3360 ttcatctcag gtcgagagaa ggcagatttc attgaaaatc tgttaggaga aactgaaaaa 3420 ctcacgtgga tacaatggcg aacaacattc aatgctgagt tcgacagttt aattgcagca 3480 gctgatggtc gagaaggtac acagaacatt atttctcaaa tcaggaccat cttctctctt 3540 gaagacccat tccgaggatc tgcaaaaatt caagaggatg catacagaga tttggaaaga 3600 atctcatgca aggatgtcca agacattata ccgtttatga acgagtatat gcgaattgct 3660 gcaaagactg gacgtctgtt catctcatcg gagctttctg aaaagttctg gatgaagctt 3720 cccggagatc taggcaagac tatcaaggat agttttgaca aaacttatgt gggtaatact 3780 gttggagtac acccacgaat tctttttaca tacaagtatt tacaagaaga atgcaagaag 3840 gcggccctca gcagatcact gaagaatctg caattctgta gcaaaatacc tattccaggg 3900 tattacaagg gcagagagaa gaaatatggc gtgaggaagt ctactaccta caaaggtaaa 3960 ccacatgact ctcatgttcg tattgaacgc aggaagcacc tgataagatc caagaagtgc 4020 aaatgcttct tgtgtggagc tgaaggacat tttgcaagag aatgtccaaa tgataaaaga 4080 aacgtcaaaa gaatcgcaat attcgaaggc atggagatcc cagaagactg cgaaattgtt 4140 tcagttcaag aaggggatga gcttagcgat tcaatcttct cagtttctga aggagaagag 4200 gtactgcaat ctctaaatcc ttttattcag cagaacctac tttacatcag agaagaagac 4260 ggcaaacact ttcttggagg aggaggatgg cgagcagcag tgcaagtcac gccacaggag 4320 catgactgtc atcatcaatg ggagtattat atcctgaaca ctggcttatg caaattctgc 4380 aaaagggata ttacccaaag atggcacgcc aaatgcatgc actgcaacct ggtgacatgt 4440 ggtttctgtt cacaacacta cctggacttg gaaattcctt gtcaaagaga agaagtgcag 4500 gtatataatc ctcaacgact attacaggaa caacagaact acatccaggt ctgtgaggtg 4560 gagatagaga agctgaagga atcagtgaaa gcagctaaca agatagctga agataaaatc 4620 atggcagcct ctatcagaca tgcagtggag ctcgaggagc tcaggaaaca acttcaagcc 4680 atcaaacttg acaaagaacg actggaattg gaaaaccatg acctgaagaa aattcttgaa 4740 gaatcagagc aggtaaatgt tcttatctgt gatggcaggg agaagagtga acggattgct 4800 tcgatcaagg agaggcgtaa tgggttgtac aatctcatgg tacaattgga gatacccggc 4860 gccgcaccta tccaagtcaa tgcaatcctt gatacaggtg caacaacatg ttgcgtaaat 4920 gaagaagggc taccaaaggc gatcctagaa gacaatactt ttgaggtgaa attcacagga 4980 gcaaactcgg agatggtggc caggaagaag cttaagggag gaaccatgaa gataggggat 5040 aaccaattca gaattcccta cacatttgca ttccctttga agcttggagg tggtgagcaa 5100 ctaatcattg gttgcaattt atcagagcta tgaatggagg agttcggatt gaaggtgaca 5160 ctgtaacctt ctacaagaat gtgacaacaa tccaaactca acagcaagtt ccaaaattac 5220 tagcacttga agaactggag atggacgaag atgaatatat cgagctacaa cagagcatca 5280 gttccatcga gccgaacaac tatttctcta acaaactgtc tcaagtgttt caggaactaa 5340 aggctgctgg ctatattggg gatgatcctt tgaaattctg gtcaaagaat caagtggttt 5400 gtgaactcaa cattataaat cctgatctca caatacagaa taaaccactg aagcatgtaa 5460 ccccagccat ggaggagacc tttcggaagc acattgatgc tttattgaag ctcaaggtga 5520 tccgagcaag caaaagtaga catcgcacaa ctgcgttcat tgtctactct ggtacgactg 5580 ttgatccagt tacaggaaag gaaaacaagg ggaaggagag gatggtcttc aactacaaaa 5640 gactgaatga caacacggag aaagaccaat acagccttcc agggatcaac acaattctga 5700 agagagtcgg tcaaagcaaa atatattcaa aatttgactt aaaaagcggc tttcatcaag 5760 tggcaatggc gccacaatca gttgaatgga cggcgtttct agcgccagga ggactttacg 5820 aatggttggt aatgccattc gggctgaaga atgcaccagc agtattccaa agaaagatgg 5880 acaacgtctt tcgaggtacc gaagatttta ttgctgtata cattgatgac atactggtgt 5940 tttctgaaac agaagaggag catttgaagc atctgaggat actgcttcaa atttgtcaac 6000 agcatgggct tgttctgagc ccaactaaaa tgaagatagg aactaaaaca atcgagttcc 6060 taggtgcagt gataggcaac aggaagataa agcttcagga gcacatcatc aagaaaattg 6120 cggattttag cgatgatgaa ctgaagacaa ctaaaggcct acgatcatgg ttaggaattc 6180 ttaattatgc tcgtgcatac atgcctaaca tgggtcgaat tctaggtcca ctgtactcaa 6240 aggtgagtcc aaatggggag aagcgactga acgctgaaga ctggagactc atcagggaag 6300 tcagaaccac aatcagggct cttcctgatc tcgaactacc accagcggac agtttcatcg 6360 ttattgaagc tgacggatgc atggaaggat ggggtggtgt gtgcaagtgg aagaagctca 6420 agaaagactt ccgaaggaat gaaagggttt gtgcatatgc gagtggaaag tttgatccac 6480 caaagggcac aattgatgca gagctcttcg cggtaatgaa tagtatggat aaatttaaaa 6540 ttcattatct cgatcgcaag gaattattga tcagatcaga ctgtcaagcg ctgatctcgt 6600 tctataacaa gagttctgaa aacaaaccct caagggttcg atggattgcc ttcaacgact 6660 tcgttacggg tcttgggata gacgtaacgt tcgagcatat tgacggcaaa gatagccagt 6720 tggcggattc tctatcaagg ctggttactt cactcgtaag gcatgaaggc aagtggcagc 6780 aacagaggat ggccttagtc ataattgagg agatgatgca gaaggcactc agcaaagaga 6840 taaggtcaca ggtggcgcat caaatctctg cactgctgaa gattattgac aaaggcaaaa 6900 gcatttgatt cctgacagca cgtggcgtgg tggctgtcat tatcttactt tattatcttt 6960 tctttatgag ttgtaaagag gaattgagcc tcggggagcc gttcctttag tttaaagaga 7020 caagaaggaa tatcattgag tgtttgatgg ggcccaatga gcacccagct caattattct 7080 tacttagctc ttaagtttta ggagtcttga gcctctatat aaggagctca tatgtatgaa 7140 gtaagggaca tcgagaaata atatcggaaa acatcggaaa ataacatctg atagccattt 7200 cttctaaata aaaatttgtg tgcgataagc tttcggggct taagcaagat ccatttccct 7260 atcccgcttc cctcct 7276 //