ID DQ875594; SV 2; linear; genomic RNA; STD; VRL; 4132 BP. XX AC DQ875594; AH013158; AY340586-AY340587; XX DT 14-SEP-2006 (Rel. 89, Created) DT 19-AUG-2009 (Rel. 101, Last updated, Version 3) XX DE Southern bean mosaic virus isolate Sao Paulo, complete genome. XX KW . XX OS Southern bean mosaic virus OC Viruses; Riboviria; Solemoviridae; Sobemovirus. XX RN [1] RP 1-4132 RA Ozato T.Jr., Gaspar J.O., Belintani P.; RT "Completion of the Nucleotide Sequence of a Brazilian Isolate of Southern RT bean mosaic virus"; RL J. Phytopathol. 157(9):573-575(2009). XX RN [2] RP 1-4132 RA Espinha L.M., Moreira A.E., Gaspar J.O., Camargo L.E.A.; RT "Characterization of the genome 3'-terminal region of Brazilian isolate of RT Southern bean mosaic virus"; RL Unpublished. XX RN [3] RP 1-4132 RA Ozato T.Jr., Belintani P., Gaspar J.O.; RT ; RL Submitted (01-AUG-2006) to the INSDC. RL Zoologia e Botanica, Instituto de Biociencias, Letras e Ciencias Exatas, RL Cristovao Colombo, Sao Jose do Rio Preto, SP 15054000, Brazil XX RN [4] RC Sequence update by submitter RP 1-4132 RA Ozato T.Jr., Belintani P., Gaspar J.O.; RT ; RL Submitted (02-APR-2008) to the INSDC. RL Zoologia e Botanica, Instituto de Biociencias, Letras e Ciencias Exatas, RL Cristovao Colombo, Sao Jose do Rio Preto, SP 15054000, Brazil XX DR MD5; bcd3a4145f1a2e1f08e4f0e7cc6abe74. DR EuropePMC; PMC3791421; 23830075. DR RFAM; RF01838; sobemo_FSE. XX CC On or before Apr 2, 2008 this sequence version replaced CC gi:33517428, gi:33517427, gi:33517426, gi:114153760. XX FH Key Location/Qualifiers FH FT source 1..4132 FT /organism="Southern bean mosaic virus" FT /isolate="Sao Paulo" FT /mol_type="genomic RNA" FT /country="Brazil" FT /db_xref="taxon:12139" FT 5'UTR 1..92 FT CDS 93..536 FT /codon_start=1 FT /product="putative movement protein" FT /db_xref="UniProtKB/TrEMBL:Q7T5V7" FT /protein_id="AAQ19970.1" FT /translation="MSYRFLVVKAVGFLGFHSDATRILSETEIVDVPSSIDFVGETELR FT LENAWPQGGERYTILPRFNVQIDFTYHPVRVEIICRVCATSLTVVFSKWNFHCERKGHF FT VPVDQNGNLFRVGTLRETGEKYFYFCEKSICRQCIIQAAHHHS" FT CDS join(503..1729,1729..3390) FT /codon_start=1 FT /ribosomal_slippage FT /product="polyprotein P2a2b" FT /note="contains protease, VPg, and RNA dependent RNA FT polymerase; due to ribosomal frameshift in ORF2a RdRp is FT fused to protease and VPg; ORF2a/ORF2b" FT /db_xref="GOA:Q09KR4" FT /db_xref="InterPro:IPR000382" FT /db_xref="InterPro:IPR001795" FT /db_xref="InterPro:IPR009003" FT /db_xref="UniProtKB/TrEMBL:Q09KR4" FT /protein_id="ABI53037.2" FT /translation="MYHPGRSPSFLITLANVICAAILYDIRMGGYQPGSLVPIVAWMTP FT FVTLLWLSASFVTYLYRYARTRLLPEEKVARVYYTAQSAPYFDPALGVMMQFAPSHGGA FT SIEVQVNPSWISLLGGSLKINGDDASNESAVLGSFYSSVKPGDEPASLVAIKSGPQTIG FT FGCRTKIDGDDCLFTDNHVWNNSMRPTALAKAGKQVAIEDWDTPLSCDHKMLDFVVVRV FT PKHVWSKLGVKATQLVCPSDKDAVTCYGGSSSDSLLSGTGVCSKVDFSWKLTHSCPTAA FT GWSGTPIYSSRGVVGMHVGFEDIGKLNRGVNAFYVSNYLLRSQETLPPDLSVIEIPFED FT VETRSYEFIEVEIKGRGKAKLGKREFAWIPESGKYWADDDDDSLPPPPKVVDGKMVWTS FT AQETVAEPLNLPEGGRVKALAALSQLAGYNFKEGEAASTRGMPLRFVGQSACKFRELCR FT KDTPDEVLRATRVFPELSDFSWPERGSKAELHSLLLQAGKFNPTAIPRNLEGACQNLLE FT RYPASKSCYCLRGEAWSFDAVYEEVCKKAQSAEINEKASPGVPLSRLASTNKDLLKRHL FT ELVALCVTERLFLLSEAEDLHNKSPVDLVQMGLCDPVRLFVKQEPHASRKVKEGRFRLI FT SSVSLVDQLVERMLFGPQNQLEIAEWEHIPSKPGMGLSLQRQAKSLFDDLRVKHSRCPA FT AEADISGFDWSVQDWELWADVEMRIVLGGFGQKLSIAARNRFSCFMNSVFQLSDGTLIE FT QQLPGIMKSGSYCTSSTNSRIRCLMAELIGSPWCIAMGDDSVEGWVDGAKDKYMRLGHT FT CKDYKPCATSISGRLYEVEFCSHVIREDRCWLASWPKTLYKYLSEGKWFFEDLERELGS FT SPHWPRIRHYVVGNTPSPDKTRLENSSPSYGEEADKTTVSQGYSEHSGSPGHSIEEAQE FT PETAPFCCKAASVYPGWGIHGPYCSGGYGSLT" FT CDS 503..2218 FT /codon_start=1 FT /product="polyprotein P2a" FT /note="contains serine protease and VPg; ORF2a" FT /db_xref="GOA:B2BA78" FT /db_xref="InterPro:IPR000382" FT /db_xref="InterPro:IPR009003" FT /db_xref="UniProtKB/TrEMBL:B2BA78" FT /protein_id="ACB55724.1" FT /translation="MYHPGRSPSFLITLANVICAAILYDIRMGGYQPGSLVPIVAWMTP FT FVTLLWLSASFVTYLYRYARTRLLPEEKVARVYYTAQSAPYFDPALGVMMQFAPSHGGA FT SIEVQVNPSWISLLGGSLKINGDDASNESAVLGSFYSSVKPGDEPASLVAIKSGPQTIG FT FGCRTKIDGDDCLFTDNHVWNNSMRPTALAKAGKQVAIEDWDTPLSCDHKMLDFVVVRV FT PKHVWSKLGVKATQLVCPSDKDAVTCYGGSSSDSLLSGTGVCSKVDFSWKLTHSCPTAA FT GWSGTPIYSSRGVVGMHVGFEDIGKLNRGVNAFYVSNYLLRSQETLPPDLSVIEIPFED FT VETRSYEFIEVEIKGRGKAKLGKREFAWIPESGKYWADDDDDSLPPPPKVVDGKMVWTS FT AQETVAEPLNYQRAAGSRPLPPFLNLQATTSKKEKQPLQEECPLDLLGSRLASLESCVE FT KILQMKSLELLESSRNCQTSPGLSEAPKQSFTPCYSKQESLIPPQSQGILKELVKTSLS FT ATPPPNPVTAFVEKPGPSTQSTKKSARRRNRRKSTRKPVQGSPSPVSPPPTKTS" FT CDS 3203..4003 FT /codon_start=1 FT /product="capsid protein" FT /db_xref="GOA:Q7T5V6" FT /db_xref="InterPro:IPR000937" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:Q7T5V6" FT /protein_id="AAQ19971.1" FT /translation="MAKRLTKQQLAKAIANTLEAPATQSRRPRNRRRRRSAARQPQSIQ FT AGASMAPIAQGAMVRLREPSLRTAGGVTVLTHSELSTELAVTNAIVVSSELVMPFTMGT FT WLRGVAANWSKYSLFSVRYTYLPSCPSTTSGSIHMGFQYDMADTLPVSVNQLSNLRGYV FT SGQVWSGSSGLCYINGTRCSDTANAITTTLDVAKLGKKWYPFKTSTDFTAAVGVIVNIA FT TPLVPARLVIAMLDGSSSTAVSTGRLYVSYTVQLIEPTALALNN" FT 3'UTR 4004..4132 XX SQ Sequence 4132 BP; 988 A; 980 C; 1070 G; 1094 T; 0 other; cacaaaatat aagaaggaaa gctggatttc ctacctttgt gtttccattg tcgaagcatt 60 ggtcaatact tatcaattgg tgcattgttc gcatgagcta ccgattctta gtagtcaaag 120 ccgttggttt tcttggtttc cattcagacg ctactcgcat tctgtcagag actgagatcg 180 tagacgttcc ttcgtccatt gatttcgtcg gtgaaaccga gttacgccta gaaaacgctt 240 ggccccaagg tggtgagaga tacactatcc tacctaggtt caacgttcag attgacttca 300 cgtaccatcc agtgcgtgtc gagatcatct gtagggtttg tgctacttcc cttactgttg 360 tctttagcaa gtggaacttc cattgcgaaa ggaagggcca ttttgtgcca gtagaccaga 420 acgggaatct gtttagggtt ggaacgctcc gggagacggg agagaaatac ttctacttct 480 gtgagaaatc tatctgcaga caatgtatca tccaggccgc tcaccatcat tcctgataac 540 gttagcaaat gttatctgcg cggcaatctt gtacgacatc cgtatggggg ggtaccaacc 600 aggatcacta gttccaatag tggcctggat gaccccgttc gttacgctgc tctggttgag 660 cgcgtcattc gtgacatacc tttataggta tgctcgaact cgactgcttc cagaggagaa 720 agtggccaga gtttattata cggcgcaatc tgcgccttac tttgacccgg ccctgggtgt 780 catgatgcaa tttgccccta gccatggcgg cgccagcata gaagtgcaag ttaacccttc 840 gtggattagt ctcttaggtg gctctctcaa gataaacgga gatgacgcct ctaatgagtc 900 tgctgtgttg ggaagctttt attcttctgt gaaacctggt gatgaaccag ctagtttggt 960 agctattaag agtggtcctc agaccatcgg ttttggttgt agaactaaga tcgacggtga 1020 tgactgcctc ttcacagaca atcacgtttg gaataattcc atgcgcccta ccgctttagc 1080 gaaagctggc aagcaggttg cgattgaaga ttgggacacc cctctctctt gtgaccataa 1140 gatgcttgat ttcgtagtgg tgcgtgtgcc aaaacacgtg tggtccaaac taggagtgaa 1200 agcgactcaa ctggtttgtc catctgataa ggacgctgta acctgttatg gtggatctag 1260 ttctgatagc ttgttgtcgg ggacgggcgt ttgtagtaag gttgatttct cttggaagtt 1320 aacccactca tgccccacgg cagctggctg gagcggaact ccaatttact ctagcagagg 1380 tgtggtagga atgcacgttg ggtttgaaga tatcggaaaa ctcaaccgtg gcgtgaacgc 1440 tttctacgtg tctaactact tgttgaggtc tcaagagact ctacctcctg atctgtccgt 1500 catcgaaata cctttcgagg acgtagaaac ccggagttat gagttcattg aggtcgagat 1560 taaaggaaga ggtaaggcta aacttggtaa gcgtgagttc gcttggattc cggaatcagg 1620 aaaatactgg gccgatgatg atgacgactc tttgccccca ccgccgaagg tggtagacgg 1680 caagatggtg tggacttcag ctcaggaaac cgtcgcagag cctttaaact accagagggc 1740 ggcagggtca aggcccttgc cgccctttct caacttgcag gctacaactt caaagaagga 1800 gaagcagcct ctacaagagg aatgcccctt agatttgttg ggcagtcggc ttgcaagttt 1860 agagagctgt gtagaaaaga tactccagat gaagtcctta gagctactag agtcttcccg 1920 gaattgtcag acttctcctg gcctgagcga ggctccaaag cagagcttca ctccctgcta 1980 ctccaagcag gaaagtttaa tcccaccgca atcccaagga atcttgaagg agcttgtcaa 2040 aacctccttg agcgctaccc cgcctccaaa tcctgttact gccttcgtgg agaagcctgg 2100 tccttcgacg cagtctacga agaagtctgc aagaaggcgc aatcggcgga aatcaacgag 2160 aaagccagtc caggggtccc cctctcccgt ctcgcctcca ccaacaaaga cctcctgaag 2220 aggcacttag aattagttgc tttgtgtgtt accgagagat tgttcttact cagtgaagct 2280 gaggacctgc acaataaatc ccctgtggac ctagttcaga tggggttgtg cgacccagtt 2340 cggctgtttg tcaagcagga gccccatgct tcccgaaagg tgaaggaggg tagatttcgc 2400 ttgatttcat ccgtttcgct ggtggatcag cttgtagagc gaatgctttt cgggccccaa 2460 aaccagcttg agatcgctga gtgggaacat attccctcaa aacctggtat gggcctttcg 2520 ctgcaacgac aagccaaaag cttgttcgac gatttgagag tcaaacattc tcgttgtcct 2580 gcggctgaag ccgacatatc gggttttgac tggtctgttc aagactggga gttgtgggct 2640 gatgtagaga tgagaatagt tttaggaggt tttggacaga agttgtccat agccgctaga 2700 aacaggtttt cgtgtttcat gaactcagtc ttccagctct cggatggcac actcatagaa 2760 cagcaactgc ctggtattat gaagtctggt tcttactgca cttcctcaac aaactccaga 2820 atacgttgcc ttatggctga acttattggt tccccatggt gtatcgccat gggtgatgat 2880 tctgttgagg gttgggttga tggtgcgaaa gacaagtaca tgagattagg ccacacgtgc 2940 aaggattata aaccctgtgc aacatccatt tccggtcgct tatacgaggt agagttttgc 3000 tctcacgtta taagggaaga tcgatgttgg ttggcgtcgt ggcccaaaac tctgtataaa 3060 tacttgtctg agggcaagtg gttctttgag gatcttgagc gagagcttgg gtcttctccc 3120 cactggccca gaatcagaca ctacgtagtc gggaatactc catcgcccga caaaactaga 3180 ttagaaaatt caagtccgag ctatggcgaa gaggctgaca aaacaacagt tagccaaggc 3240 tatagcgaac actctggaag ccccggccac tcaatcgagg aggcccagga accggagacg 3300 gcgccgttct gctgcaaggc agcctcagtc tatccaggct ggggcatcca tggcccctat 3360 tgctcagggg gctatggttc gcttacgtga accatcgctt agaacggctg gaggagtgac 3420 tgtcctgacg cactctgagc tctcaacaga gcttgctgtg acgaatgcga tagttgtctc 3480 ttctgagctt gtcatgccct tcacaatggg cacttggctt cgaggcgtgg cggccaattg 3540 gtcgaagtac agtctgtttt cagtgaggta tacttacctc ccctcttgtc cttcaacgac 3600 atctgggtcc attcatatgg gtttccaata tgatatggct gacactcttc ccgtatccgt 3660 taaccagtta tccaacctta gaggctatgt gtcagggcag gtctggtctg gatcctctgg 3720 attgtgctat ataaatggca cgaggtgttc tgacaccgcc aacgctatca cgaccacttt 3780 ggacgttgca aagcttggta agaagtggta tcctttcaag accagtacgg acttcactgc 3840 cgctgttggc gtaatagtca acattgctac tcccctggtc ccggctaggc tagtgatagc 3900 catgctggat gggtcgagtt ctacggctgt gagtactgga cgcctatacg tgtcgtacac 3960 tgtacagcta attgagccga ctgccttggc cttaaacaac tgaggagttg tataataata 4020 cctgcaccct tctctttggc agggagggtg tttcgttttc acaatgccac gcgcttgagg 4080 gagaatgcac gttaatcatc cctccgctag tgatggagcg taatccaaaa gt 4132 //