ID U71440; SV 1; linear; genomic RNA; STD; VRL; 4546 BP. XX AC U71440; XX DT 09-OCT-1996 (Rel. 49, Created) DT 04-MAR-2000 (Rel. 63, Last updated, Version 3) XX DE Rice tungro spherical virus, strain Phil.1 polyprotein gene, partial cds. XX KW . XX OS Rice tungro spherical virus OC Viruses; Riboviria; Picornavirales; Secoviridae; Waikavirus. XX RN [1] RP 1-4546 RX DOI; 10.1007/BF01702392. RX PUBMED; 8367940. RA Zhang S., Jones M.C., Barker P., Davies J.W., Hull R.; RT "Molecular cloning and sequencing of coat protein-encoding cDNA of rice RT tungro spherical virus--a plant picornavirus"; RL Virus Genes 7(2):121-132(1993). XX RN [2] RC Department of Virus Research, John Innes Centre, Norwich Research Park, RC Colney, Norwich NR4 7UH, UK. RP 1-4546 RA Zhang S., Jones M.C., Davies J.W., Hull R.; RT "Coat protein genes and the upstream region of rice tungro spherical virus, RT strain Phil.1"; RL Unpublished. XX RN [3] RP 1-4546 RA Zhang S.; RT ; RL Submitted (19-SEP-1996) to the INSDC. RL MPPL, PSI, USDA/BARC-West, Beltsville, MD 20705, USA XX DR MD5; 264ff58888ad7b5d2daf369d005c43ff. DR EuropePMC; PMC4152289; 25180860. XX FH Key Location/Qualifiers FH FT source 1..4546 FT /organism="Rice tungro spherical virus" FT /strain="Phil.1" FT /mol_type="genomic RNA" FT /db_xref="taxon:35287" FT mat_peptide 515..2437 FT /note="p72 protein" FT CDS 515..>4546 FT /codon_start=1 FT /product="polyprotein" FT /note="p72 protein and coat proteins 1,2 and 3" FT /db_xref="GOA:Q98651" FT /db_xref="InterPro:IPR001676" FT /db_xref="InterPro:IPR024379" FT /db_xref="InterPro:IPR029053" FT /db_xref="InterPro:IPR033703" FT /db_xref="UniProtKB/TrEMBL:Q98651" FT /protein_id="AAB17090.1" FT /translation="MQSFLLSSKNQAKLLHAGLEFVGGVRCAHQGWVSGKSVVFCNYCN FT FAHRLYRFYTKNHCVLNKETIENLCGRSFVSLYRAGLLLDDFTIDDLLGKGKYAKSSID FT NMSIPFDDCALCPNAGTRLSQTGVSHDHFVCNYVEHLFECASFSRETGGKFFRACSEGW FT HWNATCTTCGASCRFANPRENVVIAIFMNFLRVMYDGNKYYVSLHCDTEWIPVHPLFAR FT LVLMVRGFAPLDNSHVIEEDEMDICGHPSEVTYDDPIKLCLLASTRDARSGMGHLAFCR FT DANGVDRGEHKFYLHGPFDLKMTHAMFRVFMILLNCHGYVQSEFREEHPAVKDRSLCAL FT LSVAGLRGVNIACNEEFIHLHSQFHNGSFRSQRPIPMVYAEPEMYPPLEYVRLTESWVP FT RGRVMIDDLPSLLSRVYAESSQPHAGEIYEEIFDEDDLFELGDDEGTSTRGLLDLGRRL FT GGLLLGATKCVKGLHAVIEWPVDVLTKEAEDLGTWLADNKKYVSESTWSCQVCPEVQDA FT LEKSMRDRAKLNAQMIGGIKKLATTMDSATSKLRDSLKELERRISVLEQGVDETQQARI FT ANLENFCEDAAKAFEALRADIDALKKKPAQSVTPLPLPSGNSGTAGEQRPPPRRRPPVV FT EMSEAQAGETVIVGGDEEQEAHQDSSVAAAGPTDEHNAMLQKIYLGSFKWKVSDGGGSI FT LKTFSLPSDIWAANDRMKNFLSYFQYYTCEGMTFTLTITSIGLHGGTLLVAWDALSSAT FT RRGIVSMIQLSNLPSMTLHASGSSIGTLTVTSPAIQHQICTSGSEGSIANLGSLVISVA FT NVLCADSASAQELNVNAWVQFDKPKLSYWTAQHTIGQSGGFEESQDLGDLQAIIATGKW FT STTSDKNLMEIIVHPTACYVSEKLIYQTNLSVVAHMFAKWSGSMKYTFVFGASMFDRGK FT IMVSAVPVQFRNSKLTLSQMAAFPSMVCDLSVETREFTFEVPYISIGKMSLVCKDYLFD FT ISSYNADLVVSRLHVMVLDPLVKTGNSSNSIGFYVVAGPGKDFKLHQMCGVKSQFAHDV FT LTAQDFGRSLSCSRLLGNGFKEWCSRESLLMRIPLKNGKKRAFKYAVTPRMRTLPPEAT FT SLSWLSQIFVEWRGSLTYTIHVQSGSAIQHSYMRIWYDPNGKTDEKEIKFLDSAHPPAG FT IKVYHWDLKIGDSFRFTVPYCARTEKLQIPKAYASTPYEWLTMYNGAVTIDLRSGTDME FT LFVSIAGGDDFEMFEQTVPPKCGSVSDSYTVLSYADDIKSVTEVPNKTTYLADEQPTTS FT APRTSTVDTEEDPPTEGEIARTTNGTLVQYRGGAWKPMVERTPTMSKKQVGPEFTVSDP FT Q" FT mat_peptide 2438..3061 FT /note="coat protein 1" FT mat_peptide 3062..3670 FT /note="coat protein 2" FT mat_peptide 3671..>4546 FT /note="coat protein 3" XX SQ Sequence 4546 BP; 1155 A; 929 C; 1199 G; 1263 T; 0 other; agaaaattgg ggtatagaga ttccccaaat caattgcctt atggccgctt gcgagtagtc 60 gccatatagg tgttgccgaa acatgtgaaa ggattcacaa actactcatc cattgaggat 120 tgatgaacat ctcatgattt tccgattagt gtctgacgta gcacagaaca cttagtatgc 180 acacagggct tccctgtccc tctactaagc tagtaatgtg aaatcagttt gccgtgcaca 240 ctgggcttcc tggtccctcc ggctattcaa acactttgac gaaatagtga gtttgccatg 300 cacactgggc ttcctggtcc ctctggcttt aaaattgttc agatatctga tttggttgtg 360 gtgtattcca gttttctggt ctcccacttg gctgagtgag tagttgagct agcaatacat 420 tccggtcgcg cgcatactat tgttagtgtg tctccgagta gcgtgtgtag gcgcacctag 480 tttcttacac ataagaattc cgtttgcatc agcaatgcag agctttcttc tttcgtctaa 540 aaatcaagcg aaattgttgc acgctggttt agagtttgtt ggaggtgtcc ggtgtgctca 600 ccagggctgg gttagtggta agtcggttgt gttttgtaac tattgtaatt ttgcacatag 660 attgtatagg ttttatacta agaatcattg tgtattaaat aaggaaacta ttgagaatct 720 ttgtggaagg tcttttgtgt ctttgtatag agcgggcctt ctattagacg attttacgat 780 tgacgatttg cttggcaaag ggaaatatgc aaagagttct attgataata tgtctatccc 840 tttcgatgat tgcgctcttt gtcctaatgc tggcactcgt ctttcgcaaa ccggtgttag 900 tcatgatcac tttgtgtgta attatgttga gcatcttttt gaatgtgcta gctttagtcg 960 tgaaaccgga ggaaaatttt tccgagcttg tagcgagggt tggcactgga acgccacttg 1020 cacgacttgc ggagcgtcgt gtagatttgc caacccgcgg gagaatgtag taatagcgat 1080 ttttatgaat tttcttcgtg taatgtatga tggtaataag tattatgtgt ctttgcattg 1140 tgatactgag tggattcccg tgcatcctct ttttgcacga ttagttttga tggttcgtgg 1200 atttgcgcca ctggacaata gccatgttat tgaggaggat gaaatggata tatgtggtca 1260 tccgtctgaa gttacatatg atgacccgat caaattatgc cttctcgcat caacacgtga 1320 cgcgcggagt ggcatgggcc atttggcatt ttgcagagac gccaatggag tggacagagg 1380 cgagcacaag ttttatctgc atgggccctt tgatcttaag atgacgcatg ctatgttcag 1440 agtgttcatg atcttgctta attgccatgg atacgtccaa tccgagttca gagaggagca 1500 ccctgcagtt aaggaccgct ccttgtgcgc attgctgtca gtcgctggtc tgcgaggagt 1560 taatatagcc tgcaatgagg aattcatcca tttgcactcg cagtttcaca atggctcttt 1620 tagatcccag cgaccaattc ctatggtata tgctgaacca gagatgtatc caccactgga 1680 gtatgtgcgt cttacagaaa gttgggtgcc acggggccgc gtaatgatag atgatctacc 1740 atcacttctg agtagagtgt atgctgagag ttcacaacca catgctggtg aaatatatga 1800 ggagatattt gatgaggatg atttatttga actcggagat gatgagggta caagtacgcg 1860 tggcctattg gatcttggca ggcggctcgg aggtttgctt ttgggagcta cgaaatgtgt 1920 gaaaggtttg cacgctgtga ttgagtggcc agtggatgtg cttacgaagg aggctgaaga 1980 tcttggaacg tggcttgctg ataacaagaa atatgtcagt gagtcgactt ggagctgcca 2040 ggtgtgtcct gaggtgcaag atgctttgga gaaatcaatg cgggacaggg cgaaactgaa 2100 tgctcaaatg attggcggca ttaagaagtt ggccacgacc atggattcag ccacgtcaaa 2160 attgagagat agtctcaaag aactggaacg gcgaattagc gtgttggaac agggcgtaga 2220 tgagacacaa caggcgcgca ttgccaattt ggagaacttc tgtgaggacg cagctaaggc 2280 ttttgaggca ttgcgtgctg atatcgatgc cctaaagaag aaaccagcgc agagtgtgac 2340 gccattgcca ttaccttccg gaaattcagg tacagctggc gaacaacgtc cacctccaag 2400 acggaggcct cctgtggtcg agatgtctga ggcacaagct ggtgagacag ttattgtggg 2460 tggtgatgaa gagcaagagg cccatcagga tagtagtgtt gcagcagctg gacccactga 2520 tgaacacaac gccatgctgc aaaagatcta ccttggctct tttaagtgga aagtatctga 2580 tggggggggt tcaattctca aaaccttctc tctcccatct gacatatggg ctgccaatga 2640 taggatgaag aatttcctca gctatttcca gtattatact tgtgagggaa tgacatttac 2700 gctcacgatc accagcattg gattgcatgg aggtacgctg cttgtagcct gggatgcttt 2760 gagtagtgca acacggagag gaatcgtttc aatgatacag ctgagcaatc tcccctcaat 2820 gacgctgcat gcaagtggaa gctctattgg tacgcttacc gtgacttctc ctgcgattca 2880 acaccagatt tgtacgtcag gaagtgaagg ctccatagct aatttgggct ctcttgtgat 2940 ttctgttgca aatgtgctgt gcgctgattc tgcatcagct caggaactaa acgtcaatgc 3000 ttgggtacaa tttgacaagc ccaagctcag ctactggaca gcacaacata cgattggcca 3060 gagcgggggt ttcgaagagt cacaagactt gggtgatttg caggccataa tagcaactgg 3120 aaaatggtcc actacgagtg acaaaaacct gatggagatc atcgttcacc ccactgcgtg 3180 ttacgtgtcg gagaaattaa tataccagac caacttgagt gttgtggctc acatgtttgc 3240 caaatggtct ggatctatga agtacacgtt tgtctttgga gcctcaatgt tcgacagggg 3300 gaagataatg gtgagtgctg ttccagtgca atttagaaat tcaaagctca ccctatcaca 3360 aatggcggcg ttcccttcaa tggtatgcga tctgagtgtg gaaacgaggg agttcacttt 3420 tgaggtgcca tacatttcca tcggcaaaat gagcctggtg tgcaaggatt atctttttga 3480 catttcttca tacaatgccg atttggttgt gagccgactg catgtgatgg ttctggatcc 3540 cctggtgaag actggaaact cgtctaattc gataggcttt tatgttgttg ctggacctgg 3600 caaggatttc aaactgcatc aaatgtgcgg ggtcaagagc caatttgctc atgatgtgct 3660 gaccgcacag gactttggaa gaagcctatc atgttcgcgt cttcttggca atgggtttaa 3720 ggaatggtgc tctcgagagt cgttgctgat gaggatccct ctgaaaaatg ggaagaaacg 3780 agccttcaag tatgctgtga ccccccggat gcgaacactg cctcctgaag ccacaagtct 3840 tagctggtta agccaaattt ttgttgaatg gcgtggatcc ctgacttata ctattcacgt 3900 tcaatctggg tccgctatcc agcattcata catgcgcatc tggtacgatc ccaatggaaa 3960 aactgatgag aaggagatca aatttcttga cagtgcgcat ccaccagcag ggatcaaggt 4020 gtatcattgg gacctcaaaa taggagatag ctttcgcttc actgtcccat actgtgcgag 4080 aacagagaaa ttgcagatcc caaaggctta tgcgtcaaca ccatatgagt ggctcacgat 4140 gtacaatggg gcggtgacta ttgatttgcg tagtggtacc gacatggaac tattcgtctc 4200 gattgctggg ggagatgatt ttgagatgtt cgaacagacc gtgcctccaa aatgtggctc 4260 agtgagtgac tcatacaccg tcctgtcgta tgcggatgat attaagagtg tgacggaggt 4320 gccaaacaag accacgtacc tggcagatga gcaaccaacg acttcagcac cccgtacatc 4380 tactgtggac actgaggagg acccgccgac tgaaggggag attgcgagaa ctacgaatgg 4440 aactcttgtg cagtatcgcg gaggagcttg gaaaccaatg gtggagcgta cgccaacgat 4500 gtcgaagaag caagtgggtc cggagtttac ggtgtcagat cctcaa 4546 //