ID JQ975060; SV 1; linear; genomic RNA; STD; VRL; 2280 BP. XX AC JQ975060; XX DT 08-OCT-2012 (Rel. 114, Created) DT 08-OCT-2012 (Rel. 114, Last updated, Version 1) XX DE Sugarcane streak mosaic virus isolate W4 polyprotein gene, partial cds. XX KW . XX OS Sugarcane streak mosaic virus OC Viruses; Riboviria; Potyviridae; Poacevirus. XX RN [1] RP 1-2280 RA He Z., Li W., Ma W., Ge B., Huang Y., Wang X., Li S.; RT "Genetic variability and population structure of Sugarcane streak mosaic RT virus (SCSMV)"; RL Unpublished. XX RN [2] RP 1-2280 RA He Z., Li W., Ma W., Ge B., Huang Y., Wang X., Li S.; RT ; RL Submitted (20-APR-2012) to the INSDC. RL State Key Laboratory for Biology of Plant Diseases and Insect Pests, RL Institute of Plant Protection, Chinese Academy of Agricultural Sciences, RL West 2 Yuanmingyuan Road, Haidian, Beijing 100193, China XX DR MD5; e70001cdbcae763bcfd989646c6f97c3. XX FH Key Location/Qualifiers FH FT source 1..2280 FT /organism="Sugarcane streak mosaic virus" FT /host="sugarcane" FT /isolate="W4" FT /mol_type="genomic RNA" FT /country="China" FT /collected_by="Wenfeng Li and Zhen He" FT /collection_date="17-May-2010" FT /db_xref="taxon:53954" FT 5'UTR 1..199 FT CDS 200..>2280 FT /codon_start=1 FT /product="polyprotein" FT /db_xref="InterPro:IPR025910" FT /db_xref="UniProtKB/TrEMBL:K0I9G8" FT /protein_id="AFU50431.1" FT /translation="MATITKKQVWKPKERVVSEPPKAEIQESRTTLIFNDYAEVEDFIQ FT RFPAGSVFWTVKGKPKTIVNNLFKATQYGLAYDIAAEVYVCPICMTCARNKVYFTTNHQ FT NCGELFKNKQAYISTSLRLEVVDTFDVFPRYATIEQEKLVGDWMADMEAYAHAEDDSID FT IPYQIFNSDTGEVEERIKQVDLSVHGEIEEVERTYKVKITRSNATMLPHQRRANRVIMR FT TNEIKELIDSTLEICHNRNIKVSFVDHERKRKLFPRIPLKHTIEPQALCDPHHDIIPAT FT EKFISQWKDVGEPTMHINEQWVQKGWSGVVLHKDDLEAHPSLQEKCVDNLFVVLGRCKH FT GDLQNALKPDCCEDLVFYTDAHKARSHILWDAMMKCHPDDHKPIINVWTDEAYENMGYW FT LMATYPFKAICKECVNVKSVRDWVQNMRASKAYNLLRGGTSKHSRDLFRWLAVIQSELM FT TFNIRDAQNTQEDLNRNFLGTIPIGPLFEIANQMNQAVVDIQRGLQQMHKLVTDVEITH FT QARDEQILNEIARLRGLEFMQTEKLITNMKHVAMTYRNLINTASQPLSIHTMRQLLLDA FT RSDEAYEFDIMRGKGAIAIVAPGVFRKFDKIYSEPGVYNVEWTHLTPGGELRTDFDYLR FT TDLKISQLHDKIHKWPENPLIDETCIVSEGEMSYHLCERVYECFVPIPHIMRVGNPQNP FT " FT mat_peptide 200..1273 FT /product="P1 protein" FT mat_peptide 1274..>2280 FT /product="Hc-Pro protein" XX SQ Sequence 2280 BP; 743 A; 465 C; 510 G; 562 T; 0 other; aaatgtaatt tcaaattgac tacaatcaac tctcttccaa tcgctcaagc tctcacaagc 60 cttcaaaagc gaccaaaaga gcccagtagc cgaactcggg tggagacacg ccgagtgcta 120 ctgttccaag cgattaggag agattttagc tttggccaga gacagtttaa cgataagttc 180 acgagtcgtc tgggaagcta tggcaaccat cactaagaag caagtgtgga agccgaagga 240 gcgggtggtt agcgaaccac ctaaggctga aattcaagag tcgcgtacga ctcttatttt 300 caacgactat gcagaagttg aggatttcat tcaacgcttc ccagctggaa gcgtcttctg 360 gacggttaaa ggcaagccaa aaacgattgt aaacaatttg tttaaggcta cacaatacgg 420 attggcatac gatattgcag cagaagtata tgtgtgccct atctgtatga cctgcgcacg 480 caacaaggtt tatttcacca ccaaccatca aaactgtggt gagcttttta aaaacaagca 540 ggcatacatt tcaacttctc taagactcga agttgtagac acttttgacg tcttcccacg 600 ctacgcaacc atcgagcagg agaagttagt tggagattgg atggctgata tggaagctta 660 tgcccacgcc gaggacgatt caattgatat tccatatcaa atcttcaaca gtgacactgg 720 cgaagttgaa gaacgcatca agcaagttga cctatcagta catggcgaga ttgaggaagt 780 cgagcgcacg tacaaggtaa agataacgcg ttctaatgcc acaatgttac cacatcagcg 840 tcgggcaaac cgtgtcatca tgcgcaccaa cgaaattaag gagctgatcg actcgacact 900 cgaaatatgc cacaacagga acatcaaagt aagcttcgtg gatcatgaac gaaagcgtaa 960 attatttcca cgaatcccgt tgaaacacac catagaacct caagctttat gtgacccaca 1020 ccacgacatt atcccagcaa ctgaaaaatt tattagtcag tggaaggatg tgggagaacc 1080 aacgatgcac attaatgagc aatgggttca gaaaggatgg agcggtgtag ttctacacaa 1140 agatgatctt gaagcgcacc cgagtctcca agagaaatgt gttgacaact tgtttgtggt 1200 acttggaagg tgcaaacatg gggatttgca aaatgcactg aagccagatt gctgtgaaga 1260 tttagtgttt tatactgacg ctcataaggc aaggtcacac attttgtggg acgcgatgat 1320 gaagtgtcat ccagatgatc acaaacccat tattaacgtt tggacagacg aagcttacga 1380 gaacatgggt tattggctaa tggctacata tccctttaaa gcaatatgca aagagtgtgt 1440 aaatgtaaaa tcagttagag attgggttca aaatatgaga gcatctaaag catacaatct 1500 tttaagagga ggaacatcaa aacactcgcg ggatctcttc aggtggctcg cagttataca 1560 atctgagctg atgactttca acatacgaga tgctcagaac acgcaagaag atctcaatag 1620 aaatttcctt ggaacaatac caattggacc tctgttcgaa attgctaatc aaatgaatca 1680 ggcagttgtt gatattcaac gtgggttaca gcaaatgcat aaactggtga cagacgttga 1740 gattacacat caagcccgtg atgagcagat cctgaacgaa attgcacggc ttcgtggttt 1800 agaattcatg caaactgaaa aacttataac caacatgaag cacgttgcaa tgacatatag 1860 gaatttaatc aacacggcaa gccaaccatt gtcgattcac acaatgaggc aactcttatt 1920 agatgcaaga agtgatgagg catatgagtt tgatataatg cgcggtaaag gagcaattgc 1980 tatcgtagca cctggagtgt tccggaaatt tgataagatt tattcagaac caggtgtgta 2040 caatgtggag tggacgcatc taacaccagg tggagaacta agaactgatt ttgattactt 2100 gagaactgat ctcaaaatct cccagttaca tgataaaatt cataaatggc cagaaaatcc 2160 actaattgac gaaacgtgta ttgtctctga aggcgaaatg tcatatcact tgtgtgaacg 2220 agtctatgaa tgctttgtgc ctattccaca tatcatgcga gttggcaatc cacagaatcc 2280 //