ID JQ975070; SV 1; linear; genomic RNA; STD; VRL; 2280 BP. XX AC JQ975070; XX DT 08-OCT-2012 (Rel. 114, Created) DT 08-OCT-2012 (Rel. 114, Last updated, Version 1) XX DE Sugarcane streak mosaic virus isolate M111 polyprotein gene, partial cds. XX KW . XX OS Sugarcane streak mosaic virus OC Viruses; Riboviria; Potyviridae; Poacevirus. XX RN [1] RP 1-2280 RA He Z., Li W., Ma W., Ge B., Huang Y., Wang X., Li S.; RT "Genetic variability and population structure of Sugarcane streak mosaic RT virus (SCSMV)"; RL Unpublished. XX RN [2] RP 1-2280 RA He Z., Li W., Ma W., Ge B., Huang Y., Wang X., Li S.; RT ; RL Submitted (20-APR-2012) to the INSDC. RL State Key Laboratory for Biology of Plant Diseases and Insect Pests, RL Institute of Plant Protection, Chinese Academy of Agricultural Sciences, RL West 2 Yuanmingyuan Road, Haidian, Beijing 100193, China XX DR MD5; 891d1acc7b07c059a49d2b8e53ebedd1. XX FH Key Location/Qualifiers FH FT source 1..2280 FT /organism="Sugarcane streak mosaic virus" FT /host="sugarcane" FT /isolate="M111" FT /mol_type="genomic RNA" FT /country="China" FT /collected_by="Wenfeng Li and Zhen He" FT /collection_date="03-Jun-2011" FT /db_xref="taxon:53954" FT 5'UTR 1..199 FT CDS 200..>2280 FT /codon_start=1 FT /product="polyprotein" FT /db_xref="InterPro:IPR025910" FT /db_xref="UniProtKB/TrEMBL:K0I9H9" FT /protein_id="AFU50441.1" FT /translation="MATITKKQVWKPKERVVSEPPKAEIQESRTTLLFNDYAEVEDFIQ FT RFPAGSVFWTVKGKPKTIVNNLFKATQYGLAYDIAAEVYVCPICMTCARNKVYFTTNHQ FT NCGELFRNKQAYISTSLRLEVVDTFDVFPRYATVEQEKLVGDWMADMEAYAHAEDDSID FT IPYQIFNSDTGEVEERIKQVDLSVHGEIEEVERTYKVKIARSNATMLPHQRRANRVIMR FT TNEIKELIDSTLEICHNRNIKVSFVDHERKRKLFPRIPLKHTIEPQALCDPHHDIIPAT FT EKFISQWKDVGEPTMHINEQWVQKGWSGVVLHKDDLEAHPSLQEKCVDNLFVVLGRCKH FT GDLQNALKPDCCEDLVFYTDAHKAKSHILWDAMMKCHPDDHKPIINVWTDEAYENMGYW FT LMATYPFKAICKECVNVKSIRDWVQNMRASKAYQFLRGGTSKHSRDLFRWLAVIQSELM FT TFNIRDAQNTQEDLNRNFFGTIPIGPLFEIANQMNQAVVDIQRGLQQMHKLVTDVEITH FT QARDEQILNEIARIRGLEFMQTEKLMTNMKHVAMTYRNLINTASQPLSIHTMRQLLLDA FT RSDEAYEFDIMRGKGAIAIVAPGVFRKFDKIYSEPGVYNVEWTHLTPGGELRTDFDYLR FT TDLKISQLHDKIHKWPENPLVDETCIVSEGEMSYHLCERVYECFVPIPHIMRVGNPQNP FT " FT mat_peptide 200..1273 FT /product="P1 protein" FT mat_peptide 1274..>2280 FT /product="Hc-Pro protein" XX SQ Sequence 2280 BP; 737 A; 464 C; 515 G; 564 T; 0 other; aaatgtaatt tcaaattgac tacaatcaac tctcttccaa tcgctcaagc tctcacaagc 60 cttcaaaagc gaccaaaaga gcccagtagc cgaactcggg tggagacacg ccgggtgcta 120 ctgtttcaag cgatcaggag agaatttagc tttggccaga gacagtttaa cgataagttc 180 acgagtcgtc tgggaagcta tggctactat cactaagaag caagtgtgga agccgaagga 240 gcgggtggtt agcgaaccac ctaaggctga aattcaagag tcgcgcacga ctcttctttt 300 caacgactat gcggaagttg aggatttcat tcaacgcttc ccagctggaa gcgtcttctg 360 gacggttaaa ggtaagccaa aaacgattgt aaacaatttg tttaaggcta cacaatacgg 420 attagcatac gacattgcag cagaagtata tgtgtgccct atctgtatga cctgcgcacg 480 caacaaggtt tacttcacca ccaaccatca aaactgtggt gaactcttca ggaacaagca 540 ggcatacatt tcaacttctc tgagactcga ggttgtcgac acttttgacg ttttcccacg 600 ctacgcaacc gttgagcaag agaagttagt tggagattgg atggccgata tggaagccta 660 tgcccacgcc gaggatgatt caatcgatat tccatatcaa atcttcaata gtgacactgg 720 cgaagttgaa gaacgcatca agcaagttga tttatcagta catggcgaga ttgaagaagt 780 tgagcgcacg tacaaggtaa agattgcgcg ttctaatgcc acaatgttac cacatcagcg 840 tcgggcaaat cgtgtcatca tgcgcaccaa tgaaattaag gagctgatcg attcgacact 900 cgaaatatgc cacaacagga acatcaaagt aagcttcgtg gatcatgaac gaaagcgcaa 960 attatttcca cgaatcccgt tgaaacacac catagaacct caagccctat gtgacccaca 1020 ccacgacatt atcccagcaa ctgagaaatt tattagtcag tggaaggatg tgggagaacc 1080 aacaatgcac attaatgagc aatgggttca gaaaggatgg agcggtgtag ttctacacaa 1140 agatgatctc gaagcgcacc cgagtctcca agagaaatgt gttgacaact tgtttgtggt 1200 acttggaagg tgtaaacatg gggatttgca aaatgcactg aagccagatt gctgtgaaga 1260 tttagtattt tatactgatg ctcataaggc aaagtcacac attttgtggg acgcgatgat 1320 gaagtgtcat ccagatgatc acaaacccat tattaacgtt tggacagacg aagcttacga 1380 aaacatgggt tattggctaa tggctacata tccctttaaa gcaatatgca aagagtgtgt 1440 aaatgtaaaa tcaattagag attgggttca aaatatgaga gcatctaaag cataccaatt 1500 tttaagagga ggaacatcaa aacactcgcg ggatctcttc aggtggcttg cagttataca 1560 atccgagctg atgactttca acatacgaga tgctcagaac acgcaagagg atctcaatag 1620 aaatttcttt ggaacgatac caattggacc tctgttcgaa attgctaatc aaatgaatca 1680 ggcagttgtt gatattcaac gagggttgca gcaaatgcac aaactggtga cagacgttga 1740 gattacacat caagcccgtg atgagcagat cctgaacgaa attgcacgga ttcgtggctt 1800 agaattcatg caaactgaaa aacttatgac caacatgaag cacgttgcaa tgacatatag 1860 gaatttaatc aacacggcaa gccaaccatt gtcgattcac acaatgaggc aactcttatt 1920 agatgcaagg agtgatgagg catatgagtt tgacataatg cgcggtaaag gagcaattgc 1980 tatcgtagca cctggagtgt tccggaaatt tgataagatt tattcagaac caggtgtgta 2040 caatgtggaa tggacacatt taacgccagg tggagaacta agaactgatt ttgattactt 2100 aagaactgat ctcaaaatct cccagttgca tgataaaatt cataaatggc cagaaaatcc 2160 actagttgac gaaacatgta ttgtctctga gggtgaaatg tcatatcact tgtgtgaacg 2220 agtctatgaa tgctttgtgc ctattccaca tatcatgcga gttggcaatc cacagaatcc 2280 //