ID JQ975077; SV 1; linear; genomic RNA; STD; VRL; 2280 BP. XX AC JQ975077; XX DT 08-OCT-2012 (Rel. 114, Created) DT 08-OCT-2012 (Rel. 114, Last updated, Version 1) XX DE Sugarcane streak mosaic virus isolate M114 polyprotein gene, partial cds. XX KW . XX OS Sugarcane streak mosaic virus OC Viruses; Riboviria; Potyviridae; Poacevirus. XX RN [1] RP 1-2280 RA He Z., Li W., Ma W., Ge B., Huang Y., Wang X., Li S.; RT "Genetic variability and population structure of Sugarcane streak mosaic RT virus (SCSMV)"; RL Unpublished. XX RN [2] RP 1-2280 RA He Z., Li W., Ma W., Ge B., Huang Y., Wang X., Li S.; RT ; RL Submitted (20-APR-2012) to the INSDC. RL State Key Laboratory for Biology of Plant Diseases and Insect Pests, RL Institute of Plant Protection, Chinese Academy of Agricultural Sciences, RL West 2 Yuanmingyuan Road, Haidian, Beijing 100193, China XX DR MD5; 40474ebda073c48dacf798fc92234192. XX FH Key Location/Qualifiers FH FT source 1..2280 FT /organism="Sugarcane streak mosaic virus" FT /host="sugarcane" FT /isolate="M114" FT /mol_type="genomic RNA" FT /country="China" FT /collected_by="Wenfeng Li and Zhen He" FT /collection_date="03-Jun-2011" FT /db_xref="taxon:53954" FT 5'UTR 1..199 FT CDS 200..>2280 FT /codon_start=1 FT /product="polyprotein" FT /db_xref="InterPro:IPR025910" FT /db_xref="UniProtKB/TrEMBL:K0I805" FT /protein_id="AFU50448.1" FT /translation="MATITKKQVWKPKERVVSEPPKAEIQESRTTLLFNDYAEVEDFIQ FT RFPAGSVFWTVRGKPKTIVNNLFKATQYGLAYDIAAEVYVCPICMTCARNKVYFTTNHQ FT NCGELFRNKQAYISTSLRLEVVDTFDVFPRYATVEQEKLVGDWMADMEAYAHAEDDSID FT IPYQIFNSDTGEVEERIKQVDLSVHGEIEEVERTYKVKIARSNATMLPHQRRANRVIMR FT TNEIKELIDSTLEICHNRNIKVSFVDHERKHKLFPRIPLKHTIEPQTLCDPHHDIVPAT FT EKFISQWKDVGEPTMHINEQWVQKGWSGVVLHKDDLEAHPSLQEKCVDNLFVVLGRCKH FT GDLQNALKPDCCEGLVFYTDAHKAKSHILWDAMMKCHPDDHKPIINVWTDEAYENMGYW FT LMATYPFKAICKECVNVKSIRDWVQNMRASKAYQFLRGGTSKHSRDLFRWLAVIQSELM FT TFNIRDAQNTQEDLNRNFLGTIPIGPLFEIANQMNQAVVDIQRGLQQMHKLVTDVEITH FT QARDEQILNEIARIRGLEFMQTEKLITNMKHVAMTYRNLINTASQPLSIHTMRQLLLDA FT RSDEAYEFDIMRGKGAIAIVAPGVFRKFDKIYSEPGVYNVEWTHLTPGGELRTDLDYLR FT TDLKISQLHDKIHKWPENPLVDETCIVSEGEMSYHLCERVYECFVPIPHIMRVGNPQNP FT " FT mat_peptide 200..1273 FT /product="P1 protein" FT mat_peptide 1274..>2280 FT /product="Hc-Pro protein" XX SQ Sequence 2280 BP; 739 A; 466 C; 517 G; 558 T; 0 other; aaatgtaatt tcaaattgac tacaatcaac tctcttccaa tcgctcaagc tctcacaagc 60 cttcaaaagc gaccaaaaga gcccagtagc cgaactcggg tggagacacg ccgggtgcta 120 ctgtttcaag cgatcaggag agaatttagc tttggccaga gacagtttaa cgataagttc 180 acgagtcgtc tgggaagcta tggccactat cactaagaag caagtgtgga agccgaagga 240 gcgggtggtt agcgaaccac ctaaggctga aattcaagag tcgcgcacga ctcttctttt 300 caacgactat gcggaagttg aggatttcat tcaacgcttc ccagctggaa gcgtcttttg 360 gacggttaga ggtaagccaa aaacgattgt aaacaatttg tttaaggcta cacaatacgg 420 attagcatac gacattgcag cagaagtata tgtgtgccct atctgtatga cctgcgcacg 480 caacaaagtt tacttcacca ccaaccatca aaactgtggt gaactcttca ggaacaagca 540 ggcatacatt tcaacttctc tgagactcga agtggttgac acttttgacg ttttcccacg 600 ctacgcaacc gttgagcaag agaagttagt tggagattgg atggctgata tggaagcgta 660 tgcccacgcc gaggatgatt caattgatat cccatatcaa atcttcaata gtgacactgg 720 cgaagttgaa gaacgcatca agcaagttga tttatcagta catggcgaga ttgaagaagt 780 tgagcgcacg tacaaggtaa agattgcgcg ttctaatgct acaatgttac cacatcagcg 840 tcgggcaaac cgtgtcatca tgcgcaccaa tgaaattaag gagctgatcg actcgacact 900 cgaaatatgc cacaacagga acatcaaagt aagcttcgtg gatcatgaac gaaagcacaa 960 attatttcca cgaatcccgt tgaaacacac catagaacct caaaccctat gtgacccaca 1020 ccacgacatt gtcccagcaa ctgagaaatt tattagtcag tggaaggatg tgggagaacc 1080 aacaatgcac attaatgagc aatgggttca gaaaggatgg agcggtgtag ttctacacaa 1140 agatgatctc gaagcgcacc cgagtctcca agagaaatgt gttgacaact tgtttgtggt 1200 acttggaagg tgcaaacatg gggatttgca aaatgcactg aagccagatt gctgtgaagg 1260 tttagtattt tatactgatg ctcataaggc aaagtcacat attttgtggg acgcgatgat 1320 gaagtgtcat ccagatgatc acaaacctat tattaacgtt tggacagacg aagcttacga 1380 gaacatgggt tattggctaa tggctacata tccctttaaa gcaatatgca aagagtgtgt 1440 aaatgtaaaa tcaattagag actgggttca aaatatgaga gcatcaaaag cataccaatt 1500 tttaagagga ggaacatcaa aacactcgcg ggatctcttc aggtggcttg cagttataca 1560 atctgagctg atgactttca acatacgaga tgctcagaac acgcaagagg atctcaatag 1620 aaatttcctt ggaacgatac caattggacc tctgttcgaa attgctaatc aaatgaatca 1680 ggcagttgtt gatattcaac gagggttgca gcaaatgcat aaactggtga cagacgttga 1740 gatcacacat caagcccgtg atgagcagat cctgaacgaa attgcacgga ttcgtggctt 1800 agaattcatg caaactgaaa aacttataac caacatgaag cacgttgcaa tgacatatag 1860 gaatttaatc aacacggcaa gccaaccatt gtcgattcac acaatgaggc aactcttatt 1920 agatgcaaga agcgatgagg catatgagtt tgacataatg cgcggtaaag gagcaattgc 1980 tatcgtagca cctggagtgt tccggaaatt tgataagatt tattcagaac caggtgtgta 2040 caatgtggaa tggacgcatc taacgccagg tggagaacta agaactgatc ttgattactt 2100 gagaactgat ctcaaaatct cccagttgca tgataaaatt cacaaatggc cagaaaatcc 2160 actagttgac gaaacatgta ttgtctctga gggtgaaatg tcatatcact tgtgtgaacg 2220 agtctatgaa tgctttgtgc ctattccaca tatcatgcga gttggaaacc cacagaatcc 2280 //