ID U07219; SV 1; linear; unassigned RNA; STD; VRL; 2789 BP. XX AC U07219; XX DT 15-APR-1994 (Rel. 39, Created) DT 04-MAR-2000 (Rel. 63, Last updated, Version 2) XX DE Sorghum mosaic virus strain Sugarcane mosaic virus H polyprotein (nuclear DE inclusion II protein, coat protein) gene, partial cds. XX KW . XX OS Sorghum mosaic virus OC Viruses; Riboviria; Potyviridae; Potyvirus. XX RN [1] RP 1-2789 RA Jilka J.M., Clark J.M.Jr.; RT "Sequence analysis of the 3' regions of select strains of Maize dwarf RT mosaic virus and Sugarcane mosaic virus"; RL Unpublished. XX RN [2] RP 1-2789 RA Jilka J.M.; RT ; RL Submitted (28-FEB-1994) to the INSDC. RL Joseph M. Jilka, Biotechnology, Pioneer Hi-Bred International, Inc., 7300 RL NW 62nd Ave, Johnston, IA 50131, USA XX DR MD5; b07f7b0a752b4b49d8a151f26bfd21b3. DR EuropePMC; PMC4317601; 25685813. XX FH Key Location/Qualifiers FH FT source 1..2789 FT /organism="Sorghum mosaic virus" FT /strain="Sugarcane mosaic virus H" FT /mol_type="unassigned RNA" FT /note="SrMV-SCH" FT /db_xref="taxon:32619" FT CDS <1..2553 FT /codon_start=1 FT /product="polyprotein" FT /db_xref="GOA:Q88172" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR001592" FT /db_xref="InterPro:IPR007094" FT /db_xref="UniProtKB/TrEMBL:Q88172" FT /protein_id="AAA16874.1" FT /translation="GMTHETWLTNDIRDNLQVVAKCPGQLVTKHVVKGQCPHFSLYLST FT HDEANSFFKPLLGKYDKSRLNKAAFIKDLTKYAKPTYIGEVNPQTFQRAVDKVKETLWA FT VGMQTCNYITDEEEIFKSLNMNAAVGALYTGKKKDYFMDFSDADKEEILKQSCERLYKG FT KLGIWNGSLKAEIRPIEKTEANKTRTFTAAPLETLLGGKVCVDDFNNQFYSHHLEGPWT FT VGITKFMEGWNKLLNKLPEGWIYCDADGSQFDSSLTPYLINAVLDIRLHFMEDWSIGEK FT MLRNLYTEIVYTPIATPDGSVIKKFKGNNSGQPSTVVDNTLMVIIAFNYTMLSCGIEAD FT MIDEICKMYANGDDLLLAIRPDYEHFIDNFSKHFADLGLNFDFTSRTRDRTELWFMSTR FT GIKIDNMYIPKLEQERIVAILEWDRSLLPQYRLEAICAAMVESWGYPQLLHEIRKFYAW FT ILEMQPFATLAKEGLAPYIAETALRNLYTGEGIKEGELDVYYTQFLKDLPEYIEDELID FT VRHQAGGGTVDAGATTAEATAQAQRDAAAKAQRDADAKKKADDEAAERQRQDAAAKKKA FT DDDAKAKADAIVKQNQIADAKKKADDEAARKAQNQKDKDVDVGTSGTVAVPKLKAMSKK FT MKLPQAKGKNILHLDFLLGYKPQQQDISNTRATRDEFDRWYDALQKEYELDDTQMTVVA FT SGLMVWVIENGCSPNINGVWTMMDGDEQRKFPLKPVIEYASPTFRQIMHHFSDAAEAYI FT EYRNSTERYMPRYGLQRNLTDYNLARYAFDFYEITSRTPARRREAHMQMKAAAVGGSNT FT RMKGLDGNVGESQENTERHTAGDESRNMHSLLGVQQHH" FT mat_peptide <1..1563 FT /product="nuclear inclusion II protein" FT mat_peptide 1564..2550 FT /product="coat protein" FT 3'UTR 2554..2789 XX SQ Sequence 2789 BP; 956 A; 509 C; 650 G; 674 T; 0 other; ggaatgacac acgagacatg gctaacaaat gatatcagag ataaccttca agtagttgca 60 aaatgtccag gccagttagt cacaaaacac gtagtcaagg gacaatgccc tcatttttct 120 ttgtatttat caacacacga tgaagcgaat tctttcttca aaccactgct agggaaatac 180 gataagagta ggctaaataa agcagcattc ataaaagatc ttacgaaata tgcaaaacca 240 acttatatcg gtgaggttaa tccacaaact ttccaaagag cagttgataa agttaaagaa 300 acactgtggg ccgttggcat gcaaacatgt aactacataa cggatgagga agaaattttt 360 aaatcattga atatgaatgc cgcagtagga gctttgtaca ccggaaagaa gaaggactat 420 ttcatggact tttcggatgc cgataaagaa gaaattctca aacaatcgtg tgagcgctta 480 tataaaggga aacttggaat ttggaatgga tctctaaaag cagagatccg accaattgaa 540 aagacagaag caaataaaac gagaactttt acagcagccc cactagaaac attacttgga 600 ggaaaagtgt gcgtagatga ttttaacaat caattttatt cacaccattt ggaaggacct 660 tggaccgttg gaataacaaa atttatggag gggtggaaca aacttctgaa taaacttcca 720 gaggggtgga tatattgcga tgcagatggt tcacaattcg atagttcact aacaccttat 780 ctcatcaatg cagtgttgga catcagattg cattttatgg aagattggag tatcggagag 840 aaaatgctca ggaaccttta tacagaaatt gtttatactc ctatagcaac accagatgga 900 tccgttataa agaaattcaa aggaaataat agtggacaac catcaaccgt tgttgataac 960 acactaatgg tgatcatagc gtttaactat acgatgttgt catgtgggat cgaagcggat 1020 atgattgatg aaatatgcaa aatgtatgca aatggggacg accttttgtt agcaatacgg 1080 ccagattacg aacattttat tgataatttc tcaaaacact ttgctgatct aggtcttaac 1140 ttcgatttta catcacgcac aagagatagg acggaattgt ggtttatgtc gacacgaggc 1200 attaaaattg acaatatgta catcccaaaa ttggaacagg aaagaattgt tgctatttta 1260 gaatgggata gatcattatt accacaatat agactggaag cgatatgtgc tgcaatggtg 1320 gaatcatggg gatatccaca attattacat gagattagga aattttatgc ttggattctc 1380 gaaatgcagc cattcgccac tctagcgaaa gaaggacttg ccccgtacat agcagaaacg 1440 gctttgcgta atctttatac aggggaagga ataaaggaag gggagttgga tgtttattac 1500 acacaattcc tcaaagattt gcctgaatac atagaggatg aattaattga cgtgcgtcat 1560 caggcaggag gcggtacagt agatgcagga gcaaccacag cagaagcaac agcacaagca 1620 cagcgtgatg cagcagcgaa agctcaacga gatgctgacg cgaagaagaa ggcggatgat 1680 gaagcggcag agaggcagag acaagatgcc gcggcaaaga agaaagctga tgatgatgca 1740 aaagctaaag ctgatgcgat cgtaaagcaa aatcagattg ctgatgcgaa aaagaaagca 1800 gacgacgaag cagcaagaaa agcacaaaat caaaaagaca aagatgtgga tgtcggcaca 1860 tctggcacgg tggcagtgcc taagctcaaa gcaatgtcca agaaaatgaa actaccacaa 1920 gcaaaaggga aaaacatttt acacttggat tttcttttgg gatacaagcc acaacaacaa 1980 gacatttcaa acaccagagc cacacgggat gagttcgata ggtggtatga tgcattgcag 2040 aaggaatatg aactagatga tacgcagatg acagtggtcg caagcggact catggtttgg 2100 gtcatagaga acggatgctc acctaatatt aatggtgttt ggacaatgat ggatggagat 2160 gagcaaagga aatttccgct caagcccgtt attgaatatg catctccaac attcagacag 2220 ataatgcacc actttagtga tgcagctgaa gcgtacatag agtatcggaa ctcgacagag 2280 cgttatatgc caagatacgg acttcagcga aacttaaccg actataacct agcccggtac 2340 gcattcgatt tctatgaaat aacttcgcgt acaccggcga gacgtagaga ggcccacatg 2400 cagatgaaag cagcagcagt cggtggatca aacacgcgca tgaaaggctt ggatgggaat 2460 gtcggtgaga gtcaggagaa tacagaacgt cacacagctg gcgatgagag tcgcaatatg 2520 cactcccttc tcggagtgca gcagcatcac tgatgtactg agatcttcat tgcagtttta 2580 agagtatttt atatatttac tatttcagtg agggtctccc tccttagtat tatatatgta 2640 ctttagaaat agtagtcatt ctgcagggga gtgaggttca cctccaaccc tatggttact 2700 atttgctact agcgtcgaac tacattacgg acaccctgtt gtgtggttct accacgagtc 2760 aggagttgcg agtattgtag caagagaca 2789 //