ID D14995; SV 2; linear; genomic RNA; STD; VRL; 6495 BP. XX AC D14995; S47260; XX DT 21-DEC-1993 (Rel. 38, Created) DT 26-FEB-2008 (Rel. 94, Last updated, Version 7) XX DE Apple stem grooving virus genome, complete sequence. XX KW NTP-binding helicase; RNA-dependent RNA polymerase; serine protease. XX OS Apple stem grooving virus OC Viruses; Riboviria; Tymovirales; Betaflexiviridae; Trivirinae; OC Capillovirus. XX RN [1] RP 1-6495 RA Yoshikawa N.; RT ; RL Submitted (14-APR-1993) to the INSDC. RL Contact:Nobuyuki Yoshikawa Iwate University, Bioscience and Technology; RL Ueda 3-18-8, Morioka, Iwate 020, Japan XX RN [2] RX DOI; 10.1016/0042-6822(92)90170-T. RX PUBMED; 1413530. RA Yoshikawa N., Sasaki E., Kato M., Takahashi T.; RT "The nucleotide sequence of apple stem grooving capillovirus genome"; RL Virology 191(1):98-105(1992). XX DR MD5; 9d423f7c90cb12607d3862eb12ee93ba. DR EuropePMC; PMC4118050; 24998458. XX FH Key Location/Qualifiers FH FT source 1..6495 FT /organism="Apple stem grooving virus" FT /mol_type="genomic RNA" FT /db_xref="taxon:28347" FT CDS 36..6353 FT /codon_start=1 FT /product="241k polyprotein" FT /note="contains two consensus sequences associated with FT RNA-dependent RNA polymerase and NTP-binding helicase." FT /db_xref="GOA:P36309" FT /db_xref="InterPro:IPR001788" FT /db_xref="InterPro:IPR002588" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR008745" FT /db_xref="InterPro:IPR008879" FT /db_xref="InterPro:IPR027351" FT /db_xref="UniProtKB/Swiss-Prot:P36309" FT /protein_id="BAA03639.1" FT /translation="MAFTYRNPLEIAINKLPSKQSDQLLSLTTDEIEKTLEVTNRFFSF FT SITPEDQELLTKHGLTLAPIGFKSHSHPISKMIENHLLYICVPSLLSSFKSVAFFSLRE FT NKVDSFLKMHSVFSHGKIKSLGMYNAIIDGKDKYRYGDVEFSSFRDRVIGLRDQCLTRN FT KFPKVLFLHDELHFLSPFDMAFLFETIPEIDRVVATTVFPIELLFGDKVSKEPRVYTYK FT VHGSSFSFYPDGVASECYEQNLANSKWPFTCSGIQWANRKIRVTKLQSLFAHHVFSFDR FT GRACNEFNHFDKPSCLLAEEMRLLTKRFDKAVINRSTVSSLSTYMACLKTANAASAVAK FT LRQLEKRDLYPDELNFVYSFGEHFKNFGMRDDFDVSVLQWVKDKFCQVMPHFIAASFFE FT PTEFHLNMRKLLNDLATKGIEVPLSVIILDKVNFIETRFHARMFDIAQAIGVNLDLLGK FT RFDYEAESEEYFSENGYIFMPSKSNPERNWILNSGSLKIDYSRLVRARRFRLRRDFLDP FT ISKGKSPRKQLFLESTGNIKSNPNAEKNSESGEIKIEGSAENDQPHEVSHTSMETEDGQ FT GFEGSIPVDLINCFEPEEIKLPKRRRKNDCVFKAISAHLGIDSQDLLNFLVNEDISDEL FT LDCIEEDKGLSHEMIEEVLITKGLSMVYTSDFKEMAVLNRKYGVNGKMYCTIKGNHCEL FT SSKECFIRLLKEGGEAQMSNENLNADSLFDLGRFVHNRDRAVKLAKSMARGTTGLLNEF FT DLEFCKNMVTLSELFPENFSSVVGLRLGFAGSGKTHKVLQWINYTPSVKRMFISPRRML FT ADEVEPQLKGTACQVHTWETALKKIDGTFMEVFVDEIGLYPPGYLTLLQMCAFRKIVKG FT QSENFLKGKLLELSKTCLNIRCFGDPLQLRYYSAEDTNLLDKTHDIDLMIKTIKHKYLF FT QGYRFGQWFQELVNMPTRVDESKFSRKFFADISSVKTEDYGLILVAKREDKGVFAGRVP FT VATVSESQGMTISKRVLICLDQNLFAGGANAAIVAITRSKVGFDFILKGNSLKEVQRMA FT QKTIWQFIIEGKSIPMERIVNMNPGASFYESPLDVGNSSIQDKASNDLFIMPFINLAEE FT EVDPEEVVGDVIQPVEWFKCHVPVFDTDPTLAEIFDKVAAKEKREFQSVLGLSNQFLDM FT EKNGCKIDILPFARQNVFPHHQASDDVTFWAGVQKRIRKSNWRREKSKFEEFESQGKEL FT LQEFISMLPFEFKVNIKEIEDGEKSFLEKRKLKSEKMWANHSERSDIDWKLDHAFLFMK FT SQYCTKEGKMFTEAKAGQTLACFQHIVLFRFGPMLRAIESAFLRSCGDSYYIHSGKNFF FT CLDSFVTKNASVFDGFSIESDYTAFDSSQDHVILAFEMALLQYLGVSKEFQLDYLRLKL FT TLGCRLGSLAIMRFTGEFCTFLFNTFANMLFTQLKYKIDPRRHRILFAGDDMCSLSSLK FT RRRGERATRLMKSFSLTAVEEVRKFPMFCGWYLSPYGIIKSPKLLWARIKMMSERQLLK FT ECVDNYLFEAIFAYRLGERLYTILKEEDFEYHYLVIRFFVRNSKLLTGLSKSLIFEIGE FT GIGSKWLSSTSTASSRRSNLQTSKLMLSRPQSFTRMQPFSNQTCLIASKGLNQTSRFPL FT DLVTASSCLISNCLMTPKLIQSGRKATSTNTYTMESSWLGSKQCCQTLEAWKGESLYMM FT EPAWIRKEATFARIFSSLSLTVATLVSGQSTVCLPQTQIWPKGLDFVWTLIVHNMNRTL FT SCLLLTLELHTDASTLQGFWKPKLAIQDGLHRQSAAVKHLNSMRKSRWPSWIADPRCFW FT KKVHQTCTLKRDCSEVTRLEGHAQFPLKGGQTQGCKKREDLGPSRLELKDLEKMSLEDV FT LQQARRHRVGVYLWKTHIDPAKELLTVPPPEGFKEGESFEGKELYLLLCNHYCKYLFGN FT IAVFGSSDKTQFPAVGFDTPPVHYNLTTTPKEGETDEGRKARAGSSGEKTKIWRIDLSN FT VVPELKTFAATSRQNSLNECTFRKLCEPFADLAREFLHERWSKGLATNIYKKWPKAFEK FT SPWVAFDFATGLKMNRLTPDEKQVIDRMTKRLFRTEGQKGVFEAGSESNLELEG" FT CDS 4787..5749 FT /codon_start=1 FT /product="36K protein" FT /note="contains consensus sequence found in the active site FT of several cellular and viral serine proteases." FT /db_xref="GOA:P36698" FT /db_xref="InterPro:IPR001815" FT /db_xref="InterPro:IPR028919" FT /db_xref="UniProtKB/Swiss-Prot:P36698" FT /protein_id="BAA03640.1" FT /translation="MAIVNVNRFLKEVESTDLKIDAISSSELYKDATFFKPDVLNCIKR FT FESNVKVSSRSGDGLVLSDFKLLDDTEIDSIRKKSNKYKYLHYGVILVGIKAMLPNFRG FT MEGRVIVYDGACLDPKRGHICSYLFKFESDCCYFGLRPEHCLSTTDANLAKRFRFRVDF FT DCPQYEQDTELFALDIGVAYRCVNSARFLETKTGDSGWASQAISGCEALKFNEEIKMAI FT LDRRSPLFLEEGAPNVHIEKRLFRGDKVRRSRSISAKRGPNSRVQEKRGFRSLSARIER FT FGKNEFGRRASASEAPPGRSISMEDSHRPGKGTSDGSSP" FT polyA_site 6495 XX SQ Sequence 6495 BP; 1985 A; 1196 C; 1496 G; 1818 T; 0 other; aaatttaaca ggcttaattt ccgcgcttta cgtcaatggc tttcacttac agaaaccccc 60 tcgaaattgc aatcaacaaa cttcctagta agcagtctga tcaactgctt tccttgacca 120 ccgacgagat tgaaaagacc ttagaagtga ccaaccgctt cttctctttt tcaatcacac 180 cagaagatca agaattgttg actaagcatg gtctaacact tgcacctata gggtttaagt 240 cacactccca tccaatatcc aaaatgatag aaaatcatct cctgtatata tgtgttccga 300 gtcttttatc ctcctttaag tcagttgcct ttttttcact tagggaaaat aaagtagaca 360 gttttcttaa gatgcattca gtcttttccc atggaaaaat taaatctttg gggatgtaca 420 atgctataat tgatgggaaa gataaatata ggtatggtga tgtagagttt tcatctttta 480 gggatagagt gattggtctt agagatcaat gccttacacg taacaaattt ccaaaagttc 540 tgtttcttca cgacgagttg cactttctaa gtccatttga catggctttc ctatttgaga 600 caatcccaga aattgataga gttgttgcaa ccacagtttt tccaatagaa cttttattcg 660 gggacaaggt ctctaaggaa cccagggttt atacctacaa ggtccatggc tcttcatttt 720 cattttatcc ggatggtgtt gcctctgagt gttacgaaca gaatttggca aattctaaat 780 ggcccttcac ctgcagcggc atacaatggg ctaacaggaa aattagggta accaagctac 840 agagtctctt cgcccatcat gttttctcat ttgacagggg gagggcttgt aatgaattta 900 atcatttcga caaacctagc tgtctacttg cggaagaaat gcgccttttg accaaaaggt 960 ttgataaagc agttattaac agaagcacag tctcttccct cagtacatac atggcttgtc 1020 ttaaaactgc aaatgcggct tcagctgttg ccaagctgag gcagttggag aagagggatc 1080 tttacccaga tgagttgaac ttcgtctatt cctttggaga gcatttcaaa aattttggga 1140 tgagagatga ctttgatgtg tcagttctac aatgggtcaa agacaaattt tgccaggtca 1200 tgcctcactt catcgccgcc agtttctttg aaccaacaga atttcattta aacatgcgca 1260 aattgttgaa tgatctggct actaaaggaa tagaggttcc cctttctgtg atcatcctgg 1320 acaaagtcaa cttcatagag accagatttc atgccaggat gttcgacata gcacaggcaa 1380 tcggggtgaa cctagattta ctggggaaaa gatttgatta tgaggctgag agtgaagagt 1440 acttttcaga gaacggttac atctttatgc cctctaaatc aaatccagag agaaattgga 1500 ttctaaattc cggttcgctg aaaattgact attcaagatt ggtaagagcc aggagattta 1560 gattgagaag agatttccta gatcccatat ctaaaggaaa atcccctaga aaacaactct 1620 tcttggagtc aacgggaaac attaaatcaa atcccaatgc tgaaaaaaat agcgagagcg 1680 gcgaaataaa gattgaaggc agtgccgaaa atgatcagcc acatgaggta tcacatactt 1740 caatggaaac cgaggatgga cagggttttg aaggttcaat accagttgat ttaatcaatt 1800 gctttgaacc agaagaaatc aagcttccaa agagaagaag gaaaaatgat tgcgtcttca 1860 aggccatctc tgcacacttg gggattgact ctcaagattt gttgaatttt ttggtaaatg 1920 aagacatatc agatgaatta cttgattgca ttgaagagga caaaggactg tcacatgaaa 1980 tgattgaaga agttttgatc acaaagggtc tttcaatggt ttatacttct gacttcaaag 2040 aaatggcagt tcttaataga aagtatggag tgaatggcaa gatgtactgc acaattaaag 2100 gcaatcactg cgagctgagt tccaaagagt gcttcatcag attattgaaa gaaggtggtg 2160 aagcgcagat gtcaaatgaa aatctaaatg ctgattcctt gttcgacctt ggaagatttg 2220 tgcataatag agacagggct gtcaagctag caaaatcaat ggcaagaggc acaacaggcc 2280 tcctgaatga attcgaccta gaattctgca agaacatggt gaccctttcg gagttgtttc 2340 ctgaaaactt ttcttctgtt gtcgggctaa ggcttgggtt tgcgggttct ggtaaaacgc 2400 ataaggtgct tcaatggatt aattacactc caagtgtcaa aagaatgttt ataagtccaa 2460 ggagaatgct ggcggatgaa gttgaacctc aactcaaggg aacggcctgt caggtgcata 2520 catgggagac tgcacttaaa aaaatcgacg gaacttttat ggaagttttt gttgatgaaa 2580 taggtttgta cccacctgga taccttacac tgctacagat gtgtgctttc agaaagattg 2640 ttaagggaca aagtgaaaat ttcttgaaag gcaaactgtt ggaattgtca aagacttgct 2700 taaacataag atgttttggt gatccattgc aattaaggta ttactcagct gaagacacca 2760 atctattgga caaaacacat gatattgacc tcatgatcaa gacgatcaag cacaaatatc 2820 ttttccaagg gtacaggttc ggtcagtggt ttcaagaact ggtgaacatg cccactagag 2880 tggatgagtc gaaattctca aggaagttct ttgcagacat ttcaagtgta aaaactgaag 2940 attacggact catcctagtt gccaagagag aagataaagg tgtcttcgct ggaagagttc 3000 ctgtagcaac agtgagtgaa tctcagggaa tgaccattag caaaagggtg ttgatatgtt 3060 tggaccaaaa tctttttgcc gggggagcca atgcagccat tgttgcaata acaagatcaa 3120 aggtcggctt tgactttatc cttaaaggga attcattgaa agaggtacag aggatggcac 3180 aaaagacaat ttggcagttc atcattgaag ggaagtctat tccgatggag aggatagtga 3240 acatgaatcc tggagccagc ttttatgaga gtcctttgga tgttggaaat tcatcaattc 3300 aagacaaagc ttctaatgac ctgttcataa tgccttttat aaatttggct gaggaagaag 3360 ttgacccaga ggaagttgtt ggggacgtaa ttcaacctgt tgagtggttc aaatgtcatg 3420 tgcctgtctt cgacacagat ccgacgcttg cggagatttt tgataaggtt gcagcaaaag 3480 aaaaaaggga attccagtct gtgctgggtc tttcaaatca atttcttgac atggaaaaga 3540 atggatgcaa aatagacatc ttgccctttg cgcgacaaaa tgtttttcca catcatcaag 3600 cgtctgatga tgttactttc tgggcaggtg ttcaaaaaag aattagaaag tcgaactgga 3660 gaagggagaa atcgaagttt gaggaatttg aaagccaagg gaaagaactt cttcaagaat 3720 tcatctcaat gctaccgttt gaattcaaag tgaatatcaa ggagattgaa gatggagaga 3780 agagcttttt agaaaaaaga aagctaaaat ctgagaaaat gtgggcaaat cattcggaga 3840 gatcagacat tgactggaaa cttgaccacg cctttctctt catgaaatca caatattgca 3900 cgaaggaagg gaagatgttc accgaagcta aagctggcca aactttggcc tgtttccaac 3960 atatagtcct atttagattt ggacccatgt tgagagcaat tgaaagtgcc tttttgagaa 4020 gctgtggaga ctcatactac atacactccg ggaaaaactt cttctgcctg gatagctttg 4080 tgacaaagaa tgcaagtgtc tttgatggat tttcaattga gtcagactac acggcctttg 4140 actcatctca ggaccacgtc atattggcct ttgaaatggc actgttacaa tacctgggcg 4200 tgtcaaagga gtttcagcta gattacctta gactgaaatt aactctcgga tgccgtctcg 4260 gatcactagc aataatgagg ttcacaggag aattttgcac tttcttattc aacacatttg 4320 ccaatatgct gtttactcaa ttgaagtaca agatagaccc aaggaggcat aggattttat 4380 ttgctgggga cgatatgtgt tccttgagct ctctcaaaag aaggagaggg gagagagcga 4440 caagattgat gaagagcttt tccctaactg cagtagaaga ggtgagaaaa ttcccaatgt 4500 tttgtggatg gtacttaagt ccatatggta tcattaaatc tccaaaattg ctgtgggcca 4560 ggatcaagat gatgagtgag agacagcttt tgaaggaatg tgttgataat tacctatttg 4620 aggcgatatt tgcctacaga ttaggtgaga ggctttacac aattttgaaa gaagaggatt 4680 ttgaatacca ttatcttgtc ataagatttt ttgttagaaa ttcaaaattg ttaacagggt 4740 tgagcaaaag cttgatattt gaaattgggg agggcatcgg gtccaaatgg ctatcgtcaa 4800 cgtcaaccgc ttcctcaagg aggtcgaatc tacagacctc aaaattgatg ctatctcgtc 4860 ctcagagctt tacaaggatg caaccttttt caaaccagac gtgcttaatt gcatcaaaag 4920 gtttgaatca aacgtcaagg tttcctctcg atctggtgac ggcctcgtcc tgtctgattt 4980 caaactgctt gatgacaccg aaattgattc aatcaggaag aaaagcaaca agtacaaata 5040 cttacactat ggagtcatcc tggttgggat caaagcaatg ttgccaaact ttagaggcat 5100 ggaagggaga gtcattgtat atgatggagc ctgcctggat ccgaaaagag gccacatttg 5160 ctcgtatctt ttcaagtttg agtctgactg ttgctacttt ggtctcaggc cagagcactg 5220 tttgtctacc acagacgcaa atttggccaa aaggtttaga tttcgtgtgg actttgattg 5280 tccacaatat gaacaggaca ctgagttgtt tgctcttgac attggagttg catacagatg 5340 cgtcaactct gcaaggtttt tggaaaccaa aactggcgat tcaggatggg cttcacaggc 5400 aatcagcggc tgtgaagcac ttaaattcaa tgaggaaatc aagatggcca tcctggatcg 5460 cagatccccg ctgtttctgg aagaaggtgc accaaacgtg cacattgaaa agagattgtt 5520 cagaggtgac aaggttagaa ggtcacgctc aatttccgct aaaagggggc caaactcaag 5580 ggtgcaagaa aagagaggat ttaggtccct ctcggctaga attgaaagat ttggaaaaaa 5640 tgagtttgga agacgtgctt cagcaagcga ggcgccaccg ggtaggagta tatctatgga 5700 agactcacat agacccggca aaggaacttc tgacggttcc tccccctgaa ggatttaagg 5760 aaggtgaaag ctttgagggc aaagagcttt accttcttct ttgcaaccat tactgtaaat 5820 acttgttcgg taatattgct gtctttgggt catctgataa gacccagttt cccgctgttg 5880 gatttgatac acctccggtt cattataatt tgacaacgac cccaaaggaa ggggagactg 5940 acgaaggaag gaaggccaga gcgggttcgt ctggcgaaaa aacaaaaatt tggaggatcg 6000 atttgtcaaa tgttgttcct gaattgaaaa cctttgctgc cacttccagg cagaactctt 6060 tgaacgaatg tacgttcaga aagctttgcg agccatttgc cgatttggct cgagaatttc 6120 tacatgaaag gtggtctaag ggattggcca ccaatattta caagaaatgg cccaaagctt 6180 tcgaaaaaag tccatgggtg gcctttgatt ttgccactgg tctgaaaatg aatcgtctaa 6240 cacctgatga gaaacaggtg attgatagaa tgaccaaaag actttttcgt actgaaggac 6300 aaaaaggggt tttcgaggca ggttcggaaa gtaacctgga actggagggt taggagtcgt 6360 gtgaaattcc gcaaacttgg tcgcggtctt gcaggttgac atgcctgcct ttatacttaa 6420 ttaaagggtt cccccggttt tctgagcatt tccgggttag tgtggttttt ctagagtcta 6480 gagtttgtcc actct 6495 //