ID AY237420; SV 2; linear; mRNA; STD; VRL; 7458 BP. XX AC AY237420; XX DT 28-FEB-2004 (Rel. 79, Created) DT 07-JUN-2005 (Rel. 84, Last updated, Version 6) XX DE Sapovirus Mc10 polyprotein precursor, mRNA, complete cds; and unknown mRNA. XX KW . XX OS Sapovirus Mc10 OC Viruses; Riboviria; Caliciviridae; Sapovirus. XX RN [1] RP 1-7458 RX DOI; 10.1128/JVI.79.12.7283-7290.2005. RX PUBMED; 15919882. RA Oka T., Katayama K., Ogawa S., Hansman G.S., Kageyama T., Ushijima H., RA Miyamura T., Takeda N.; RT "Proteolytic processing of sapovirus ORF1 polyprotein"; RL J. Virol. 79(12):7283-7290(2005). XX RN [2] RP 1-7458 RA Hansman G.S., Maneekarn N., Peerakome S., Khamrin P., Tonusin S., RA Okitsu S., Nishio O., Ushijima H.; RT ; RL Submitted (17-FEB-2003) to the INSDC. RL Developmental Medical Sciences, Tokyo University, 7-3-1 Hongo, Bunkyo-Ku, RL Tokyo 113-0033, Japan XX RN [3] RC Sequence update by submitter RP 1-7458 RA Hansman G.S., Maneekarn N., Peerakome S., Khamrin P., Tonusin S., RA Okitsu S., Nishio O., Ushijima H.; RT ; RL Submitted (18-MAR-2004) to the INSDC. RL Developmental Medical Sciences, Tokyo University, 7-3-1 Hingo, Bunkyo-Ku, RL Tokyo 113-0033, Japan XX DR MD5; 9af677c8d976213741691bbd90c0357c. DR EuropePMC; PMC1143638; 15919882. DR EuropePMC; PMC1317165; 16333083. DR EuropePMC; PMC1865854; 17267629. DR EuropePMC; PMC1933329; 17459935. DR EuropePMC; PMC2600344; 18598655. DR EuropePMC; PMC2725984; 17553282. DR EuropePMC; PMC2738451; 18044044. DR EuropePMC; PMC2744225; 19624925. DR EuropePMC; PMC2815582; 19940055. DR EuropePMC; PMC2851512; 18258001. DR EuropePMC; PMC3291391; 16494732. DR EuropePMC; PMC3323275; 15504283. DR EuropePMC; PMC3367643; 16485479. DR EuropePMC; PMC3433708; 22973264. DR EuropePMC; PMC4068436; 24637690. DR EuropePMC; PMC4284302; 25567221. DR EuropePMC; PMC4290914; 25339401. DR EuropePMC; PMC4751832; 26655761. DR EuropePMC; PMC4859703; 26937032. DR EuropePMC; PMC5035092; 27661997. DR EuropePMC; PMC553994; 15727685. DR EuropePMC; PMC5835752; 29305515. DR EuropePMC; PMC5864949; 29567738. XX CC On Mar 18, 2004 this sequence version replaced gi:33359666. XX FH Key Location/Qualifiers FH FT source 1..7458 FT /organism="Sapovirus Mc10" FT /strain="Mc10" FT /mol_type="mRNA" FT /country="Thailand:Chiang Mai" FT /clone="10" FT /db_xref="taxon:234601" FT CDS 14..6850 FT /codon_start=1 FT /product="polyprotein precursor" FT /note="ORF1" FT /db_xref="GOA:Q6XDK8" FT /db_xref="InterPro:IPR000317" FT /db_xref="InterPro:IPR000605" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR004004" FT /db_xref="InterPro:IPR004005" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR009003" FT /db_xref="InterPro:IPR014759" FT /db_xref="InterPro:IPR027417" FT /db_xref="InterPro:IPR029053" FT /db_xref="InterPro:IPR033703" FT /db_xref="UniProtKB/Swiss-Prot:Q6XDK8" FT /protein_id="AAQ17058.2" FT /translation="MASKPFYPIEFNPSVELQVLRSAHLRVGGREQMFETINDLNDHVR FT GVVAKLWCKHLHRSLAAAPTFTEEGLLDSFLSKPPVDINPDTTFRELFGINPHEQFPLS FT IHDLAKLQGELVDAARNPGHVLRRHYSTDSLTALINKITKFVPVHATLQEMQARRAFER FT ERAELFRELPHADLDVSRQQKSYFYAMWRQVVKKSKEFFIPLVKCTSWRKKFTEPAEIV FT RQVLVHFCEGMRSQFSTNANYINLSLIAKLRPTVLTMILQQHKNTYRGWLATVTALVEV FT YSNLFQDMRDTAVSAVSAITLVFETIKDFVVNVIDLVKSTFQSQGPTSCGWAAIIAGAL FT LILMKLSGCSNTTSYWHRLLKVCGGVTTIAAAARAVVWVRDIIAEADGKARLKKYMART FT AALLELAASRDVTGTDELKRLLDCFTQLIEEGTELIQEFGTSPLAGLTRSYVSELESTA FT NSIRSTILLDTPRKTPVAIILTGPPGIGKTRLAQHLAAGFGKVSNFSVTLDHHDSYTGN FT EVAIWDEFDVDTQGKFVETMIGVVNTAPYPLNCDRVENKGKVFTSDYIICTSNYPTSVL FT PDNPRAGAFYRRVTTIDVSSPTIEDWKKKNPGKKPPPDLYKNDFTHLRLSVRPFLGYNP FT EGDTLDGVRVKPVLTSVDGLSRLMETKFKEQGNEHRNLWITCPRDLVAPAASGLKAYMA FT ANRALAQVFQEPSSQDIGETCTSRVYVSCNNPPPTYSGRVVKITAINPWDASLANSMLS FT MFETTSHIPASIQREIMYRVWDPLVHLQTREPNTQMLPYINRVVPVSSAFDFIRGLRHH FT LGLCSVKGMWRAYQGWNSSSSILEFLSKHMADVAFPHNPECTVFRAPDGDVIFYTFGSY FT ACFVSPARVPFVGEPPKNVHSNITRNMTWAETLRLLAETITESLVHFGPFLLMMHNVSY FT LATRSGREEEAKGKTKHGRGAKHARRGGVSLSDDEYDEWRDLVRDWRQDMTVGEFVELR FT ERYALGMDSEDVQRYRAWLELRAMRMGAGAYQHATIIGRGGVQDTIIRTQPMRAPRAPR FT NQGYDEEAPTPIVTFTSGGDHIGYGCHMGNGVVVTVTHVASASDQVEGQDFAIRKTEGE FT TTWVNTNLGHLPHYQIGDGAPVYYSARLHPVTTLAEGTYETPNITVQGYHLRIINGYPT FT KRGDCGTPYFDSCRRLVGLHAATSTNGETKLAQRVTKTSKVENAFAWKGLPVVRGPDCG FT GMPTGTRYHRSPAWPNPVEGETHAPAPFGSGDERYKFSQVEMLVNGLKPYSEPTPGIPP FT ALLQRAATHTRTYLETIIGTHRSPNLSFSEACSLLEKSTSCGPFVAGQKGDYWDEDKQC FT YTGVLAEHLAKAWDAANRGVAPQNAYKLALKDELRPIEKNAQGKRRLLWGCDAGATLVA FT TAAFKGVATRLQAVAPMTPVSVGINMDSYQVEVLNESLKGGVLYCLDYSKWDSTQHPAV FT TAASLGILERLSEATPITTSAVELLSSPARGHLNDIVFITKSGLPSGMPFTSVINSLNH FT MTYFAAAVLKAYEQHGAPYTGNVFQVETVHTYGDDCLYSVCPATASIFQTVLANLTSFG FT LKPTAADKSETIAPTHTPVFLKRTLTCTPRGVRGLLDITSIKRQFLWIKANRTVDINSP FT PAYDRDARGIQLENALAYASQHGHAVFEEVAELARHTAKAEGLVLTNVNYDQALATYES FT WFIGGTGLVQGSPSEETTKLVFEMEGLGQPQPQGGEKTSPQPVTPQDTIGPTAALLLPT FT QIETPNASAQRLELAMATGAVTSNVPNCIRECFASVTTIPWTTRQAANTFLGAIHLGPR FT INPYTAHLSAMFAGWGGGFQVRVTISGSGLFAGRAVTAILPPGVNPASVQNPGVFPHAF FT IDARTTEPILINLPDIRPVDFHRVDGDDATASVGLWVAQPLINPFQTGPVSTCWLSFET FT RPGPDFDFCLLKAPEQQMDNGISPASLLPRRLGRSRGNRMGGRIVGLVVVAAAEQVNHH FT FDARSTTLGWSTLPVEPIAGDISWYGDAGNKSIRGLVSAQGKGIIFPNIVNHWTDVALS FT SKTSNTTTIPTDTSTLGNLPGASGPLVTFADNGDVNESSAQNAILTAANQNFTSFSPTF FT DAAGIWVWMPWATDRPGASDSNIYISPTWVNGNPSHPIHEKCTNMIGTNFQFGGTGTNN FT IMLWQEQHFTSWPGAAEVYCSQLESTAEIFQNNIVNIPMNQMAVFNVETAGNSFQIAIL FT PNGYCVTNAPVGTHQLLDYETSFKFVGLFPQSTSLQGPHGNSGRAVRFLE" FT mat_peptide 5174..6847 FT /product="capsid protein" FT CDS 6850..7350 FT /codon_start=1 FT /product="unknown" FT /note="ORF2" FT /db_xref="GOA:Q6XDK7" FT /db_xref="InterPro:IPR008437" FT /db_xref="UniProtKB/Swiss-Prot:Q6XDK7" FT /protein_id="AAQ17059.2" FT /translation="MSWFTGASLAAGSLVDMAGTISSIVAQQRQIDLMAEANRIQADWV FT RRQEALQIRGQDISRDLAVNGTAQRVESLVNAGFTPVDARRLAGGTETVSYGLLDRPIL FT QRGILSGITETRHLQAMQGALSAFKNGASYGAPPAPSGFVNPNYQPSPPRLKLGPRPPS FT TNV" XX SQ Sequence 7458 BP; 1855 A; 1980 C; 1858 G; 1765 T; 0 other; gtgattggtt agtatggctt ccaagccatt ctacccaata gagttcaacc cgagtgttga 60 gcttcaagtg ctccgatcgg cccacctcag ggtgggtggt cgtgagcaaa tgtttgaaac 120 cattaatgac ctcaatgatc atgtcagggg tgtggtggcc aaactgtggt gcaagcattt 180 gcaccgtagt ttggctgccg cccccacatt cacggaggag ggcttgttag actctttcct 240 ttcaaaacca ccggttgaca tcaatcctga cacaacgttc cgtgagctgt ttggtattaa 300 tccccacgag cagttcccgc tgtccattca tgatttggca aaattacagg gtgagcttgt 360 ggatgcggca cgcaacccag gccatgtgtt gcggcgtcat tattccaccg attcgctcac 420 cgccctaatt aacaaaatca cgaaatttgt ccctgtgcat gccacacttc aagaaatgca 480 agcacgccgt gctttcgagc gagagcgcgc ggagctgttt agggaactgc cacatgctga 540 tttggatgta agtcgccaac aaaagtcgta cttttatgcc atgtggcgtc aggtggttaa 600 gaagagcaaa gagtttttca tccccctggt caaatgtaca tcttggcgga agaagtttac 660 agagcctgcg gaaattgtta gacaggttct ggtccacttt tgtgaaggga tgaggtcgca 720 gttttccacc aatgcaaatt acatcaattt gtccctcatt gccaaactcc ggccaacagt 780 cctcacaatg attctccagc aacacaagaa cacctacaga gggtggttgg caacagtcac 840 tgctttggtt gaagtgtact ccaacctgtt tcaagacatg cgggacaccg ctgtctcagc 900 agtgtcagcc attacactgg tgtttgaaac cattaaggac tttgtagtca atgtgataga 960 ccttgttaag agcacgttcc agtcacaagg cccaacatct tgcggctggg ctgctatcat 1020 tgctggtgca ctgctcatct taatgaaatt gtcagggtgc tccaacacca caagttattg 1080 gcaccgactc ctcaaggtgt gtgggggtgt cactaccatt gctgcggcgg cccgtgctgt 1140 cgtgtgggtg cgagacatca tagcagaagc tgatggcaag gctagactga aaaagtacat 1200 ggcccgcaca gcagctctac ttgagcttgc agcatctcga gacgtgacgg gcactgatga 1260 actcaagcgc ctattagatt gtttcacaca gctcatcgag gagggtactg agttgataca 1320 ggaatttggt acatcaccac ttgctggtct gactaggtca tatgtgagtg agcttgagtc 1380 aactgcaaac agtatcagga gcaccatcct cctagacaca ccccgaaaga ctccagttgc 1440 aatcatcctc actggtcctc ctggtatagg caaaacaagg cttgcacagc accttgctgc 1500 aggttttggc aaagtgtcaa acttttccgt cacgttggac caccacgact cttacaccgg 1560 aaatgaagtc gcaatttggg atgagtttga cgttgacaca cagggtaaat ttgtggagac 1620 catgattggt gtagttaata ccgcccccta cccactcaat tgcgaccgag tggagaacaa 1680 aggcaaagtg ttcacatctg attatatcat atgcaccagc aattacccaa cctctgtgtt 1740 acctgacaac ccacgagcgg gggctttcta tcgccgagtc acaacgatag atgtgtcatc 1800 tcctaccatt gaagattgga agaagaagaa cccagggaag aaacccccac ccgacttgta 1860 caagaacgat ttcacacacc ttcgcctatc tgttagaccg ttcttggggt acaaccccga 1920 gggggacacc ttggatggtg tccgagtgaa acctgtgctc actagtgtgg atggtctgtc 1980 acgcttgatg gaaaccaagt ttaaggagca gggcaatgaa catcggaacc tgtggataac 2040 atgcccgcgt gacctggtgg cccccgccgc atctggttta aaagcataca tggccgccaa 2100 ccgagcgctc gcacaagtgt tccaggaacc atcttcacag gacattggtg aaacctgcac 2160 gtcccgtgtg tacgtgtcat gtaacaatcc acctcccaca tacagtggac gggtggtgaa 2220 aatcacagcc atcaatccat gggatgcatc actcgccaat tccatgttat caatgtttga 2280 aaccaccagt cacatccctg cctcgattca gcgtgaaatc atgtatagag tttgggaccc 2340 actggttcac ctgcagacac gtgagccaaa tacgcagatg ctcccctaca tcaacagagt 2400 ggtcccggtg tcttctgcat ttgacttcat ccgaggcctc aggcaccatc ttggtctgtg 2460 ttcagtcaaa ggcatgtgga gagcttatca gggttggaac agttccagct cgatcttgga 2520 attcctgtca aagcacatgg ctgatgttgc tttcccacac aaccccgagt gcaccgtttt 2580 ccgggccccg gacggtgatg tgatctttta cacgttcggg tcatatgctt gctttgtgtc 2640 cccagcccgt gtcccatttg ttggagagcc cccgaagaac gtgcattcaa atataacacg 2700 caacatgacg tgggctgaga cactccgcct gttggcagaa actataactg aaagtctggt 2760 gcactttggc cccttcctac tcatgatgca caatgtttca tacctcgcca cccggtctgg 2820 tcgggaggag gaggccaaag gaaagaccaa gcatggccgt ggtgccaaac acgctaggag 2880 gggaggtgtc agcttgtctg atgatgagta tgatgagtgg cgtgacctgg tacgggactg 2940 gcggcaagac atgactgttg gggagtttgt ggagcttcgt gagcgatacg cgctcggaat 3000 ggactctgag gatgtgcaac gttatcgtgc ttggcttgag ctacgagcga tgcgcatggg 3060 tgcaggtgcc taccaacatg ccaccattat tggtagggga ggagtacaag acaccatcat 3120 ccgcacccaa ccaatgcgtg ctccacgtgc gccccgtaat caaggttatg atgaagaagc 3180 tcccacacca attgttacat tcacatctgg gggtgatcac attgggtatg gttgtcacat 3240 gggtaatggg gtggttgtca cagttacaca cgtggcctct gcgtctgacc aagtagaagg 3300 gcaggacttc gcaatcagga agaccgaggg tgaaaccacc tgggtgaaca ccaaccttgg 3360 tcacttgccc cactaccaga tcggtgatgg cgcccctgtc tactactcgg cgcgcctaca 3420 ccctgtcacc acgcttgcgg aggggacgta tgagacaccc aatatcacgg tccaggggta 3480 tcacctgcgc atcataaatg gatacccaac aaagcgtggg gactgtggca caccctattt 3540 tgactcatgc cgtcgtttgg tcggactgca cgcagccaca tcaacaaatg gagaaaccaa 3600 gcttgctcag cgagtgacta aaacatccaa ggtggagaat gcttttgctt ggaagggtct 3660 accagtggtt cgaggccccg actgtggcgg catgcccacg gggactcgtt accaccgctc 3720 acctgcatgg cccaaccctg tggaaggaga aacacacgcc cctgcgccgt ttggttccgg 3780 tgatgagcgg tacaaatttt cccaggtgga gatgttggtc aacggcttaa agccttactc 3840 agagcccacc cctggcatac cccccgcttt gttacaacgt gcagccacac acacacgcac 3900 gtatctggaa acaataattg gcacccaccg atcaccaaat ttgtcattca gtgaggcatg 3960 ttcactcttg gaaaaatcaa catcgtgtgg tccgttcgtg gctggccaaa agggggacta 4020 ctgggacgag gacaaacagt gttacacagg tgtgttggca gaacatcttg ccaaagcatg 4080 ggatgcagcc aacaggggcg ttgcacccca aaacgcctac aaattggctt tgaaagatga 4140 actgagacca attgaaaaga atgcacaagg aaaaagacgc ctcctgtggg gttgtgatgc 4200 gggtgccaca ttggtggcta ctgcggcctt caagggtgtt gccacccgcc tccaagcagt 4260 tgctccaatg acaccagtta gcgttggcat aaacatggac agttaccagg ttgaggtgct 4320 gaatgagtca ctcaagggtg gggtgcttta ctgtctcgat tatagcaagt gggattcaac 4380 acagcaccct gccgtcacgg ccgcctcact tgggattttg gagagattgt ctgaagccac 4440 tcccattaca acgtcagctg tcgagttgct atcctcccct gctagaggcc atttaaacga 4500 cattgtattt atcacaaaat ctggtctccc atctggcatg ccgtttacca gtgtcatcaa 4560 ctcactcaac cacatgactt actttgcagc tgcagtgctt aaggcgtatg aacaacacgg 4620 agcaccatac acaggtaacg tgtttcaggt tgaaactgtt cacacctacg gggatgactg 4680 tttatactca gtgtgccctg caacagcctc cattttccag acagttctag ccaacttgac 4740 ctcgttcggt ctcaaaccaa cagctgcaga taagagtgag acgatagccc cgacccacac 4800 tcctgtcttc cttaagagaa ctctaacatg cacaccacgt ggtgtgcgtg gcctattaga 4860 catcacatcc ataaagaggc aattcttgtg gatcaaggct aacaggacag ttgacatcaa 4920 ttcaccacca gcatacgatc gcgacgcgcg tggcatccag ttagaaaacg ccctcgcgta 4980 cgcatcgcag catggccatg cagtttttga ggaagttgct gaattggctc gacacacagc 5040 caaggctgag ggactggtgc taaccaatgt caactatgac caggctctcg ccacctacga 5100 atcttggttc ataggtggta caggcctggt acaaggtagc cccagtgaag agaccaccaa 5160 attagtgttt gaaatggagg gcctaggcca accacagcca cagggtggcg aaaagaccag 5220 cccacagcct gtgacaccac aggacaccat tggccctaca gcggccctct tacttccaac 5280 tcagattgaa acaccaaacg caagtgcaca gcgcttggag ttggccatgg ccacaggggc 5340 agtcacgagt aatgtaccaa actgtattcg tgagtgtttt gcctctgtta ccacaatccc 5400 ctggacaact cgacaggccg ctaacacttt ccttggggct atccaccttg gcccacgcat 5460 caacccatac accgctcacc tgagcgcaat gtttgctggg tggggtggtg gttttcaagt 5520 gagagtgact atatctggtt ctggcctgtt tgctggtcgg gcggtaactg ccatcttgcc 5580 acccggagtg aaccccgcga gtgtccaaaa ccctggggtt ttccctcatg ccttcattga 5640 tgcacgtacc actgagccaa tcttgattaa tctgccagac attcgtcctg ttgacttcca 5700 ccgtgtagac ggagacgatg ccacggcatc tgttggactg tgggtagctc aacccctcat 5760 caacccattt cagacaggcc ctgtgtccac ttgttggttg agttttgaaa caagacccgg 5820 ccctgacttt gacttttgtc tgttgaaggc cccagagcag cagatggaca atggaatttc 5880 gcccgcctct ttgttgcccc gtaggctcgg acgttcccga ggcaacagaa tgggtgggcg 5940 aattgtggga ctggttgtgg tggcggctgc ggaacaggtg aaccatcact ttgatgcccg 6000 gtcaacaaca ttagggtggt ccacattgcc tgttgaacct attgcagggg atatatcctg 6060 gtatggtgat gctgggaaca agtcaatccg agggcttgtt agtgctcagg gcaaaggtat 6120 aatatttcca aacatagtca accactggac tgacgttgca ctgtcctcca agacatctaa 6180 caccacaacc ataccaactg acacatctac tcttggcaat ttaccaggtg cctctggacc 6240 acttgtcact tttgctgaca atggggatgt taatgagagt tccgcccaaa atgccatatt 6300 gacagctgca aatcagaact tcacatcatt ctctccaact tttgatgcgg cagggatatg 6360 ggtgtggatg ccttgggcca cggatcgtcc aggtgcgtca gacagcaaca tctacattag 6420 ccccacctgg gtaaatggca atccctccca cccaatccat gaaaaatgca ctaacatgat 6480 tggcacaaac tttcagtttg gagggaccgg caccaacaac atcatgttgt ggcaggaaca 6540 gcacttcaca tcctggcccg gtgcagcaga ggtgtactgc tcgcaactgg aaagcactgc 6600 cgagattttc cagaacaaca ttgttaacat cccaatgaac caaatggcag tgtttaatgt 6660 tgagactgca ggtaattcat tccaaatagc catcttgccc aatggttatt gtgtcaccaa 6720 cgcaccagtg ggaacacacc aacttcttga ctatgagact agcttcaaat ttgtaggact 6780 cttcccccaa agcacttcac ttcaggggcc ccatgggaac agtggccggg ccgttaggtt 6840 cttagaataa tgtcttggtt tactggagca tctctggctg ccggttcact cgtggacatg 6900 gcaggcacca tttcatcaat tgtggcacaa caaagacaaa ttgatctgat ggcagaagca 6960 aatagaatcc aggcagattg ggtgcgccgt caagaggcac tacaaatccg tggccaggac 7020 atctcacggg atcttgctgt taacggcact gcccagcgtg ttgagtcttt agtcaatgca 7080 ggcttcacac ccgtggacgc acgtcggctg gccggcggaa cggaaacggt gagttacggc 7140 ctactggatc gccctatcct acaacggggc atcctttctg gcatcactga gacacgacac 7200 ctccaggcca tgcagggcgc tctaagtgca ttcaaaaatg gtgcctctta cggagccccg 7260 ccagccccat caggctttgt gaatccaaat tatcaacctt cacctccgag attgaaacta 7320 ggccctaggc cccctagcac caatgtttga aatcctatct cttatacaaa ttttctatct 7380 tttcttttct ttctacacgg tacctcacgc gttcgggtgg tcaaatgcaa ttaagcgatt 7440 gcagccgtgc tttcttgg 7458 //