ID Z25771; SV 1; linear; genomic RNA; STD; VRL; 6813 BP. XX AC Z25771; XX DT 01-NOV-1993 (Rel. 37, Created) DT 14-NOV-2006 (Rel. 89, Last updated, Version 16) XX DE Human astrovirus type 1 genes for capsid protein and nonstructural protein XX KW capsid protein; nonstructural protein. XX OS Human astrovirus 1 OC Viruses; Riboviria; Astroviridae; Mamastrovirus. XX RN [1] RP 5779-6813 RX DOI; 10.1007/BF01309809. RX PUBMED; 1605739. RA Willcocks M.M., Carter M.J.; RT "The 3' terminal sequence of a human astrovirus"; RL Arch. Virol. 124(3-4):279-289(1992). XX RN [2] RP 4000-6813 RX DOI; 10.1016/0378-1097(93)90133-M. RX PUBMED; 8293952. RA Willcocks M.M., Carter M.J.; RT "Identification and sequence determination of the capsid protein gene of RT human astrovirus serotype 1"; RL FEMS Microbiol. Lett. 114(1):1-7(1993). XX RN [3] RC (sites) RX PUBMED; 8021608. RA Willcocks M.M., Brown T.O.K., Maddey C.R., Carter M.J.; RT "The complete sequence of a Human Astrovirus"; RL J. Gen. Virol. 75:1785-1788(1994). XX RN [4] RP 1-6813 RA Willcocks M.; RT ; RL Submitted (24-AUG-1993) to the INSDC. RL Willcocks M., University of Newcastle upon Tyne, Framlington Place, RL Newcastle upon Tyne, Tyne and Wear, UK. XX RN [5] RP 1-6813 RA Willcocks M.M., Carter M.J.; RT "Sequence analysis of a human Astrovirus"; RL Unpublished. XX DR MD5; 9d8f0f8eb3b92d1f3bd4f8ff2d472e4b. DR EuropePMC; PMC116360; 10954549. DR EuropePMC; PMC120088; 11773106. DR EuropePMC; PMC1234003; 16081927. DR EuropePMC; PMC1828050; 17343744. DR EuropePMC; PMC229263; 14557623. DR EuropePMC; PMC2832772; 20100346. DR EuropePMC; PMC2944169; 20831802. DR EuropePMC; PMC3630178; 23637878. DR EuropePMC; PMC3700315; 23616657. DR EuropePMC; PMC4236132; 25406089. DR EuropePMC; PMC533902; 15564473. DR GOA; P0C6K4. DR InterPro; IPR009003; Peptidase_S1_PA. DR RFAM; RF00164; s2m. DR RFAM; RF01833; astro_FSE. DR UniProtKB/Swiss-Prot; P0C6K4; NS1A_HASV1. XX FH Key Location/Qualifiers FH FT source 1..6813 FT /organism="Human astrovirus 1" FT /mol_type="genomic RNA" FT /db_xref="taxon:12456" FT CDS join(86..2839,2824..4380) FT /standard_name="Non structural gene" FT /product="Non-structural protein" FT /function="2b like nuclear targetting sequence, serine FT protease" FT /note="Non-structural gene composed of two sections 1a and FT 1b separated by ribosomal frame shift motif at appx FT position 2840 leading to expression of RNA-dependent RNA FT polymerase." FT /db_xref="GOA:Q67726" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR009003" FT /db_xref="InterPro:IPR022068" FT /db_xref="UniProtKB/Swiss-Prot:Q67726" FT /citation=[3] FT /protein_id="CAA81033.1" FT /translation="MAYGEPYYSSKPDKDFNFGSTMARRQMTPTMVTKLPKFVRNSPQA FT YDWIVRGLIFPTIGKTYFQRVVVITGGLEDGTYGSFAFDGKEWVGIYPIEHLNLMSSLK FT LIHKANALQERLRLSQEEKATLALDVQFLQHENVRLKEMIPKPEPRKIQMKWIIMGAVL FT TFLSLIPGGYAHSQTNNTIFTDMIAACKYSTETLTENLDLRIKLALANITISDKLDAVR FT QILNFAFVPRAHWLRTVFYYIHYYEMWNIFMFVLAIGTVMRSARPGTDLVTLATSHLSG FT FRMAVLPTIPFHTTMTLWVMNTLMVCYYFDNLLAITLAILAPILGIIFLCFMEDSNYVS FT QIRGLIATAVLIAGGHACLTLTGTTTSLFVVILTCRFIRMATVFIGTRFEIRDANGKVV FT ATVPTRIKNVAFDFFQKLKQSGVRVGVNEFVVIKPGALCVIDTPEGKGTGFFSGNDIVT FT AAHVVGNNTFVNVCYEGLMYEAKVRYMPEKDIAFLTCPGDLHPTARLKLSKNPDYSCVT FT VMAYVNEDLVVSTAAAMVHGNTLSYAVRTQDGMSGAPVCDKYGRVLAVHQTNTGYTGGA FT VIIDPADFHPVKAPSQVELLKEEIERLKAQLNSATENATTVVTQQPSAALEQKSVSDSD FT VVDLVRTAMEREMKVLRDEINGILAPFLQKKKGKTKHGRGRVRRNLRKGVKLLTEEEYR FT ELLEKGLDRETFLDLIDRIIGERSGYPDYDDEDYYDEDDDGWGMVGDDVEFDYTEVINF FT DQAKPIPAPRTTKQKICPEPEVESQPLDLSQKKEKQSEYEQQVVKSTKPQQLEHEQQVV FT KPIKPQKSEPQPYSQTYGKAPIWESYDFDWDEDDAKFILPAPHRLTKADEIVLGSKIVK FT LRTIIETAIKTQNYSALPEAVFELDKAAYEAGLEGFLQRVKSKNKAPEKQGPKKLQRAP FT EDQGAQNYHSLDAWKLLLEPPRERRCVPANFPLLGHLPINRPIFDDKKPRDDLLGLLPE FT PTWHAFEEYGPTTWGPQAFIKSFDKFFYAEPIDFFSEYPQLCAFADWATYREFRYLEDT FT RVIHITATEKNTDSTPAYPKMNYFDTEENYLEAHGWAPYIREFTRVYKGDKPEVLWYLF FT LKKEIIKEEKIRNSDIRQIVCADPIYTRIGACLEAHQNALMKQHTDTSVGQCGWSPMEG FT GFKKTMQRLVNKGNKHFIEFDWTRYDGTIPPALFKHIKEIRWNFINKDQREKYRHVHEW FT YVNNLLNRHVLLPSGEVTLQTRGNPSGQFSTTMDNNMVNFWLQAFEFAYFNGPDRDLWK FT TYDTVVYGDDRLSTTPSVPDDYEERVITMYRDIFGMWVKPGKVICRDSIVGLSFCGFTV FT NENLEPVPTSPEKLMASLLKPYKILPDLESLHGKLLCYQLLAAFMAEDHPFKVYVEHCL FT SRTAKQLRDSGLPARLTEEQLHRIWRGGPKKCDG" FT exon 86..2839 FT /number=1 FT exon 2824..4380 FT /number=2 FT CDS 4373..6733 FT /product="capsid precursor protein" FT /function="structural" FT /db_xref="GOA:O12792" FT /db_xref="InterPro:IPR004337" FT /db_xref="InterPro:IPR022027" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/Swiss-Prot:O12792" FT /citation=[2] FT /protein_id="CAA81032.1" FT /translation="MASKSNKQVTVEVSNNGRSRSKSRARSQSRGRDKSVKITVNSRNR FT ARRQPGRDKRQSSQRVRNIVNKQLRKQGVTGPKPAICQRATATLGTVGSNTSGTTEIEA FT CILLNPVLVKDATGSTQFGPVQALGAQYSMWKLKYLNVKLTSMVGASAVNGTVSGVSLN FT PTTTPTSTSWSGLGARKHLDVTVGKNATFKLKPSDLGGPRDGWWLTNTNDNASDTLGPS FT IEIHTLGRTMSSYKNEQFTGGLFLVELASEWCFTGYAANPNLVNLVKSTDNQVSVTFEG FT SAGSPLIMNVPEGSHFARTVLARSTTPTTLARAGERTTSDTVWQVLNTAVSAAELVTPP FT PFNWLVKGGWWFVKLIAGRTRTGSRSFYVYPSYQDALSNKPALCTGSTPGGMRTRNPVT FT TTLQFTQMNQPSLGHGEAPAAFGRSIPAPGEEFKVVLTFGAPMSPNANNKQTWVNKPLD FT APSGHYNVKIAKDVDHYLTMQGFTSIASVDWYTIDFQPSEAPAPIQGLQVLVNSSKKAD FT VYAIKQFVTAQTNNKHQVTSLFLVKVTTGFQVNNYLSYFYRASATGDATTNLLVRGDTY FT TAGISFTQGGWYLLTNTSIVDGAMPPGWVWNNVELKTNTAYHMDKGLVHLIMPLPESTQ FT MCYEMLTSIPRSRASGHGYESDNTEYLDAPDSADQFKEDIETDTDIESTEDEDEADRFD FT IIDTSDEEDENETDRVTLLSTLVNQGMTMTRATRIARRAFPTLSDRIKRGVYMDLLVSG FT ASPGNAWSHACEEARKAAGEINPCTSGSRGHAE" XX SQ Sequence 6813 BP; 2039 A; 1496 C; 1559 G; 1719 T; 0 other; ccaagagggg ggtggtgatt ggcctttggc ttatcagtgt atagataaca tttctctgac 60 cgtttaccac acaattaaca acaatatggc atacggtgag ccatactata gctctaaacc 120 tgacaaagat ttcaattttg gaagcacaat ggcacgtagg cagatgacac ctactatggt 180 aacaaagctt cccaaatttg ttaggaattc tccacaagct tatgattgga tcgtaagagg 240 tctgatcttt cccaccattg gtaaaactta tttccaacga gttgttgtga ttactggtgg 300 gcttgaggat ggaacatatg gctcattcgc atttgacggt aaagagtggg tagggatcta 360 cccaatagag cacttaaatc tcatgtcatc tttgaaactg atacacaaag ccaacgctct 420 tcaggagaga ctgcgtctct cccaagaaga gaaagccacc ctcgctctgg atgtgcaatt 480 ccttcaacat gaaaacgtgc gattgaagga gatgatccca aagccagagc cacggaaaat 540 acagatgaag tggataataa tgggagcagt gcttacattc ttatctctga tacctggggg 600 ttatgcgcac agccagacca acaacaccat atttactgac atgatagctg cctgcaagta 660 ctcaactgag acactaacag aaaatcttga ccttagaatc aagcttgcac tagcaaacat 720 aaccattagt gataagctag atgctgtgag gcaaattctt aactttgcct ttgtgcccag 780 agcccattgg ttgagaactg tgttctatta tatccattac tatgaaatgt ggaatatttt 840 tatgtttgtt cttgctattg gcactgtcat gaggagcgcc cgccctggta cagacttggt 900 tacacttgca acatcccact tgtctggttt taggatggct gtcctaccca caattccatt 960 tcacaccact atgactttgt gggttatgaa cacacttatg gtttgttatt attttgacaa 1020 cttgctagca ataacattgg caatcttagc accaattctt ggcattatct tcttgtgctt 1080 catggaagac tccaactatg tgagccagat acgtggcctt attgctacag cagtattaat 1140 tgctggtggg catgcatgtt tgacactcac aggcacgacc acgtcattgt ttgttgtcat 1200 actaacctgt aggttcatac gtatggcaac tgttttcatt ggcaccaggt tcgagatccg 1260 tgacgctaat ggaaaggttg tggccactgt accaactagg attaaaaatg ttgcatttga 1320 cttttttcag aagctgaagc agtcaggggt gcgagttgga gtcaacgaat tcgttgtcat 1380 aaaaccaggt gcattatgtg ttatagacac ccctgaagga aaagggacag gtttcttttc 1440 tggcaatgac atagtaacag cagcacatgt tgttggcaat aatacttttg tgaatgtgtg 1500 ctatgagggt ttgatgtatg aagcgaaggt gcggtacatg cccgaaaagg atatagcatt 1560 cttaacttgt cctggtgacc tgcatccaac agcaagatta aaattatcaa agaacccaga 1620 ttatagttgt gtcacagtta tggcttatgt gaatgaggat cttgtggttt caaccgcagc 1680 agctatggtg catggcaaca ctctttcata tgcagttcgc acccaggacg ggatgtcggg 1740 tgcaccagtc tgtgacaaat atggtcgggt gttagcagtc catcaaacaa acactgggta 1800 cactggaggt gctgtcataa tagacccagc agacttccat ccagtgaaag ccccatctca 1860 ggtggaattg ctcaaagagg aaatagagcg gctaaaagct caactgaact ctgccactga 1920 gaacgcaacg actgtagtta cacaacaacc tagtgctgca ctagaacaga aaagtgtcag 1980 cgatagtgat gtagttgacc ttgtcagaac tgcaatggaa cgtgagatga aggtgctgcg 2040 tgatgaaatc aatggaatac ttgcaccatt cctacaaaaa aagaaaggta agaccaagca 2100 tggtaggggt agagtcaggc gtaaccttag aaaaggtgtg aaacttctta ccgaggaaga 2160 gtatcgagaa ctcttagaga aaggtcttga tcgtgagaca ttccttgatc tcatagaccg 2220 cattattggt gagaggtctg gctaccctga ctatgatgat gaagattact atgatgaaga 2280 tgatgatggc tggggaatgg ttggtgatga tgtagaattt gattatactg aagtaattaa 2340 ctttgaccaa gcaaaaccaa ttcctgcccc gagaacaacc aagcaaaaaa tttgccccga 2400 accagaagtc gaatcacaac cacttgattt gtcccaaaag aaagaaaaac aatcagaata 2460 tgaacaacaa gtggtgaagt ctaccaagcc tcaacaatta gaacatgaac aacaagtggt 2520 gaagcctatc aagcctcaga agagtgagcc tcaaccatac tcacaaactt acggcaaggc 2580 accaatctgg gaatcttacg attttgactg ggatgaggat gatgccaagt ttattctgcc 2640 agcgccacac cggttgacta aggcagatga aatagtcctt ggatccaaaa tcgtcaagct 2700 tagaacgatt attgaaacag ccataaagac tcagaattat agtgcattac ctgaagcagt 2760 atttgagctc gacaaagcag cttatgaagc aggtttggaa ggttttctcc aaagggttaa 2820 atcgaaaaac aaggccccaa aaaactacaa agggccccag aagaccaagg ggcccaaaac 2880 taccactcat tagatgcatg gaaattgttg ctagagcctc cgcgggagcg aaggtgcgtg 2940 cctgcgaatt ttccactatt aggccattta ccaattaata gacccatctt tgatgataag 3000 aaacccaggg acgatctcct tggcctactt ccagaaccaa cctggcatgc tttcgaggaa 3060 tacggaccaa ccacatgggg cccacaagct tttatcaaat cttttgataa atttttttat 3120 gcagaaccaa ttgacttttt ctcagaatat ccacagttgt gtgctttcgc tgattgggca 3180 acttatcgcg agtttcggta tctagaggat actagagtga tacacataac tgcaactgag 3240 aaaaacactg attcaacacc tgcttatcct aaaatgaatt attttgacac tgaagaaaat 3300 tacctggaag cacatgggtg ggcaccatat attagagaat tcactagggt ctacaaagga 3360 gacaaacctg aagtactttg gtacctattt cttaagaaag agatcattaa ggaggaaaaa 3420 attaggaatt ctgatatccg gcagatagtg tgtgccgacc ccatttacac caggataggg 3480 gcgtgcttag aagcgcatca gaatgcctta atgaaacagc ataccgacac ctcagttggt 3540 caatgtgggt ggtcaccaat ggaaggcggc tttaaaaaaa ccatgcagcg cctggtaaat 3600 aaagggaata aacacttcat tgagttcgac tggacccgct atgatggaac tataccacca 3660 gcactcttta aacacatcaa agaaattagg tggaatttca tcaataaaga ccaacgtgaa 3720 aagtacagac atgtacatga atggtatgtt aacaacctcc ttaatcgcca tgtacttcta 3780 ccatctggtg aagtcacctt gcagacgcga ggtaatccat ctggtcagtt ttcaacaaca 3840 atggataaca acatggttaa cttttggtta caggcttttg agtttgctta ttttaatgga 3900 ccagacagag acctttggaa gacctatgac actgtagttt atggagatga caggctttct 3960 acaacacctt cggtgcccga tgattatgag gagagagtga tcactatgta tagagacatc 4020 tttggcatgt gggttaagcc tgggaaggtt atctgtagag acagcatagt tggattgtcc 4080 ttttgtggct tcactgttaa tgaaaacctt gagcctgtgc caacttctcc tgaaaagtta 4140 atggcatcac tgttaaaacc ttacaaaata ttacctgatc ttgaatcact ccatgggaaa 4200 ctcctatgct atcagttgct tgctgcgttc atggcagagg accacccctt taaggtgtat 4260 gtggagcact gcctctcgcg gactgcaaag cagcttcgtg actctggcct tccagccaga 4320 ctcacagaag agcaactcca tcgcatttgg aggggaggac caaagaagtg tgatggctag 4380 caagtccaac aagcaagtaa ctgttgaggt cagtaataat ggccgcagca ggagtaaatc 4440 aagggcccgt tcacaatcta ggggccgaga caaatcagtc aagatcacag tcaattcaag 4500 aaacagagcc aggagacagc ccggacgcga caaacgtcaa tcttctcaac gtgtccgtaa 4560 cattgtcaat aagcaactca ggaaacaggg tgtcacagga ccaaaacctg caatatgtca 4620 gagagcaaca gcaacccttg ggacagtcgg gtcaaacacc agtggcacca ctgagattga 4680 ggcgtgtatt ctcctcaacc ctgtcctcgt taaggacgct actggaagca ctcagtttgg 4740 ccctgtgcag gcgctaggtg cacagtattc catgtggaag ttgaagtatt tgaatgtcaa 4800 attgacctct atggttggtg catctgctgt taatggtact gtctcagggg tctcacttaa 4860 ccccactacc acgcctactt ctactagttg gtcaggatta ggtgcacgta aacatcttga 4920 tgttacggtt ggtaagaatg caacatttaa attgaaacct tctgaccttg gtggacctag 4980 ggatggctgg tggctcacaa acaccaacga caatgcatct gacaccctgg gaccctccat 5040 tgaaatacac acactcggac gaacaatgtc ctcatacaag aatgagcagt tcacaggagg 5100 gctatttttg gttgaacttg cttcagagtg gtgcttcaca ggttatgctg ccaacccaaa 5160 ccttgttaac ttggtcaagt caacagataa tcaagtatct gtgacatttg aaggaagtgc 5220 aggatcacca ttaataatga atgtgccaga ggggagccat tttgcacgaa cagttcttgc 5280 acgctcaaca acacccacca ctctagcgcg tgcaggagag agaaccacct cagacacagt 5340 atggcaggtg ctcaatacag ctgtatctgc tgctgagctt gtcacgcctc ctccattcaa 5400 ttggttagtt aaaggaggtt ggtggtttgt gaaactcatt gccgggagaa ccagaactgg 5460 ctctcgtagc ttttatgtgt atcccagcta ccaagatgct ttgtccaaca agccagcact 5520 ctgcaccggc agcactccag gaggcatgag gacgcggaat cctgtgacaa ccaccctaca 5580 attcacacag atgaaccaac ctagtctagg gcacggtgaa gcaccagctg cgtttggtag 5640 atccattcca gcacctggtg aggagtttaa agtcgtcctc acgtttggag ccccaatgag 5700 ccctaatgca aataacaaac agacttgggt taataaacct cttgatgcgc cttcaggcca 5760 ttacaatgtg aagattgcaa aggatgttga ccactatcta accatgcagg gtttcacttc 5820 tatagcttct gttgactggt acaccataga ttttcaacca tctgaggccc ctgccccaat 5880 acaaggcctg caggtacttg tgaacagctc gaaaaaagct gatgtgtatg ccatcaaaca 5940 atttgttaca gcgcagacca acaacaagca ccaggttaca agcctgttcc tagtaaaagt 6000 gacaactggt tttcaggtga acaactacct gagttacttt tacagggcgt ctgccactgg 6060 ggatgccaca actaacctgt tggttagagg agacacatac acagcaggga taagttttac 6120 ccagggtgga tggtatttgt tgacaaatac atctattgtt gatggggcta tgccacctgg 6180 ttgggtctgg aataacgtgg aacttaaaac taacacagca tatcacatgg acaaaggctt 6240 ggtccattta ataatgcctt tgcctgagtc cacgcaaatg tgttatgaga tgctgacatc 6300 tataccacgc tccagggcat ctggtcatgg ttatgagagt gacaacaccg aatacttgga 6360 tgctccagac tctgctgacc agtttaaaga agacatagag acagatacag acattgagag 6420 tacagaggac gaagacgaag cggacaggtt tgacatcata gacacttctg atgaagaaga 6480 tgaaaatgag acagaccgtg taaccctcct ctcaactctc gtaaaccaag gaatgacaat 6540 gacgcgtgcc acaaggatag caaggcgcgc attccccacg ctttccgata ggatcaagcg 6600 tggagtatac atggacctgc ttgtctcggg ggcaagccca ggcaatgcat ggtcccatgc 6660 gtgtgaagaa gcacgcaaag cagcagggga aatcaatccc tgcacatctg gaagccgcgg 6720 ccacgccgag taggaacgag ggtacagctt ccttcttttc tgtctctgtt tagattattt 6780 taatcaccat ttaaaattga tttaatcaga agc 6813 //