ID GQ502193; SV 2; linear; genomic RNA; STD; VRL; 6531 BP. XX AC GQ502193; XX DT 19-OCT-2009 (Rel. 102, Created) DT 27-SEP-2012 (Rel. 114, Last updated, Version 3) XX DE Astrovirus VA2 isolate VA2/human/Stl/WD0680/2009, complete genome. XX KW . XX OS Astrovirus VA2 OC Viruses; Riboviria; Astroviridae; Mamastrovirus; OC unclassified Mamastrovirus. XX RN [1] RC Publication Status: Online-Only RP 1-6531 RX PUBMED; 19814825. RA Finkbeiner S.R., Holtz L.R., Jiang Y., Rajendran P., Franz C.J., Zhao G., RA Kang G., Wang D.; RT "Human stool contains a previously unrecognized diversity of novel RT astroviruses"; RL Virol J 6:161-161(2009). XX RN [2] RP 1-6531 RA Finkbeiner S.R., Holtz L.R., Jiang Y., Rajendran P., Franz C.J., Zhao G., RA Kang G., Wang D.; RT ; RL Submitted (20-AUG-2009) to the INSDC. RL Molecular Microbiology and Immunology, Washington University, Campus Box RL 8230, 660 S. Euclid Ave, Saint Louis, MO 63110, USA XX DR MD5; e3eb1007a292a76ffa91d63eba44ad6e. DR EuropePMC; PMC2765957; 19814825. DR EuropePMC; PMC3545029; 23084422. DR EuropePMC; PMC4047031; 24775733. DR EuropePMC; PMC4465002; 25975198. DR EuropePMC; PMC4495221; 25979884. DR EuropePMC; PMC4537245; 26274322. DR EuropePMC; PMC5625321; 28928418. XX CC On Sep 25, 2012 this sequence version replaced gi:261291335. XX FH Key Location/Qualifiers FH FT source 1..6531 FT /organism="Astrovirus VA2" FT /host="Homo sapiens" FT /isolate="VA2/human/Stl/WD0680/2009" FT /mol_type="genomic RNA" FT /country="USA" FT /isolation_source="stool" FT /collection_date="2009" FT /db_xref="taxon:683173" FT 5'UTR 1..42 FT gene 43..2706 FT /gene="ORF1a" FT CDS 43..2706 FT /codon_start=1 FT /gene="ORF1a" FT /product="putative serine protease" FT /db_xref="GOA:D0EHK1" FT /db_xref="InterPro:IPR009003" FT /db_xref="UniProtKB/TrEMBL:D0EHK1" FT /protein_id="ACX83590.2" FT /translation="MERSYKPGGLNIYDPYDRVLQHGSKSARIKGIQLDKVSGNKLADI FT FSDGGPLCFGYGVLEMVKVDTGSVQPVTTPVKTVYVSGVLAGNEYVTYCFKPGVNQWVE FT VDPDIHQPTALVGTLYQEYKKLVQSERELKETKSGLQLEISLLRHELERSRPTVRTLRP FT YKMANIVFFGLLLGLLMAHAVTGFRTGQCLDPDVSETLKPQTCINWKWDGGIAPDTEVP FT LVERAYAWYTGVKQQMKEYYNDSVIVDWSVYVFKMLCSWTSVAVTIGVFYMIKSENPMY FT MMLTLMLATLSRIQLMAVAAIPGMEITSAFSLWCCMIVYYFNQAAAMASAIIIASVCYV FT SCMFMSDIEYVQVLRGHAVVVFTIICSHIFHILQIPSWVTVLIMVLYRVVRLTSVVIGE FT KIEVRNLDGKVVNTITTQTSWLNKVSKFVQSKFKQNVRVGVSSTARVVPNGVLVVEAKD FT NIGTGFRVQNYVVTAAHVVGSETQVRLKWGDVSAFAKVVYIHPSKDVAYLSLPPEMQNL FT PTYKFAKAVADGTIVITSLEDCGVLAVAITEGVVVSNNMTYAVCTKNGMSGSPVTNVDG FT RVIGVHQTNTGFTGGAVIIRQEDLPPQKKPQRELELEEKVKQLEEALAGKMNQKFSEDQ FT VIELIRMAVGREIGILRHELSMNQAKGKNKGKRRGNVKRKRRRMWTEEEYKELLEKGFT FT RQQLRDMAETLREAEYTDTESEEYESGYPAWSDPEDSDEIDREWFGPKKKILDEVDSGW FT SKGDFWEQCQKVWKETEPMSEEQVNTLPSHLHEKYGLTCYVITKADMEALAKDLLKYQS FT LVEDKIKNNVVRGQWIDGVDPKVIINELDELWLGINHIMWENGLVPFTQRRKINKRKNQ FT KNLKGGLKVSPQQKNN" FT gene <2643..4229 FT /gene="ORF1b" FT CDS <2643..4229 FT /codon_start=1 FT /gene="ORF1b" FT /product="RNA-dependent RNA polymerase" FT /note="RNAP; translated by ribosomal frameshift" FT /db_xref="GOA:D0EHK2" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR007094" FT /db_xref="UniProtKB/TrEMBL:D0EHK2" FT /protein_id="ACX69838.1" FT /translation="KKESKKLEGGAQGEPPTKEQLTLSYWENMLEPGDYYLTPPTYPLL FT GVLPINRPICDYDEPIDDLLNLLPSYDDDMSYGPTVWGPEAYVKSFEKFTYKEPMSNIK FT DKYKREWNFAMRVLRREFDYLIDSVMTDITATSKNSDSTPSYPKCLWWKTEAEYLKERG FT YQDYITQLESIKKGERPKVLWYLFLKKEILKLSKIEDGDIRQIVCSDPIFARIGCVFEE FT HQNNLMKMRTKTRMGQCGWSPFCGGFNDHVKRLVDKGNNLFVEFDWTRYDGTIPNEVFM FT AIKQFRYSCLAEEFKTEENLSIYKWYCESILDRYVMLPSGEVTKQVRGNPSGQVSTTMD FT NNLCNVFFQAFEYAYMHPDKDISQLMHDWERVDSLIYGDDRLSTYPELPGDYVDRVVDM FT YATVFGMWVKPEKVKIPNSIIGLTFCGFTVTESNGVYVPVPTETEKLMAGLVRPTKRLP FT DILSLYGKLLCYRILSHNLPDDHKFKNYILVALEVVARHIRASGGEEPYYITDGMLDRL FT WRGGPKRGDGW" FT CDS 4219..6414 FT /codon_start=1 FT /product="capsid protein" FT /db_xref="GOA:D0EHK3" FT /db_xref="InterPro:IPR004337" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:D0EHK3" FT /protein_id="ACX83591.2" FT /translation="MAGKQPQHAATKAAIKAVAKEVVKEEKKAFKPRQQNKSTRNKYRN FT KWQTKHQVKNEIKTELKKKGLEGPKTKFTVKVSATIGKIGPNVNSGPELQISTFMHPAL FT MKEPNDGTNFGPLQAAAAQWGLWRLSDLKIQFTPLVGSSAVTGSVYRTSLNLTQSPGST FT SWGGLGARKHLDIPVGVSRTWHLKKGDLAGPRETWWLTDTNEEGGQSCGPMLEVHGLGK FT TTSTYKDEAWRGDLFIVEVTGVWQFTNYNAKPALGTLNRVVEDTTATIEVGTDGIMKMS FT IPSTGQLARHMSEQNERASNAANTIGETIWQVVDQGAGLVASVAPEPFGWLIKGGWWFV FT KKLIGRANTTTEQYYVYASLADAQNNKPVEANTFAASSHATTLAVTQINAPNTGPSTAS FT AVVQSRMFPLPQAGIPDGWFWLSGQFEALHTVGLNGTNGSTLPCGMITSFDKWEFKLKK FT GTVEWGCAMQGMAAASNTVTCWSVDRENQLLGWYDVSGIASTGIVVDWVTGPNGGTVHL FT DWADVLAWRTDLWGTLRMTYWLCRTRRAVQGSDFDTTQKSPSPYIARAWPETISNVRLD FT VREVLMVNTTTNATDSNSRVQYIQRVPAGAIVILWCLGNHTFDTSTGQGSIVQKSSFGG FT FGQLPSKSSTTGLWSRALNSVGPSTDWITVSFDSHPPVCDDVVSRLMAEIADRYDLEPR FT RKKPKDATKELTRLKIYEALREADWEHLPAEEVSSVL" FT 3'UTR 6415..6531 XX SQ Sequence 6531 BP; 1962 A; 1212 C; 1640 G; 1717 T; 0 other; ccaagtaggg tttggtgcct ttgggcatct ggtgggttag ctatggagcg ttcctataaa 60 ccagggggtc ttaatatcta tgacccttac gaccgtgttt tgcaacacgg aagtaagagt 120 gccagaataa agggtattca actggacaaa gtttcaggga ataagttggc agacatattc 180 tcagatggtg gcccattgtg ctttggttat ggagttctgg aaatggttaa ggttgacact 240 ggttcagtgc aaccagtgac tacaccagtg aaaaccgtgt acgtctcagg cgtccttgcg 300 ggcaatgagt acgttaccta ctgcttcaaa cctggtgtta accaatgggt tgaagtggat 360 ccagatatac atcaaccaac ggctttggtg ggtacattgt atcaagaata caaaaaactg 420 gtccagtctg aaagggaatt gaaagaaact aagtcaggac tacaattaga aatatccctt 480 ttgaggcacg aattggaacg ctccagaccc actgtcagaa ccctacgccc atacaaaatg 540 gctaacatag tgttttttgg attactactg ggattgttaa tggcacatgc agttacaggc 600 tttagaacag gacaatgcct tgacccagat gtaagtgaaa ctctgaagcc tcagacctgt 660 attaattgga aatgggatgg tggtatagct ccagacaccg aggtaccact agttgaaagg 720 gcatatgcat ggtatacagg agttaaacag cagatgaagg aatactataa tgacagcgtt 780 atagtagatt ggtctgtgta tgtgttcaag atgttatgtt catggacatc agttgcagtg 840 actatagggg ttttctacat gattaagtca gaaaatccca tgtatatgat gttaacattg 900 atgctggcta cactatcaag gatacagcta atggcagtgg ctgccatacc tggaatggaa 960 ataacatcag cattttcatt gtggtgctgt atgatagtgt actactttaa tcaagcagct 1020 gcgatggcat ctgcaattat tatagcttca gtgtgttatg tctcttgcat gttcatgagt 1080 gacattgaat atgtacaagt tttgagaggc catgccgtgg ttgtttttac cataatatgc 1140 tcccacattt tccatatttt acaaatacca tcttgggtta ctgttttaat aatggtttta 1200 tacagggtgg tgaggttaac tagtgttgtg attggagaga aaattgaggt taggaacctc 1260 gacggtaagg tggttaatac aatcacaact cagacatcat ggcttaataa agtttccaaa 1320 tttgtgcaaa gtaaattcaa gcaaaatgtt agggtgggtg tttcatcaac tgctcgagtt 1380 gtaccaaatg gggtattagt tgtggaggcc aaggataaca tcgggactgg ctttagagtg 1440 caaaactatg ttgttacagc agcacatgtt gttggtagtg aaacccaggt tcgcctaaaa 1500 tggggtgatg ttagtgcatt tgctaaggtt gtttacatac atcctagtaa ggatgtggca 1560 tatttaagtt tgccacctga aatgcagaac ttaccaacat acaagtttgc aaaagccgtt 1620 gctgatggga ccatagtcat tacctcatta gaggattgtg gggttttggc tgttgccatc 1680 acagaaggtg ttgtcgtctc taacaacatg acatatgctg tgtgtaccaa aaatggtatg 1740 agtgggtcac cagttaccaa tgttgatggg cgcgtcattg gtgttcatca gacaaacact 1800 ggtttcactg gtggcgcagt cattatacgc caggaagacc tcccacccca aaagaagcca 1860 caacgtgagt tggagctgga ggagaaggtt aagcaacttg aggaagccct agcaggaaaa 1920 atgaatcaaa aattcagtga agaccaggtc atcgaattga taaggatggc tgttggtaga 1980 gaaattggaa tattacgcca cgaattgtct atgaatcaag ccaaaggtaa aaataagggt 2040 aagaggcgtg gcaatgtcaa acgtaagagg aggaggatgt ggactgaaga agagtacaaa 2100 gagctcctag aaaaaggctt taccagacag cagctacggg atatggctga aacgctgcgt 2160 gaggcagaat atactgacac tgagagtgag gaatatgagt caggttatcc tgcttggtct 2220 gatccagaag attctgatga gatagaccga gaatggtttg ggcccaaaaa gaagatactt 2280 gatgaggttg actctggatg gtccaaagga gacttctggg agcagtgtca aaaagtgtgg 2340 aaggaaactg aaccgatgag tgaggagcaa gtgaacactc ttccatcaca cctgcatgaa 2400 aaatacggtt tgacatgtta cgtcattaca aaagcagaca tggaggcatt ggctaaagac 2460 ttgctaaagt atcaatcatt ggttgaagat aaaataaaga acaatgtggt caggggccaa 2520 tggattgatg gtgtggatcc caaggtgatc ataaatgaat tagatgaatt gtggcttggt 2580 atcaatcaca taatgtggga gaatgggctt gtcccattta cccaacgccg taagatcaat 2640 aaaagaaaga atcaaaaaaa cttgaagggg gggctcaagg tgagccccca acaaaagaac 2700 aattaactct ttcttattgg gaaaatatgt tggagccagg ggattattac cttacccctc 2760 caacataccc tttattgggt gttttaccaa tcaaccgacc aatatgtgat tatgatgagc 2820 caattgatga tttgctaaat ttgttgccaa gttatgatga tgatatgtca tatggtccaa 2880 cagtttgggg ccctgaagcc tatgttaagt catttgaaaa gtttacatat aaagaaccaa 2940 tgagtaacat caaggacaaa tataagagag agtggaactt tgccatgaga gtcttgagga 3000 gggagtttga ttacctaatt gatagtgtga tgacagacat cacagcaaca tctaagaatt 3060 cagactccac accctcctac ccaaaatgcc tgtggtggaa aacagaggct gaatacttga 3120 aagagagggg ctaccaagat tatataacac agttggaatc cataaagaaa ggtgaaaggc 3180 ccaaagttct ctggtatttg tttctcaaga aggagatcct aaagctcagc aaaattgagg 3240 atggggatat acgacaaatc gtgtgttcag atccgatatt tgcgcgcatt gggtgtgttt 3300 ttgaggaaca tcagaataac ctaatgaaaa tgaggacaaa gacccgcatg gggcaatgtg 3360 gctggtcccc attctgtggt ggcttcaatg atcatgtcaa gaggctggtg gataaaggta 3420 acaatttgtt tgtcgagttt gattggacac gatatgatgg tacaatacca aatgaagtgt 3480 ttatggcaat taaacaattt aggtactcat gtttagctga ggagtttaaa acagaggaga 3540 atttgagtat ctataaatgg tattgtgaga gtatactaga taggtacgtt atgttacctt 3600 ctggtgaagt caccaaacag gttaggggaa acccttcagg tcaagtgtca acaactatgg 3660 acaataattt gtgtaatgtc ttcttccaag cgtttgagta cgcgtacatg caccctgata 3720 aggatataag tcaacttatg catgattggg agagggttga ttcacttata tatggtgatg 3780 atagattgtc tacttatcct gagttgcctg gagattatgt agacagagtt gttgatatgt 3840 atgccacagt gtttggaatg tgggtcaaac ctgaaaaagt gaaaatccca aatagtataa 3900 taggccttac attttgtggt ttcacagtga cagaatcaaa tggggtatat gtcccagtcc 3960 ccacagaaac agagaaactc atggctggac tagtgcgacc aactaaaaga ttacctgaca 4020 ttttatcgct gtatgggaaa ctcctttgct accgcatact aagtcacaat ctgccagatg 4080 accacaaatt taaaaattac atcttggtgg ccttagaagt tgtggctagg cacatccgtg 4140 ctagtggagg tgaagagcct tactatatta cggatggtat gctggataga ctttggaggg 4200 gcggaccaaa gcgtggagat ggctggtaag cagccccagc atgctgcaac taaggctgcc 4260 attaaggcag ttgctaaaga ggttgtcaag gaggagaaga aggcattcaa acccagacag 4320 caaaacaaaa gcactaggaa taagtataga aataaatggc aaactaaaca tcaagttaag 4380 aatgaaatta agactgaact taagaagaaa ggacttgagg gacccaagac aaaatttaca 4440 gttaaagtct cagcaacaat tggaaaaatt ggaccaaatg taaattcagg gcctgaatta 4500 caaatttcaa catttatgca tcctgctttg atgaaagagc caaatgatgg caccaacttt 4560 ggcccattgc aagctgcagc tgctcagtgg ggtctgtgga gactatcaga cctcaagata 4620 cagtttacac cactcgtggg ttcatcggct gtaactggct cagtgtaccg cacatctttg 4680 aacctaacgc aatcccctgg gtctacatct tggggtggtc taggtgcccg aaaacacctg 4740 gacattccag ttggggtgtc acggacatgg cacctaaaga aaggtgatct tgctgggccc 4800 agggagacat ggtggttaac agataccaat gaagagggag ggcaaagctg cggtcctatg 4860 cttgaagtcc atggcctcgg caaaacaacc tcaacttata aggatgaagc ctggaggggt 4920 gatctcttta tagttgaagt tacaggagtg tggcagttca caaactacaa tgccaaacct 4980 gctttgggaa cactgaacag ggtggttgag gacaccacgg cgactattga ggtcggcaca 5040 gatggcataa tgaagatgtc aataccatca actgggcagc tagcacgcca catgtcagag 5100 caaaatgaaa gggcatcaaa tgcagcgaac acaattggtg agaccatatg gcaggtcgtt 5160 gaccaaggag ctggtcttgt tgccagtgtt gcacctgagc ctttcggctg gctcatcaag 5220 ggtggttggt ggtttgttaa gaaactgatt ggcagggcca ataccactac tgaacagtat 5280 tatgtttatg cttcactagc agatgcacaa aacaacaaac cagttgaagc aaatactttc 5340 gctgcttcaa gccatgctac aacactggct gttactcaaa taaatgctcc taacactggg 5400 ccaagcacag cctcagcggt tgtccaatcc agaatgtttc ccctaccaca agcggggatt 5460 cctgatggtt ggttttggct cagtggacaa tttgaagctt tgcacacggt tggtttgaat 5520 ggtaccaacg ggtctactct tccatgtggt atgatcacat catttgacaa atgggagttc 5580 aaactaaaga aaggcactgt tgagtgggga tgtgctatgc aaggaatggc agcagcaagc 5640 aatactgtta cttgctggtc agttgatcgt gaaaatcagc ttttgggttg gtacgatgtc 5700 agcggcattg cttcaactgg cattgtcgtt gattgggtaa ctggcccaaa tggtggcacc 5760 gtgcatttgg actgggctga tgtccttgct tggaggacag atttgtgggg gacactccgc 5820 atgacatact ggttgtgtcg aactagaaga gccgtgcaag gctctgactt tgatacgaca 5880 cagaagagcc catcgcctta catcgcgcgg gcttggcctg agacaataag caatgtcagg 5940 cttgatgttc gggaggtttt gatggttaat acaaccacca atgccactga ttcaaatagc 6000 cgagtccaat atatccaaag agtgccagct ggcgccatag tcatactctg gtgcctgggg 6060 aatcacacat ttgacacatc aactggccaa ggttccattg tccaaaagtc ctcatttggt 6120 ggatttggac agctgcctag taagtcatct actactggtt tgtggtctcg tgcacttaat 6180 tcagttggtc catctaccga ttggattact gtgtcttttg actcacaccc accagtttgt 6240 gatgacgttg tttcaaggct gatggctgaa attgctgaca ggtatgacct tgagccccgc 6300 agaaagaagc ctaaggatgc aactaaagag ctaaccaggc tcaaaatcta tgaagcactt 6360 agagaagctg attgggaaca cctacctgca gaggaggtga gctcagtgct ttaaattccg 6420 ccgaggccac gccgagtagg atcgagggta cagcggacta ttgattgctt gtggaatgaa 6480 ttagtttatg attataatct gttcatttga tcattagtga atttgattct c 6531 //