ID GQ891990; SV 1; linear; genomic RNA; STD; VRL; 6584 BP. XX AC GQ891990; XX DT 02-JUN-2010 (Rel. 105, Created) DT 02-JUN-2010 (Rel. 105, Last updated, Version 1) XX DE Astrovirus SG, complete genome. XX KW . XX OS Astrovirus SG OC Viruses; ssRNA positive-strand viruses, no DNA stage; Astroviridae; OC unclassified Astroviridae. XX RN [1] RP 1-6584 RX PUBMED; 20507741. RA Quan P.-L., Wagner T.A., Briese T., Torgerson T.R., Horning M., RA Tashmukhamedova A., Firth C., Palacios G., Baisre-De-Leon A., Paddock C.D., RA Hutchison S.K., Egholm M., Zaki S.R., Goldman J.E., Ochs H.D., Lipkin W.I.; RT "Astrovirus encephalitis in boy with X-linked agammaglobulinemia"; RL Emerg. Infect. Dis. 16(6):918-925(2010). XX RN [2] RP 1-6584 RA Quan P.-L.; RT ; RL Submitted (08-SEP-2009) to the INSDC. RL Center for Infection and Immunity, Columbia University, 722 West 168th RL Street, New York, NY 10032, USA XX FH Key Location/Qualifiers FH FT source 1..6584 FT /organism="Astrovirus SG" FT /host="Homo sapiens" FT /isolate="SG" FT /mol_type="genomic RNA" FT /country="USA" FT /collection_date="2007" FT /db_xref="taxon:767521" FT 5'UTR 1..39 FT gene 40..4220 FT /locus_tag="AstVgp1" FT CDS join(40..2652,2652..4220) FT /codon_start=1 FT /ribosomal_slippage FT /locus_tag="AstVgp1" FT /product="nonstructural protein 1ab" FT /note="ORF1ab polyprotein; contains putative serine FT protease and RNA-dependent RNA polymerase domains" FT /db_xref="GOA:D7P3D2" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR001730" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR009003" FT /db_xref="UniProtKB/TrEMBL:D7P3D2" FT /protein_id="ADH93575.1" FT /translation="MERSYKPSGSSHYDPYDRVLQHGSVKARIQGLQLNKVAKTKLEEI FT FSCGGPLCFGYGDVETTRVLNGVVEPQPSIVKTVYVSGVREGNEYVTYLFKPGLNDWVE FT VDANIHEPTAIVGVLYHEYNRLKSENESLKTERSSLQLDISILRHELERARPPTKIIRP FT FSVRCIILYGLLIGLLFSHISQAFRTGVCLDPDVGETLKPQTCINWKWDGGVELDETTP FT LYDRFTVWYTGLMQQFKSMYNDIVIDLVVQAFGFAYTWTAIALMIGTYYMLKSTNPAYM FT LVTLMMATVSRMQLFAISAIPNMEVTSMFSLWCCMVLYYFNQVAAMAASLMIAAMCSVV FT CLFMGDAEYVKVIRGHGVVILTIVVSHIFSVLLVPHWVTVFLIVAFRIVRLIGAVVGEK FT IEVRNAEGKVISVIPAPTSWLNRISGFVQSKFTQKVRTGIMSTARVIPNGVVIVESKES FT SGTGFRVQNYIATAGHVVGNETQIKVKWGDVNVYTKVVYMHPTKDIAYLALPSEYQALP FT TYKFAKVIEDGTVVITSMEDCGVLAVAVTEGVIVKDNITYAVSTRNGMSGSPVTNVDGR FT IVGIHQATTGFTGGAVIIKQEDLPPQKKPQREIDLENKIKELEDALKGQMNQGLNENQI FT IELIRLAVGREIEILRHEINMNQAKGKNKRKNHHKRRRKGKVWTEEEYKDLLEKGFTRQ FT QLRDMAEVLREADYSEDDESDEYDTGYPQWSDPEDSEEVEREWFGPKKKILDEVEGWSN FT TDFWEQCQKVWKEMEPMPEESVNTLPSHLSDKYGITCYVVTKSDMEALARDLQEYQAKV FT EEKIKANVVRGQWLEGVNPKTIISELDELWLKLNHLMWTHGIVPFIQRKKINRKKQQKN FT LEGGPETGAPKPEQLRLGYWRELLQPGEYYLTPPHCPLVGVLPIDRPISDYDEPIDDLL FT NLLPKCEEKPPYAPSTWGPEAYRRSFDKFFYRKPTENIKGKYPREWKFAMSVLRREFDF FT LQDSVLIDITSTSKNADSTPAYPKTLWWKTETDYLKERGYQDYIKELDLIRSGERPDVL FT WYLFLKKEILKISKIEEEDIRQIVCADPIFSRIGCVFEEYQNQLMKNRTLTRMGQCGWS FT PFMGGFHKRIKRLVDKGNDYFIEFDWTRYDGTIPNEVFRVIKDFRFSCLRGDLQTKENR FT DVYNWYCENIFRRYVMLPSGEVTIQDRGNPSGQISTTMDNNICNVFFQAFEFAYLNTEL FT DSDELKENWDKYDSLIYGDDRLTTTPILCDNYVDRVIKMYADVFGMWVKREKVKVSNKI FT NGLTFCGFTVQESNGLFVPIPTDTDKLLAGLITPIKKLPDILSLYGKLLCYRILGHNLP FT DDHKFKNYILVALEVVARHIRASGGEEPYYITDSMLDRLWRGGPKQSHGW" FT CDS 40..2697 FT /codon_start=1 FT /locus_tag="AstVgp1" FT /product="nonstructural protein 1a" FT /note="ORF1a; putative serine protease" FT /db_xref="GOA:D7P3D3" FT /db_xref="InterPro:IPR001730" FT /db_xref="InterPro:IPR009003" FT /db_xref="UniProtKB/TrEMBL:D7P3D3" FT /protein_id="ADH93576.1" FT /translation="MERSYKPSGSSHYDPYDRVLQHGSVKARIQGLQLNKVAKTKLEEI FT FSCGGPLCFGYGDVETTRVLNGVVEPQPSIVKTVYVSGVREGNEYVTYLFKPGLNDWVE FT VDANIHEPTAIVGVLYHEYNRLKSENESLKTERSSLQLDISILRHELERARPPTKIIRP FT FSVRCIILYGLLIGLLFSHISQAFRTGVCLDPDVGETLKPQTCINWKWDGGVELDETTP FT LYDRFTVWYTGLMQQFKSMYNDIVIDLVVQAFGFAYTWTAIALMIGTYYMLKSTNPAYM FT LVTLMMATVSRMQLFAISAIPNMEVTSMFSLWCCMVLYYFNQVAAMAASLMIAAMCSVV FT CLFMGDAEYVKVIRGHGVVILTIVVSHIFSVLLVPHWVTVFLIVAFRIVRLIGAVVGEK FT IEVRNAEGKVISVIPAPTSWLNRISGFVQSKFTQKVRTGIMSTARVIPNGVVIVESKES FT SGTGFRVQNYIATAGHVVGNETQIKVKWGDVNVYTKVVYMHPTKDIAYLALPSEYQALP FT TYKFAKVIEDGTVVITSMEDCGVLAVAVTEGVIVKDNITYAVSTRNGMSGSPVTNVDGR FT IVGIHQATTGFTGGAVIIKQEDLPPQKKPQREIDLENKIKELEDALKGQMNQGLNENQI FT IELIRLAVGREIEILRHEINMNQAKGKNKRKNHHKRRRKGKVWTEEEYKDLLEKGFTRQ FT QLRDMAEVLREADYSEDDESDEYDTGYPQWSDPEDSEEVEREWFGPKKKILDEVEGWSN FT TDFWEQCQKVWKEMEPMPEESVNTLPSHLSDKYGITCYVVTKSDMEALARDLQEYQAKV FT EEKIKANVVRGQWLEGVNPKTIISELDELWLKLNHLMWTHGIVPFIQRKKINRKKQQKN FT LKGAPKQGPQNQNN" FT misc_feature 2646..2652 FT /locus_tag="AstVgp1" FT /note="ribosomal frameshift signal" FT gene 4210..6486 FT /gene="ORF2" FT /locus_tag="AstVgp2" FT CDS 4210..6486 FT /codon_start=1 FT /gene="ORF2" FT /locus_tag="AstVgp2" FT /product="capsid protein precursor" FT /db_xref="InterPro:IPR004337" FT /db_xref="UniProtKB/TrEMBL:D7P3D4" FT /protein_id="ADH93577.1" FT /translation="MAGKQPQQALPKAAAKQIAKEVVKQEKKEPVVRKKKQFYSNPKFN FT NRSNKKFVKKQLDKNLKKQGFAGPKPRFAVTVSATIGKVGPNKSQGPELQISTFMHPSL FT MKEPNDGTNFGPLQSAAAQWGLWRLKSLSVTFTPLVGPSAVTGSVFRISLNMAQSPGAT FT SWGGLGARKHKDVTVGKQFTWKLQKGDFTGPRETWWLTDTNEEGAQSCGPLLEIHGLGE FT TTSTYKDAAWVGDLFIVEVRGRWEFANYNSKPALGMLERVTETTNASIEVVGGNMIMTI FT PQNSQLARHMSERFERTTNASTVGETIWQIVDEGAGLVAKVAPPPFTWLIKGGWWFVKR FT LLGKSANTDDQYLVYASLADAQNNRPVEARDYTRVTCQTTLSSTQINAPNTGPNTTIES FT IGNNNQQWPIPLTGVPVGDFYVYGRMTTLHMGGQSGIQATTLVNGMIYRTDHPEPSTSP FT VSNWEFTVLENNTIVGAGMGCVWFQKSEVLVWTLDGQKLSGWNTLDGVGTTQLTVAWRQ FT HNRTIYGWANVVAWNSEEWHTNAERLHQPTLRLTYWLVKINVSSEPEDFDVVQKFPLAY FT LEDYTTAQSKSAIQKLNFQTFQKPEGGGTLRAQYSTVPRQGDFAVIWQIGRHNFDMSTG FT KGTPVESLSDYVMPQQKDARIGMWYRALTSVGPRSDMLTLHFHFPTVEKDLVEQIIDQI FT QHRYRLTPLDSDSDSSSSDSDFEPEDGFEKLEIYEGLGSSGLPHHVSDGAAIAVKKKLR FT RGHAE" FT stem_loop 6463..6505 FT /note="stem loop-2-like sequence" FT 3'UTR 6489..6584 XX SQ Sequence 6584 BP; 2031 A; 1152 C; 1586 G; 1815 T; 0 other; ccaaattttg ttggctgtgc cgattggcac tggtgggtca tggagcgctc atacaagcct 60 agtggcagta gtcactatga tccctatgat agggtactac aacatggtag tgtcaaagca 120 cggatacagg gccttcagct taataaagta gctaagacca agcttgaaga gattttttca 180 tgtggtgggc ccttgtgttt tggctacggt gatgttgaga ctactcgtgt tttaaatgga 240 gttgttgagc cacaaccttc aattgttaag acggtgtatg tctcaggagt gagagagggt 300 aatgaatatg tcacttacct ttttaaaccg ggacttaacg attgggttga ggttgatgcc 360 aacatacatg aacctactgc cattgttggt gtattgtatc atgagtataa caggcttaaa 420 tcagagaatg aaagtttgaa aacagagcgc tcatctttac aattggatat atcaatcctc 480 agacatgagc ttgagcgtgc aagaccaccg accaaaataa ttaggccttt cagcgttaga 540 tgtatcatac tgtatggttt actgattggc cttttgtttt cacacatctc acaagctttt 600 aggactggtg tgtgtcttga tccagatgtt ggtgaaacat taaagccaca aacctgtata 660 aattggaagt gggatggtgg agttgaatta gatgagacta ccccacttta tgataggttt 720 acagtgtggt atacaggctt aatgcaacag tttaaaagca tgtacaatga cattgtcatt 780 gatttggtgg ttcaggcttt tggcttcgct tacacatgga cagctatagc actgatgata 840 ggcacatatt acatgttgaa atccactaat ccagcatata tgctggtgac attgatgatg 900 gcaactgtgt caagaatgca gttatttgca atatctgcta tacctaacat ggaagtcact 960 tcaatgtttt cactgtggtg ctgcatggta ttatactatt ttaatcaggt tgcagcaatg 1020 gctgcatcat taatgatagc agctatgtgt tctgttgttt gcctttttat gggtgatgct 1080 gaatatgtga aagtgataag gggccatggt gtggttatct taactattgt tgtttctcac 1140 atctttagtg ttttgttagt gccacactgg gtcacagtgt ttctaatagt tgcttttaga 1200 attgttaggt taattggagc agttgttggt gaaaaaatag aagttaggaa tgctgaggga 1260 aaagttataa gtgtcatacc agcaccaaca tcctggttga atcggatttc tggatttgtc 1320 cagtccaaat tcacccaaaa agttagaact ggtataatgt caacagctag agtgatacct 1380 aatggtgttg ttattgttga atcaaaagaa agctcaggca ctggctttag agttcaaaat 1440 tacatagcca cagccggaca tgttgttggc aatgaaacac aaataaaggt taaatggggg 1500 gatgttaatg tttatacaaa agttgtttac atgcatccca ccaaggatat agcctatctt 1560 gccctaccat cagagtatca agcactccca acatacaagt ttgccaaggt gattgaggat 1620 ggcaccgttg ttataacatc aatggaggat tgtggtgttc ttgccgttgc ggttacagaa 1680 ggtgttattg ttaaagataa cataacatat gctgttagca ccagaaacgg catgagtggt 1740 tcacctgtta caaatgttga tggtagaatt gttggtatac atcaagccac cactggattt 1800 acaggcggtg ctgtcatcat aaagcaagag gatttaccac cccaaaagaa gccacaaagg 1860 gagatagacc ttgagaacaa gattaaagaa ttagaggatg cccttaaagg tcagatgaat 1920 caaggcctaa atgaaaatca gataattgaa ttgattcggc ttgctgttgg tcgtgagatt 1980 gaaatcttac ggcatgaaat aaatatgaat caagcaaaag gtaaaaataa aaggaagaat 2040 caccacaaga ggcgtaggaa gggaaaagtc tggactgaag aagaatacaa agaccttctg 2100 gaaaagggtt tcaccagaca gcaattacgg gacatggctg aagtgcttag agaggcggac 2160 tattccgaag atgatgagag tgatgagtat gacactggtt acccacaatg gtcagaccca 2220 gaagactctg aagaggttga aagggaatgg tttgggccaa agaaaaaaat acttgatgag 2280 gttgaaggtt ggtccaatac tgatttctgg gagcagtgtc agaaggtgtg gaaggagatg 2340 gagcccatgc cggaagagtc tgttaacact ttaccgtcac acttgagtga taagtatggc 2400 attacatgct atgttgtcac aaagagtgat atggaagcct tagcccgtga tttgcaggaa 2460 taccaagcca aggttgagga gaagattaag gcaaatgttg ttcgtggtca gtggcttgag 2520 ggagtcaatc caaaaactat cataagtgag ttggatgaat tatggctgaa attgaaccac 2580 ttaatgtgga ctcatggcat agtccctttc atacagagaa agaaaattaa cagaaagaaa 2640 cagcaaaaaa acttgaaggg ggccccgaaa caggggcccc aaaaccagaa caactaaggc 2700 ttgggtactg gagagaacta ttacaacctg gtgaatatta tcttaccccc ccacattgcc 2760 ccttggttgg tgttttacca atagataggc ctataagtga ttatgatgag ccaattgatg 2820 atttactaaa tttgttgcca aaatgtgagg aaaagccgcc atacgcacct tccacatggg 2880 gaccagaagc gtataggcgg tcatttgata aattctttta cagaaagcca actgaaaaca 2940 taaaaggaaa atatcctagg gagtggaaat ttgcaatgtc agtgcttaga agagaatttg 3000 atttcttaca ggacagtgtt cttattgata taacatccac ttcaaagaat gctgattcca 3060 caccagctta tccaaagaca ttatggtgga aaactgaaac agattatctt aaagagcggg 3120 gttatcaaga ttatattaaa gagttagatc taataagatc tggagagagg cctgatgtct 3180 tatggtattt atttttgaaa aaagaaattc taaagataag taaaattgag gaagaagaca 3240 ttaggcaaat tgtttgtgct gatcccattt tttctagaat tggttgtgta tttgaagagt 3300 accaaaatca attgatgaaa aatagaaccc tgacacgtat gggtcaatgt ggatggtcgc 3360 cgtttatggg aggtttccat aaacgcataa agcgcttagt tgataaaggc aatgattatt 3420 tcattgaatt cgactggacg cgttatgatg gtaccatccc taatgaagtc tttagggtca 3480 ttaaggactt tagattctcg tgtcttaggg gggatttgca aacaaaggaa aatagagatg 3540 tctataattg gtattgtgag aatatattta gaagatatgt gatgttacct tcaggagaag 3600 ttacaatcca ggacaggggg aatccctctg ggcaaatatc cacaactatg gataataaca 3660 tttgtaatgt ctttttccag gcatttgagt ttgcatatct gaatactgaa ttggattctg 3720 atgaattaaa ggaaaattgg gataagtatg attcacttat atatggagat gacaggctaa 3780 ctacaacccc tattttatgt gataattatg tagacagagt tattaaaatg tatgctgatg 3840 tctttgggat gtgggtcaag agagagaaag taaaagtttc aaataaaatt aatggattga 3900 ccttttgtgg ctttactgtt caagagtcaa atggcctttt tgtccccata ccaactgata 3960 cagataaatt acttgctggt ttaataacac caataaagaa attgcctgat attttgtcac 4020 tctatgggaa gctcctttgc taccgcatcc ttggccataa cttgccggat gaccataaat 4080 ttaaaaatta tattttggtc gccttggagg tagtggccag gcacatccgt gctagtggtg 4140 gggaagaacc ctactatatc acggatagca tgctggatag gctttggagg ggaggtccaa 4200 agcaaagtca tggctggtaa gcagccccag caggccctgc ccaaggcagc ggcaaagcaa 4260 atagccaagg aggtggttaa acaggagaag aaggaaccag tggtgcgtaa aaagaaacag 4320 ttttattcaa atccaaagtt taataataga tctaataaga aatttgtgaa gaagcagtta 4380 gataagaatt tgaagaaaca agggtttgca ggaccaaaac ctagatttgc tgttaccgtt 4440 tctgccacca ttggcaaggt tgggccaaat aaaagtcagg gacctgaact ccaaatatct 4500 actttcatgc accccagctt gatgaaagaa ccaaatgatg gtacaaattt tggtccccta 4560 cagtcagcag ctgcacaatg gggtttgtgg cgcttgaaaa gtttaagcgt cacgtttact 4620 cctcttgttg gtccatcagc agttactggg tctgttttcc gcatatccct aaacatggca 4680 cagtcacctg gagccacatc atgggggggt cttggtgcta ggaagcacaa ggatgttact 4740 gtggggaagc agttcacttg gaagctacag aagggagact tcacaggccc cagggaaacc 4800 tggtggctta cagatacaaa tgaggaggga gcacaaagtt gtgggcctct tcttgagatc 4860 catggcttgg gtgaaacgac ctctacctac aaggatgcag catgggttgg agacctcttc 4920 attgttgagg tcaggggccg ctgggagttt gcaaattata acagtaaacc tgcattaggt 4980 atgttggaga gagtgaccga aactaccaat gcctcaattg aagtggttgg tggcaacatg 5040 attatgacaa ttccacagaa ttcccaactc gcaaggcata tgagtgaaag gttcgagagg 5100 accacaaatg caagcactgt tggtgaaaca atatggcaga ttgtggatga gggtgctggc 5160 ttggttgcaa aagttgcacc acccccgttt acttggttga ttaagggggg atggtggttt 5220 gtcaagagac tgctaggtaa atcagcaaat actgatgatc aatacctagt ttatgcatca 5280 ttggcagatg cccaaaacaa cagacctgta gaggcacgag attacacaag agtcacatgc 5340 cagacaacac tttcttccac gcaaattaat gcaccaaata caggccctaa taccactata 5400 gagtcaattg gaaataacaa ccaacagtgg ccgatacctc tgacaggggt gccggttggt 5460 gacttttatg tttatggtag gatgacaaca ttgcatatgg gtggtcaatc tggcattcaa 5520 gctacgacct tagtgaatgg gatgatatat cgtacagatc acccagaacc atcaacaagc 5580 ccagtctcca attgggagtt cacagttttg gaaaataata caattgttgg tgctggaatg 5640 gggtgtgtgt ggtttcagaa atctgaagta ctagtgtgga cactagatgg ccagaagctg 5700 tcaggatgga acacgctaga tggcgttggc acaacccaat taacagttgc ctggagacag 5760 cataatagaa caatttatgg atgggctaat gttgttgctt ggaactctga agaatggcac 5820 acaaatgcgg agcggctaca ccagcctaca ttgaggttga catattggct agtaaaaatc 5880 aatgtttcgt ctgaaccaga agattttgat gttgtccaaa aattcccatt agcttattta 5940 gaagattaca ctacagcaca atcaaaatct gccatccaaa agctcaactt ccaaacgttt 6000 cagaaacctg aagggggagg cactttgcgg gcacaatact caactgttcc caggcaaggg 6060 gattttgccg taatatggca gattggtagg cataattttg atatgtctac cggtaaaggt 6120 acaccagttg aaagtttgag tgattatgtt atgccccagc agaaagatgc ccgtattggc 6180 atgtggtatc gtgctttaac tagtgttgga ccaagatcag atatgttaac ccttcatttc 6240 catttcccaa ctgtggaaaa ggatttagtt gagcagatta ttgatcaaat tcagcatcgc 6300 tacagattga ccccactgga ttcggattca gactcctcta gttctgattc cgattttgag 6360 cccgaagatg gatttgagaa gttagaaatc tatgagggtc tcgggtccag tggcttgcca 6420 caccatgtgt ctgatggtgc tgcgatagct gttaagaaaa agttgcgccg aggccacgcc 6480 gagtaggatc gagggtacag cgctaaattg attactagag gtgttaatca ataaatcatt 6540 gatttggtga ttgatatgat caatttgaaa ttgaaatttc cagc 6584 //