ID J02363; SV 1; linear; genomic RNA; STD; VRL; 11703 BP. XX AC J02363; J02364-J02367; XX DT 03-JUL-1991 (Rel. 28, Created) DT 17-APR-2005 (Rel. 83, Last updated, Version 10) XX DE Sindbis virus (hrsp and wild-type strains) complete genome. XX KW complete genome; glycoprotein; nucleocapsid protein; polyprotein; KW RNA replicase. XX OS Sindbis virus OC Viruses; Riboviria; Togaviridae; Alphavirus. XX RN [1] RP 11558-11703 RX DOI; 10.1016/0042-6822(81)90499-2. RX PUBMED; 6259811. RA Ou J.H., Strauss E.G., Strauss J.H.; RT "Comparative studies of the 3'-terminal sequences of several alpha virus RT RNAs"; RL Virology 109(2):281-289(1981). XX RN [2] RP 7629-11703 RX DOI; 10.1073/pnas.78.4.2062. RX PUBMED; 6941270. RA Rice C.M., Strauss J.H.; RT "Nucleotide sequence of the 26S mRNA of Sindbis virus and deduced sequence RT of the encoded virus structural proteins"; RL Proc. Natl. Acad. Sci. U.S.A. 78(4):2062-2066(1981). XX RN [3] RP 7630-11703 RX DOI; 10.1016/0022-2836(81)90550-7. RX PUBMED; 6271974. RA Rice C.M., Strauss J.H.; RT "Synthesis, cleavage and sequence analysis of DNA complementary to the 26 S RT messenger RNA of Sindbis virus"; RL J. Mol. Biol. 150(3):315-340(1981). XX RN [4] RP 11554-11703 RX PUBMED; 6896345. RA Monroe S.S., Ou J.H., Rice C.M., Schlesinger S., Strauss E.G., RA Strauss J.H.; RT "Sequence analysis of cDNA's derived from the RNA of Sindbis virions and of RT defective interfering particles"; RL J. Virol. 41(1):153-162(1982). XX RN [5] RP 11477-11703 RX DOI; 10.1016/0022-2836(82)90138-3. RX PUBMED; 6288962. RA Ou J.H., Trent D.W., Strauss J.H.; RT "The 3'-non-coding regions of alphavirus RNAs contain repeating sequences"; RL J. Mol. Biol. 156(4):719-730(1982). XX RN [6] RP 7350-7675 RX DOI; 10.1073/pnas.79.17.5235. RX PUBMED; 6291034. RA Ou J.H., Rice C.M., Dalgarno L., Strauss E.G., Strauss J.H.; RT "Sequence studies of several alphavirus genomic RNAs in the region RT containing the start of the subgenomic RNA"; RL Proc. Natl. Acad. Sci. U.S.A. 79(17):5235-5239(1982). XX RN [7] RP 1-221 RX DOI; 10.1016/S0022-2836(83)80319-2. RX PUBMED; 6308269. RA Ou J.H., Strauss E.G., Strauss J.H.; RT "The 5'-terminal sequences of the genomic RNAs of several alphaviruses"; RL J. Mol. Biol. 168(1):1-15(1983). XX RN [8] RP 4344-7601 RX DOI; 10.1073/pnas.80.17.5271. RX PUBMED; 6577423. RA Strauss E.G., Rice C.M., Strauss J.H.; RT "Sequence coding for the alphavirus nonstructural proteins is interrupted RT by an opal termination codon"; RL Proc. Natl. Acad. Sci. U.S.A. 80(17):5271-5275(1983). XX RN [9] RP 1-11703 RX DOI; 10.1016/0042-6822(84)90428-8. RX PUBMED; 6322438. RA Strauss E.G., Rice C.M., Strauss J.H.; RT "Complete nucleotide sequence of the genomic RNA of Sindbis virus"; RL Virology 133(1):92-110(1984). XX DR MD5; 1298d08cd0a17b74fea8d22c2639cf72. DR EuropePMC; PMC112110; 10846095. DR EuropePMC; PMC112336; 10933713. DR EuropePMC; PMC136221; 12021349. DR EuropePMC; PMC1698312; 16957044. DR EuropePMC; PMC2569925; 18822126. DR EuropePMC; PMC2798122; 19259792. DR EuropePMC; PMC3205794; 21452926. DR EuropePMC; PMC3302312; 22258240. DR EuropePMC; PMC3323234; 15200824. DR EuropePMC; PMC3950689; 24335283. DR EuropePMC; PMC4992889; 27545976. DR EuropePMC; PMC5279924; 27664855. DR EuropePMC; PMC6025923; 29954885. DR EuropePMC; PMC6074608; 30094371. DR EuropePMC; PMC6522784; 31097499. DR GOA; P0DOK0. DR InterPro; IPR000936; Alpha_E2_glycop. DR InterPro; IPR009003; Peptidase_S1_PA. DR InterPro; IPR042305; Alphavir_E2_B. DR RFAM; RF00470; Toga_5_CRE. DR RFAM; RF01836; weev_FSE. DR UniProtKB/Swiss-Prot; P0DOK0; POLSF_SINDV. XX CC [3] hr strain. CC [2] hr strain. CC [1] hr comp strand. CC [5] wt, sin-1(1. CC [4] wt strain. CC [6] hr strain. CC [7] hr strain. CC [8] hr strain. CC [9] hr strain. CC Sindbis is a single stranded RNA virus of the genus alphavirus CC which includes the Eastern Equine Encephalitis , the CC Highlands J , the Middelburg , the Ross River , the CC Semliki Forest , the Venezuelan Equine Encephalitis and CC the Western Equine Encephalitis viruses. The genome is of CC plus polarity, capped at the 5' end and polyadenylated at the 3' CC end, and, in the case of Sindbis, is 11703 nucleotides in length CC without these modifications. The sequence shown is of the HR small CC plaque strain (HRsp); changes from wild-type and from the HR large CC plaque strain (HRlp) are indicated where known. CC The 49S plus strand shown below is replicated into a 49S minus CC strand which serves as a template for the progeny 49S plus strands CC and for a 26S subgenomic mRNA. Four RNA synthesis activities are CC thought to be required for these replications as summarized in [9]. CC The possible recognition sites for these events are suggested by CC comparative studies (see the loci mentioned in the above paragraph) CC and by analysis of defective interfering particle sequences (see CC ). CC The 49S plus strand RNA serves as an mRNA for the nonstructural CC viral proteins, i.e. the replicase/transcriptase components and CC perhaps a protease for cleavage of structural proteins. These are CC nsp1 (ns60), nsp2 (ns89), nsp3 (ns76) and nsp4 (ns72). The first CC three are produced as a polyprotein which has been designated p230. CC the fourth component is produced by readthrough translation of an CC opal terminator at bases 5748-5750 [8]; this readthrough yields a CC polyprotein which has been designated p270. Multiple in-phase CC termination codons at the start of the 26S transcript (structural CC protein ) region prevent further readthrough. These four peptides CC may or may not simply be the four activities required for RNA CC synthesis and elongation. CC The 26S mRNA transcript (4106 bases) encodes the nucleocapsid CC protein, two glycoproteins and two small peptides not present in CC the mature virion. These are initially produced as a polyprotein CC designated p130; subsquent processing steps have been fully CC determined (see [2] and related literature). CC Comparative studies have revealed conserved sequences at the 3'end CC [3],[4], near the 5' end [7], and at the start of the 26S RNA [6]. CC Complete source information: CC Sindbis genomic 49S RNA [7], subgenomic 26S RNA [6], and cDNA to CC 26S and 49S RNA [3],[2],[1],[5],[4],[6],[7],[8],[9] from HR CC [3],[2],[1],[6],[7],[8],[9] and wild-type [5],[4] strains. all CC virus preparations were obtained from passage through cultured CC chicken embryo fibroblast cells. XX FH Key Location/Qualifiers FH FT source 1..11703 FT /organism="Sindbis virus" FT /mol_type="genomic RNA" FT /db_xref="taxon:11034" FT mRNA 1..11703 FT /product="49S genomic mRNA" FT variation 5 FT /note="a in hrsp strain; g in hrlp and wt [7],[9]" FT variation 25 FT /note="g in hrsp strain and wt; a in hrlp [7],[9]" FT CDS 60..5750 FT /codon_start=1 FT /note="p230 nonstructural polyprotein" FT /db_xref="GOA:P03317" FT /db_xref="InterPro:IPR001788" FT /db_xref="InterPro:IPR002588" FT /db_xref="InterPro:IPR002589" FT /db_xref="InterPro:IPR002620" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR027351" FT /db_xref="InterPro:IPR027417" FT /db_xref="InterPro:IPR029063" FT /db_xref="PDB:4GUA" FT /db_xref="UniProtKB/Swiss-Prot:P03317" FT /protein_id="AAA96975.1" FT /translation="MEKPVVNVDVDPQSPFVVQLQKSFPQFEVVAQQVTPNDHANARAF FT SHLASKLIELEVPTTATILDIGSAPARRMFSEHQYHCVCPMRSPEDPDRMMKYASKLAE FT KACKITNKNLHEKIKDLRTVLDTPDAETPSLCFHNDVTCNMRAEYSVMQDVYINAPGTI FT YHQAMKGVRTLYWIGFDTTQFMFSAMAGSYPAYNTNWADEKVLEARNIGLCSTKLSEGR FT TGKLSIMRKKELKPGSRVYFSVGSTLYPEHRASLQSWHLPSVFHLNGKQSYTCRCDTVV FT SCEGYVVKKITISPGITGETVGYAVTHNSEGFLLCKVTDTVKGERVSFPVCTYIPATIC FT DQMTGIMATDISPDDAQKLLVGLNQRIVINGRTNRNTNTMQNYLLPIIAQGFSKWAKER FT KDDLDNEKMLGTRERKLTYGCLWAFRTKKVHSFYRPPGTQTCVKVPASFSAFPMSSVWT FT TSLPMSLRQKLKLALQPKKEEKLLQVSEELVMEAKAAFEDAQEEARAEKLREALPPLVA FT DKGIEAAAEVVCEVEGLQADIGAALVETPRGHVRIIPQANDRMIGQYIVVSPNSVLKNA FT KLAPAHPLADQVKIITHSGRSGRYAVEPYDAKVLMPAGGAVPWPEFLALSESATLVYNE FT REFVNRKLYHIAMHGPAKNTEEEQYKVTKAELAETEYVFDVDKKRCVKKEEASGLVLSG FT ELTNPPYHELALEGLKTRPAVPYKVETIGVIGTPGSGKSAIIKSTVTARDLVTSGKKEN FT CREIEADVLRLRGMQITSKTVDSVMLNGCHKAVEVLYVDEAFACHAGALLALIAIVRPR FT KKVVLCGDPMQCGFFNMMQLKVHFNHPEKDICTKTFYKYISRRCTQPVTAIVSTLHYDG FT KMKTTNPCKKNIEIDITGATKPKPGDIILTCFRGWVKQLQIDYPGHEVMTAAASQGLTR FT KGVYAVRQKVNENPLYAITSEHVNVLLTRTEDRLVWKTLQGDPWIKQPTNIPKGNFQAT FT IEDWEAEHKGIIAAINSPTPRANPFSCKTNVCWAKALEPILATAGIVLTGCQWSELFPQ FT FADDKPHSAIYALDVICIKFFGMDLTSGLFSKQSIPLTYHPADSARPVAHWDNSPGTRK FT YGYDHAIAAELSRRFPVFQLAGKGTQLDLQTGRTRVISAQHNLVPVNRNLPHALVPEYK FT EKQPGPVKKFLNQFKHHSVLVVSEEKIEAPRKRIEWIAPIGIAGADKNYNLAFGFPPQA FT RYDLVFINIGTKYRNHHFQQCEDHAATLKTLSRSALNCLNPGGTLVVKSYGYADRNSED FT VVTALARKFVRVSAARPDCVSSNTEMYLIFRQLDNSRTRQFTPHHLNCVISSVYEGTRD FT GVGAAPSYRTKRENIADCQEEAVVNAANPLGRPGEGVCRAIYKRWPTSFTDSATETGTA FT RMTVCLGKKVIHAVGPDFRKHPEAEALKLLQNAYHAVADLVNEHNIKSVAIPLLSTGIY FT AAGKDRLEVSLNCLTTALDRTDADVTIYCLDKKWKERIDAALQLKESVTELKDEDMEID FT DELVWIHPDSCLKGRKGFSTTKGKLYSYFEGTKFHQAAKDMAEIKVLFPNDQESNEQLC FT AYILGETMEAIREKCPVDHNPSSSPPKTLPCLCMYAMTPERVHRLRSNNVKEVTVCSST FT PLPKHKIKNVQKVQCTKVVLFNPHTPAFVPARKYIEVPEQPTAPPAQAEEAPEVVATPS FT PSTADNTSLDVTDISLDMDDSSEGSLFSSFSGSDNSITSMDSWSSGPSSLEIVDRRQVV FT VADVHAVQEPAPIPPPRLKKMARLAAARKEPTPPASNSSESLHLSFGGVSMSLGSIFDG FT ETARQAAVQPLATGPTDVPMSFGSFSDGEIDELSRRVTESEPVLFGSFEPGEVNSIISS FT RSAVSFPLRKQRRRRRSRRTEY" FT mat_peptide 60..1679 FT /product="nsp1 nonstructural protein" FT CDS 60..7601 FT /codon_start=1 FT /transl_except=(pos:5748..5750,aa:OTHER) FT /note="p270 nonstructural polyprotein (readthrough)" FT /db_xref="GOA:P03317" FT /db_xref="InterPro:IPR001788" FT /db_xref="InterPro:IPR002588" FT /db_xref="InterPro:IPR002589" FT /db_xref="InterPro:IPR002620" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR027351" FT /db_xref="InterPro:IPR027417" FT /db_xref="InterPro:IPR029063" FT /db_xref="PDB:4GUA" FT /db_xref="UniProtKB/Swiss-Prot:P03317" FT /protein_id="AAA96974.1" FT /translation="MEKPVVNVDVDPQSPFVVQLQKSFPQFEVVAQQVTPNDHANARAF FT SHLASKLIELEVPTTATILDIGSAPARRMFSEHQYHCVCPMRSPEDPDRMMKYASKLAE FT KACKITNKNLHEKIKDLRTVLDTPDAETPSLCFHNDVTCNMRAEYSVMQDVYINAPGTI FT YHQAMKGVRTLYWIGFDTTQFMFSAMAGSYPAYNTNWADEKVLEARNIGLCSTKLSEGR FT TGKLSIMRKKELKPGSRVYFSVGSTLYPEHRASLQSWHLPSVFHLNGKQSYTCRCDTVV FT SCEGYVVKKITISPGITGETVGYAVTHNSEGFLLCKVTDTVKGERVSFPVCTYIPATIC FT DQMTGIMATDISPDDAQKLLVGLNQRIVINGRTNRNTNTMQNYLLPIIAQGFSKWAKER FT KDDLDNEKMLGTRERKLTYGCLWAFRTKKVHSFYRPPGTQTCVKVPASFSAFPMSSVWT FT TSLPMSLRQKLKLALQPKKEEKLLQVSEELVMEAKAAFEDAQEEARAEKLREALPPLVA FT DKGIEAAAEVVCEVEGLQADIGAALVETPRGHVRIIPQANDRMIGQYIVVSPNSVLKNA FT KLAPAHPLADQVKIITHSGRSGRYAVEPYDAKVLMPAGGAVPWPEFLALSESATLVYNE FT REFVNRKLYHIAMHGPAKNTEEEQYKVTKAELAETEYVFDVDKKRCVKKEEASGLVLSG FT ELTNPPYHELALEGLKTRPAVPYKVETIGVIGTPGSGKSAIIKSTVTARDLVTSGKKEN FT CREIEADVLRLRGMQITSKTVDSVMLNGCHKAVEVLYVDEAFACHAGALLALIAIVRPR FT KKVVLCGDPMQCGFFNMMQLKVHFNHPEKDICTKTFYKYISRRCTQPVTAIVSTLHYDG FT KMKTTNPCKKNIEIDITGATKPKPGDIILTCFRGWVKQLQIDYPGHEVMTAAASQGLTR FT KGVYAVRQKVNENPLYAITSEHVNVLLTRTEDRLVWKTLQGDPWIKQPTNIPKGNFQAT FT IEDWEAEHKGIIAAINSPTPRANPFSCKTNVCWAKALEPILATAGIVLTGCQWSELFPQ FT FADDKPHSAIYALDVICIKFFGMDLTSGLFSKQSIPLTYHPADSARPVAHWDNSPGTRK FT YGYDHAIAAELSRRFPVFQLAGKGTQLDLQTGRTRVISAQHNLVPVNRNLPHALVPEYK FT EKQPGPVKKFLNQFKHHSVLVVSEEKIEAPRKRIEWIAPIGIAGADKNYNLAFGFPPQA FT RYDLVFINIGTKYRNHHFQQCEDHAATLKTLSRSALNCLNPGGTLVVKSYGYADRNSED FT VVTALARKFVRVSAARPDCVSSNTEMYLIFRQLDNSRTRQFTPHHLNCVISSVYEGTRD FT GVGAAPSYRTKRENIADCQEEAVVNAANPLGRPGEGVCRAIYKRWPTSFTDSATETGTA FT RMTVCLGKKVIHAVGPDFRKHPEAEALKLLQNAYHAVADLVNEHNIKSVAIPLLSTGIY FT AAGKDRLEVSLNCLTTALDRTDADVTIYCLDKKWKERIDAALQLKESVTELKDEDMEID FT DELVWIHPDSCLKGRKGFSTTKGKLYSYFEGTKFHQAAKDMAEIKVLFPNDQESNEQLC FT AYILGETMEAIREKCPVDHNPSSSPPKTLPCLCMYAMTPERVHRLRSNNVKEVTVCSST FT PLPKHKIKNVQKVQCTKVVLFNPHTPAFVPARKYIEVPEQPTAPPAQAEEAPEVVATPS FT PSTADNTSLDVTDISLDMDDSSEGSLFSSFSGSDNSITSMDSWSSGPSSLEIVDRRQVV FT VADVHAVQEPAPIPPPRLKKMARLAAARKEPTPPASNSSESLHLSFGGVSMSLGSIFDG FT ETARQAAVQPLATGPTDVPMSFGSFSDGEIDELSRRVTESEPVLFGSFEPGEVNSIISS FT RSAVSFPLRKQRRRRRSRRTEYXLTGVGGYIFSTDTGPGHLQKKSVLQNQLTEPTLERN FT VLERIHAPVLDTSKEEQLKLRYQMMPTEANKSRYQSRKVENQKAITTERLLSGLRLYNS FT ATDQPECYKITYPKPLYSSSVPANYSDPQFAVAVCNNYLHENYPTVASYQITDEYDAYL FT DMVDGTVACLDTATFCPAKLRSYPKKHEYRAPNIRSAVPSAMQNTLQNVLIAATKRNCN FT VTQMRELPTLDSATFNVECFRKYACNDEYWEEFARKPIRITTEFVTAYVARLKGPKAAA FT LFAKTYNLVPLQEVPMDRFVMDMKRDVKVTPGTKHTEERPKVQVIQAAEPLATAYLCGI FT HRELVRRLTAVLLPNIHTLFDMSAEDFDAIIAEHFKQGDPVLETDIASFDKSQDDAMAL FT TGLMILEDLGVDQPLLDLIECAFGEISSTHLPTGTRFKFGAMMKSGMFLTLFVNTVLNV FT VIASRVLEERLKTSRCAAFIGDDNIIHGVVSDKEMAERCATWLNMEVKIIDAVIGERPP FT YFCGGFILQDSVTSTACRVADPLKRLFKLGKPLPADDEQDEDRRRALLDETKAWFRVGI FT TGTLAVAVTTRYEVDNITPVLLALRTFAQSKRAFQAIRGEIKHLYGGPK" FT mat_peptide 1680..4100 FT /product="nsp2 nonstructural protein" FT mat_peptide 4101..5747 FT /product="nsp3 nonstructural protein" FT mat_peptide 5751..7598 FT /product="nsp4 nonstructural protein (putative start); FT putative" FT mRNA 7598..11703 FT /product="26S subgenomic mRNA" FT CDS 7647..11384 FT /codon_start=1 FT /note="p130 structural polyprotein" FT /db_xref="GOA:P03316" FT /db_xref="InterPro:IPR000336" FT /db_xref="InterPro:IPR000930" FT /db_xref="InterPro:IPR000936" FT /db_xref="InterPro:IPR002533" FT /db_xref="InterPro:IPR002548" FT /db_xref="InterPro:IPR009003" FT /db_xref="InterPro:IPR014756" FT /db_xref="InterPro:IPR036253" FT /db_xref="InterPro:IPR038055" FT /db_xref="InterPro:IPR042304" FT /db_xref="InterPro:IPR042305" FT /db_xref="InterPro:IPR042306" FT /db_xref="PDB:1KXA" FT /db_xref="PDB:1KXB" FT /db_xref="PDB:1KXC" FT /db_xref="PDB:1KXD" FT /db_xref="PDB:1KXE" FT /db_xref="PDB:1KXF" FT /db_xref="PDB:1LD4" FT /db_xref="PDB:1SVP" FT /db_xref="PDB:1Z8Y" FT /db_xref="PDB:2SNV" FT /db_xref="PDB:2SNW" FT /db_xref="PDB:3J0F" FT /db_xref="PDB:3MUU" FT /db_xref="PDB:3MUW" FT /db_xref="UniProtKB/Swiss-Prot:P03316" FT /protein_id="AAA96976.1" FT /translation="MNRGFFNMLGRRPFPAPTAMWRPRRRRQAAPMPARNGLASQIQQL FT TTAVSALVIGQATRPQPPRPRPPPRQKKQAPKQPPKPKKPKTQEKKKKQPAKPKPGKRQ FT RMALKLEADRLFDVKNEDGDVIGHALAMEGKVMKPLHVKGTIDHPVLSKLKFTKSSAYD FT MEFAQLPVNMRSEAFTYTSEHPEGFYNWHHGAVQYSGGRFTIPRGVGGRGDSGRPIMDN FT SGRVVAIVLGGADEGTRTALSVVTWNSKGKTIKTTPEGTEEWSAAPLVTAMCLLGNVSF FT PCDRPPTCYTREPSRALDILEENVNHEAYDTLLNAILRCGSSGRSKRSVIDDFTLTSPY FT LGTCSYCHHTVPCFSPVKIEQVWDEADDNTIRIQTSAQFGYDQSGAASANKYRYMSLKQ FT DHTVKEGTMDDIKISTSGPCRRLSYKGYFLLAKCPPGDSVTVSIVSSNSATSCTLARKI FT KPKFVGREKYDLPPVHGKKIPCTVYDRLKETTAGYITMHRPRPHAYTSYLEESSGKVYA FT KPPSGKNITYECKCGDYKTGTVSTRTEITGCTAIKQCVAYKSDQTKWVFNSPDLIRHDD FT HTAQGKLHLPFKLIPSTCMVPVAHAPNVIHGFKHISLQLDTDHLTLLTTRRLGANPEPT FT TEWIVGKTVRNFTVDRDGLEYIWGNHEPVRVYAQESAPGDPHGWPHEIVQHYYHRHPVY FT TILAVASATVAMMIGVTVAVLCACKARRECLTPYALAPNAVIPTSLALLCCVRSANAET FT FTETMSYLWSNSQPFFWVQLCIPLAAFIVLMRCCSCCLPFLVVAGAYLAKVDAYEHATT FT VPNVPQIPYKALVERAGYAPLNLEITVMSSEVLPSTNQEYITCKFTTVVPSPKIKCCGS FT LECQPAAHADYTCKVFGGVYPFMWGGAQCFCDSENSQMSEAYVELSADCASDHAQAIKV FT HTAAMKVGLRIVYGNTTSFLDVYVNGVTPGTSKDLKVIAGPISASFTPFDHKVVIHRGL FT VYNYDFPEYGAMKPGAFGDIQATSLTSKDLIASTDIRLLKPSAKNVHVPYTQASSGFEM FT WKNNSGRPLQETAPFGCKIAVNPLRAVDCSYGNIPISIDIPNAAFIRTSDAPLVSTVKC FT EVSECTYSADFGGMATLQYVSDREGQCPVHSHSSTATLQESTVHVLEKGAVTVHFSTAS FT PQANFIVSLCGKKTTCNAECKPPADHIVSTPHKNDQEFQAAISKTSWSWLFALFGGASS FT LLIIGLMIFACSMMLTSTRR" FT mat_peptide 7647..8438 FT /product="capsid (c) protein" FT mat_peptide 8439..8630 FT /product="e-3 structural protein" FT mat_peptide 8631..9899 FT /product="e-2 structural protein" FT variation 8644 FT /note="a in hrsp; g in hrlp [2],[1],[9]" FT variation 8698 FT /note="t in hrsp; a in hrlp [2],[1],[9]" FT variation 9782 FT /note="t in hrsp; c in hrlp [2],[1],[9]" FT variation 9884 FT /note="t in hrsp; c in hrlp [2],[1],[9]" FT mat_peptide 9900..10064 FT /product="6k structural protein" FT mat_peptide 10065..11381 FT /product="e-1 structural protein" FT variation 11568 FT /note="t in hrsp; c in sin-1(16) [5]" FT variation 11697 FT /note="a in hrsp; g in sin-1(16) [5]" XX SQ Sequence 11703 BP; 3308 A; 3049 C; 2908 G; 2438 T; 0 other; attgacggcg tagtacacac tattgaatca aacagccgac caattgcact accatcacaa 60 tggagaagcc agtagtaaac gtagacgtag acccccagag tccgtttgtc gtgcaactgc 120 aaaaaagctt cccgcaattt gaggtagtag cacagcaggt cactccaaat gaccatgcta 180 atgccagagc attttcgcat ctggccagta aactaatcga gctggaggtt cctaccacag 240 cgacgatctt ggacataggc agcgcaccgg ctcgtagaat gttttccgag caccagtatc 300 attgtgtctg ccccatgcgt agtccagaag acccggaccg catgatgaaa tacgccagta 360 aactggcgga aaaagcgtgc aagattacaa acaagaactt gcatgagaag attaaggatc 420 tccggaccgt acttgatacg ccggatgctg aaacaccatc gctctgcttt cacaacgatg 480 ttacctgcaa catgcgtgcc gaatattccg tcatgcagga cgtgtatatc aacgctcccg 540 gaactatcta tcatcaggct atgaaaggcg tgcggaccct gtactggatt ggcttcgaca 600 ccacccagtt catgttctcg gctatggcag gttcgtaccc tgcgtacaac accaactggg 660 ccgacgagaa agtccttgaa gcgcgtaaca tcggactttg cagcacaaag ctgagtgaag 720 gtaggacagg aaaattgtcg ataatgagga agaaggagtt gaagcccggg tcgcgggttt 780 atttctccgt aggatcgaca ctttatccag aacacagagc cagcttgcag agctggcatc 840 ttccatcggt gttccacttg aatggaaagc agtcgtacac ttgccgctgt gatacagtgg 900 tgagttgcga aggctacgta gtgaagaaaa tcaccatcag tcccgggatc acgggagaaa 960 ccgtgggata cgcggttaca cacaatagcg agggcttctt gctatgcaaa gttactgaca 1020 cagtaaaagg agaacgggta tcgttccctg tgtgcacgta catcccggcc accatatgcg 1080 atcagatgac tggtataatg gccacggata tatcacctga cgatgcacaa aaacttctgg 1140 ttgggctcaa ccagcgaatt gtcattaacg gtaggactaa caggaacacc aacaccatgc 1200 aaaattacct tctgccgatc atagcacaag ggttcagcaa atgggctaag gagcgcaagg 1260 atgatcttga taacgagaaa atgctgggta ctagagaacg caagcttacg tatggctgct 1320 tgtgggcgtt tcgcactaag aaagtacatt cgttttatcg cccacctgga acgcagacct 1380 gcgtaaaagt cccagcctct tttagcgctt ttcccatgtc gtccgtatgg acgacctctt 1440 tgcccatgtc gctgaggcag aaattgaaac tggcattgca accaaagaag gaggaaaaac 1500 tgctgcaggt ctcggaggaa ttagtcatgg aggccaaggc tgcttttgag gatgctcagg 1560 aggaagccag agcggagaag ctccgagaag cacttccacc attagtggca gacaaaggca 1620 tcgaggcagc cgcagaagtt gtctgcgaag tggaggggct ccaggcggac atcggagcag 1680 cattagttga aaccccgcgc ggtcacgtaa ggataatacc tcaagcaaat gaccgtatga 1740 tcggacagta tatcgttgtc tcgccaaact ctgtgctgaa gaatgccaaa ctcgcaccag 1800 cgcacccgct agcagatcag gttaagatca taacacactc cggaagatca ggaaggtacg 1860 cggtcgaacc atacgacgct aaagtactga tgccagcagg aggtgccgta ccatggccag 1920 aattcctagc actgagtgag agcgccacgt tagtgtacaa cgaaagagag tttgtgaacc 1980 gcaaactata ccacattgcc atgcatggcc ccgccaagaa tacagaagag gagcagtaca 2040 aggttacaaa ggcagagctt gcagaaacag agtacgtgtt tgacgtggac aagaagcgtt 2100 gcgttaagaa ggaagaagcc tcaggtctgg tcctctcggg agaactgacc aaccctccct 2160 atcatgagct agctctggag ggactgaaga cccgacctgc ggtcccgtac aaggtcgaaa 2220 caataggagt gataggcaca ccggggtcgg gcaagtcagc tattatcaag tcaactgtca 2280 cggcacgaga tcttgttacc agcggaaaga aagaaaattg tcgcgaaatt gaggccgacg 2340 tgctaagact gaggggtatg cagattacgt cgaagacagt agattcggtt atgctcaacg 2400 gatgccacaa agccgtagaa gtgctgtacg ttgacgaagc gttcgcgtgc cacgcaggag 2460 cactacttgc cttgattgct atcgtcaggc cccgcaagaa ggtagtacta tgcggagacc 2520 ccatgcaatg cggattcttc aacatgatgc aactaaaggt acatttcaat caccctgaaa 2580 aagacatatg caccaagaca ttctacaagt atatctcccg gcgttgcaca cagccagtta 2640 cagctattgt atcgacactg cattacgatg gaaagatgaa aaccacgaac ccgtgcaaga 2700 agaacattga aatcgatatt acaggggcca caaagccgaa gccaggggat atcatcctga 2760 catgtttccg cgggtgggtt aagcaattgc aaatcgacta tcccggacat gaagtaatga 2820 cagccgcggc ctcacaaggg ctaaccagaa aaggagtgta tgccgtccgg caaaaagtca 2880 atgaaaaccc actgtacgcg atcacatcag agcatgtgaa cgtgttgctc acccgcactg 2940 aggacaggct agtgtggaaa accttgcagg gcgacccatg gattaagcag cccactaaca 3000 tacctaaagg aaactttcag gctactatag aggactggga agctgaacac aagggaataa 3060 ttgctgcaat aaacagcccc actccccgtg ccaatccgtt cagctgcaag accaacgttt 3120 gctgggcgaa agcattggaa ccgatactag ccacggccgg tatcgtactt accggttgcc 3180 agtggagcga actgttccca cagtttgcgg atgacaaacc acattcggcc atttacgcct 3240 tagacgtaat ttgcattaag tttttcggca tggacttgac aagcggactg ttttctaaac 3300 agagcatccc actaacgtac catcccgccg attcagcgag gccggtagct cattgggaca 3360 acagcccagg aacccgcaag tatgggtacg atcacgccat tgccgccgaa ctctcccgta 3420 gatttccggt gttccagcta gctgggaagg gcacacaact tgatttgcag acggggagaa 3480 ccagagttat ctctgcacag cataacctgg tcccggtgaa ccgcaatctt cctcacgcct 3540 tagtccccga gtacaaggag aagcaacccg gcccggtcaa aaaattcttg aaccagttca 3600 aacaccactc agtacttgtg gtatcagagg aaaaaattga agctccccgt aagagaatcg 3660 aatggatcgc cccgattggc atagccggtg cagataagaa ctacaacctg gctttcgggt 3720 ttccgccgca ggcacggtac gacctggtgt tcatcaacat tggaactaaa tacagaaacc 3780 accactttca gcagtgcgaa gaccatgcgg cgaccttaaa aaccctttcg cgttcggccc 3840 tgaattgcct taacccagga ggcaccctcg tggtgaagtc ctatggctac gccgaccgca 3900 acagtgagga cgtagtcacc gctcttgcca gaaagtttgt cagggtgtct gcagcgagac 3960 cagattgtgt ctcaagcaat acagaaatgt acctgatttt ccgacaacta gacaacagcc 4020 gtacacggca attcaccccg caccatctga attgcgtgat ttcgtccgtg tatgagggta 4080 caagagatgg agttggagcc gcgccgtcat accgcaccaa aagggagaat attgctgact 4140 gtcaagagga agcagttgtc aacgcagcca atccgctggg tagaccaggc gaaggagtct 4200 gccgtgccat ctataaacgt tggccgacca gttttaccga ttcagccacg gagacaggca 4260 ccgcaagaat gactgtgtgc ctaggaaaga aagtgatcca cgcggtcggc cctgatttcc 4320 ggaagcaccc agaagcagaa gccttgaaat tgctacaaaa cgcctaccat gcagtggcag 4380 acttagtaaa tgaacataac atcaagtctg tcgccattcc actgctatct acaggcattt 4440 acgcagccgg aaaagaccgc cttgaagtat cacttaactg cttgacaacc gcgctagaca 4500 gaactgacgc ggacgtaacc atctattgcc tggataagaa gtggaaggaa agaatcgacg 4560 cggcactcca acttaaggag tctgtaacag agctgaagga tgaagatatg gagatcgacg 4620 atgagttagt atggatccat ccagacagtt gcttgaaggg aagaaaggga ttcagtacta 4680 caaaaggaaa attgtattcg tacttcgaag gcaccaaatt ccatcaagca gcaaaagaca 4740 tggcggagat aaaggtcctg ttccctaatg accaggaaag taatgaacaa ctgtgtgcct 4800 acatattggg tgagaccatg gaagcaatcc gcgaaaagtg cccggtcgac cataacccgt 4860 cgtctagccc gcccaaaacg ttgccgtgcc tttgcatgta tgccatgacg ccagaaaggg 4920 tccacagact tagaagcaat aacgtcaaag aagttacagt atgctcctcc accccccttc 4980 ctaagcacaa aattaagaat gttcagaagg ttcagtgcac gaaagtagtc ctgtttaatc 5040 cgcacactcc cgcattcgtt cccgcccgta agtacataga agtgccagaa cagcctaccg 5100 ctcctcctgc acaggccgag gaggcccccg aagttgtagc gacaccgtca ccatctacag 5160 ctgataacac ctcgcttgat gtcacagaca tctcactgga tatggatgac agtagcgaag 5220 gctcactttt ttcgagcttt agcggatcgg acaactctat tactagtatg gacagttggt 5280 cgtcaggacc tagttcacta gagatagtag accgaaggca ggtggtggtg gctgacgttc 5340 atgccgtcca agagcctgcc cctattccac cgccaaggct aaagaagatg gcccgcctgg 5400 cagcggcaag aaaagagccc actccaccgg caagcaatag ctctgagtcc ctccacctct 5460 cttttggtgg ggtatccatg tccctcggat caattttcga cggagagacg gcccgccagg 5520 cagcggtaca acccctggca acaggcccca cggatgtgcc tatgtctttc ggatcgtttt 5580 ccgacggaga gattgatgag ctgagccgca gagtaactga gtccgaaccc gtcctgtttg 5640 gatcatttga accgggcgaa gtgaactcaa ttatatcgtc ccgatcagcc gtatcttttc 5700 cactacgcaa gcagagacgt agacgcagga gcaggaggac tgaatactga ctaaccgggg 5760 taggtgggta catattttcg acggacacag gccctgggca cttgcaaaag aagtccgttc 5820 tgcagaacca gcttacagaa ccgaccttgg agcgcaatgt cctggaaaga attcatgccc 5880 cggtgctcga cacgtcgaaa gaggaacaac tcaaactcag gtaccagatg atgcccaccg 5940 aagccaacaa aagtaggtac cagtctcgta aagtagaaaa tcagaaagcc ataaccactg 6000 agcgactact gtcaggacta cgactgtata actctgccac agatcagcca gaatgctata 6060 agatcaccta tccgaaacca ttgtactcca gtagcgtacc ggcgaactac tccgatccac 6120 agttcgctgt agctgtctgt aacaactatc tgcatgagaa ctatccgaca gtagcatctt 6180 atcagattac tgacgagtac gatgcttact tggatatggt agacgggaca gtcgcctgcc 6240 tggatactgc aaccttctgc cccgctaagc ttagaagtta cccgaaaaaa catgagtata 6300 gagccccgaa tatccgcagt gcggttccat cagcgatgca gaacacgcta caaaatgtgc 6360 tcattgccgc aactaaaaga aattgcaacg tcacgcagat gcgtgaactg ccaacactgg 6420 actcagcgac attcaatgtc gaatgctttc gaaaatatgc atgtaatgac gagtattggg 6480 aggagttcgc tcggaagcca attaggatta ccactgagtt tgtcaccgca tatgtagcta 6540 gactgaaagg ccctaaggcc gccgcactat ttgcaaagac gtataatttg gtcccattgc 6600 aagaagtgcc tatggataga ttcgtcatgg acatgaaaag agacgtgaaa gttacaccag 6660 gcacgaaaca cacagaagaa agaccgaaag tacaagtgat acaagccgca gaacccctgg 6720 cgactgctta cttatgcggg attcaccggg aattagtgcg taggcttacg gccgtcttgc 6780 ttccaaacat tcacacgctt tttgacatgt cggcggagga ttttgatgca atcatagcag 6840 aacacttcaa gcaaggcgac ccggtactgg agacggatat cgcatcattc gacaaaagcc 6900 aagacgacgc tatggcgtta accggtctga tgatcttgga ggacctgggt gtggatcaac 6960 cactactcga cttgatcgag tgcgcctttg gagaaatatc atccacccat ctacctacgg 7020 gtactcgttt taaattcggg gcgatgatga aatccggaat gttcctcaca ctttttgtca 7080 acacagtttt gaatgtcgtt atcgccagca gagtactaga agagcggctt aaaacgtcca 7140 gatgtgcagc gttcattggc gacgacaaca tcatacatgg agtagtatct gacaaagaaa 7200 tggctgagag gtgcgccacc tggctcaaca tggaggttaa gatcatcgac gcagtcatcg 7260 gtgagagacc accttacttc tgcggcggat ttatcttgca agattcggtt acttccacag 7320 cgtgccgcgt ggcggatccc ctgaaaaggc tgtttaagtt gggtaaaccg ctcccagccg 7380 acgacgagca agacgaagac agaagacgcg ctctgctaga tgaaacaaag gcgtggttta 7440 gagtaggtat aacaggcact ttagcagtgg ccgtgacgac ccggtatgag gtagacaata 7500 ttacacctgt cctactggca ttgagaactt ttgcccagag caaaagagca ttccaagcca 7560 tcagagggga aataaagcat ctctacggtg gtcctaaata gtcagcatag tacatttcat 7620 ctgactaata ctacaacacc accaccatga atagaggatt ctttaacatg ctcggccgcc 7680 gccccttccc ggcccccact gccatgtgga ggccgcggag aaggaggcag gcggccccga 7740 tgcctgcccg caacgggctg gcttctcaaa tccagcaact gaccacagcc gtcagtgccc 7800 tagtcattgg acaggcaact agacctcaac ccccacgtcc acgcccgcca ccgcgccaga 7860 agaagcaggc gcccaagcaa ccaccgaagc cgaagaaacc aaaaacgcag gagaagaaga 7920 agaagcaacc tgcaaaaccc aaacccggaa agagacagcg catggcactt aagttggagg 7980 ccgacagatt gttcgacgtc aagaacgagg acggagatgt catcgggcac gcactggcca 8040 tggaaggaaa ggtaatgaaa cctctgcacg tgaaaggaac catcgaccac cctgtgctat 8100 caaagctcaa atttaccaag tcgtcagcat acgacatgga gttcgcacag ttgccagtca 8160 acatgagaag tgaggcattc acctacacca gtgaacaccc cgaaggattc tataactggc 8220 accacggagc ggtgcagtat agtggaggta gatttaccat ccctcgcgga gtaggaggca 8280 gaggagacag cggtcgtccg atcatggata actccggtcg ggttgtcgcg atagtcctcg 8340 gtggcgctga tgaaggaaca cgaactgccc tttcggtcgt cacctggaat agtaaaggga 8400 agacaattaa gacgaccccg gaagggacag aagagtggtc cgcagcacca ctggtcacgg 8460 caatgtgttt gctcggaaat gtgagcttcc catgcgaccg cccgcccaca tgctataccc 8520 gcgaaccttc cagagccctc gacatccttg aagagaacgt gaaccatgag gcctacgata 8580 ccctgctcaa tgccatattg cggtgcggat cgtctggcag aagcaaaaga agcgtcattg 8640 acgactttac cctgaccagc ccctacttgg gcacatgctc gtactgccac catactgtac 8700 cgtgcttcag ccctgttaag atcgagcagg tctgggacga agcggacgat aacaccatac 8760 gcatacagac ttccgcccag tttggatacg accaaagcgg agcagcaagc gcaaacaagt 8820 accgctacat gtcgcttaag caggatcaca ccgttaaaga aggcaccatg gatgacatca 8880 agattagcac ctcaggaccg tgtagaaggc ttagctacaa aggatacttt ctcctcgcaa 8940 aatgccctcc aggggacagc gtaacggtta gcatagtgag tagcaactca gcaacgtcat 9000 gtacactggc ccgcaagata aaaccaaaat tcgtgggacg ggaaaaatat gatctacctc 9060 ccgttcacgg taaaaaaatt ccttgcacag tgtacgaccg tctgaaagaa acaactgcag 9120 gctacatcac tatgcacagg ccgagaccgc acgcttatac atcctacctg gaagaatcat 9180 cagggaaagt ttacgcaaag ccgccatctg ggaagaacat tacgtatgag tgcaagtgcg 9240 gcgactacaa gaccggaacc gtttcgaccc gcaccgaaat cactggttgc accgccatca 9300 agcagtgcgt cgcctataag agcgaccaaa cgaagtgggt cttcaactca ccggacttga 9360 tcagacatga cgaccacacg gcccaaggga aattgcattt gcctttcaag ttgatcccga 9420 gtacctgcat ggtccctgtt gcccacgcgc cgaatgtaat acatggcttt aaacacatca 9480 gcctccaatt agatacagac cacttgacat tgctcaccac caggagacta ggggcaaacc 9540 cggaaccaac cactgaatgg atcgtcggaa agacggtcag aaacttcacc gtcgaccgag 9600 atggcctgga atacatatgg ggaaatcatg agccagtgag ggtctatgcc caagagtcag 9660 caccaggaga ccctcacgga tggccacacg aaatagtaca gcattactac catcgccatc 9720 ctgtgtacac catcttagcc gtcgcatcag ctaccgtggc gatgatgatt ggcgtaactg 9780 ttgcagtgtt atgtgcctgt aaagcgcgcc gtgagtgcct gacgccatac gccctggccc 9840 caaacgccgt aatcccaact tcgctggcac tcttgtgctg cgttaggtcg gccaatgctg 9900 aaacgttcac cgagaccatg agttacttgt ggtcgaacag tcagccgttc ttctgggtcc 9960 agttgtgcat acctttggcc gctttcatcg ttctaatgcg ctgctgctcc tgctgcctgc 10020 cttttttagt ggttgccggc gcctacctgg cgaaggtaga cgcctacgaa catgcgacca 10080 ctgttccaaa tgtgccacag ataccgtata aggcacttgt tgaaagggca gggtatgccc 10140 cgctcaattt ggagatcact gtcatgtcct cggaggtttt gccttccacc aaccaagagt 10200 acattacctg caaattcacc actgtggtcc cctccccaaa aatcaaatgc tgcggctcct 10260 tggaatgtca gccggccgct catgcagact atacctgcaa ggtcttcgga ggggtctacc 10320 cctttatgtg gggaggagcg caatgttttt gcgacagtga gaacagccag atgagtgagg 10380 cgtacgtcga attgtcagca gattgcgcgt ctgaccacgc gcaggcgatt aaggtgcaca 10440 ctgccgcgat gaaagtagga ctgcgtattg tgtacgggaa cactaccagt ttcctagatg 10500 tgtacgtgaa cggagtcaca ccaggaacgt ctaaagactt gaaagtcata gctggaccaa 10560 tttcagcatc gtttacgcca ttcgatcata aggtcgttat ccatcgcggc ctggtgtaca 10620 actatgactt cccggaatat ggagcgatga aaccaggagc gtttggagac attcaagcta 10680 cctccttgac tagcaaggat ctcatcgcca gcacagacat taggctactc aagccttccg 10740 ccaagaacgt gcatgtcccg tacacgcagg cctcatcagg atttgagatg tggaaaaaca 10800 actcaggccg cccactgcag gaaaccgcac ctttcgggtg taagattgca gtaaatccgc 10860 tccgagcggt ggactgttca tacgggaaca ttcccatttc tattgacatc ccgaacgctg 10920 cctttatcag gacatcagat gcaccactgg tctcaacagt caaatgtgaa gtcagtgagt 10980 gcacttattc agcagacttc ggcgggatgg ccaccctgca gtatgtatcc gaccgcgaag 11040 gtcaatgccc cgtacattcg cattcgagca cagcaactct ccaagagtcg acagtacatg 11100 tcctggagaa aggagcggtg acagtacact ttagcaccgc gagtccacag gcgaacttta 11160 tcgtatcgct gtgtgggaag aagacaacat gcaatgcaga atgtaaacca ccagctgacc 11220 atatcgtgag caccccgcac aaaaatgacc aagaatttca agccgccatc tcaaaaacat 11280 catggagttg gctgtttgcc cttttcggcg gcgcctcgtc gctattaatt ataggactta 11340 tgatttttgc ttgcagcatg atgctgacta gcacacgaag atgaccgcta cgccccaatg 11400 atccgaccag caaaactcga tgtacttccg aggaactgat gtgcataatg catcaggctg 11460 gtacattaga tccccgctta ccgcgggcaa tatagcaaca ctaaaaactc gatgtacttc 11520 cgaggaagcg cagtgcataa tgctgcgcag tgttgccaca taaccactat attaaccatt 11580 tatctagcgg acgccaaaaa ctcaatgtat ttctgaggaa gcgtggtgca taatgccacg 11640 cagcgtctgc ataactttta ttatttcttt tattaatcaa caaaattttg tttttaacat 11700 ttc 11703 //