ID M14627; SV 1; linear; genomic RNA; STD; VRL; 3616 BP. XX AC M14627; XX DT 19-SEP-1987 (Rel. 13, Created) DT 14-NOV-2006 (Rel. 89, Last updated, Version 4) XX DE Hantaan virus, complete M RNA segment coding for G1 and G2 proteins, DE complete cds. XX KW polyprotein; glycoprotein; envelope protein. XX OS Orthohantavirus hantanense OC Viruses; Riboviria; Orthornavirae; Negarnaviricota; Polyploviricotina; OC Ellioviricetes; Bunyavirales; Hantaviridae; Mammantavirinae; OC Orthohantavirus. XX RN [1] RP 1-3616 RX DOI; 10.1016/0042-6822(87)90310-2. RX PUBMED; 3103329. RA Schmaljohn C.S., Schmaljohn A.L., Dalrymple J.M.; RT "Hantaan virus M RNA: coding strategy, nucleotide sequence, and gene RT order"; RL Virology 157(1):31-39(1987). XX DR MD5; 1e86f6a6b39c26ecc725cb635ee12a29. DR EuropePMC; PMC109349; 9420200. DR EuropePMC; PMC112616; 10364307. DR EuropePMC; PMC1151903; 15956394. DR EuropePMC; PMC140611; 12477889. DR EuropePMC; PMC1578555; 16953877. DR EuropePMC; PMC1850951; 17124003. DR EuropePMC; PMC2786626; 19828747. DR EuropePMC; PMC2958020; 20113591. DR EuropePMC; PMC3165796; 21715500. DR EuropePMC; PMC3838119; 24067954. DR EuropePMC; PMC4113790; 25029493. DR EuropePMC; PMC4178757; 25100845. DR EuropePMC; PMC4398386; 25874643. DR EuropePMC; PMC4912082; 27315053. DR EuropePMC; PMC5303042; 27895275. DR EuropePMC; PMC87178; 10921972. DR EuropePMC; PMC88127; 11376073. XX CC Draft entry and computer-readable sequence of [1] kindly provided CC by C.Schmaljohn, 02-MAR-1987. CC There are two possible initiation codons at nucleotide positions CC 41-43 or 65-67, but 'the first codon has more favorable flanking CC sequences for initiation of protein synthesis'. CC It was not possible to precisely define the carboxy terminus of G1 CC and G2 mature glycoproteins, but it was established that the CC carboxy terminus of G1 and G2 were between positions 1802-1882 and CC 3419-3445 respectively. XX FH Key Location/Qualifiers FH FT source 1..3616 FT /organism="Orthohantavirus hantanense" FT /mol_type="genomic RNA" FT /db_xref="taxon:3052480" FT CDS 41..3448 FT /codon_start=1 FT /note="precursor structural polyprotein" FT /db_xref="GOA:P08668" FT /db_xref="InterPro:IPR002532" FT /db_xref="InterPro:IPR002534" FT /db_xref="InterPro:IPR012316" FT /db_xref="InterPro:IPR016402" FT /db_xref="PDB:5LJX" FT /db_xref="PDB:5LJY" FT /db_xref="PDB:5LJZ" FT /db_xref="PDB:5LK0" FT /db_xref="PDB:5LK1" FT /db_xref="PDB:5LK2" FT /db_xref="PDB:5LK3" FT /db_xref="UniProtKB/Swiss-Prot:P08668" FT /func_characterised="identical sequence" FT /protein_id="AAA43836.1" FT /translation="MGIWKWLVMASLVWPVLTLRNVYDMKIECPHTVSFGENSVIGYVE FT LPPVPLADTAQMVPESSCNMDNHQSLNTITKYTQVSWRGKADQSQSSQNSFETVSTEVD FT LKGTCVLKHKMVEESYRSRKSVTCYDLSCNSTYCKPTLYMIVPIHACNMMKSCLIALGP FT YRVQVVYERSYCMTGVLIEGKCFVPDQSVVSIIKHGIFDIASVHIVCFFVAVKGNTYKI FT FEQVKKSFESTCNDTENKVQGYYICIVGGNSAPIYVPTLDDFRSMEAFTGIFRSPHGED FT HDLAGEEIASYSIVGPANAKVPHSASSDTLSLIAYSGIPSYSSLSILTSSTEAKHVFSP FT GLFPKLNHTNCDKSAIPLIWTGMIDLPGYYEAVHPCTVFCVLSGPGASCEAFSEGGIFN FT ITSPMCLVSKQNRFRLTEQQVNFVCQRVDMDIVVYCNGQRKVILTKTLVIGQCIYTITS FT LFSLLPGVAHSIAVELCVPGFHGWATAALLVTFCFGWVLIPAITFIILTVLKFIANIFH FT TSNQENRLKSVLRKIKEEFEKTKGSMVCDVCKYECETYKELKAHGVSCPQSQCPYCFTH FT CEPTEAAFQAHYKVCQVTHRFRDDLKKTVTPQNFTPGCYRTLNLFRYKSRCYIFTMWIF FT LLVLESILWAASASETPLTPVWNDNAHGVGSVPMHTDLELDFSLTSSSKYTYRRKLTNP FT LEEAQSIDLHIEIEEQTIGVDVHALGHWFDGRLNLKTSFHCYGACTKYEYPWHTAKCHY FT ERDYQYETSWGCNPSDCPGVGTGCTACGLYLDQLKPVGSAYKIITIRYSRRVCVQFGEE FT NLCKIIDMNDCFVSRHVKVCIIGTVSKFSQGDTLLFFGPLEGGGLIFKHWCTSTCQFGD FT PGDIMSPRDKGFLCPEFPGSFRKKCNFATTPICEYDGNMVSGYKKVMATIDSFQSFNTS FT TMHFTDERIEWKDPDGMLRDHINILVTKDIDFDNLGENPCKIGLQTSSIEGAWGSGVGF FT TLTCLVSLTECPTFLTSIKACDKAICYGAESVTLTRGQNTVKVSGKGGHSGSTFRCCHG FT EDCSQIGLHAAAPHLDKVNGISEIENSKVYDDGAPQCGIKCWFVKSGEWISGIFSGNWI FT VLIVLCVFLLFSLVLLSILCPVRKHKKS" FT sig_peptide 41..94 FT /note="precursor structural polyprotein, signal peptide" FT mat_peptide 95..1801 FT /note="envelope glycoprotein G1 (see comment)" FT /partial FT mat_peptide 1985..3418 FT /note="envelope glycoprotein G2 (see comment)" FT /partial XX SQ Sequence 3616 BP; 1113 A; 648 C; 773 G; 1082 T; 0 other; tagtagtaga caccgcaaaa gaaagcagtc aatcagcaac atggggatat ggaagtggct 60 agtgatggcc agtttagtat ggcctgtttt gacactgaga aatgtctatg acatgaaaat 120 tgagtgcccc catacagtaa gttttgggga aaacagtgtg ataggttatg tagaattacc 180 ccccgtgcca ttggccgaca cagcacagat ggtgcctgag agttcttgta acatggataa 240 tcaccaatcg ttgaatacaa taacaaaata tacccaagta agttggagag gaaaggctga 300 tcagtcacag tctagtcaaa attcatttga gacagtgtcc actgaagttg acttgaaagg 360 aacatgtgtt ctaaaacaca aaatggtgga agaatcatac cgtagtagga aatcagtaac 420 ctgttacgac ctgtcttgca atagcactta ctgcaagcca acactataca tgattgtacc 480 aattcatgca tgcaatatga tgaaaagctg tttgattgca ttgggaccat acagagtaca 540 ggtggtttat gagagaagtt actgtatgac aggagtcctg attgaaggga aatgctttgt 600 cccagatcaa agtgtggtca gtattatcaa gcatgggatc tttgatattg caagtgttca 660 tattgtatgt ttctttgttg cagttaaagg gaatacttat aaaatttttg aacaggttaa 720 gaaatccttt gaatcaacat gcaatgatac agagaataaa gtgcaaggat attatatttg 780 tattgtaggg ggaaactctg caccaatata tgttccaaca cttgatgatt tcagatccat 840 ggaagcattt acaggaatct tcagatcacc acatggggaa gatcatgatc tggctggaga 900 agaaattgca tcttattcta tagtcggacc tgccaatgca aaagttcctc atagtgctag 960 ctcagataca ttgagcttga ttgcctattc aggtatacca tcttattctt cccttagcat 1020 cctaacaagt tcaacagaag ctaagcatgt attcagccct gggttgttcc caaaacttaa 1080 tcacacaaat tgtgataaaa gtgccatacc actcatatgg actgggatga ttgatttacc 1140 tggatactac gaagctgtcc acccttgtac agttttttgc gtattatcag gtcctggggc 1200 atcatgtgaa gccttttctg aaggcgggat tttcaacata acctctccca tgtgcttagt 1260 gtcaaaacaa aatcgattcc ggttaacaga acagcaagtg aattttgtgt gtcagcgagt 1320 ggacatggac attgttgtgt actgcaacgg gcagaggaaa gtaatattaa caaaaactct 1380 agttattgga cagtgtatat atactataac aagcttattc tcattactac ctggagtagc 1440 acattctatt gctgttgaat tgtgtgtacc tgggttccat ggttgggcca cagctgctct 1500 gcttgttaca ttctgtttcg gatgggttct tataccagca attacattta tcatactaac 1560 agtcctaaag ttcattgcta atatttttca cacaagtaat caagagaata ggctaaaatc 1620 agtacttaga aagataaagg aagagtttga aaaaacaaaa ggctcaatgg tatgtgatgt 1680 ctgcaagtat gagtgtgaaa cctataaaga attaaaggca cacggggtat catgccccca 1740 atctcaatgt ccttactgtt ttactcattg tgaacccaca gaagcagcat tccaagctca 1800 ttacaaggta tgccaagtta ctcacagatt cagggatgat ctaaagaaaa ctgttactcc 1860 tcaaaatttt acaccaggat gttaccggac actaaattta tttagataca aaagcaggtg 1920 ctacatcttt acaatgtgga tatttcttct tgtcttagaa tccatactgt gggctgcaag 1980 tgcatcagag acaccattaa ctcctgtctg gaatgacaat gcccatgggg taggttctgt 2040 tcctatgcat acagatttag agcttgattt ctctttaaca tccagttcca agtatacata 2100 ccgtaggaag ttaacaaacc cacttgagga agcacaatcc attgacctac atattgaaat 2160 agaagaacag acaattggtg ttgatgtgca tgctctagga cactggtttg atggtcgtct 2220 taaccttaaa acatcctttc actgttatgg tgcttgtaca aagtatgaat acccttggca 2280 tactgcaaag tgccattatg aaagagatta ccaatatgag acgagctggg gttgtaatcc 2340 atcagattgt cctggggtgg gcacaggctg tacagcatgt ggtttatacc tagatcaact 2400 gaaaccagtt ggtagtgctt ataaaattat cacaataagg tacagcagga gagtctgtgt 2460 tcagtttggg gaggaaaacc tttgtaagat aatagacatg aatgattgtt ttgtatctag 2520 gcatgttaag gtctgcataa ttggtacagt atctaaattc tctcagggtg ataccttatt 2580 gttttttgga ccgcttgaag gtggtggtct aatatttaaa cactggtgta catccacatg 2640 tcaatttggt gacccaggag atatcatgag tccaagagac aaaggttttt tatgccctga 2700 gtttccaggt agtttcagga agaaatgcaa ctttgctact acccctattt gtgagtatga 2760 tggaaatatg gtctcaggtt acaagaaagt gatggcgaca attgattcct tccaatcttt 2820 taatacaagc actatgcact tcactgatga aaggatagag tggaaagacc ctgatggaat 2880 gctaagggac catataaaca ttttagtaac gaaggacatt gactttgata accttggtga 2940 aaatccttgc aaaattggcc tacaaacatc ttctattgag ggggcctggg gttctggtgt 3000 ggggttcaca ttaacatgtc tggtatcact aacagaatgt cctacctttt tgacctcaat 3060 aaaggcttgt gataaggcta tctgttatgg tgcagagagt gtaacattga caagaggaca 3120 aaatacagtc aaggtatcag ggaaaggtgg ccatagtggt tcaacattta ggtgttgcca 3180 tggggaggac tgttcacaaa ttggactcca tgctgctgca cctcaccttg acaaggtaaa 3240 tgggatttct gagatagaaa atagtaaagt atatgatgat ggggcaccgc aatgtgggat 3300 aaaatgttgg tttgttaaat caggggaatg gatttcaggg atattcagtg gtaattggat 3360 tgtactcatt gtcctctgtg tatttctatt gttctccttg gttttactaa gcattctctg 3420 tcccgtaagg aagcataaaa aatcatagct aaattctgtg actatcctgt tcttatgtat 3480 agctttaaca tatatactaa tttttatatt ccagtatact ctatctaaca cactaaaaaa 3540 aatagtagct ttctaaccac aaaacttaga ttcttcttct gtatgatgtc ttaacatctt 3600 gcggtgtcta ctacta 3616 //