ID J02147; SV 1; linear; viral cRNA; STD; VRL; 1565 BP. XX AC J02147; XX DT 13-JAN-1992 (Rel. 30, Created) DT 16-JUL-2016 (Rel. 129, Last updated, Version 4) XX DE Influenza A virus (A/Puerto Rico/8/1934(H1N1)) segment 5, complete DE sequence. XX KW complete genome; nucleoprotein. XX OS Influenza A virus (A/Puerto Rico/8/1934(H1N1)) OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Insthoviricetes; OC Articulavirales; Orthomyxoviridae; Alphainfluenzavirus. XX RN [1] RP 1-1517 RX DOI; 10.1111/j.1432-1033.1981.tb05341.x. RX PUBMED; 6166474. RA Van Rompuy L., Min Jou W., Huylebroeck D., Devos R., Fiers W.; RT "Complete nucleotide sequence of the nucleoprotein gene from the human RT influenza strain A/PR/8/34 (HON1)"; RL Eur. J. Biochem. 116(2):347-353(1981). XX RN [2] RP 1-1565 RX DOI; 10.1016/0042-6822(81)90223-3. RX PUBMED; 7292985. RA Winter G., Fields S.; RT "The structure of the gene encoding the nucleoprotein of human influenza RT virus A/PR/8/34"; RL Virology 114(2):423-428(1981). XX RN [3] RP 1-1565 RA van Rompuy L., Min J.W., Huylebroeck D., Devos R., Fiers W.; RT "Complete nucleotide sequence of the nucleoprotein gene from the human RT influenza strain A/PR/8/34 (HON1), Correction"; RL Eur. J. Biochem. 116:645-645(1982). XX DR MD5; 43423d8010258a2583e6af862f174a81. XX CC assignment of coding region by consideration of open reading frames CC and by comparison of predicted mw to nucleoprotein mw. [Eur. J. CC Biochem. 116, 645-645 (1981)] is a CC major revision of [1], so the sequence shown below reflects only CC [2],[Eur. J. Biochem. 116, 645-645 (1981)]. [1] compared with CC grantham's data. CC Complete source information: CC influenza [2]: A/Puerto Rico/8/34 cdna to rna from human; [1],[Eur. CC J. Biochem. 116, 645-645 (1981)]: CC x31 (a laboratory recombinant containing the A/Puerto Rico/8/34 CC nucleoprotein segment) cdna to rna, originally from human. XX FH Key Location/Qualifiers FH FT source 1..1565 FT /organism="Influenza A virus (A/Puerto Rico/8/1934(H1N1))" FT /segment="5" FT /strain="A/Puerto Rico/8/1934(H1N1)" FT /mol_type="viral cRNA" FT /db_xref="taxon:211044" FT CDS 46..1542 FT /codon_start=1 FT /product="nucleoprotein" FT /db_xref="GOA:P03466" FT /db_xref="InterPro:IPR002141" FT /db_xref="PDB:2BST" FT /db_xref="PDB:2WFS" FT /db_xref="PDB:4NQV" FT /db_xref="PDB:4ZDU" FT /db_xref="PDB:5NPZ" FT /db_xref="PDB:5NQ3" FT /db_xref="PDB:5V5O" FT /db_xref="UniProtKB/Swiss-Prot:P03466" FT /protein_id="AAA43467.1" FT /translation="MASQGTKRSYEQMETDGERQNATEIRASVGKMIGGIGRFYIQMCT FT ELKLSDYEGRLIQNSLTIERMVLSAFDERRNKYLEEHPSAGKDPKKTGGPIYRRVNGKW FT MRELILYDKEEIRRIWRQANNGDDATAGLTHMMIWHSNLNDATYQRTRALVRTGMDPRM FT CSLMQGSTLPRRSGAAGAAVKGVGTMVMELVRMIKRGINDRNFWRGENGRKTRIAYERM FT CNILKGKFQTAAQKAMMDQVRESRDPGNAEFEDLTFLARSALILRGSVAHKSCLPACVY FT GPAVASGYDFEREGYSLVGIDPFRLLQNSQVYSLIRPNENPAHKSQLVWMACHSAAFED FT LRVLSFIKGTKVVPRGKLSTRGVQIASNENMETMESSTLELRSRYWAIRTRSGGNTNQQ FT RASAGQISIQPTFSVQRNLPFDRTTVMAAFTGNTEGRTSDMRTEIIRMMESARPEDVSF FT QGRGVFELSDEKAASPIVPSFDMSNEGSYFFGDNAEEYDN" FT unsure 589 FT /note="g in 2 clones, a in 1 clone" XX SQ Sequence 1565 BP; 504 A; 314 C; 412 G; 335 T; 0 other; agcaaaagca gggtagataa tcactcactg agtgacatca aaatcatggc gtcccaaggc 60 accaaacggt cttacgaaca gatggagact gatggagaac gccagaatgc cactgaaatc 120 agagcatccg tcggaaaaat gattggtgga attggacgat tctacatcca aatgtgcaca 180 gaacttaaac tcagtgatta tgagggacgg ttgatccaaa acagcttaac aatagagaga 240 atggtgctct ctgcttttga cgaaaggaga aataaatacc tggaagaaca tcccagtgcg 300 gggaaggatc ctaagaaaac tggaggacct atatacagaa gagtaaacgg aaagtggatg 360 agagaactca tcctttatga caaagaagaa ataaggcgaa tctggcgcca agctaataat 420 ggtgacgatg caacggctgg tctgactcac atgatgatct ggcattccaa tttgaatgat 480 gcaacttatc agaggacaag ggctcttgtt cgcaccggaa tggatcccag gatgtgctct 540 ctgatgcaag gttcaactct ccctaggagg tctggagccg caggtgctgc agtcaaagga 600 gttggaacaa tggtgatgga attggtcagg atgatcaaac gtgggatcaa tgatcggaac 660 ttctggaggg gtgagaatgg acgaaaaaca agaattgctt atgaaagaat gtgcaacatt 720 ctcaaaggga aatttcaaac tgctgcacaa aaagcaatga tggatcaagt gagagagagc 780 cgggacccag ggaatgctga gttcgaagat ctcacttttc tagcacggtc tgcactcata 840 ttgagagggt cggttgctca caagtcctgc ctgcctgcct gtgtgtatgg acctgccgta 900 gccagtgggt acgactttga aagagaggga tactctctag tcggaataga ccctttcaga 960 ctgcttcaaa acagccaagt gtacagccta atcagaccaa atgagaatcc agcacacaag 1020 agtcaactgg tgtggatggc atgccattct gccgcatttg aagatctaag agtattgagc 1080 ttcatcaaag ggacgaaggt ggtcccaaga gggaagcttt ccactagagg agttcaaatt 1140 gcttccaatg aaaatatgga gactatggaa tcaagtacac ttgaactgag aagcaggtac 1200 tgggccataa ggaccagaag tggaggaaac accaatcaac agagggcatc tgcgggccaa 1260 atcagcatac aacctacgtt ctcagtacag agaaatctcc cttttgacag aacaaccgtt 1320 atggcagcat tcactgggaa tacagagggg agaacatctg acatgaggac cgaaatcata 1380 aggatgatgg aaagtgcaag accagaagat gtgtctttcc aggggcgggg agtcttcgag 1440 ctctcggacg aaaaggcagc gagcccgatc gtgccttcct ttgacatgag taatgaagga 1500 tcttatttct tcggagacaa tgcagaggag tacgacaatt aaagaaaaat acccttgttt 1560 ctact 1565 //