ID V00642; SV 1; linear; mRNA; STD; PHG; 3569 BP. XX AC V00642; J02467; XX DT 09-JUN-1982 (Rel. 01, Created) DT 21-OCT-1996 (Rel. 49, Last updated, Version 24) XX DE phage MS2 genome. XX KW coat protein; genome. XX OS Enterobacteria phage MS2 OC Viruses; Riboviria; Leviviridae; Levivirus. XX RN [1] RP 1-125 RX DOI; 10.1111/j.1432-1033.1971.tb01558.x. RX PUBMED; 5125360. RA De Wachter R., Merregaert J., den Berghe A., Contreras R.R., Fiers W.; RT "studies on the bacteriophage ms2: the untranslated 5'-terminal nucleotide RT sequence preceding the first cistron"; RL Eur. J. Biochem. 22(3):400-414(1971). XX RN [2] RP 1308-1781 RX DOI; 10.1038/237082a0. RX PUBMED; 4555447. RA Jou W.M., Haegeman G., Ysebaert M., Fiers W.; RT "Nucleotide sequence of the gene coding for the bacteriophage MS2 coat RT protein."; RL Nature 237(5350):82-88(1972). XX RN [3] RX DOI; 10.1016/0014-5793(72)80386-7. RX PUBMED; 11946702. RA Contreras R.R., den Berghe A., Volckaert G., Jou W.M., Fiers W.; RT "Studies on the bacteriophage MS2. Some nucleotide sequences from the RT RNA-polymerase gene"; RL FEBS Lett. 24(3):339-342(1972). XX RN [4] RP 3209-3569 RX DOI; 10.1073/pnas.72.7.2559. RX PUBMED; 809766. RA den Berghe A., Jou W.M., Fiers W.; RT "3'-terminal nucleotide sequence (n=361) of bacteriophage ms2 rna"; RL Proc. Natl. Acad. Sci. U.S.A. 72(7):2559-2562(1975). XX RN [5] RP 130-1355 RX DOI; 10.1038/256273a0. RX PUBMED; 806810. RA Fiers W., Contreras R.R., Duerinck F., Haegeman G., Merregaert J., RA Jou W.M., Raeymaekers A., Volckaert G., Ysebaert M., Kerckhove J., Nolf F., RA Van Montagu M.M.; RT "A-protein gene of bacteriophage MS2"; RL Nature 256(5515):273-278(1975). XX RN [6] RP 1761-3398 RX DOI; 10.1038/260500a0. RX PUBMED; 1264203. RA Fiers W., Contreras R.R., Duerinck F., Haegeman G., Iserentant D., RA Merregaert J., Jou W.M., Molemans F., Raeymaekers A., Berghe A., RA Volckaert G., Ysebaert M.; RT "Complete nucleotide sequence of bacteriophage MS2 RNA: primary and RT secondary structure of the replicase gene"; RL Nature 260(5551):500-507(1976). XX RN [7] RC identification of l-protein RX DOI; 10.1016/0092-8674(79)90045-X. RX PUBMED; 387256. RA Beremand M.N., Blumenthal T.; RT "Overlapping genes in RNA phage: a new protein implicated in lysis"; RL Cell 18(2):257-266(1979). XX RN [8] RP 1-3569 RA Schwartz R.M., Dayhoff M.O.; RT "Genome Nucleic Acids"; RL (in) Dayhoff M.O. (Eds.); RL Atlas of Protein Sequence and Structure, Voume 5, Supplement 3:338-344; RL National Biomedical Research Foundation, Silver Spring (1978) XX RN [9] RP 2270-2271 RA Min Jou W.; RT ; RL Submitted (13-JAN-1983) to the INSDC. XX RN [10] RX PUBMED; 2462433. RA Auerswald E.A., Hoerlein D., Reinhardt G., Schroeder W., Schnabel E.; RT "Expression, isolation and characterization of recombinant [Arg-15, Glu-52] RT aprotinin"; RL Biol. Chem. Hoppe-Seyler 369:27-35(1988). XX RN [11] RP 1-3569 RX DOI; 10.1111/j.1365-2958.1993.tb01237.x. RX PUBMED; 7934914. RA de Smit M.H., Van Duin J.; RT "Translational initiation at the coat-protein gene of phage MS2: native RT upstream RNA relieves inhibition by local secondary structure"; RL Mol. Microbiol. 9(5):1079-1088(1993). XX DR MD5; b7e4ef1e5fbd470b3910247143b7b9d4. DR EuropePMC; PMC1234060; 16145106. DR EuropePMC; PMC2223264; 18065626. DR EuropePMC; PMC2395109; 18305135. DR EuropePMC; PMC2583511; 18820067. DR EuropePMC; PMC2997057; 21151977. DR EuropePMC; PMC4592258; 26431175. XX CC The sequence was taken from reference [8]. CC KST MS2 XX FH Key Location/Qualifiers FH FT source 1..3569 FT /organism="Enterobacteria phage MS2" FT /mol_type="mRNA" FT /db_xref="taxon:12022" FT CDS 130..1311 FT /transl_table=11 FT /note="A-protein" FT /db_xref="GOA:P03610" FT /db_xref="InterPro:IPR005563" FT /db_xref="PDB:5TC1" FT /db_xref="UniProtKB/Swiss-Prot:P03610" FT /protein_id="CAA23988.1" FT /translation="MRAFSTLDRENETFVPSVRVYADGETEDNSFSLKYRSNWTPGRFN FT STGAKTKQWHYPSPYSRGALSVTSIDQGAYKRSGSSWGRPYEEKAGFGFSLDARSCYSL FT FPVSQNLTYIEVPQNVANRASTEVLQKVTQGNFNLGVALAEARSTASQLATQTIALVKA FT YTAARRGNWRQALRYLALNEDRKFRSKHVAGRWLELQFGWLPLMSDIQGAYEMLTKVHL FT QEFLPMRAVRQVGTNIKLDGRLSYPAANFQTTCNISRRIVIWFYINDARLAWLSSLGIL FT NPLGIVWEKVPFSFVVDWLLPVGNMLEGLTAPVGCSYMSGTVTDVITGESIISVDAPYG FT WTVERQGTAKAQISAMHRGVQSVWPTTGAYVKSPFSMVHTLDALALIRQRLSR" FT CDS 1335..1727 FT /transl_table=11 FT /note="coat protein" FT /db_xref="GOA:P03612" FT /db_xref="InterPro:IPR002703" FT /db_xref="InterPro:IPR015954" FT /db_xref="PDB:1AQ3" FT /db_xref="PDB:1AQ4" FT /db_xref="PDB:1BMS" FT /db_xref="PDB:1MSC" FT /db_xref="PDB:1MST" FT /db_xref="PDB:1MVA" FT /db_xref="PDB:1MVB" FT /db_xref="PDB:1U1Y" FT /db_xref="PDB:1ZDH" FT /db_xref="PDB:1ZDI" FT /db_xref="PDB:1ZDJ" FT /db_xref="PDB:1ZDK" FT /db_xref="PDB:1ZSE" FT /db_xref="PDB:2B2D" FT /db_xref="PDB:2B2E" FT /db_xref="PDB:2B2G" FT /db_xref="PDB:2BNY" FT /db_xref="PDB:2BQ5" FT /db_xref="PDB:2BS0" FT /db_xref="PDB:2BS1" FT /db_xref="PDB:2BU1" FT /db_xref="PDB:2C4Q" FT /db_xref="PDB:2C4Y" FT /db_xref="PDB:2C4Z" FT /db_xref="PDB:2C50" FT /db_xref="PDB:2C51" FT /db_xref="PDB:2IZ8" FT /db_xref="PDB:2IZ9" FT /db_xref="PDB:2IZM" FT /db_xref="PDB:2IZN" FT /db_xref="PDB:2MS2" FT /db_xref="PDB:2VTU" FT /db_xref="PDB:2WBH" FT /db_xref="PDB:4BP7" FT /db_xref="PDB:4ZOR" FT /db_xref="PDB:5MSF" FT /db_xref="PDB:5TC1" FT /db_xref="PDB:6MSF" FT /db_xref="PDB:7MSF" FT /db_xref="UniProtKB/Swiss-Prot:P03612" FT /protein_id="CAA23989.1" FT /translation="MASNFTQFVLVDNGGTGDVTVAPSNFANGVAEWISSNSRSQAYKV FT TCSVRQSSAQNRKYTIKVEVPKVATQTVGGVELPVAAWRSYLNMELTIPIFATNSDCEL FT IVKAMQGLLKDGNPIPSAIAANSGIY" FT CDS 1678..1905 FT /transl_table=11 FT /note="L-protein" FT /db_xref="GOA:P03609" FT /db_xref="InterPro:IPR022599" FT /db_xref="UniProtKB/Swiss-Prot:P03609" FT /protein_id="CAA23990.1" FT /translation="METRFPQQSQQTPASTNRRRPFKHEDYPCRRQQRSSTLYVLIFLA FT IFLSKFTNQLLLSLLEAVIRTVTTLQQLLT" FT CDS 1761..3398 FT /transl_table=11 FT /note="replicase, beta subunit" FT /db_xref="GOA:P00585" FT /db_xref="InterPro:IPR005093" FT /db_xref="InterPro:IPR007096" FT /db_xref="UniProtKB/Swiss-Prot:P00585" FT /protein_id="CAA23991.1" FT /translation="MSKTTKKFNSLCIDLPRDLSLEIYQSIASVATGSGDPHSDDFTAI FT AYLRDELLTKHPTLGSGNDEATRRTLAIAKLREANGDRGQINREGFLHDKSLSWDPDVL FT QTSIRSLIGNLLSGYRSSLFGQCTFSNGAPMGHKLQDAAPYKKFAEQATVTPRALRAAL FT LVRDQCAPWIRHAVRYNESYEFRLVVGNGVFTVPKNNKIDRAACKEPDMNMYLQKGVGA FT FIRRRLKSVGIDLNDQSINQRLAQQGSVDGSLATIDLSSASDSISDRLVWSFLPPELYS FT YLDRIRSHYGIVDGETIRWELFSTMGNGFTFELESMIFWAIVKATQIHFGNAGTIGIYG FT DDIICPSEIAPRVLEALAYYGFKPNLRKTFVSGLFRESCGAHFYRGVDVKPFYIKKPVD FT NLFALMLILNRLRGWGVVGGMSDPRLYKVWVRLSSQVPSMFFGGTDLAADYYVVSPPTA FT VSVYTKTPYGRLLADTRTSGFRLARIARERKFFSEKHDSGRYIAWFHTGGEITDSMKSA FT GVRVIRTSEWLTPVPTFPQECGPASSPR" FT misc_difference 2001..2008 FT /note="ggugauc is gaucggu in [10]" FT /note="conflict" FT /citation=[10] FT misc_difference 2270..2271 FT /note="cg is gc in [9]" FT /note="conflict" FT /citation=[9] XX SQ Sequence 3569 BP; 835 A; 933 C; 927 G; 874 T; 0 other; gggtgggacc cctttcgggg tcctgctcaa cttcctgtcg agctaatgcc atttttaatg 60 tctttagcga gacgctacca tggctatcgc tgtaggtagc cggaattcca ttcctaggag 120 gtttgacctg tgcgagcttt tagtaccctt gatagggaga acgagacctt cgtcccctcc 180 gttcgcgttt acgcggacgg tgagactgaa gataactcat tctctttaaa atatcgttcg 240 aactggactc ccggtcgttt taactcgact ggggccaaaa cgaaacagtg gcactacccc 300 tctccgtatt cacggggggc gttaagtgtc acatcgatag atcaaggtgc ctacaagcga 360 agtgggtcat cgtggggtcg cccgtacgag gagaaagccg gtttcggctt ctccctcgac 420 gcacgctcct gctacagcct cttccctgta agccaaaact tgacttacat cgaagtgccg 480 cagaacgttg cgaaccgggc gtcgaccgaa gtcctgcaaa aggtcaccca gggtaatttt 540 aaccttggtg ttgctttagc agaggccagg tcgacagcct cacaactcgc gacgcaaacc 600 attgcgctcg tgaaggcgta cactgccgct cgtcgcggta attggcgcca ggcgctccgc 660 taccttgccc taaacgaaga tcgaaagttt cgatcaaaac acgtggccgg caggtggttg 720 gagttgcagt tcggttggtt accactaatg agtgatatcc agggtgcata tgagatgctt 780 acgaaggttc accttcaaga gtttcttcct atgagagccg tacgtcaggt cggtactaac 840 atcaagttag atggccgtct gtcgtatcca gctgcaaact tccagacaac gtgcaacata 900 tcgcgacgta tcgtgatatg gttttacata aacgatgcac gtttggcatg gttgtcgtct 960 ctaggtatct tgaacccact aggtatagtg tgggaaaagg tgcctttctc attcgttgtc 1020 gactggctcc tacctgtagg taacatgctc gagggcctta cggcccccgt gggatgctcc 1080 tacatgtcag gaacagttac tgacgtaata acgggtgagt ccatcataag cgttgacgct 1140 ccctacgggt ggactgtgga gagacagggc actgctaagg cccaaatctc agccatgcat 1200 cgaggggtac aatccgtatg gccaacaact ggcgcgtacg taaagtctcc tttctcgatg 1260 gtccatacct tagatgcgtt agcattaatc aggcaacggc tctctagata gagccctcaa 1320 ccggagtttg aagcatggct tctaacttta ctcagttcgt tctcgtcgac aatggcggaa 1380 ctggcgacgt gactgtcgcc ccaagcaact tcgctaacgg ggtcgctgaa tggatcagct 1440 ctaactcgcg ttcacaggct tacaaagtaa cctgtagcgt tcgtcagagc tctgcgcaga 1500 atcgcaaata caccatcaaa gtcgaggtgc ctaaagtggc aacccagact gttggtggtg 1560 tagagcttcc tgtagccgca tggcgttcgt acttaaatat ggaactaacc attccaattt 1620 tcgctacgaa ttccgactgc gagcttattg ttaaggcaat gcaaggtctc ctaaaagatg 1680 gaaacccgat tccctcagca atcgcagcaa actccggcat ctactaatag acgccggcca 1740 ttcaaacatg aggattaccc atgtcgaaga caacaaagaa gttcaactct ttatgtattg 1800 atcttcctcg cgatctttct ctcgaaattt accaatcaat tgcttctgtc gctactggaa 1860 gcggtgatcc gcacagtgac gactttacag caattgctta cttaagggac gaattgctca 1920 caaagcatcc gaccttaggt tctggtaatg acgaggcgac ccgtcgtacc ttagctatcg 1980 ctaagctacg ggaggcgaat ggtgatcgcg gtcagataaa tagagaaggt ttcttacatg 2040 acaaatcctt gtcatgggat ccggatgttt tacaaaccag catccgtagc cttattggca 2100 acctcctctc tggctaccga tcgtcgttgt ttgggcaatg cacgttctcc aacggtgctc 2160 ctatggggca caagttgcag gatgcagcgc cttacaagaa gttcgctgaa caagcaaccg 2220 ttaccccccg cgctctgaga gcggctctat tggtccgaga ccaatgtgcg ccgtggatca 2280 gacacgcggt ccgctataac gagtcatatg aatttaggct cgttgtaggg aacggagtgt 2340 ttacagttcc gaagaataat aaaatagatc gggctgcctg taaggagcct gatatgaata 2400 tgtacctcca gaaaggggtc ggtgctttca tcagacgccg gctcaaatcc gttggtatag 2460 acctgaatga tcaatcgatc aaccagcgtc tggctcagca gggcagcgta gatggttcgc 2520 ttgcgacgat agacttatcg tctgcatccg attccatctc cgatcgcctg gtgtggagtt 2580 ttctcccacc agagctatat tcatatctcg atcgtatccg ctcacactac ggaatcgtag 2640 atggcgagac gatacgatgg gaactatttt ccacaatggg aaatgggttc acatttgagc 2700 tagagtccat gatattctgg gcaatagtca aagcgaccca aatccatttt ggtaacgccg 2760 gaaccatagg catctacggg gacgatatta tatgtcccag tgagattgca ccccgtgtgc 2820 tagaggcact tgcctactac ggttttaaac cgaatcttcg taaaacgttc gtgtccgggc 2880 tctttcgcga gagctgcggc gcgcactttt accgtggtgt cgatgtcaaa ccgttttaca 2940 tcaagaaacc tgttgacaat ctcttcgccc tgatgctgat attaaatcgg ctacggggtt 3000 ggggagttgt cggaggtatg tcagatccac gcctctataa ggtgtgggta cggctctcct 3060 cccaggtgcc ttcgatgttc ttcggtggga cggacctcgc tgccgactac tacgtagtca 3120 gcccgcctac ggcagtctcg gtatacacca agactccgta cgggcggctg ctcgcggata 3180 cccgtacctc gggtttccgt cttgctcgta tcgctcgaga acgcaagttc ttcagcgaaa 3240 agcacgacag tggtcgctac atagcgtggt tccatactgg aggtgaaatc accgacagca 3300 tgaagtccgc cggcgtgcgc gttatacgca cttcggagtg gctaacgccg gttcccacat 3360 tccctcagga gtgtgggcca gcgagctctc ctcggtagct gaccgaggga cccccgtaaa 3420 cggggtgggt gtgctcgaaa gagcacgggt gcgaaagcgg tccggctcca ccgaaaggtg 3480 ggcgggcttc ggcccaggga cctcccccta aagagaggac ccgggattct cccgatttgg 3540 taactagctg cttggctagt taccaccca 3569 //