ID L00163; SV 1; linear; genomic RNA; STD; VRL; 3644 BP. XX AC L00163; J02000; XX DT 03-JUL-1991 (Rel. 28, Created) DT 03-AUG-2006 (Rel. 88, Last updated, Version 4) XX DE Alfalfa mosaic virus (strain 425 Leiden) RNA 1 of complete genome. XX KW complete genome. XX OS Alfalfa mosaic virus OC Viruses; Riboviria; Bromoviridae; Alfamovirus. XX RN [1] RP 3458-3495 RX DOI; 10.1093/nar/7.7.1887. RX PUBMED; 537914. RA Koper-Zwarthoff E.C., Brederode F.T., Walstra P., Bol J.F.; RT "Nucleotide sequence of the 3'-noncoding region of alfalfa mosaic virus RNA RT 4 and its homology with the genomic RNAs"; RL Nucleic Acids Res. 7(7):1887-1900(1979). XX RN [2] RP 1-61 RX DOI; 10.1093/nar/8.23.5635. RX PUBMED; 6927843. RA Koper-Zwarthoff E.C., Brederode F.T., Veeneman G., van Boom J.H., Bol J.F.; RT "Nucleotide sequences at the 5'-termini of the alfalfa mosaic virus RNAs RT and the intercistronic junction in RNA 3"; RL Nucleic Acids Res. 8(23):5635-5647(1980). XX RN [3] RP 813-3644 RX DOI; 10.1016/0042-6822(83)90208-8. RX PUBMED; 6404055. RA Zuidema D., Bierhuizen M.F., Cornelissen B.J., Bol J.F., Jaspars E.M.; RT "Coat protein binding sites on RNA 1 of alfalfa mosaic virus"; RL Virology 125(2):361-369(1983). XX RN [4] RP 1-3644 RX DOI; 10.1093/nar/11.5.1253. RX PUBMED; 6298738. RA Cornelissen B.J., Brederode F.T., Moormann R.J., Bol J.F.; RT "Complete nucleotide sequence of alfalfa mosaic virus RNA 1"; RL Nucleic Acids Res. 11(5):1253-1265(1983). XX DR MD5; 0dbbffb8e166ff7c253be200b4228fff. DR EuropePMC; PMC3807391; 23903837. DR RFAM; RF00196; AMV_RNA1_SL. DR RFAM; RF00252; Alfamo_CPB. XX CC [4] fragments. CC The one long open reading frame (with a predicted size of 125.7 kd) CC codes for a protein of Mr 115,000 or, in vitro, two proteins of Mr CC 58,000 and 62,000 both with the same N-terminus [5]. [1] also CC report sequences of the 3'-termini of RNAs 2, 3, and 4. [4] CC reports sequences of the ALMV 5'-termini of RNAs 2, 3, and 4. [3] CC reports binding sites for coat protein, which are noted in the CC sites table. Fragments d and d' reported in [3] are not present in CC the complete sequence [5]. XX FH Key Location/Qualifiers FH FT source 1..3644 FT /organism="Alfalfa mosaic virus" FT /strain="425 Leiden" FT /mol_type="genomic RNA" FT /db_xref="taxon:12321" FT modified_base 1 FT /mod_base=OTHER FT /note="m7Gppp cap" FT CDS 101..3481 FT /codon_start=1 FT /product="125.7 kd protein" FT /db_xref="GOA:P03589" FT /db_xref="InterPro:IPR002588" FT /db_xref="InterPro:IPR027351" FT /db_xref="InterPro:IPR027417" FT /db_xref="UniProtKB/Swiss-Prot:P03589" FT /protein_id="AAA46289.1" FT /translation="MNADAQSTDASLSMREPLSHASIQEMLRRVVEKQAADDTTAIGKV FT FSEAGRAYAQDALPSDKGEVLKISFSLDATQQNILRANFPGRRTVFSNSSSSSHCFAAA FT HRLLETDFVYRCFGNTVDSIIDLGGNFVSHMKVKRHNVHCCCPILDARDGARLTERILS FT LKSYVRKHPEIVGEADYCMDTFQKCSRRADYAFAIHSTSDLDVGELACSLDQKGVMKFI FT CTMMVDADMLIHNEGEIPNFNVRWEIDRKKDLIHFDFIDEPNLGYSHRFSLLKHYLTYN FT AVDLGHAAYRIERKQDFGGVMVIDLTYSLGFVPKMPHSNGRSCAWYNRVKGQMVVHTVN FT EGYYHHSYQTAVRRKVLVDKKVLTRVTEVAFRQFRPNADAHSAIQSIATMLSSSTNHTI FT IGGVTLISGKPLSPDDYIPVATTIYYRVKKLYNAIPEMLSLLDKGERLSTDAVLKGSEG FT PMWYSGPTFLSALDKVNVPGDFVAKALLSLPKRDLKSLFSRSATSHSERTPVRDESPIR FT CTDGVFYPIRMLLKCLGSDKFESVTITDPRSNTETTVDLYQSFQKKIETVFSFILGKID FT GPSPLISDPVYFQSLEDVYYAEWHQGNAIDASNYARTLLDDIRKQKEESLKAKAKEVED FT AQKLNRAILQVHAYLEAHPDGGKIEGLGLSSQFIAKIPELAIPTPKPLPEFEKNAETGE FT ILRINPHSDAILEAIDYLKSTSANSIITLNKLGDHCQWTTKGLDVVWAGDDKRRAFIPK FT KNTWVGPTARSYPLAKYERAMSKDGYVTLRWDGEVLDANCVRSLSQYEIVFVDQSCVFA FT SAEAIIPSLEKALGLEAHFSVTIVDGVAGCGKTTNIKQIARSSGRDVDLILTSNRSSAD FT ELKETIDCSPLTKLHYIRTCDSYLMSASAVKAQRLIFDECFLQHAGLVYAAATLAGCSE FT VIGFGDTEQIPFVSRNPSFVFRHHKLTGKVERKLITWRSPADATYCLEKYFYKNKKPVK FT TNSRVLRSIEVVPINSPVSVERNTNALYLCHTQAEKAVLKAQTHLKGCDNIFTTHEAQG FT KTFDNVYFCRLTRTSTSLATGRDPINGPCNGLVALSRHKKTFKYFTIAHDSDDVIYNAC FT RDAGNTDDSILARSYNHNF" FT protein_bind 813..837 FT /bound_moiety="RNA1 coat protein" FT /citation=[3] FT protein_bind 1580..1616 FT /bound_moiety="RNA1 coat protein" FT /citation=[3] FT protein_bind 1764..1786 FT /bound_moiety="RNA1 coat protein" FT /citation=[3] FT protein_bind 1800..1829 FT /bound_moiety="RNA1 coat protein" FT /citation=[3] FT protein_bind 1994..2042 FT /bound_moiety="RNA1 coat protein" FT /citation=[3] FT protein_bind 3564..3644 FT /bound_moiety="RNA1 coat protein" FT /citation=[3] FT protein_bind 3577..3644 FT /bound_moiety="RNA1 coat protein" FT /citation=[3] XX SQ Sequence 3644 BP; 1037 A; 743 C; 805 G; 1059 T; 0 other; gtttttatct tacacacgct tgtgtaagat agttaatcca tttatttttc catgctcttt 60 ccacagcatt acgttcattc aatactgtga agatttcact atgaatgctg acgcccaatc 120 caccgatgcc agccttagta tgcgagaacc tttatctcat gcctccattc aggagatgct 180 tcgacgtgta gtcgaaaagc aagctgcaga cgacacaact gcaatcggaa aagttttttc 240 cgaagcgggt cgtgcctatg cccaggatgc tctcccttca gacaaaggtg aagtcttgaa 300 gatatccttt tccctggacg ccacgcaaca aaacatacta cgcgccaact ttcctggtcg 360 acgcactgta ttttcaaaca gttcgagttc atctcactgt tttgcggctg cccatcgtct 420 actagaaacc gattttgttt accgatgttt cggtaatacg gttgatagta ttatagacct 480 tggaggaaat tttgtttccc atatgaaggt gaagcggcat aatgtacatt gctgctgtcc 540 catattggat gctagagacg gagctaggct cacggagaga atattgtctc taaagtcgta 600 cgtccgaaaa cacccggaaa ttgtgggtga agcagattac tgcatggaca cgtttcagaa 660 atgctcaagg cgagctgact atgcttttgc catccattct actagcgatc tcgacgtggg 720 agagttggca tgtagtttgg accaaaaagg cgttatgaaa ttcatttgca ccatgatggt 780 tgatgcagat atgttaattc ataacgaggg ggaaattcct aactttaatg ttagatggga 840 gatcgatcgt aagaaagatc tcattcattt cgacttcatc gacgagccca atttgggata 900 tagtcatcgg ttttcattgt taaaacacta tttgacttac aatgccgttg atttgggtca 960 tgctgcttat cgaatcgaac gtaagcaaga ttttggaggt gtgatggtta ttgacttaac 1020 ttattccctt ggatttgtcc ccaagatgcc acactccaat gggaggtcct gcgcctggta 1080 taatagagtc aaaggacaaa tggtagtgca caccgttaac gaggggtact atcatcattc 1140 ataccagaca gcagtgaggc ggaaagtact tgtcgataag aaagtgctta ccagagttac 1200 tgaagttgct ttcaggcaat tcagacctaa cgctgatgct cattccgcaa ttcagtccat 1260 agcgactatg ttatcttctt caacgaatca taccatcatc ggtggtgtga ctctgatttc 1320 gggtaaacct ctcagcccgg atgactatat tccagtggca acaacgattt attatagagt 1380 gaaaaaactc tataacgcca ttccagagat gttatccctc ctagacaagg gagagagatt 1440 atcgactgat gctgttttaa aagggtctga aggtccaatg tggtattctg gtcctacctt 1500 tttaagtgcg ctggataagg tcaatgttcc tggtgatttt gtcgccaaag ctctgttgtc 1560 gttgcctaag agagatttga aatctctatt ttctaggtca gcgacttctc attctgaacg 1620 gacaccggtt cgggacgaga gccccattcg atgtacagac ggtgtctttt accctataag 1680 gatgttgttg aaatgcctag gaagtgacaa atttgagtcg gtcactataa ctgatcctag 1740 aagtaacacg gaaactaccg tggatttata ccaatctttt caaaagaaaa ttgaaacggt 1800 tttctcattc attcttggaa agattgatgg tccttcacct ctaatttctg atccagtata 1860 cttccaatca cttgaagatg tgtactatgc tgaatggcat caaggaaatg ccattgatgc 1920 gtcaaattac gcacgtaccc tgttagacga tatcaggaag cagaaagaag agagcttaaa 1980 agctaaagcg aaggaagttg aagatgctca aaaattaaat agagcaattt tgcaagttca 2040 tgcctatttg gaagctcatc cggatggagg aaaaatcgaa ggactggggt tgagttctca 2100 gttcatcgca aaaatccccg agcttgcaat tccaacgcca aaaccgttac ctgaattcga 2160 gaagaacgca gaaactggcg aaattttgcg tatcaatcct cattcagatg ccattcttga 2220 agcaattgat tacttgaagt ccacttcagc caattctatc attaccttga ataaattggg 2280 tgatcattgt cagtggacga caaaaggtct tgatgtagta tgggccggtg acgataaacg 2340 tcgagctttc atcccaaaga aaaatacttg ggtcggacct actgctagaa gttatcccct 2400 tgcaaaatat gaaagagcaa tgagcaagga cggatacgta actctgagat gggacggaga 2460 agttctagat gctaattgcg tcaggagttt atctcaatac gagattgtct ttgttgacca 2520 atcttgcgtc tttgcctcag cggaggctat cattccaagc ctggagaaag ccctaggtct 2580 tgaagcacac ttttcagtta cgattgttga tggagttgct ggttgcggaa aaaccaccaa 2640 tatcaagcaa atagcccgtt catcgggtcg ggatgtggat ttgatcctta ccagcaatcg 2700 tagctctgcc gatgagttga aagaaaccat cgattgttca ccgttgacaa agttgcatta 2760 cattcgtacc tgtgattctt acttgatgtc tgcctcggcg gtaaaagcac agaggttaat 2820 ttttgatgaa tgttttttgc aacatgcagg tttagtctat gccgctgcta ctttagctgg 2880 ttgtagcgaa gtcattggtt ttggtgacac ggaacaaatt ccttttgtct caaggaatcc 2940 gtcatttgtt tttcgtcatc ataagctaac tgggaaagtc gagagaaagt taattacctg 3000 gagatcccca gcagatgcca cctattgcct tgaaaagtat ttttacaaga acaagaagcc 3060 ggtgaagaca aattccagag tactaagatc tatcgaagtt gtgccgataa attcccctgt 3120 gagcgttgag agaaatacca acgctcttta tttgtgtcat actcaagctg aaaaagcagt 3180 tttgaaagct caaacacatc taaagggatg cgataatatc tttactactc atgaagctca 3240 gggtaagact ttcgacaatg tttatttctg tcgtttaact cgtacctcaa cgagtcttgc 3300 tactggtaga gatccaataa atggcccatg caatggatta gttgccttgt cgagacacaa 3360 gaagactttt aaatatttta ccatcgccca tgatagcgat gatgtgatct acaatgcttg 3420 tagagatgcc ggtaataccg acgatagtat tctagcgagg agctataatc ataatttctg 3480 aattagtcat tggtaattca atgccaacct ccactgggtg ggttaaggtt gaggtataga 3540 atcctattcg ctcctgatag gagaaattct atattgctta tatacgtgct tatgcacgta 3600 tataaatgct catgctaaat tgcatgaatg cccctaaggg atgc 3644 //