ID J02059; SV 1; linear; genomic RNA; STD; VRL; 2193 BP. XX AC J02059; XX DT 07-MAY-1992 (Rel. 31, Created) DT 09-SEP-2004 (Rel. 81, Last updated, Version 3) XX DE Cucumber mosaic virus (Q strain) RNA 3, complete segment including RNA 4 DE cds. XX KW coat protein; subgenome; unidentified reading frame. XX OS Cucumber mosaic virus (cucumber mosaic cucumovirus) OC Viruses; Riboviria; Bromoviridae; Cucumovirus. XX RN [1] RP 1924-2193 RX DOI; 10.1093/nar/7.4.825. RX PUBMED; 92011. RA Symons R.H.; RT "Extensive sequence homology at the 3'-termini of the four RNAs of cucumber RT mosaic virus"; RL Nucleic Acids Res. 7(4):825-837(1979). XX RN [2] RP 1-2193 RX PUBMED; 6927855. RA Gould A.R., Symons R.H.; RT "Cucumber mosaic virus RNA 3. Determination of the nucleotide sequence RT provides the amino acid sequences of protein 3A and viral coat protein"; RL Eur. J. Biochem. 126(2):217-226(1982). XX DR MD5; 3f0969997843491de37c5465ea32b78a. DR RFAM; RF01084; TLS-PK3. XX CC Cucumber mosaic virus is a single-stranded RNA plant virus with a CC functionally divided genome, designated RNAs 1-4. [2] reports that CC the nucleotide sequence of RNA 3 also gave the complete sequence of CC RNA 4 (bases 1167-2193), which codes for the viral coat protein and CC is derived from the 3' end. The sequences from RNA 3 and RNA 4 are CC identical [1],[2], which supports the assumption that RNA 4 is CC encoded by RNA 3. CC [1] also reports the 3'-terminal sequences of CMV RNAs 1 and 2. CC There is extensive homology among the four sequences. Ahlquist et CC al (Cell 23, 183-189 (1981)) report that the secondary structures CC of these CMV sequences and those of bromoviruses (see separate CC entries) are nearly identical. XX FH Key Location/Qualifiers FH FT source 1..2193 FT /organism="Cucumber mosaic virus" FT /mol_type="genomic RNA" FT /db_xref="taxon:12305" FT CDS 95..1096 FT /codon_start=1 FT /note="protein 3a" FT /db_xref="GOA:P03604" FT /db_xref="InterPro:IPR000603" FT /db_xref="UniProtKB/Swiss-Prot:P03604" FT /protein_id="AAA46413.1" FT /translation="MAFQGPSRTLTQQSSAALPDDLQKILFSPDAIKKMATECDLGRHH FT WMRADNAISVRPLVPQVTSNNLLSFFKSGYDAGELRSKGYMSVPQVLCVVTRTGSTDAE FT GSLKIYLADLGDKELSPIDGQCVTLHNHELPALISFQPTYDCPMELVGNRHRCFAVVVE FT RHGYIGYGGTTASVCSNWQAQFSSKNNNYTHAAAAKTLVLPYNRLAEHSKPSAVARLLK FT SQLNNVSSSRYLLPNVALNQNASGHESEILKESPPIAIGSLSASRNNSFRSQVVNGLCA FT LLRCTYVYIYYVYLPYVNTCESRVPCELSTVDICDDALEDFDHGVSKYTSQF" FT mRNA 1167..2193 FT /note="RNA 4 mRNA" FT CDS 1220..1930 FT /codon_start=1 FT /note="viral coat protein" FT /db_xref="GOA:Q83267" FT /db_xref="InterPro:IPR000247" FT /db_xref="InterPro:IPR023800" FT /db_xref="InterPro:IPR037137" FT /db_xref="UniProtKB/TrEMBL:Q83267" FT /protein_id="AAA46414.1" FT /translation="MDKSGSPNASRTSGRRPRRGSRSASGADAGLRALTQQMLRLNKTL FT AIGRPTLNHQPSWVVKAVNPGYTFTSITLKPPEIEKGSCFGRRLSLPDSVTDYDKKLVS FT RIQIRINPLPKFDSTVWVTVRKVPSSSDLSVAAISAMFGDGNSPVLVYQYAASGVQANN FT KLLYDLSEMRADIGDMRKYAVLVYSKDDKLEKDEIVFMSTSSINEFLSHGCSRLSPCVY FT RRPKTLNYTLNREC" XX SQ Sequence 2193 BP; 498 A; 511 C; 529 G; 655 T; 0 other; gtaatcttac cactttcttt cacgtcgtgt cgcgtcagtg acgctgtgtg tgtgtgtgtg 60 ttagttagtg tgtcgtgttt agattacgaa ggttatggct ttccaaggtc ccagtaggac 120 gttaactcaa cagtcctcgg cggcgttgcc tgacgactta cagaagatat tattcagccc 180 cgatgccatc aagaagatgg ctactgagtg tgacctaggt cgacatcatt ggatgcgcgc 240 ggataacgcc atctctgtca gacctctcgt tccccaagta accagtaaca atttattgtc 300 cttctttaaa tctgggtatg atgccggtga attgcgctct aaaggctata tgagcgttcc 360 tcaagtgttg tgtgtcgtta ccaggacagg ttctacggat gctgagggtt ctttgaaaat 420 ttatttggct gacctaggtg acaaagaatt atccccaatt gatgggcagt gtgttacttt 480 acataatcat gagctccctg ctttgatatc tttccaacct acctacgatt gccccatgga 540 attagttggt aatcggcatc ggtgtttcgc ggtagtcgtt gagagacatg gttatattgg 600 ttacggtggt accactgcta gcgtgtgtag taactggcaa gctcagtttt cttcaaagaa 660 taataattac acacacgcag ctgctgctaa gactcttgtg ttgccttaca acagattagc 720 cgagcattcg aaaccgtcag ccgtcgctcg cctgttgaag tcgcagttaa acaacgttag 780 ctcatcgcgt taccttttac cgaacgttgc tcttaatcaa aatgcgtctg ggcacgagtc 840 cgagatttta aaggaaagcc ctcccatcgc tatagggagt ctgtccgcgt cccgtaacaa 900 tagcttcaga tcgcaggtgg ttaacggtct ttgcgctttg ttacgttgta cctacgtata 960 tatatactac gtttatcttc cgtatgtaaa tacatgtgag tctagagtcc cgtgtgagtt 1020 gtcaacggta gacatctgtg acgatgcgct tgaagatttc gatcatgggg ttagtaagta 1080 cacatcacag ttttaaggtt caattcaatt tgcatccctg ttaggcaagg ccttactttc 1140 tcatggatgc ttctccgcga gttagcgttt agttgttcac ctgagtcgtg ttttctttgt 1200 tttgcgtctc agtgtgccta tggacaaatc tggctctccc aatgctagta gaacctccgg 1260 tcgtcgcccg cgtagaggtt ctcggtccgc ttctggtgcg gatgcagggt tgcgtgcttt 1320 gactcagcag atgctgagac tcaataaaac cctcgccatt ggtcgtccca ctcttaacca 1380 ccaaccttcg tgggtagtga aagctgtaaa cccgggttac actttcacat ctattaccct 1440 gaaaccgcct gaaattgaga aaggttcgtg ttttggtaga aggttgtctt tgccagattc 1500 agtcacggac tatgataaga agcttgtttc gcgcattcaa atcaggatta atcctttgcc 1560 gaaatttgat tctaccgtgt gggttacagt tcggaaagtg ccttcatcat ccgatctttc 1620 cgtcgccgcc atctctgcta tgtttggcga tggtaactca ccggttttgg tttatcagta 1680 tgctgcgtcc ggagttcagg ccaacaataa gttactttat gacctgtccg agatgcgtgc 1740 tgatatcggc gacatgcgta agtacgccgt cctggtttac tcgaaagacg ataaactaga 1800 gaaggacgag attgtcttca tgtcgacgtc gagcatcaac gaattcctat ctcacggatg 1860 ctcccgactt agtccgtgtg tttaccggcg tccgaagacg ttaaactaca ctctcaatcg 1920 cgagtgctga gttggtagtg ctgctccaaa ctgcctgaag tccctaaacg tgttgttgcg 1980 cggggaacgg gttgtccatc cagcttacgg ctaaaatggt cagtcgtgtc tttcacacgc 2040 cgatgtctta caagatgtcg aggtaccctt gaaatcacct cctagatttc ttcggaaggg 2100 cttcgtgaga agctcgtgca cggtaataca ctgatattac caagagtgcg ggtatcgcct 2160 gtggttttcc acaggttctc cataaggaga cca 2193 //