ID M11216; SV 1; linear; genomic RNA; STD; VRL; 2324 BP. XX AC M11216; XX DT 04-OCT-1988 (Rel. 17, Created) DT 04-MAR-2000 (Rel. 63, Last updated, Version 3) XX DE Tobacco etch virus capsid protein gene. XX KW . XX OS Tobacco etch virus OC Viruses; Riboviria; Potyviridae; Potyvirus. XX RN [1] RP 1-2324 RX DOI; 10.1073/pnas.82.12.3969. RX PUBMED; 16593574. RA Allison R.F., Sorenson J.C., Kelly M.E., Armstrong F.B., Dougherty W.G.; RT "Sequence determination of the capsid protein gene and flanking regions of RT tobacco etch virus: Evidence for synthesis and processing of a polyprotein RT in potyvirus genome expression"; RL Proc. Natl. Acad. Sci. U.S.A. 82(12):3969-3972(1985). XX DR MD5; e99a889b941bd722f30b179f6c09d702. XX FH Key Location/Qualifiers FH FT source 1..2324 FT /organism="Tobacco etch virus" FT /mol_type="genomic RNA" FT /db_xref="taxon:12227" FT mat_peptide <1..1346 FT /note="nuclear inclusion protein" FT CDS <1..2138 FT /codon_start=3 FT /note="viral polyprotein" FT /db_xref="GOA:P04517" FT /db_xref="InterPro:IPR001205" FT /db_xref="InterPro:IPR001456" FT /db_xref="InterPro:IPR001592" FT /db_xref="InterPro:IPR001650" FT /db_xref="InterPro:IPR001730" FT /db_xref="InterPro:IPR002540" FT /db_xref="InterPro:IPR007094" FT /db_xref="InterPro:IPR009003" FT /db_xref="InterPro:IPR011492" FT /db_xref="InterPro:IPR013648" FT /db_xref="InterPro:IPR014001" FT /db_xref="InterPro:IPR027417" FT /db_xref="InterPro:IPR031159" FT /db_xref="InterPro:IPR039560" FT /db_xref="InterPro:IPR042308" FT /db_xref="PDB:1LVB" FT /db_xref="PDB:1LVM" FT /db_xref="PDB:1Q31" FT /db_xref="UniProtKB/Swiss-Prot:P04517" FT /protein_id="AAA47908.1" FT /translation="NREAFLKDILKYASEIEIGNVDCDLLELATSMLITKLKALGFPTV FT NYITDPEEIFSALNMKAAMGALYKGKKKEALSELTLDEQEAMLKASLVRLYTGKLGIWN FT GSLKAELRPIEKVENNKTRTFTAAPIDTLLAGKVCVDDFNNHFYDFNIKAPWTVGMTKF FT YQGWNELMEALPSGWVYCDADGSQFDSSLTPFLINAVLKVRLAFMEEWDVGGQMLRNLY FT TEIVYTPILTPDGTVIKKHKGNNSGQPSTVVDNTLMVIIAMLYTCEKCGINKEEIVYYV FT NGDDLLIAIHPDKAERLNRFKESFGELGLKYEFDCTTRDKTQLWFMSHRALERDGMYIP FT KLEEERIVSILEWDRSKEPSHRLEAICASMIEAWGYDKLVEEIRNFYAWVLEQAPYSQL FT AEEGKAPYLAETALKFLYTSQHGTNSEIEEYLKVLYDYDIPTTETLYFQGGTVDASADV FT GKKKDQKDDKVAEQASKDRDVNAGTSGTFSVPRINAMATKLQYPRMKGEVVVNLNHLLG FT YKPQQIDLSNARATHEQFAAWHQAVMTAYGVNEEQMKILLNGFMVWCIENGTSPNLNGT FT WVMMDGEEQVSYPLKPMIENAQPTLRQIMTHFSDLAEAYIEMRNRERPYMPRYGLQRNI FT TDMSLSRYAFDFYELTSKTPVRAREAHMQMKAAAVRNSGTRLFGLDGNVGTAEEDTERH FT TAHDVNRNMHTLLGVRQ" FT mat_peptide 1347..2135 FT /note="capsid protein" XX SQ Sequence 2324 BP; 717 A; 426 C; 583 G; 598 T; 0 other; ttaatagaga ggcgttcctc aaagatattc taaaatatgc tagtgaaatt gagattggga 60 atgtggattg tgacttgcta gagctcgcaa caagcatgct catcacgaag ctcaaggcat 120 taggattccc aactgtaaac tacatcactg acccagagga aatttttagt gcattgaata 180 tgaaagcagc tatgggagca ctatacaagg gcaagaagaa agaagctctc agcgagctca 240 cactagatga gcaggaggcg atgctcaaag caagtctcgt gcggctgtat acgggaaagc 300 ttggaatttg gaatggctca ttgaaagcag agttgcgtcc aattgagaag gttgaaaaca 360 acaaaacgcg aactttcaca gcagcaccaa tagacactct ccttgctggt aaagtttgcg 420 tggatgattt taataatcac ttctatgatt tcaacataaa agcaccatgg acagttggta 480 tgaccaagtt ttatcagggg tggaatgaat tgatggaggc tttaccaagt ggatgggtgt 540 attgtgacgc tgatggttcg caattcgaca gttccttgac tccattcctc attaatgctg 600 tattgaaggt gcgactcgcc ttcatggagg aatgggatgt tggtggacaa atgctgcgaa 660 atttgtacac tgagatagtg tatacaccaa tcctcacacc cgatggcact gtcattaaga 720 agcataaagg caacaatagt gggcaacctt caacagtggt ggacaacaca ctcatggtca 780 ttattgcaat gttatacaca tgtgagaagt gtggaatcaa caaggaagag attgtgtatt 840 acgttaatgg cgatgactta ctgattgcca tccacccaga caaagctgag aggctgaata 900 gattcaagga atctttcgga gagttgggcc tgaaatatga atttgactgt accaccaggg 960 acaagacaca gttgtggttt atgtcacaca gggctttgga gagagatggc atgtatatac 1020 caaagctaga agaagaaagg attgtttcta ttttggaatg ggacagatcc aaggagccat 1080 cacataggct tgaagctatc tgtgcatcaa tgattgaagc atggggttat gacaagctgg 1140 ttgaagaaat ccgcaatttc tatgcatggg ttttggaaca agcgccgtat tcacagcttg 1200 cagaggaagg aaaagcgcca tacctggctg agactgcgct taagtttttg tacacatctc 1260 agcacggaac aaactctgag atagaagagt atttgaaagt attgtatgat tacgatattc 1320 caacgactga gactctttat tttcagggtg gcactgtgga tgctagtgct gatgttggta 1380 agaagaaaga tcaaaaggat gataaagtcg ctgagcaggc ttcaaaggat agggatgtca 1440 atgctggaac ttcaggaaca ttctcagttc cacgaataaa tgctatggcc acaaaacttc 1500 aatatccaag gatgaaaggg gaggtagttg taaacttgaa tcacctttta ggatacaagc 1560 cacagcaaat tgacttgtca aatgctcgag ccacacatga gcagtttgcc gcgtggcacc 1620 aggcagtgat gacagcctat ggagtgaacg aagagcaaat gaaaatattg ctaaatggat 1680 tcatggtgtg gtgcatagaa aatgggactt ccccaaattt gaatggaact tgggtcatga 1740 tggatggtga ggagcaagtt tcatacccgc tgaaaccaat gatcgaaaac gcgcagccaa 1800 cattgaggca aattatgacg cacttcagtg acctggctga ggcgtatatt gaaatgagaa 1860 acagggaacg accatacatg cctaggtatg gtctacagag aaacattaca gacatgagtt 1920 tgtcacgcta tgcgttcgac ttctatgagc taacttcaaa aacacctgtt agagcgaggg 1980 aggcgcatat gcaaatgaaa gctgctgcag tacgaaacag tggaactagg ttatttggtc 2040 ttgatggcaa cgtgggtact gcagaggaag acactgaacg gcacacagcg catgatgtga 2100 accgtaacat gcacacacta ttaggggtcc gccagtgata gtttctgcgt gtctttgctt 2160 tccgctttta agcttattgt aatatatatg aatagctatt cacagtggga cttggtcttg 2220 tgttgaatag tatcttatat gttttaatat gtcttattag tctcattact taggcgaacg 2280 acaaagtgag gtcacctcgg tctaactctc ctatgtagtg cgag 2324 //