ID M34549; SV 1; linear; genomic DNA; STD; FUN; 5510 BP. XX AC M34549; XX DT 27-JUL-1990 (Rel. 24, Created) DT 14-FEB-2020 (Rel. 143, Last updated, Version 11) XX DE Saccharomyces cerevisiae tRNA-Cys gene, complete sequence; 5' sigma element DE long terminal repeat, complete sequence; gag3 (gag3) gene, complete cds; DE POL3 (POL3) gene, partial cds; and 3' sigma element long terminal repeat, DE complete sequence. XX KW capsid; integrase; LTR; protease; retrotransposon; reverse transcriptase; KW transfer RNA; transfer RNA-Cys; transposon. XX OS Saccharomyces cerevisiae (baker's yeast) OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes; OC Saccharomycetales; Saccharomycetaceae; Saccharomyces. XX RN [1] RP 1-5510 RX PUBMED; 2159534. RA Hansen L.J., Sandmeyer S.B.; RT "Characterization of a transpositionally active Ty3 element and RT identification of the Ty3 integrase protein"; RL J. Virol. 64(6):2599-2607(1990). XX DR MD5; abff4adcde6327527cafc46a7adc1b0e. DR EuropePMC; PMC2258933; 18094177. DR EuropePMC; PMC2789563; 18279991. DR EuropePMC; PMC60222; 11600699. DR RFAM; RF00005; tRNA. XX CC Draft entry and computer-readable sequence for [1] kindly submitted CC by S.B.Sandmeyer, 24-MAY-1990. XX FH Key Location/Qualifiers FH FT source 1..5510 FT /organism="Saccharomyces cerevisiae" FT /strain="AB950" FT /mol_type="genomic DNA" FT /note="contains ICTV exemplar Saccharomyces cerevisiae Ty3 FT virus" FT /db_xref="taxon:4932" FT tRNA complement(31..105) FT /product="tRNA-Cys" FT /anticodon="(pos:63..65,aa:Cys,seq:gtt)" FT repeat_region 116..120 FT /note="5' insertion target sequence" FT repeat_region 121..460 FT /rpt_type=LONG_TERMINAL_REPEAT FT /note="5' sigma element" FT repeat_region 121..128 FT /rpt_type=INVERTED FT /note="terminal" FT mRNA 343..5404 FT repeat_region 453..460 FT /rpt_type=INVERTED FT /note="terminal" FT gene 485..1408 FT /gene="gag3" FT CDS 485..1408 FT /codon_start=1 FT /gene="gag3" FT /db_xref="GOA:Q12173" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR005162" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/Swiss-Prot:Q12173" FT /protein_id="AAA98434.2" FT /translation="MFKRTIEKPNLATPHPSMSFMDQIPGGGNYPKLPVECLPNFPIQP FT SLTFRGRNDSHKLKNFISEIMLNMSMISWPNDASRIVYCRRHLLNPAAQWANDFVQEQG FT ILEITFDTFIQGLYQHFYKPPDINKIFNAITQLSEAKLGIERLNQRFRKIWDRMPPDFM FT TEKAAIMTYTRLLTKETYNIVRMHKPETLKDAMEEAYQTTALTERFFPGFELDADGDTI FT IGATTHLQEEYDSDYDSEDNLTQNGYVHTVRTRRSYNKPMSNHRNRRNNNPSREECIKN FT RLCFYCKKEGHRLNECRARKASSNRS" FT mat_peptide 536..1402 FT /gene="gag3" FT /product="capsid protein" FT /note="carboxy terminal not determined; 26 kDa protein" FT mat_peptide 1235..1402 FT /gene="gag3" FT /product="nucleocapsid" FT /note="9 kDa protein" FT gene <1368..5180 FT /gene="POL3" FT CDS <1368..5180 FT /codon_start=1 FT /gene="POL3" FT /db_xref="GOA:Q99315" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR024650" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036875" FT /db_xref="InterPro:IPR041577" FT /db_xref="InterPro:IPR041588" FT /db_xref="PDB:4OL8" FT /db_xref="UniProtKB/Swiss-Prot:Q99315" FT /protein_id="AAA98435.1" FT /translation="TNVEHVRRVLTDLELESKDQQTPFIKTLPIVHYIAIPEMDNTAEK FT TIKIQNTKVKTLFDSGSPTSFIRRDIVELLKYEIYETPPLRFRGFVATKSAVTSEAVTI FT DLKINDLHITLAAYILDNMDYQLLIGNPILRRYPKILHTVLNTRESPDSLKPKTYRSET FT VNNVRTYSAGNRGNPRNIKLSFAPTILEATDPKSAGNRGDSRTKTLSLATTTPAAIDPL FT TTLDNPGSTQSTFAQFPIPEEASILEEDGKYSNVVSTIQSVEPNATDHSNKDTFCTLPV FT WLQQKYREIIRNDLPPRPADINNIPVKHDIEIKPGARLPRLQPYHVTEKNEQEINKIVQ FT KLLDNKFIVPSKSPCSSPVVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRI FT GNAQIFTTLDLHSGYHQIPMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFARYMAD FT TFRDLRFVNVYLDDILIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEETEFLG FT YSIGIQKIAPLQHKCAAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQPIQLFIC FT DKSQWTEKQDKAIDKLKDALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVDNKNKL FT VGVVGYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNK FT NEPARRVQRWLDDLATYDFTLEYLAGPKNVVADAISRAVYTITPETSRPIDTESWKSYY FT KSDPLCSAVLIHMKELTQHNVTPEDMSAFRSYQKKLELSETFRKNYSLEDEMIYYQDRL FT VVPIKQQNAVMRLYHDHTLFGGHFGVTVTLAKISPIYYWPKLQHSIIQYIRTCVQCQLI FT KSHRPRLHGLLQPLPIAEGRWLDISMDFVTGLPPTSNNLNMILVVVDRFSKRAHFIATR FT KTLDATQLIDLLFRYIFSYHGFPRTITSDRDVRMTADKYQELTKRLGIKSTMSSANHPQ FT TDGQSERTIQTLNRLLRAYASTNIQNWHVYLPQIEFVYNSTPTRTLGKSPFEIDLGYLP FT NTPAIKSDDEVNARSFTAVELAKHLKALTIQTKEQLEHAQIEMETNNNQRRKPLLLNIG FT DHVLVHRDAYFKKGAYMKVQQIYVGPFRVVKKINDNAYELDLNSHKKKHRVINVQFLKK FT FVYRPDAYPKNKPISSTERIKRAHEVTALIGIDTTHKTYLCHMQDVDPTLSVEYSEAEF FT CQIPERTRRSILANFRQLYETQDNPEREEDVVSQNEICQYDNTSP" FT mat_peptide 1464..>1466 FT /gene="POL3" FT /product="protease" FT /note="16 kDa protein; Carboxy terminal not determined. FT Approx 10 kDa coding region between protease and reverse FT transcriptase. The identity of the 10kDa protein is not FT known." FT mat_peptide 2142..5177 FT /gene="POL3" FT /product="reverse transcriptase" FT /note="carboxy terminal not determined; 55 kDa protein" FT mat_peptide 3570..5177 FT /gene="POL3" FT /product="integrase" FT /note="61 kDa protein" FT repeat_region 5132..5471 FT /rpt_type=LONG_TERMINAL_REPEAT FT /note="3' sigma element" FT repeat_region 5132..5139 FT /gene="POL3" FT /rpt_type=INVERTED FT /note="terminal" FT repeat_region 5464..5471 FT /rpt_type=INVERTED FT /note="terminal" FT repeat_region 5472..5476 FT /note="3' insertion target sequence" XX SQ Sequence 5510 BP; 1955 A; 1306 C; 919 G; 1330 T; 0 other; aactttcatg gaaggaccac ctagttaata aaaagctcgc actcaggatc gaactaagga 60 ccaacagatt tgcaatctgc tgcgctacca ctgcgccata cgagcttgat tttctgaaag 120 tgttgtatct caaaatgaga tatgtcagta tgacaatacg tcaccctgaa cgttcataaa 180 acacatatga aacaacctta taacaaaacg aacaacatga gacaaaaccc gaccttccct 240 agctgaacta cccaaagtat aaatgcctga acaattagtt tagatccgag attccgcgct 300 tccaccactt agtatgattc atattttata taatatataa gataagtaac attccgtgaa 360 ttaatctgat aaactgtttt gacaactggt tacttcccta agactgttta tattaggatt 420 gtcaagacac tccggtatta ctcgagcccg taatacaaca cctggtagcg ttaaaggtta 480 ctaattgttc aaacgaacca tcgaaaagcc gaacctagct acaccacacc ccagtatgag 540 ctttatggat caaatcccag gaggaggaaa ttatccaaaa ctcccagtag aatgccttcc 600 taacttcccg atccaaccat ctttgacctt cagaggtaga aatgactcgc ataaactgaa 660 aaactttatc tccgaaataa tgttaaacat gtctatgata tcttggccga atgatgccag 720 tcgtattgtg tactgcagaa gacatttatt aaaccccgct gctcagtggg ctaatgactt 780 tgtacaagaa caaggtatac ttgaaataac attcgacaca ttcatacaag gattatatca 840 gcatttctat aagccaccag atatcaataa aatctttaat gcaatcacgc aactttccga 900 agctaaactt ggtattgagc gtctcaacca acgattcaga aagatttggg acagaatgcc 960 accagacttc atgaccgaaa aagctgccat aatgacatat actaggctat tgacaaagga 1020 aacctataat attgtcagaa tgcacaaacc agagacatta aaagacgcca tggaagaggc 1080 ttaccagaca actgcactaa ctgaaagatt cttcccagga ttcgaacttg atgctgatgg 1140 agacactatc atcggtgcca caacccactt acaagaagaa tacgactctg actatgattc 1200 agaagataat ctgacccaga atggatacgt ccataccgta aggacaagaa gatcttacaa 1260 taaaccaatg tcaaatcatc gaaacaggag aaataacaac ccatctagag aagaatgtat 1320 aaaaaatcgg ctatgcttct attgtaagaa agagggacat cgcctgaacg aatgtagagc 1380 acgtaaggcg agttctaacc gatcttgaac tcgaatcaaa agaccaacaa actcctttta 1440 tcaaaacctt accaattgta cactatatcg ccatccccga gatggacaat accgccgaaa 1500 aaaccataaa aatacaaaac acgaaagtaa aaaccctgtt tgacagtgga tcacccacgt 1560 catttatccg aagagatatt gtagaacttc tcaaatacga aatctacgag acccctccac 1620 tccgttttag aggattcgta gccaccaaat ccgccgttac atccgaagca gtcaccattg 1680 acctcaaaat caatgacctg catataactt tagccgcgta catactggat aacatggact 1740 accaattgtt aattggaaat ccaatcttac gccgctaccc gaaaatcctg cacacagtac 1800 tgaataccag agagagcccc gactccttaa agcccaagac ttatcgctcc gaaaccgtta 1860 ataacgttag aacctactcc gctggtaatc gtggtaaccc cagaaacata aaactgtctt 1920 ttgcccccac cattctcgaa gcaactgacc cgaaatccgc tggtaatcgt ggtgactcca 1980 gaaccaaaac cctgtctctt gcaaccacta ctcctgcagc aattgacccg cttacgaccc 2040 ttgataaccc aggtagtact caaagtacat ttgcgcaatt cccgatacct gaagaagcga 2100 gcatcctaga agaggatgga aaatactcca acgttgtctc aaccattcag agtgtagaac 2160 ctaatgctac tgatcacagc aataaggaca ccttttgcac tttgccagtt tggttacaac 2220 agaagtatag agagatcata cgtaatgatc tcccaccaag acctgccgac attaataaca 2280 tccccgtaaa acatgatatt gaaattaaac ctggcgcaag actacctcga ctacagccat 2340 accatgttac agaaaagaac gaacaagaaa tcaacaaaat agttcaaaaa ctgctcgata 2400 acaagttcat tgttccctca aagtcgcctt gcagctcccc tgtagtcctc gtcccgaaga 2460 aagacggtac cttccgactc tgcgtcgatt accgcaccct gaacaaagct accatctccg 2520 acccattccc attacccaga atcgacaacc tattgagccg tattggaaat gcccagatat 2580 ttaccacgct agatttgcat agtggttacc accagatccc gatggaaccc aaagaccgct 2640 acaaaaccgc ctttgtcaca ccatccggta agtatgaata taccgtcatg ccatttggct 2700 tagtcaatgc acctagtaca ttcgcaagat acatggctga tacatttaga gacctgagat 2760 tcgtcaatgt ttaccttgat gatatattaa tattctccga atctccagaa gaacattgga 2820 aacatttaga cacggtacta gaaagattaa agaacgagaa cctcattgtt aagaagaaaa 2880 aatgtaaatt tgcatctgaa gaaactgagt ttttaggcta tagtattgga atccagaaaa 2940 tagctccact acagcacaaa tgtgcagcaa tccgagactt tccgacgcct aaaacagtaa 3000 aacaagcaca gagattttta ggaatgatta attactacag acgattcatt ccaaattgct 3060 ccaagattgc acagccaatc caactgttta tttgtgacaa aagtcaatgg acagaaaaac 3120 aagacaaggc aattgataaa ctaaaagacg ccttgtgtaa ctcccccgtc ctagtaccat 3180 tcaacaacaa agcaaactac cgacttacaa cagacgcctc aaaagacggc attggtgctg 3240 ttctagaaga agtcgacaac aagaacaaac ttgttggtgt cgtcggttac ttctctaaat 3300 ccttagagag tgcccagaaa aactatcctg ctggcgaatt agaactactt ggaattatca 3360 aagcactcca ccacttccga tatatgcttc acggaaagca tttcacgtta agaacagacc 3420 acattagttt gttatcatta caaaacaaga acgaacccgc acgacgcgtg caacgctggt 3480 tagatgacct agccacatat gacttcacct tagaatacct agctggaccc aagaacgttg 3540 tcgcagatgc catatcccgt gccgtatata ctataacccc cgaaacatcc cgacctatcg 3600 acacagaaag ctggaaatct tactacaaat cagacccatt atgtagtgct gtcttaattc 3660 atatgaaaga attgacacaa cacaacgtca cacctgaaga tatgtcagcc ttccgtagtt 3720 accagaagaa actcgaacta tcagagacct tccgaaagaa ttattcccta gaagacgaaa 3780 tgatctatta ccaagaccga ctagtagtac caataaaaca acagaacgca gttatgagac 3840 tatatcatga ccatacctta tttggaggac attttggtgt aacagtgacc cttgcgaaaa 3900 tcagcccaat ttactattgg ccaaaattac aacattcgat catacaatac atcaggacct 3960 gcgtacaatg tcaactaata aaatcacacc gaccacgctt acatggacta ttacaaccac 4020 tccctatagc agaaggaaga tggcttgata tatcaatgga ttttgtgaca ggattacccc 4080 cgacatcaaa taacttgaat atgatcctcg tcgtagttga tcgtttttcg aaacgcgctc 4140 acttcatagc tacaaggaaa accttagacg caacacaact aatagatcta ctctttcgat 4200 acattttttc atatcatggt tttcccagga caataaccag tgatagagat gtccgtatga 4260 ccgccgacaa atatcaagaa ctcacgaaaa gactaggaat aaaatcgaca atgtcttccg 4320 cgaaccaccc ccaaacagat ggacaatccg aacgaacgat acagacatta aacaggttac 4380 taagagccta tgcttcaacc aatattcaga attggcatgt atatttacca caaatcgaat 4440 ttgtttacaa ttctacacct actagaacac ttggaaaatc accatttgaa attgatttag 4500 gatatttacc gaatacccct gctattaagt cagatgacga agtcaacgca agaagtttta 4560 ctgccgtaga acttgccaaa cacctcaaag cccttaccat ccaaacgaag gaacagctag 4620 aacacgctca aatcgaaatg gaaactaata acaatcaaag acgtaaaccc ttattgttaa 4680 acataggaga tcacgtatta gtgcatagag atgcatactt caagaaaggt gcttatatga 4740 aagtacaaca aatatacgtc ggaccatttc gagttgtcaa gaaaataaac gataacgcct 4800 acgaactaga tttaaactct cacaagaaaa agcacagagt tattaatgta caattcctga 4860 aaaagtttgt ataccgtcca gacgcgtacc caaagaataa accaatcagc tccactgaaa 4920 gaattaagag agcacacgaa gttactgcac tcataggaat agatactaca cacaaaactt 4980 acttatgtca catgcaagat gtagacccaa cactttcagt agaatactca gaagctgaat 5040 tttgccaaat tcccgaaaga acacgaagat caatattagc caactttaga caactctacg 5100 aaacacaaga caaccctgag agagaggaag atgttgtatc tcaaaatgag atatgtcagt 5160 atgacaatac gtcaccctga acgttcataa aacacatatg aaacaacctt ataacaaaac 5220 gaacaacatg agacaaaacc cgaccttccc tagctgaact acccaaagta taaatgcctg 5280 aacaattagt ttagatccga gattccgcgc ttccaccact tagtatgatt catattttat 5340 ataatatata agataagtaa cattccgtga attaatctga taaactgttt tgacaactgg 5400 ttacttccct aagactgttt atattaggat tgtcaagaca ctccggtatt actcgagccc 5460 gtaatacaac agaaagttcc attttggatg ctctatttat gggaatatga 5510 //