ID M94164; SV 1; linear; genomic DNA; STD; FUN; 6727 BP. XX AC M94164; XX DT 20-MAY-1992 (Rel. 31, Created) DT 14-FEB-2020 (Rel. 143, Last updated, Version 8) XX DE S.cerevisciae TY4 retrotransposon endogenous protease, integrase, reverse DE transcriptase protein (TY4B) gene, complete cds and gag protein (TY4A) DE pseudogene. XX KW gag gene; integrase; protease; retrotransposon; reverse transcriptase. XX OS Saccharomyces cerevisiae (baker's yeast) OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes; OC Saccharomycetales; Saccharomycetaceae; Saccharomyces. XX RN [1] RP 1-6727 RX DOI; 10.1016/0378-1119(92)90039-R. RX PUBMED; 1333437. RA Stucka R., Schwarzlose C., Lochmuller H., Hacker U., Feldmann H.; RT "Molecular analysis of the yeast Ty4 element: homology with Ty1, copia, and RT plant retrotransposons"; RL Gene 122(1):119-128(1992). XX RN [2] RP 1-6727 RA Feldmann H.; RT ; RL Submitted (18-MAY-1992) to the INSDC. RL Horst Feldmann, Adolf-Butenandt-Institut fur Physiologische Chemie, RL Schillerstrasse 44, Munchen, D-80336, Germany XX RN [3] RP 1-6727 RA Feldmann H.; RT ; RL Submitted (15-FEB-1996) to the INSDC. RL Horst Feldmann, Adolf-Butenandt-Institut fur Physiologische Chemie, RL Schillerstrasse 44, Munchen, D-80336, Germany XX DR MD5; 3280fc6f3ba60a076b8269b8669f3659. DR EuropePMC; PMC1951106; 17513563. DR EuropePMC; PMC2666490; 19153256. XX CC On Mar 8, 1996 this sequence version replaced gi:173091. XX FH Key Location/Qualifiers FH FT source 1..6727 FT /organism="Saccharomyces cerevisiae" FT /strain="C836" FT /mol_type="genomic DNA" FT /tissue_lib="Ty4-90" FT /note="contains ICTV exemplar Saccharomyces cerevisiae Ty4 FT virus" FT /db_xref="taxon:4932" FT mobile_element 153..6362 FT /mobile_element_type="transposon:retrotransposon TY4" FT repeat_region 153..523 FT /rpt_type=LONG_TERMINAL_REPEAT FT repeat_region 153..157 FT /rpt_type=INVERTED FT repeat_region 519..523 FT /rpt_type=INVERTED FT gene 523..1758 FT /gene="TY4A" FT CDS 523..1758 FT /pseudo FT /codon_start=1 FT /gene="TY4A" FT /product="gag protein" FT /note="replace 'taa' at 1180-1182 with 'tta' in Ty4-476 and FT Ty4-832" FT gene <1529..5872 FT /gene="TY4B" FT CDS <1529..5872 FT /codon_start=1 FT /gene="TY4B" FT /product="endogenous protease, integrase, reverse FT transcriptase protein" FT /note="5' end of coding region undetermined" FT /db_xref="GOA:V9GZW0" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR013103" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR039537" FT /db_xref="UniProtKB/TrEMBL:V9GZW0" FT /protein_id="AAA91746.1" FT /translation="KIAACIANLFSIAQLTAKRNQIGNLGLTRPISQKPIIYKVHRDNN FT HLSPVQNEQKSWNKTQKRSNKVYNSKKLVIIDTGSGVNITNDKTLLHNYEDSNRSTRFF FT GIGKNSSVSVKGYGYIKIKNGHNNTDNKCLLTYYVPEEESTIISCYDLAKTTKMVLSRK FT YTRLGNKIIKIKTKIVNGVIHVKMNELIERFLSDDSKINAIKPTSSPGFKLNKRSITLE FT DAHKRMGHTGIQQIENSIKHNHYEESLDLIKEPNEFWCQTCKISKATKRNHYTGSMNNH FT STDHEPGSSWCMDIFGPVSSSNADTKRYMLIMVDNNTRYCMTSTHFNKNAETILAQVRK FT NIQYVETQFDRKVREINSDRGTEFTNDQIEEYFISKGIHHILTSTQDHAANGRAERYIR FT TIITDATTLLRQSNLRVKFWEYAVTSATNIRNCLEHKSTGKLPLKAISRQPVTVRLMSF FT LPFGEKGIIWNHNHKKLKPSGLPSIILCKDPNSYGYKFFIPSKNKIVTSDNYTIPNYTM FT DGRVRNTQNINKSHQFSSHNDDEEDQIETVTNLCEALENYEDDNKPITRLEDLFTEEEL FT SQIDSNAKYPSPSNNLEGDLDYVFSDVEESGDYDVESELSTTNNSISTDKNKILSNKDF FT NSELASTEISISGIDKKGLINTSHIDEDKYDEKVHRIPSIIQEKLVGSKNTIKINDENK FT ISDRIRSKNIGSILNTGLSRCVDITDESITNKDESMHNAKPELIQEQLKKTNHETSFPK FT EGSIGTNVKFRNTNNEISLKTGDTSLPIKTLESINNHHSNDYSTNKVEKFEKENHHPPP FT IEDIVDMSDQTDMESNCQDGNNLKELKVTDKNVPTDNGTNVSPRLEQNIERSGSPVQTV FT NKSAFLNKEFSSLNMKRKRKRHDKNNSLTSYELERDKKRSKKNRVKLIPDNMETVSAPK FT IRAIYYNEAISKNPDLKEKHEYKQAYHKELQNLKDMKVFDVDVKYSRSEIPDNLIVPTN FT TIFTKKRNGIYKARIVCRGDTQSPDTYSVITTESLNHNHIKIFLMMQTTEICLWTLDIN FT HAFLYAKLEEEIYIPHPLIGDVYVKLNKALYGLKQSPKEWNDHLRQYLNGIGLKDNSYT FT PGLYQTEDKNLMIAVYVDDCVIAASNEQRLDEFINKLKSNFELKITGTLIDDVLDTDIL FT GMDLVYNKRLGTIDLTLKSFINRMDKKYNEELKKIRKSSIPHMSTYKIDPKKDVLQMSE FT EEFRQGVLKLQQLLGELNYVRHKCRYDIEFAVKKVARLVNYPHERVFYMIYKIIQYLVR FT YKDIGIHYDRDCNKDKKVIAITDASVGSEYDAQSRIGVILWYGMNIFNVYSNKSTNRCV FT SSTEAELHAIYEGYRDSETLKVTLKELGEGDNNDIVMITDSKPAIQGLNRSYQQPKEKF FT TWIKTEIIKEKLKRSITVKITGKGNIADLLTNQYQHLILKDLYKY" FT repeat_region 5992..6362 FT /rpt_type=LONG_TERMINAL_REPEAT FT repeat_region 5992..5996 FT /rpt_type=INVERTED FT repeat_region 6358..6362 FT /rpt_type=INVERTED XX SQ Sequence 6727 BP; 2661 A; 1063 C; 1178 G; 1825 T; 0 other; gaggagatac ctggcaaaac atttcttgtg agcacaacct caattaaagt tagacaagta 60 ggtgcactat tggttgcttg ttggctcatc tcgttgagga atgtaataag tacttgttat 120 taacctgttt ttgtgccatc tatagtggag agtgttggaa cgagagtaat taatagtgac 180 atgagttgct atggtaacaa tctaatgctt acatcgtata ttaatgtaca actcgtatac 240 gtttaagtgt gattgcgcct attgcagaag gaatgttaaa cgagaagctc agacaatact 300 gaagctgtgt taaagaccta ttagttgaac atgttatggt aggtacatat atgaggaata 360 tgagtcgtca catcaatgta tagtaactac cggaatcact attatattgg tcataattaa 420 tatgaccaat cggcgtgtgt tttatatacc tctcttattt agtataagaa gatcagtact 480 cacttcttca ttaatactaa tttttaacct ctaattatca acatggcgac cccagtgagg 540 ggtgaaacaa gaaatgttat tgacgacaac atttctgcgc ggattcaatc gaaagtcaaa 600 acaaatgata ctgtcagaca gacgccatta agaaaagttt ctattaaaga tgaacaggtg 660 agacaatatc aaagaaattt aaataggttt aaaaccatac taaatggttt aaaggcagaa 720 gaggaaaaac tttctgaggc tgatgatatt cagatgctag ctgaaaaatt attaaaactc 780 ggagaaacca ttgacaaggt tgagaatagg attgtggatc tagttgaaaa gatacaatta 840 ttggaaacaa acgagaacaa taatatatta catgaacata tagatgctac agggacttac 900 tatttattcg atacgttaac ttcaaccaac aaaagattct accctaagga ttgtgttttt 960 gattatagga ctaataatgt cgagaacatt cctattctct taaacaattt taaaaaattc 1020 atcaagaaat atcaatttga tgatgtcttt gaaaatgata tcatagaaat cgatcctcgt 1080 gaaaatgaaa tcttgtgcaa gataatcaaa gaaggactcg gtgaaagttt agatatcatg 1140 aacacaaata caactgacat ttttaggata atcgatggtt aaaaaaacaa atatagaagt 1200 ttgcatggta gagatgtcag aattagagcc tgggaaaagg ttttggttga tacaacatgt 1260 agaaattccg cattgttaat gaataaactt caaaagttgg tactaatgga aaaatggatt 1320 ttttctaaat gctgccaaga ttgtcctaat ctaaaggatt acctacaaga agctatcatg 1380 ggaaccttac atgaatcctt aagaaattct gtgaaacaac gtttgtacaa cattccacat 1440 gacgtaggaa ttgatcacga agaatttcta atcaatactg ttattgaaac agtaattgat 1500 ttgagcccaa ttgcagacga tcaaatagaa aatagctgca tgtattgcaa atctgttttc 1560 cattgctcaa ttaactgcaa aaagaaacca aatagggaac ttaggcctga ctcgaccaat 1620 ttctcaaaaa cctattatct acaaggtgca cagagacaac aaccacttaa gtccagtgca 1680 aaacgaacaa aagtcttgga acaagacaca aaaaaggtcg aacaaagtgt acaacagcaa 1740 aaaactggta attattgata ccggttccgg cgtaaacatt accaatgaca aaaccttact 1800 gcataattac gaagacagta atcgcagtac acgatttttt ggtattggga aaaacagttc 1860 agtgtctgtt aaagggtatg gctatataaa aatcaagaat ggtcacaaca atactgacaa 1920 taagtgtcta ttaacttact atgtaccgga agaagaatcc actataatca gctgttatga 1980 cttagccaag acaaccaaaa tggttttaag tcgaaaatat accagattgg gaaacaaaat 2040 cataaaaatt aaaaccaaga tagttaatgg tgtcattcac gtaaaaatga acgagttaat 2100 tgaacgcttc ctctccgatg attcaaaaat aaatgcaata aaacctactt cttctcctgg 2160 atttaaacta aataaaaggt ctattacctt ggaagatgct cataaaagaa tgggccatac 2220 aggaattcaa caaattgaaa attccataaa acataatcat tatgaagaat cccttgactt 2280 aatcaaagaa ccaaatgaat tttggtgtca aacctgtaaa atctctaaag ccacgaaacg 2340 aaatcattat accgggtcta tgaataatca tagtactgat catgaaccag gctcatcatg 2400 gtgcatggat atatttggcc ctgtatcaag ttcaaacgcg gacactaaaa ggtacatgct 2460 tattatggtg gataacaaca cgagatattg catgacctcc acacacttca ataagaatgc 2520 tgaaactatt ttagctcaag ttagaaagaa tattcagtac gtggaaacac aatttgacag 2580 gaaagtcaga gaaattaatt cagacagagg tactgaattc acaaatgatc agatagaaga 2640 atattttatt tcaaaaggaa tacatcacat acttacttct acacaagatc atgctgctaa 2700 cggaagagca gaaagataca taagaacaat aataactgat gcaacaacac tcctaagaca 2760 aagtaactta agagtaaaat tttgggaata cgcagtaact tctgctacca atataagaaa 2820 ttgcctggaa cacaaaagta caggtaaact accattgaag gcaatctcac gtcaacctgt 2880 gacagtgaga ttaatgtcat tcttaccatt tggcgaaaaa ggaataattt ggaatcataa 2940 tcacaaaaaa ttgaaaccat ctggacttcc ttctataatt ctatgcaaag atccaaatag 3000 ttatggatac aaattcttta taccatccaa aaataaaatt gtcacatctg ataattatac 3060 aattcccaac tatacaatgg acggtagagt aagaaatact cagaatatta acaagagtca 3120 tcaattcagt tcacataatg atgatgaaga agatcaaatc gaaacggtca caaacttatg 3180 tgaagctttg gaaaactacg aagatgataa taaaccaatt actcgcctgg aagatttgtt 3240 cacagaggaa gagttatctc aaatagactc aaacgcaaaa tacccatctc ctagtaataa 3300 cctagaaggg gacttggatt acgtattttc tgatgttgag gaatctggag attatgacgt 3360 tgaatctgaa ctttcaacga caaataattc aatctcaact gataaaaaca aaattttgtc 3420 aaacaaggat tttaattcag aacttgcatc gactgaaata tccatcagtg gaatcgataa 3480 gaaaggatta ataaatacaa gtcatattga tgaagataag tatgatgaaa aagtacacag 3540 aattccatcg attatacaag agaaactggt aggaagtaaa aatactatta aaatcaatga 3600 cgaaaacaaa atctccgaca gaattcgtag taaaaacatt gggagtattt taaacactgg 3660 actcagtaga tgtgtagata tcaccgatga atctattact aacaaagatg agtcaatgca 3720 caacgcaaaa cccgaactaa ttcaggagca gttaaaaaaa acaaatcatg aaacttcgtt 3780 tcctaaagaa gggagcattg gaacaaatgt aaaattccga aatacaaaca atgagatttc 3840 tttaaaaaca ggcgatacga gtttaccaat aaaaacttta gaaagcatta acaatcacca 3900 tagtaatgat tattccacaa acaaagttga aaagtttgag aaggaaaatc atcatccgcc 3960 cccgattgag gacattgtgg atatgagtga tcaaactgat atggaatcaa actgtcagga 4020 tggtaataac ttaaaagaat taaaagtcac cgataaaaat gtaccaactg acaatggaac 4080 aaatgtgtca ccaaggttgg aacaaaatat tgaacgatct ggatcaccag tacaaacagt 4140 taataaaagt gccttcttaa acaaagaatt cagttctttg aacatgaaaa gaaaacggaa 4200 aagacacgat aaaaacaata gtctaacaag ctatgaatta gaaagagata agaagcgttc 4260 aaaaaagaat cgagtgaaat taattccaga taatatggaa acagtttcag caccaaaaat 4320 tcgagccata tattataatg aagctatttc aaaaaatcct gacctcaaag aaaaacatga 4380 atacaaacag gcatatcata aagaattaca gaatttaaaa gatatgaagg tatttgatgt 4440 cgatgtgaag tacagtagat cagaaattcc tgataattta atagtaccca ccaacacgat 4500 attcacaaag aaaagaaatg ggatttataa ggctaggata gtctgcagag gtgatactca 4560 gtcaccagac acttacagtg taataactac agaatcttta aatcacaatc atattaagat 4620 attcttaatg atgcaaacaa cagaaatatg tttatggacc ctggatatca atcatgcatt 4680 cctatatgct aaattggaag aagaaatata catcccacat ccgctgatag gagatgtgta 4740 cgtcaagcta aataaggcgt tatatggtct aaaacagagt cctaaagaat ggaatgatca 4800 tctaagacaa tacttgaatg gaattggact gaaagataac tcttatactc cgggattata 4860 ccaaaccgag gataaaaatc taatgattgc agtctatgtt gatgactgcg taattgcggc 4920 aagcaatgaa cagagattgg atgaattcat aaacaaattg aaaagtaatt ttgaactgaa 4980 aattacagga acattaatag acgatgtact cgatacagat atattaggaa tggatctagt 5040 atacaacaaa agacttggta ctatcgattt aacattaaaa tcattcataa atagaatgga 5100 taaaaaatac aacgaggaat tgaaaaagat tagaaaaagt tcaattccgc atatgtcaac 5160 ttataaaata gatcctaaga aagacgtact gcaaatgtca gaagaagagt ttagacaagg 5220 tgttctaaag ctacaacaat tactaggtga actaaactat gtcagacaca aatgcagata 5280 cgacattgaa tttgctgtta agaaagtggc tagactagta aattacccac atgaaagagt 5340 cttttatatg atttacaaaa taatccagta cttggttcgg tataaagata ttggaataca 5400 ctatgaccga gactgtaata aagacaaaaa ggttattgct ataactgatg catcagttgg 5460 atcagaatat gatgctcaat caaggattgg agttatatta tggtacggta tgaatatttt 5520 taatgtttat tctaacaaga gcacaaacag atgtgtatca tcaacagaag cagagcttca 5580 tgccatttat gaaggctatc gagactcaga aacgttgaag gtaacattaa aggagctagg 5640 agaaggagac aataatgaca ttgtcatgat cactgactca aagccagcca ttcaaggatt 5700 aaatcgcagc tatcaacaac caaaagagaa attcacttgg ataaaaactg aaataataaa 5760 agaaaaatta aagagaagta taactgttaa aattaccggc aaaggtaata ttgctgattt 5820 actaacaaac cagtatcagc atctgatttt aaaagattta tacaagtatt aaaaaataaa 5880 ataacatcac aggatatttt ggcctcaaca gactattgat aattaattaa tgaagttcta 5940 aacacacaat gaatatctgt tgaagtacaa taatatatct ttaagggagc atgttggaac 6000 gagagtaatt aatagtgaca tgagttgcta tggtaacaat ctaatgctta catcgtatat 6060 taatgtacaa ctcgtatacg tttaagtgtg attgcgccta ttgcagaagg aatgttaaac 6120 gagaagctca gacaatactg aagctgtgtt aaagacctat tagttgaaca tgttatggta 6180 ggtacatata tgaggaatat gagtcgtcac atcaatgtat agtaactacc ggaatcacta 6240 ttatattggt cataattaat atgaccaatc ggcgtgtgtt ttatatacct ctcttattta 6300 gtataagaag atcagtactc acttcttcat taatactaat ttttaacctc taattatcaa 6360 cagagagctt attgcaattg tttttatttc ttggctgcat atatcagtct tgacaggctg 6420 catggggatg acagttagaa actagccaaa ttacccctta tgtagataac aatcattgct 6480 tattcgctct tcccccattt tttttcttgc tcttgctgtt ttttctttta gcgttcgttt 6540 caaggaacaa gagaggaaaa aaaatcaaaa gtagaaaaga agaagaaaaa aacaacgtaa 6600 cacaagttaa caccacaact gaaaaaaaaa ataagaggtg aacgaacgag taactgggga 6660 gaggaaagca gattccacaa tatacattca aattaaagaa atggactcac aaccagttga 6720 cgttgat 6727 //