ID X13291; SV 1; linear; genomic DNA; STD; PLN; 5258 BP. XX AC X13291; XX DT 23-NOV-1989 (Rel. 21, Created) DT 06-JUL-2002 (Rel. 72, Last updated, Version 7) XX DE Arabidopsis thaliana DNA for copia-like transposable element Ta1-3 XX KW copia-like element; integrase; polyprotein; protease; repetitive sequence; KW retrotransposon; reverse transcriptase; RNA binding protein. XX OS Arabidopsis thaliana (thale cress) OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. XX RN [1] RP 1-5258 RA Voytas D.F.; RT ; RL Submitted (17-OCT-1988) to the INSDC. RL Voytas D.F., Harvard University, 10 Wellman, Massachusetts General RL Hospital, Boston, MA 02114. XX RN [2] RX DOI; 10.1038/336242a0. RX PUBMED; 2904123. RA Voytas D.F., Ausubel F.M.; RT "A copia-like transposable element family in Arabidopsis thaliana"; RL Nature 336(6196):242-244(1988). XX DR MD5; ba98ec0d5b2774f1517d28629dd01a29. DR EuropePMC; PMC136960; 12456661. DR EuropePMC; PMC1383729; 12045146. DR EuropePMC; PMC2576258; 18842156. DR EuropePMC; PMC2633040; 19194510. DR EuropePMC; PMC2666490; 19153256. DR EuropePMC; PMC2717514; 14699263. DR EuropePMC; PMC5258746; 28174588. XX CC The put. polyprotein contains domains for CC RNA-binding (RB) : starting at AA 257, CC protease (P) : starting at AA 318, CC integrase (INT) : starting at AA 463 and CC reverse transcriptase (RT): starting at AA 856. CC CC Data kindly reviewed (22-aug-1989) by Voytas D.F. XX FH Key Location/Qualifiers FH FT source 1..5258 FT /organism="Arabidopsis thaliana" FT /strain="La-O" FT /mol_type="genomic DNA" FT /clone_lib="Lambda FIX." FT /clone="L2B" FT /db_xref="taxon:3702" FT repeat_region 16..20 FT /note="target site for Ta1-3 integration" FT mobile_element 21..5238 FT /mobile_element_type="transposon:Ta1-3" FT misc_feature 21..534 FT /note="5' LTR" FT CDS 567..4442 FT /product="polyprotein" FT /db_xref="GOA:V9H1C3" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR013103" FT /db_xref="InterPro:IPR025724" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR039537" FT /db_xref="UniProtKB/TrEMBL:V9H1C3" FT /protein_id="CAA31653.1" FT /translation="MANDPNQNTILKTSFQVFNENSDFSLWKTCMKAHLGLAGLKGIID FT DFDLTMTVPIPKSEGKKIEDGDEQGDSSQTKIVPDLVKIEKSENAMNIIIAHVGDAVLR FT KIDHCKSAAEMWETLNKQYMETSLPNRIYVQLKFYSFKMNDTKSINENVNEFLKIVAEL FT SSLEINVVEEVRAILFLNRLSSRYSQLKHTLKYGNKALSLKDVISAARSLERELNEQKE FT TDKNTSTVLYTNERSRPQTRNQNHNKGGQGRGRSKSNSNAKLTCWYCKKEGHVKKDYFA FT RKRKLESENPGEAGVITEKLVFSEALSVNDLAVRDIWVLDSGCTSHMSARRDWFCSFRE FT DGGPTILLGDDHSVKSQGQGSIKIETHGGTIIGLENVKYVPELRRNLISTGTLDKRGYK FT HEGGDGKVRYFKNQKTALRGELVNGLYILDGNTVLSETCVAEGSKGKTELWHSRLGHIG FT LNNMKVLAGKGLVSKEEIRVLDFCENCVMGKAKKVSFNVGKHNSEDVLRYVHADLWGST FT NVTPSLSGNKYFLSIIDDKTRKVWLYFLRSKDETFDRFCEWKELVENQQNKKVKCLRTD FT NGLEFCNLKFDAYCKEHGIERHKTCTYTPQQNGVAERMNRTIMEKVRCMLNESGLGEEF FT WAEAAATAAYLINRSPASAIDHNVPEELWLNKKPGYKHLRRFGSIAYVHIDQGKLKPRA FT LKGIFIGYPAGTKGYKIWLLEEHKCVISRNVLFHEESVYKDTMKKERVVESEAEPASHS FT KSTLIKVKTPGNLNSGEVIQVSDEEESDESVEEEQEPETQVELPETQTTSSLANYQLAR FT DRERRQIHPPARFTEESGVAFALVTVETLSMEEPQSYQEATSDKEWKKWKLATHEEMDS FT LIKNGTWVLVDKPQNRKIIGCRWLFKLKSGSPGVEPVRYKAQLVAKGYTHREGVDYQEI FT FALVVKHTSIRILMSVVVDQDLELEQMDVKTAFLHGELEEELYMEQPEGCISEDGENKV FT CLLKKSLYGLKQSPRQWNKRFNRFMIDQNFIRSEHDACVYVKQVSEQEHLYLLLYVDDM FT LIAGKSKSEINKVKEQLSMEFEMKDMGPASRILGIDIIRDMKNGVLRMSQASYIHNVVQ FT RFNMAEAKVTRSPIGAHFKLAAVRDDDECIDNNAVPYASAVGSIMYAMIGIRPDLAYVI FT CLVSRYMARPGSIHWEAVKWILRYMRGSQDLNLVFTKEKEFRVTGYCDSDYAADLDRRR FT SVSGYVFTVGGNTVSWKANLQSVTALSTTEAEFMALTEAAKEALWIKGLMKDLGLEQDK FT VTLWCDS" FT misc_feature 4725..5238 FT /note="3'LTR" FT repeat_region 5239..5243 FT /note="target site for Ta1-3 integration" XX SQ Sequence 5258 BP; 1682 A; 846 C; 1301 G; 1429 T; 0 other; tttcccaaaa gggaaatcaa tgttggagtt atgatccaat tcctaagttg ctaaagtcta 60 atgtcgacta ttaacttaag ttgagtttga atttggattg gaggagctaa accggtctgg 120 ttaagtttgg ttattgaaag aaggaaggat tgagttcggt ttaatctttg aagctgaaga 180 ctcgacttgg tttagagggt ttgacgttga ctataaaaag gactcgtctt cttcttttct 240 gtttcatcct ctgtaacaaa cattgtatct tcttcttctt cctctgatct tgagcttgta 300 acggtgtgtg taaaagcttg agaaactcca ttgatatagt gaattgctgg tcagaatcca 360 gccgagacgt aggcttactc attccgagta gctgaactcg taaatcctct gtgtcacttt 420 attctttgaa tgtttcttgt tttgagagtg agagattaca aattgagaga cgagagagag 480 gttcgtgtgc gtgagatcac aaatcgatca aggtttaagg ttcgtttggt aacaagtggt 540 atcagagcca ttggttcttg cgagctatgg cgaacgatcc aaatcagaac acgatcctga 600 agacctcgtt tcaagtcttt aacgagaatt cagatttttc gctatggaag acgtgtatga 660 aggcacatct gggattggca ggacttaaag gcatcatcga tgattttgat cttacgatga 720 cagtgccaat tccaaaatct gagggaaaga agattgaaga tggtgacgaa caaggagatt 780 cgtctcaaac aaagattgtt cctgatctcg tgaagattga gaaatctgaa aacgcgatga 840 acattatcat cgctcatgtt ggtgatgcag tattgagaaa gatcgatcac tgcaagagtg 900 cagctgagat gtgggaaact ttgaacaagc aatacatgga aacctcattg cctaatcgga 960 tctatgtaca gctcaagttc tattcattca agatgaatga tactaagtcg atcaacgaaa 1020 acgtgaatga attcttaaag atcgtcgcag aattgagtag cttggagatc aatgtggttg 1080 aagaagtaag agccatcttg ttcttgaatc gtttgtcttc aagatattca caactcaaac 1140 atacactcaa gtatgggaac aaggcattgt cactgaaaga tgtgatatca gctgcacgtt 1200 ctcttgaaag agaacttaat gaacaaaagg aaactgataa gaacacctct acagttttgt 1260 atactaatga gagaagcaga cctcagacta gaaatcaaaa tcacaacaaa ggaggtcaag 1320 ggagaggcag aagcaaatcc aactctaatg caaagcttac gtgctggtac tgcaagaaag 1380 agggacatgt caaaaaggac tattttgcta ggaaaaggaa actagaaagt gaaaatccag 1440 gagaagctgg agtcatcact gaaaagctgg tgttttctga agcactcagt gtcaatgatc 1500 tagcagtaag agacatttgg gtacttgact caggttgcac gtctcacatg tctgcaagaa 1560 gggattggtt ctgcagtttt agagaagatg gtggccctac tattcttctg ggagatgacc 1620 actcggttaa atctcaagga caaggatcta ttaagataga aactcatgga ggcactataa 1680 tagggcttga gaatgtgaag tatgtacctg aacttagaag gaacctaatc tccacaggta 1740 ctcttgacaa aaggggatac aaacatgaag gtggtgatgg taaagtgagg tatttcaaga 1800 atcagaaaac agctttaaga ggagagcttg ttaacggact atacatactt gatggaaaca 1860 cagtattatc tgaaacgtgt gttgctgaag gatctaaggg aaaaacagaa ctctggcaca 1920 gtaggctcgg tcatatcggt ctaaacaata tgaaggtgtt agcaggaaaa gggctagtga 1980 gcaaagaaga aataagggta ctggacttct gtgaaaattg tgtcatggga aaggccaaga 2040 aagtgagctt taatgtggga aagcacaact cagaagatgt tctccgctat gtccatgcag 2100 atctgtgggg ttccacaaac gtcacacctt cattgtcagg taacaagtat ttcttgtcaa 2160 taattgatga taaaacacgc aaagtttggt tgtattttct caggtctaaa gacgaaacat 2220 ttgatcgctt ctgcgagtgg aaagagctcg ttgagaatca acaaaacaag aaagtcaagt 2280 gtttgagaac tgacaacgga ttggagtttt gcaacttgaa gtttgatgct tactgtaaag 2340 agcatggaat agaaagacac aagacctgca cctatactcc tcagcagaat ggagtagcag 2400 aacgcatgaa taggacaatc atggagaagg tgaggtgcat gttgaatgag tcagggttgg 2460 gagaagagtt ttgggcagaa gctgctgcaa ctgcagccta tttgataaac aggtccccag 2520 cgtctgcaat tgatcataat gtccctgagg aattatggtt gaataagaaa cctggttaca 2580 aacatttgag gcggtttggt tctattgcat atgtccacat agaccaaggg aagttgaagc 2640 ctagagcttt aaagggaatc tttattggat acccagctgg aacaaaaggg tataagatct 2700 ggcttctaga agaacataaa tgtgtgataa gccgaaatgt gttatttcat gaggaatcag 2760 tgtataagga tactatgaaa aaagaaagag ttgtagaaag tgaagcagaa cctgctagtc 2820 actcaaagag tacactgata aaagtaaaaa ctccagggaa tctgaattca ggtgaagtaa 2880 ttcaagtatc agatgaagaa gaatctgatg aaagtgttga agaagaacag gaacctgaaa 2940 ctcaggtgga gttaccagaa actcaaacaa ctagttcttt agctaactat caactagcta 3000 gagatcgaga aagaaggcag atccatcctc ctgctaggtt tacagaagaa agtggtgtag 3060 catttgcact agtaactgtt gagactttga gtatggagga gccgcagagt tatcaggaag 3120 caacttctga taaagaatgg aagaaatgga aacttgctac tcatgaggag atggattctc 3180 tgattaagaa cggtacatgg gtgttggttg ataaacccca gaaccgaaag atcattggtt 3240 gcagatggtt gtttaaactg aagagtggca gtccaggagt tgagcctgtg agatacaagg 3300 ctcagttagt ggcaaaaggg tacactcata gagagggtgt tgattaccaa gagatctttg 3360 ctctagtggt taaacacaca tctatcagga tattgatgtc tgttgttgtt gatcaagacc 3420 tagagttgga acagatggac gtaaagacag ctttccttca tggagagtta gaagaagaac 3480 tttatatgga acaaccagag ggttgcatat ctgaagatgg tgagaataag gtttgcttat 3540 tgaagaagtc gttgtatggg ttaaaacaat ccccaagaca gtggaacaaa cgcttcaata 3600 gattcatgat tgatcaaaac ttcattagaa gtgagcatga tgcttgtgta tatgtgaagc 3660 aggtcagtga acaagaacac ctgtacctgt tgctatacgt ggatgatatg ttgattgcag 3720 gaaagagcaa atcagaaatt aacaaggtta aagagcagct gagcatggaa tttgaaatga 3780 aagatatggg accagcgagt agaattctcg gcattgacat tataagagac atgaagaatg 3840 gagttctacg catgtctcag gctagctaca ttcacaatgt ggtccagcgg ttcaacatgg 3900 ctgaagccaa agtcacacgg tcaccaatag gagctcattt caagctagct gcagtgaggg 3960 acgatgatga gtgcattgac aacaatgctg taccttatgc cagtgcagtt ggcagtatca 4020 tgtacgccat gataggtata cgtcctgact tagcttatgt tatatgtctg gtaagcaggt 4080 acatggcaag accaggcagt attcactggg aagcagtcaa gtggattctc aggtacatgc 4140 gaggatctca ggacttaaat cttgtgttta caaaagagaa agaattcaga gttacggggt 4200 attgtgattc ggactatgct gctgatttgg atagaagaag atcagtaagt ggatacgtgt 4260 ttacagtagg tggtaacaca gtaagttgga aggcaaattt gcagtcagtg actgcattat 4320 caactacaga agccgagttc atggcactta cagaagctgc caaagaagct ttatggatta 4380 aaggcttaat gaaggacttg ggacttgagc aggataaggt aaccctttgg tgtgattcct 4440 agtcagctat ttgcttgttt aaaaacagta ctcatcatga aaggactaag catatagatg 4500 tcagatacaa cttcataaga gatgttgtgg aagcaggaga tgtggatgta cttaagatac 4560 acacttcaag aaatcctgcg gatgctttaa ccaagagcat tccggtaaac aagtttcagt 4620 cagctttaga gttgctgaag ctggttaagt gggactgagg tgattcagcc actgctatga 4680 ccatggagag taattcacgg ttggaatagg atcaaggtgg agattgttgg agttatgatc 4740 caattcctaa gttgctaaag tctaatgtcg actattgact taagctgagt ttgaatttgg 4800 attggaggag ctaaaccggt ctggttaagt ttggttattg aaagaaggaa ggattgagtt 4860 cggtttaatc tttgaagctg aagactcgac ttggtttaga gggtttgacg ttgactataa 4920 aaaggattcg tcttcttctt ttctgtttca tcctctgtaa caaacattgt atcttcttct 4980 tcttcctctg atcttgagct tgtaacggtg tgtgtaaaag cttgagaaac tccactgata 5040 tagtgaattg ctggtcagaa tccagccgag acgtaggctt actcattccg agtagctgaa 5100 cccgtaaatc ttctgtgtca ctttattctt tgagtgtttc ctgttttgag agtgagagat 5160 tacaaattga gagacgagag agaggttcgt gtgcgtgaga tcacaaatcg atcaaggttt 5220 aaggttcgtt tggtaacaat caatataact tcaatgta 5258 //