ID AF053008; SV 1; linear; genomic DNA; STD; PLN; 8301 BP. XX AC AF053008; XX DT 21-MAY-1998 (Rel. 55, Created) DT 14-FEB-2020 (Rel. 143, Last updated, Version 6) XX DE Glycine max env pseudogene, partial sequence; uncharacterized long terminal DE repeat, complete sequence; gag-pol polyprotein (pol) gene, complete cds; DE and envelope-like gene, partial cds. XX KW . XX OS Glycine max (soybean) OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade; OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine; Soja. XX RN [1] RP 1-8301 RX DOI; 10.1073/pnas.95.12.6897. RX PUBMED; 9618510. RA Laten H.M., Majumdar A., Gaucher E.A.; RT "SIRE-1, a copia/Ty1-like retroelement from soybean, encodes a retroviral RT envelope-like protein"; RL Proc. Natl. Acad. Sci. U.S.A. 95(12):6897-6902(1998). XX RN [2] RP 1-8301 RA Gaucher E.A., Laten H.M.; RT ; RL Submitted (09-MAR-1998) to the INSDC. RL Biology, Loyola University of Chicago, 6525 N. Sheridan Rd., Chicago, IL RL 60656, USA XX RN [3] RC Sequence update by submitter RP 1-8301 RA Gaucher E.A., Laten H.M.; RT ; RL Submitted (19-OCT-1998) to the INSDC. RL Biology, Loyola University of Chicago, 6525 N. Sheridan Rd., Chicago, IL RL 60656, USA XX DR MD5; 77f87b2660b5271eaee5e223c993a7dd. DR EuropePMC; PMC22677; 9618510. XX CC On Oct 22, 1998 this sequence version replaced gi:3142377. XX FH Key Location/Qualifiers FH FT source 1..8301 FT /organism="Glycine max" FT /mol_type="genomic DNA" FT /note="contains ICTV exemplar Glycine max SIRE1 virus" FT /db_xref="taxon:3847" FT gene <1..621 FT /pseudo FT /gene="env" FT CDS <1..621 FT /pseudo FT /codon_start=1 FT /gene="env" FT repeat_region 729..2150 FT /rpt_type=LONG_TERMINAL_REPEAT FT /note="5' end ambiguous" FT repeat_region 749..950 FT /rpt_type=DIRECT FT primer_bind 2153..2174 FT /note="tRNA primer binding site" FT gene 2290..6942 FT /gene="pol" FT CDS 2290..6942 FT /codon_start=1 FT /gene="pol" FT /product="gag-pol polyprotein" FT /note="encodes gag, protease, integrase, reverse FT transcriptase, ribonuclease H" FT /db_xref="GOA:O65147" FT /db_xref="InterPro:IPR001584" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR012337" FT /db_xref="InterPro:IPR013103" FT /db_xref="InterPro:IPR025724" FT /db_xref="InterPro:IPR036397" FT /db_xref="InterPro:IPR036875" FT /db_xref="InterPro:IPR039537" FT /db_xref="UniProtKB/TrEMBL:O65147" FT /protein_id="AAC64917.1" FT /translation="MVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTNELKPEEDWTKE FT EDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKTTHEGTSKVKMSRLQLLA FT TKFENLKMKEEECIHEFHMNILEIANACTALGERMTDEKLVRKILRSLPKRFDMKVTAI FT EEAQDICNMRVDELIGSLQTFELGLSDRTEKKSKNLAFVSNDEGEEDEYDLDTDEGLTN FT AVVLLGKQFNKVLNRMDRRQKPHVRNIPFDIRKGSEYQKRSDEKPSHSKGIQCHGCEGY FT GHIKAECPTHLKKQRKGLSVCRSDDTESEQESDSDRDVNALTGRFESAEDSSDTDSEIT FT FDELAISYRELCIKSEKILQQEAQLKKVIANLEAEKEAHEDEISELKGEIGFLNSKLEN FT MTKSIKMLNKGSDLLDEVLQLGKNVGNQRGLGFNHKSAGRTTMTEFVPAKNSTGATMSQ FT HRSRHHGTQQKKSKRKKWRCHYCGKYGHIKPFCYHLHGHPHHGTQSSSSGRKMMWVPKH FT KTVSLVVHTSLRASAKEDWYLDSGCSRHMTGVKEFLVNIEPCSTSYVTFGDGSKGKITG FT MGKLVHDGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKGS FT RSKDNCYLWTPQETSYSSTCLFSKEDEVKIWHQRFGHLHLRGMKKIIDKGAVRGIPNLK FT IEEGRICGECQIGKQVKMSNQKLQHQTTSRVLELLHMDLMGPMQVESLGRKRYAYVVVD FT DFSRFTWVNFIREKSDTFEVFKELSLRLQREKDCVIKRIRSDHGREFENSKFTEFCTSE FT GITHEFSAAITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTL FT RRGTPTTLYEIWKGRKPTVKHFHICGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRA FT YRVFNSRTRTVMESINVVVDDLTPARKKDVEEDVRTSGDNVADTAKSAENAENSDSATD FT EPNINQPDKRPSIRIQKMHPKELIIGDPNRGVTTRSREIEIISNSCFVSKIEPKNVKEA FT LTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLV FT AQGYTQIEGVDFDETFAPGARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEAYV FT EQPKGFVDPTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQD FT AENLMIAQIYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDSIF FT LSQSKYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTAS FT RPDITYAVGGCARYQANPKISHLNQVKRILKYVNGTSDYGIMYCHCSDSMLVGYCDADW FT AGSVDDRKSTFGGCFYLGTNFISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLK FT EYNVEQDVMTLYCDNLSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLEHVDTEE FT QIADIFTKALDANQFEKLRGKLGICLLEDL" FT repeat_region <4441..>8301 FT /rpt_family="copia/Ty1-like retroelement" FT /rpt_type=DISPERSED FT /note="SIRE-1" FT CDS <6970..>8301 FT /codon_start=1 FT /product="envelope-like" FT /note="similar to viral envelope protein" FT /db_xref="UniProtKB/TrEMBL:Q7G127" FT /protein_id="AAC64918.1" FT /translation="TLIARSLLGQNKFDRCFTRPSTFLIQTHIFVVISFSAFPNSSQRF FT TKPFQRLCFSMATSPKDTSSPGSPSVPSSPSSTKAPSNQEQPEFHIQPIQMIPGQAPVP FT EKLVPKRQQGVKISENPSIATSPREVDTEMDKKIRSIVSSILKNASVPDADKDVPTSST FT PNAEVLSSSSKEESTEEEEQATEETPAPRAPEPAPGDLIDLEEVESDEEPIANKLAPGI FT AERLQSRKGKTPITRSGRIKTMAQKKSTPITPTTSRWSKVAIPSKKRKEFSSSDSDDDV FT ELDVPDIKRAKKSGKKVPGNVPDAPLDNISFHSIGNVERWKFVYQRRLALERELGRDAL FT DCKEIMDLIKAAGLLKTVTKLGDCYESLVREFIVNIPSDITNRKSDEYQKVFVRGKCVR FT FSPAVINKYLGRPTEGVVDIAVSEHQIAKEITAKQVQHWPKKGKL" XX SQ Sequence 8301 BP; 2703 A; 1575 C; 1939 G; 2084 T; 0 other; aagctttctg cagggaagct aagtgtgaag tatgcaatcc tgcacaggat tggcgctgca 60 aactgggtac ccaccaatca tacttccact gttgccacag gtttgggtaa atttctgtat 120 gctgttggaa ccaagtccaa atttaatttt ggaaagtata tttttgatca aactgttaag 180 cattcagaat catttgctgt caaattaccc attgccttcc caactgtatt gtgtggcatt 240 atgttgagtc aacatcccaa tattttaaac aacattgact ctgtgatgaa gagagaatcg 300 gctctgtccc tgcattacaa actgtttgag gggacacatg tcccagacat tgtctcgaca 360 tcagggaaag ctgctgcttc aggtgctgta tccaaggatg ctttgattgc tgaactcaag 420 gacacatgca aggtgctgga agcaaccatc aaagccacca cagagaagaa aatggagctg 480 gaacgcctga tcaaaagact ctcagacagt ggcattgatg atggtgaagc agctgaggaa 540 gaagaagaag ccgctgagga agagaaagat gcagcagagg atacagaatc agatgatgat 600 gattctgatg ccaccccatg accatcagac ctttattttt gctttttact cttactagct 660 ataggggcat gtccctttga acaattgatt gctattggtc tgtaatattt gcatgcattc 720 tacttttgtc aaattctgtc taaaaagggg gagtaatagt attatgcatg attttgagta 780 gtaggatact atgtatgcaa tagtagtatt atgcataatt tatgattttg agtagtagga 840 tacgatgtat gcatgattca tgattttgag ggggagttgt atgtatatga ttttgagggg 900 gagtagtatc tgatgatgct gatagaagat ggcatggaga cagggggagc agaaagctga 960 tgtcacgtga gatgtcttga catcctggaa acgacttgca acttgcagaa ttttgctgtc 1020 gcccctacag ataccgctgt gcttgattac tctgataatg aaagttgctg atcccacttg 1080 cataactgct cgtacctgct caggaagtgt ctaagtatgt tttagacaaa atttgccaaa 1140 gggggagatt gttagtgctt agctttactg agttttaaaa gattggctaa aattttgtta 1200 aaacataagc acttagacaa tgaaggaaag ctggagttgc tgcacatgat gtccaacgtt 1260 atgtcaagga atcagattgg gctccacaat gcacaaggca agataaaagg tcaaatgaag 1320 aattgaagct gcaggatcca cgatgtcgga tacaatgtcc aggacatcct gcccgaaaat 1380 actggacaca taaatctgtt atatctttaa cagattaatg tgcagttagc aacagatttg 1440 gcgatctatc tttaggaacg aattaaaaga taattaaagt tcgaattaca aacttgaata 1500 gttcgttcag ggattaaaga ttaaagataa aaactaaaag atcaaactgt atcttttaga 1560 tctttaagtg cagatttttc aggagaatga tagatcttat ccaagcgcaa gatgttgcag 1620 cccagatacg cacactgcta tataaacatg aaggctgcac gagttttcta ccaagtccgg 1680 gattgaagag ttattttgtg agttttggga cttgagtgtt ttgtgagcca ccttgatgtt 1740 accctaacat caagtgttgg acctgagtgt gtagagttga tctctattgt tcagagagca 1800 atctctggtg tgtctttgat ttatttgtaa acacgggaga gtgattgaga gggagtgaga 1860 ggggttctca tatctaagag tggctcttag gtagaggttg cacgggtagt ggttaggtga 1920 gaaggttgta aacagtggct gttagatctt cgaactaaca ctattttagt ggatttcctc 1980 cctggcttgg tagcccccag atgtaggtga cgttgcaccg aactgggtta acaattctct 2040 tgtgttattt acttgtttaa tctgttcata ctgtcaaata taatctgcat gttctgaagc 2100 gtgatgtcgt gacatccggt acgacatctg tcattggtat cagaatttca attggtatca 2160 gagcgggcac tctaaatcac tgagtgagat ctagggagat aaattctgat gaacatggag 2220 aaagaaggag gaccagtgaa caaaccaccc attctggatg gaaccaacta tgaatactgg 2280 taagcaagga tggtggcctt cctcaaatca ctggatagca gaacctggaa agctgtcatc 2340 aaaggctggg aacatcccaa gatgctggac acagaaggaa agcccactaa tgaattgaag 2400 ccagaagaag actggacaaa agaagaagac gaattggcac ttggaaactc caaagccttg 2460 aatgctctat tcaatggagt tgacaagaat atcttcagac tgatcaacac atgcacagtg 2520 gccaaggatg catgggagat cctgaaaacc actcatgaag gaacctccaa agtgaagatg 2580 tccagattgc aactattggc tacaaaattc gaaaatctga agatgaagga ggaagagtgt 2640 attcatgaat tccacatgaa cattcttgaa attgccaatg cttgcactgc cttgggagaa 2700 aggatgacag atgaaaagct ggtgagaaag atcctcagat ctttgcctaa gagatttgac 2760 atgaaagtca ctgcaataga ggaggcccaa gacatttgca acatgagagt agatgaactc 2820 attggttccc ttcaaacctt tgagctagga ctctcggata ggactgaaaa gaagagcaag 2880 aatctggcgt tcgtgtccaa tgatgaagga gaagaagatg agtatgacct ggatactgat 2940 gaagggctga ctaacgcagt tgtgctcctt ggaaaacagt tcaacaaagt gctgaacaga 3000 atggacagga ggcagaaacc acatgtccgg aacatccctt tcgacatcag gaaaggtagt 3060 gaataccaga aaaggtcaga tgaaaagccc agtcacagca aaggaattca atgccatggg 3120 tgtgaaggct atggacacat caaagctgaa tgtcccactc atctcaagaa gcagaggaaa 3180 ggactttctg tatgtcggtc tgatgataca gagagtgaac aagaaagtga ttctgacaga 3240 gatgtgaatg cactcactgg gagatttgaa tctgctgaag attcaagtga tacagatagt 3300 gaaatcactt ttgatgagct tgctatatcc tatagagaac tatgcatcaa aagtgagaag 3360 attcttcagc aagaagcaca actaaagaag gtcattgcaa atctggaggc tgagaaggag 3420 gcacatgaag atgaaatctc tgaacttaaa ggagaaattg gttttctgaa ctctaaactg 3480 gaaaacatga caaaatcaat aaagatgctg aataaaggct cagatttgct tgatgaggtg 3540 ctacagcttg ggaagaatgt tggaaaccag agaggacttg gatttaatca taaatctgct 3600 ggcagaacaa ccatgacaga atttgttcct gccaaaaaca gcactggagc cacgatgtca 3660 caacatcggt ctcgacatca tggaacgcag cagaaaaaga gcaaaagaaa gaagtggagg 3720 tgtcactact gtggcaagta tggtcacata aagccctttt gctatcattt acatggccat 3780 ccacatcatg gaactcaaag tagcagcagc ggaaggaaga tgatgtgggt tccaaaacac 3840 aagactgtta gtcttgttgt tcatacttca cttagagcat cagctaagga agattggtac 3900 ctagatagcg gctgttccag acacatgaca ggagttaaag aattcctggt gaacattgaa 3960 ccttgctcca ctagctatgt gacatttgga gatggctcta aaggaaagat cactggaatg 4020 ggaaagctag tccatgatgg acttcctagt ctaaacaaag tactgctggt gaagggactg 4080 actgcgaact tgatcagcat cagtcagttg tgtgatgaag gattcaatgt aaacttcaca 4140 aagtcagaat gcttggtgac aaatgagaag agtgaagtcc taatgaaggg cagcagatca 4200 aaggacaact gttacctatg gacacctcaa gaaaccagtt actcctccac atgtctattc 4260 tccaaagaag atgaagtcaa aatatggcat caaagatttg gacatctgca cttaagaggc 4320 atgaagaaaa tcattgacaa aggtgctgtt agaggcattc ccaatctgaa aatagaagaa 4380 ggcagaatct gtggtgaatg tcagattgga aagcaagtca agatgtccaa ccagaagctt 4440 caacatcaga ccacttccag ggtgctggaa ctacttcaca tggacttgat ggggcctatg 4500 caagttgaaa gccttggaag aaaaaggtat gcctatgttg ttgtggatga tttctccaga 4560 tttacctggg tcaactttat cagagagaaa tcagacacct ttgaagtatt caaggagttg 4620 agtctaagac ttcaaagaga aaaagactgt gtcatcaaga gaatcaggag tgaccatggc 4680 agagagtttg aaaacagcaa gtttactgaa ttctgcacat ctgaaggcat cactcatgag 4740 ttctctgcag ccattacacc acaacaaaat ggcatagttg aaaggaaaaa caggaccttg 4800 caagaagctg ctagggtcat gcttcatgcc aaagaacttc cctataatct ctgggctgaa 4860 gccatgaaca cagcatgcta catccacaac agagtcacac ttagaagagg gactccaacc 4920 acactgtatg aaatctggaa agggaggaag ccaactgtca agcacttcca catctgtgga 4980 agtccatgtt acattttggc agatagagag caaaggagaa agatggatcc caagagtgat 5040 gcagggatat tcttgggata ctctacaaac agcagagcat atagagtatt caattccaga 5100 accagaactg tgatggaatc catcaatgtg gttgttgatg atctaactcc agcaagaaag 5160 aaggatgtcg aagaagatgt cagaacatcg ggagacaatg tagcagatac agctaaaagt 5220 gcagaaaatg cagaaaactc tgattctgct acagatgaac caaacatcaa tcaacctgac 5280 aagagaccct ccattagaat ccagaagatg caccccaagg agctgattat aggagatcca 5340 aacagaggag tcactacaag atcaagggag attgagatta tctccaattc atgttttgtc 5400 tccaaaattg agcccaagaa tgtgaaagag gcactgactg atgagttctg gatcaatgct 5460 atgcaagaag aattggagca attcaaaagg aatgaagttt gggagctagt tcctaggccc 5520 gagggaacta atgtgattgg caccaagtgg atcttcaaga acaaaaccaa tgaagaaggt 5580 gttataacca gaaacaaggc cagacttgtt gctcaaggct acactcagat tgaaggtgta 5640 gactttgatg aaacttttgc ccctggtgct agacttgagt ccatcagact gttacttggt 5700 gtagcttgca tcctcaaatt caagctgtac cagatggatg tgaagagcgc atttctgaat 5760 ggatacctga atgaagaagc ctatgtggag cagccaaagg gatttgtaga tccaactcat 5820 ccagatcatg tatacaggct caagaaggct ctctatggat tgaagcaagc tccaagagct 5880 tggtatgaaa ggctaacaga gttccttact cagcaagggt ataggaaggg aggaattgac 5940 aagaccctct ttgtcaaaca agatgctgaa aacttgatga tagcacagat atatgttgat 6000 gacattgtgt ttggagggat gtcgaatgag atgcttcgac attttgtcca acagatgcaa 6060 tctgaatttg agatgagtct tgttggagag ctgacttatt ttctgggact ccaagtgaag 6120 cagatggaag actccatatt cctctcacaa agcaagtatg caaagaacat tgtcaagaag 6180 tttgggatgg aaaatgccag ccataaaaga acacctgcac ctactcactt gaagctgtca 6240 aaagatgaag ctggcaccag tgttgatcaa agtctgtaca gaagcatgat tgggagctta 6300 ctatatttaa cagctagcag acctgacatc acctatgcag taggtggttg tgcaagatat 6360 caagccaatc ctaagataag tcacttgaat caagtaaaga gaattttgaa atatgtaaat 6420 ggcaccagtg actatgggat tatgtactgt cattgttcag attcaatgct ggttgggtat 6480 tgtgatgctg attgggctgg aagtgtagat gacagaaaaa gcacttttgg tggatgtttt 6540 tatttgggaa ccaattttat ttcatggttc agcaagaagc agaactgtgt gtccctatcc 6600 actgcagaag cagagtatat tgcagcagga agcagctgtt cacaactagt ttggatgaag 6660 cagatgctca aggagtacaa tgtcgaacaa gatgtcatga cattgtactg tgacaacttg 6720 agtgctatta atatttctaa aaatcctgtt caacacagca gaaccaagca cattgacatt 6780 agacatcact atattagaga tcttgttgat gataaagtta tcacactgga gcatgttgac 6840 actgaggaac aaatagcaga tattttcaca aaggcattgg atgcaaatca gtttgaaaaa 6900 ctgaggggca agctgggcat ttgtctgcta gaggatttat agcaattact tttatctgaa 6960 cgtgcttaaa cgttaatagc gcgttctcta ctgggccaaa acaaattcga ccgttgcttc 7020 acacgtccct ctacattcct cattcaaact catattttcg tggtaatctc gttttcagca 7080 ttccccaaca gctctcagag atttacgaaa ccattccaaa ggctctgctt ctccatggct 7140 acctcaccaa aagatacttc atctcctggt tcaccctctg taccatcatc tccatcatcc 7200 accaaagcac catcaaacca ggaacaacct gaattccata tccaacccat acaaatgatt 7260 cctggtcaag cccctgttcc tgagaaactg gtccccaaaa gacaacaggg agtgaagatt 7320 tctgaaaacc ctagcattgc aacaagtcct agggaagtag acacggagat ggataagaag 7380 atccgcagta ttgtgagtag tattctgaaa aatgcttctg tccctgatgc tgataaagat 7440 gttccaacat cttccacccc aaatgctgaa gtcctctctt catccagtaa agaggaatca 7500 acagaggaag aggaacaagc cacagaggag acccctgcac caagggcacc agaacctgct 7560 ccaggtgacc tcattgacct agaagaagta gaatctgatg aggaacccat tgccaacaag 7620 ttggcacctg gcattgcaga aagattacaa agcagaaagg gaaaaacccc cattactagg 7680 tctggacgaa tcaaaactat ggcacagaag aagagcacac caatcactcc taccacatcc 7740 agatggagca aagttgcaat cccttccaag aagaggaaag aattttcctc atctgattct 7800 gatgatgatg tcgaactaga tgttcccgac atcaagaggg ccaagaaatc tgggaaaaag 7860 gtgcctggaa atgtccctga tgcaccattg gacaacattt cattccactc cattggcaat 7920 gttgaaaggt ggaaatttgt atatcaacgc agacttgcct tagaaagaga actgggaaga 7980 gatgccttgg attgcaagga gatcatggac ctcatcaagg ctgctggact gctgaaaaca 8040 gtcaccaagt tgggagattg ttatgaaagc ctagtcaggg aattcattgt caacattccc 8100 tctgacataa caaacagaaa gagtgatgag tatcagaaag tgtttgtcag aggaaaatgt 8160 gttagattct cccctgctgt aatcaacaaa tacctgggca gacctactga aggagtggtg 8220 gatattgctg tttctgagca tcaaattgcc aaggaaatca ctgccaaaca agtccagcat 8280 tggccaaaga aagggaagct t 8301 //