![]() |
EBI DbfetchID L00950; SV 1; linear; genomic DNA; STD; INV; 6974 BP. XX AC L00950; XX DT 06-SEP-1992 (Rel. 33, Created) DT 04-MAR-2000 (Rel. 63, Last updated, Version 6) XX DE Nasonia vitripennis retrotransposable element R2, complete sequence. XX KW . XX OS Nasonia vitripennis (jewel wasp) OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; OC Endopterygota; Hymenoptera; Apocrita; Chalcidoidea; Pteromalidae; OC Pteromalinae; Nasonia. XX RN [1] RP 1-6974 RX PUBMED; 8383793. RA Burke W.D., Eickbush D.G., Xiong Y., Jakubczak J.L., Eickbush T.H.; RT "Sequence relationship of retrotransposable elements R1 and R2 within and RT between divergent insect species"; RL Mol. Biol. Evol. 10(1):163-185(1993). XX RN [2] RP 1-6974 RX DOI; 10.1038/32330 RX PUBMED; 9515960. RA Burke W.D., Malik H.S., Lathe W.C.III., Eickbush T.H.; RT "Are retrotransposons long-term hitchhikers?"; RL Nature 392(6672):141-142(1998). XX RN [3] RP 1-6974 RA Burke W.D., Malik H.S., Eickbush T.H.; RT "Identification of a conserved domain structure and retrotransposition RT mechanism of R2 retrotransposons from diverse arthropods"; RL Unpublished. XX RN [4] RP 1-6974 RA Burke W.D.; RT ; RL Submitted (06-SEP-1992) to the EMBL/GenBank/DDBJ databases. RL Biology, Univ. of Rochester, Rochester, NY 14627, USA XX RN [5] RC Sequence update by submitter RP 1-6974 RA Burke W.D.; RT ; RL Submitted (09-SEP-1998) to the EMBL/GenBank/DDBJ databases. RL Biology, Univ. of Rochester, Rochester, NY 14627, USA XX CC On Sep 9, 1998 this sequence version replaced gi:2317817. XX FH Key Location/Qualifiers FH FT source 1..6974 FT /organism="Nasonia vitripennis" FT /mol_type="genomic DNA" FT /tissue_lib="Charon 35 (from W. Burke)" FT /db_xref="taxon:7425" FT 5'UTR 1..699 FT repeat_region 1..6974 FT /standard_name="retrotransposon R2" FT /rpt_type=DISPERSED FT CDS <700..2166 FT /codon_start=1 FT /product="putative chimeric R1/R2 retrotransposon" FT /note="ORF1; similar to the ORF1 sequence of R1 elements FT from arthropods; contains three cysteine motifs" FT /db_xref="GOA:O76962" FT /db_xref="InterPro:IPR001878" FT /db_xref="UniProtKB/TrEMBL:O76962" FT /protein_id="AAC34928.1" FT /translation="DTQNPTCTDASKLRSYYSQNASKEQQLISEGHDEDLGMTEGLFSP FT PVDIGKAKDVEDEIRGHLYFLETVDLSRLSKESQAGIAKAVTGLTKGMDLVMYHVRTMT FT KEAGLAGAFSKVLQETMRKVILKEREQHLQRVQNAVKSVGEKVQKAIAEISEIGTGMSS FT NVDINALAESTANKIARGWKESEKQQMSKLEELKQSIDEAKTTGNNITYAQAAGNQWTT FT VGQKRKRILSSAGLEVTVTETEDVLIKPEQERGEEYPNAAVIIKKLKATIDPDEQNITV FT ERMIPRTNMVVAVVKKGDGPKLIEGITRKGIGLLAETRKKLQPRILVQDIPEEMEEEEL FT MARLKKNVSLEAQRDEVRLIRMIKTRRGNKLAVIELPARAHEDLTHLQKVKIGWSICRI FT ATDIRPNQCYKCQAFGHHAARCASDAVCAKCAQNHETKTCRNKGARKCANCSKACRADC FT NHPAFDATKCPIFRAELEKSARNIDYSYSE" FT CDS <2230..5307 FT /codon_start=1 FT /product="reverse transcriptase" FT /note="ORF2; contains 2 cysteine motifs (putative DNA FT binding), reverse transcriptase and site-specific FT endonuclease" FT /db_xref="GOA:Q03278" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR007087" FT /db_xref="InterPro:IPR015706" FT /db_xref="InterPro:IPR015880" FT /db_xref="UniProtKB/Swiss-Prot:Q03278" FT /protein_id="AAC34927.1" FT /translation="NQIKKSNTSTGARIPKAMTNPADNFAGGQWKPPGRRSARTSATGM FT FVCEHCLRAFTTNTGRGLHIKRAHEEQANEAITTERSRARWTNEEMEAVQAEIDCEGRT FT AINQEILRIIPYQRTIDAIKCLRKQQKYKTIRERVANRRAENRARETELTRLETADEDP FT ASQEQDNPNMSLKNWLKEVIESDDDRLCADLRTAIEMALAGQSPLDVCTVGCYQYTMTN FT LPLVPVRLGGPIYWCNAQSRSNPGETQRRQTIKESNNSWKKNMSKAAHIVLDGDTDACP FT AGLEGTEASGAIMRAGCPTTRHLRSRMQGEIKNLWRPISNDEIKEVEACKRTAAGPDGM FT TTTAWNSIDECIKSLFNMIMYHGQCPRRYLDSRTVLIPKEPGTMDPACFRPLSIASVAL FT RHFHRILANRIGEHGLLDTRQRAFIVADGVAENTSLLSAMIKEARMKIKGLYIAILDVK FT KAFDSVEHRSILDALRRKKLPLEMRNYIMWVYRNSKTRLEVVKTKGRWIRPARGVRQGD FT PLSPLLFNCVMDAVLRRLPENTGFLMGAEKIGALVFADDLVLLAETREGLQASLSRIEA FT GLQEQGLEMMPRKCHTLALVPSGKEKKIKVETHKPFTVGNQEITQLGHADQWKYLGVVY FT NSYGPIQVKINIAGDLQRVTAAPLKPQQRMAILGMFLIPRFIHKLVLGRTSNADVRKGD FT KIIRKTVRGWLRLPHDTPIGYFHAPIKEGGLGIPAFESRIPELLKSRIEALGASNMQTA FT RSLLGGDWVAERKKWINTQKIKNSEWAQKLHLTTDGKDLRDTRKAEASYSWIRDIHVAI FT PASVWIKYHHTRINALPTLMRMSRGRRTNGNALCRAGCGLPETLYHVVQQCPRTHGGRV FT LRHDKIAEQVAIFMQEKGWLVLREAHIRTSVGLRKPDIIARKGQDCKIIDCQIVTTGND FT IRIQHERKIQYYASNWELRRSAATMIGHQGQVSVEAITISWKGVWEPRSYCLLRDCGIP FT KVKIKGLTTRVLLGAYLNFNTFSKATYRTERRRTAN" FT 3'UTR 5308..6974 XX SQ Sequence 6974 BP; 2321 A; 1496 C; 1776 G; 1381 T; 0 other; agagcggttc gatcgcacaa tggctctgtg taaaagtgtc taattagtgg tttaggagtc 60 cggagtggac cagaaagtga ataacgagtg ctgttggtga ttctgcatgt gtggtaataa 120 taattgttgc aatacctgct aagtggacgc gaaaagaaca gaaaatgtta aactaacctc 180 aaaatctacc acgcgccaca tggccgtaca ttacttgaaa gacgctaaac aaaccgtatg 240 tgaacctatg gaacaatata tatcaaaatg ttcggaaaaa agtcgcgcga ggttgtgaac 300 tgttagcaga atgttattct aacaatatct cgagcaatca tcggaaaaag tgaacacgtg 360 gactaacata acctcaaaaa gtgcgtggcc atctgccggg ggatatggta ctgccgtgtg 420 ggctcggttg gtaagacagt atactatctt atcgactgca ggtcaaactg tgtgtactcg 480 gtaagttatc tttgtggcct tccgacagtg ggtactgcag ttgcgtaggc aggttgtcgc 540 aacataacct ccaccaagac acggtgcttc tggcgatacg atggggacac caccaaaaag 600 gtcaggctcg gaagattcca gaatcaatcg gaagtcaaca tcaaaggata cgatgaagaa 660 accaccaaat gtccagaata gccaagataa gaagcatgag acacccagaa cccaacatgt 720 actgacgcca gtaaactcag gtcctactat tcccagaacg ccagcaagga gcagcagctt 780 atttctgagg gacacgatga ggacctcggc atgactgagg ggctgttctc acccccagtt 840 gacatcggaa aggcgaagga cgtggaagat gagattcgag gtcaccttta cttcctggaa 900 acggttgatc tctcgaggct gagcaaggaa tcgcaagcgg gcatagcaaa ggcagtgact 960 ggactaacca agggtatgga cttggtaatg taccatgtca ggacgatgac gaaagaggcc 1020 ggactggcag gtgcattttc aaaggtactg caagagacca tgagaaaagt tattctgaaa 1080 gagagagagc aacatctgca aagagtacag aatgctgtca aatcggtagg agagaaagtg 1140 cagaaagcta ttgctgaaat aagcgaaata ggaacaggga tgtcatcaaa cgtggacata 1200 aatgcgctag ctgaatccac agcaaacaag atagccagag gatggaagga gagcgagaaa 1260 cagcagatgt caaagctgga ggaactgaag cagtccatcg atgaagcaaa gaccaccgga 1320 aacaacatca cgtatgctca agctgctgga aaccaatgga ccacggtagg acagaaaaga 1380 aagagaatac tttctagcgc gggcctagag gttacggtga ccgagaccga agatgtgctt 1440 ataaaaccgg aacaggaaag aggggaagag taccccaatg cagccgtgat aattaagaag 1500 ctcaaagcaa ccatcgaccc agatgaacag aacatcacgg tggaaaggat gatcccgaga 1560 actaatatgg ttgtagcagt cgtgaagaaa ggagatggac cgaaactgat tgagggtata 1620 acccggaaag gtattggtct tctggctgaa acaaggaaga agctacaacc aagaatcctg 1680 gttcaagaca tcccagaaga aatggaagaa gaagaactaa tggctcggtt gaaaaagaac 1740 gtatcactag aagcgcaaag agatgaggtc agactaatca gaatgatcaa aaccagaaga 1800 ggtaacaagc ttgcagtaat tgaactgcca gccagagcgc atgaagacct aacgcatctc 1860 caaaaggtga aaattggatg gtctatctgc aggatagcga cagacataag gccaaaccaa 1920 tgctataaat gccaggcatt cgggcaccat gcagccaggt gtgcctcgga cgcggtgtgt 1980 gcaaaatgcg cccagaatca tgagaccaaa acatgcagaa ataaaggcgc taggaaatgt 2040 gccaactgca gcaaggcctg cagagctgac tgcaaccacc cggcattcga tgccactaag 2100 tgtccaatct ttagggcaga gctagagaag agtgcgagga acattgacta ctcctattcg 2160 gaatagtcag gttacgtata caggggtgac tccctgccgc aaaggcgccg caaggaacac 2220 ctgcgataaa accaaattaa aaagtcaaac acatccactg gcgcacgcat acctaaagct 2280 atgacaaacc ctgcggacaa tttcgcaggg ggccagtgga aaccacccgg gcgaagatct 2340 gcccggacta gcgcgacagg catgtttgtc tgcgaacact gcctaagagc gttcaccacc 2400 aacacgggaa gaggactaca tataaagaga gcccacgaag aacaagcaaa cgaagcaata 2460 acaacagaaa gaagcagggc aaggtggacc aacgaagaaa tggaagcggt gcaagctgaa 2520 attgactgcg aggggagaac ggctatcaac caggagatcc taaggataat accctaccag 2580 aggaccatcg atgccataaa atgcctaagg aaacaacaga aatacaagac cattagggaa 2640 agagtcgcaa acagaagagc ggaaaacaga gccagggaga ctgagctgac aagactggaa 2700 acagcagatg aagacccagc aagccaggag caggacaatc caaatatgtc cctgaagaat 2760 tggctgaaag aagtcattga gagcgacgat gacaggttgt gcgcggacct gcggacagcc 2820 atagaaatgg cactagcagg tcagtcacca cttgatgtct gcaccgttgg ctgctatcaa 2880 tacacaatga cgaatctgcc actggtacca gtgcgactgg gaggacccat ctactggtgc 2940 aacgcccaaa gccgcagcaa tccaggagaa acgcaaagaa ggcagactat aaaagaatcc 3000 aacaactctt ggaagaagaa catgagcaaa gcagcccaca tagtacttga cggagacact 3060 gacgcgtgtc cagctggtct tgaaggaaca gaagcatctg gtgcgattat gagggcaggc 3120 tgtccaacaa cacgacactt gcgatcaagg atgcaaggtg aaattaagaa cttatggagg 3180 ccaataagta atgatgaaat caaggaggtt gaagcctgca agcggactgc ggctggtcct 3240 gacggaatga caacgacagc atggaacagc atagatgagt gcataaaaag cctttttaac 3300 atgataatgt accatgggca atgccccagg agatatcttg actcaagaac tgtactcatc 3360 ccaaaggagc ctggaacaat ggacccagca tgctttaggc cgctgtccat tgcatcagtt 3420 gcactgcgac acttccacag aatactggca aatagaatag gtgagcatgg actcctcgac 3480 acaagacaaa gagcgttcat tgtggctgac ggtgttgcgg aaaacacttc gctactatcg 3540 gccatgatca aagaggccag aatgaagata aaaggcttat acatcgctat actcgacgta 3600 aagaaagcgt ttgactccgt agagcacagg tcaatcttag atgccctgag aagaaagaaa 3660 ctaccacttg aaatgaggaa ctacatcatg tgggtgtaca gaaactccaa aaccaggctg 3720 gaagtagtaa aaacgaaggg cagatggatt cgcccggcga ggggagtgag acagggtgac 3780 ccgctctcgc cactcctgtt caactgcgtg atggatgctg tccttcggag gctgccagag 3840 aatacaggct tcttgatggg tgcagaaaag attggtgctc tcgtcttcgc ggacgacctg 3900 gttctccttg cagagacgag agagggtctg caggcgtctc taagtaggat tgaggctgga 3960 ctacaggagc aaggcctaga aatgatgcca aggaaatgcc acactcttgc gctggtgccg 4020 tccggaaaag agaaaaagat aaaggttgaa acgcataaac cgtttactgt aggcaaccag 4080 gaaataacgc agcttggaca tgcggaccag tggaagtatc taggtgtggt gtacaactcc 4140 tacggaccaa ttcaggttaa gatcaacatc gcgggtgacc ttcagagagt aactgctgcc 4200 ccactaaagc cacagcagag aatggctatc ctgggtatgt tcctgatacc cagatttata 4260 cacaaactcg tgcttggcag gacatcaaat gcggacgtgc gtaaaggaga caagatcatt 4320 aggaagaccg tcagagggtg gctcagactg ccacatgata ctccgatcgg gtacttccac 4380 gcgccgatta aggaaggtgg tttgggcatt ccagcgtttg agtccaggat tccagagctt 4440 ctaaaatcaa gaatagaagc acttggagca tccaacatgc aaactgcaag aagccttctt 4500 gggggcgact gggtggccga aaggaagaag tggatcaaca cccagaaaat caagaactcg 4560 gaatgggctc agaaactaca cctaacaacg gatggcaaag acctacggga caccaggaaa 4620 gcagaggcgt catacagttg gataagggac atacatgttg ctataccagc tagcgtctgg 4680 ataaagtacc accacaccag aatcaacgca cttcccacac tgatgagaat gagcagaggc 4740 agacgaacaa atggaaatgc tctgtgcaga gccggatgtg gacttccgga gaccctctac 4800 cacgttgttc aacagtgccc ccgcacccat ggaggaagag tattacgtca tgacaaaata 4860 gctgagcaag tagccatctt catgcaagag aaaggttggc tggtgctaag agaggcacac 4920 attagaactt cagtgggact aagaaagcca gacattattg cacgaaaagg acaggattgt 4980 aaaatcatcg actgccaaat cgttacaacg gggaatgata tacggataca acatgagagg 5040 aaaattcagt attacgccag caactgggag ctgcgaagat cagcagcaac catgatcggg 5100 catcaagggc aagtaagtgt ggaggctata acaatatcct ggaagggcgt gtgggagcca 5160 cgatcatact gtttgctcag ggattgcggc ataccaaaag tcaagatcaa agggctaacg 5220 actagagttc ttcttggggc gtatctaaat tttaacacct ttagcaaggc gacatataga 5280 actgaaagga gacgaacagc aaactaagag aagtaccaca ccagctataa acgcctaaaa 5340 tatgcattaa agaaaaattt tgcttagaca tccggaaagg ccagcaccct tgggagcatc 5400 accaggaact ggagagtggc cacacatagt aagaacgaag aaatagcgct caaaaaccgc 5460 aaaaactgag caatcttagc aaattataaa agacggaaaa acagcctaaa taacaaaaat 5520 aacggaaaat cttgtttagt catcgtagaa agtactccac cagacgaaca catacaaaca 5580 agaaagacat cggaaaaata attatatcaa aagttatgac taaaagagta aacaagtagt 5640 gcagccctgc aaaggaaaac taagtggtga agccctacat gtagtcaggc agaaacccgc 5700 aagtgattct gccaactgtg gggcatataa gtaaagcagt gtaagaatag ggacattgaa 5760 actagaggac atctacaggc aagaaagcga tagaaatgac aactagacgt accagaagcc 5820 ggagtgcagc cacgaaagtg aaaatggcaa catcaggccc agcaaaggaa cctgaagaag 5880 ggaaacctga cctggttaag gcagcattat ggctaaatgg acggcttttt ggcaacaagt 5940 aaaggggttt acctaccgct ggggaccaag gatgacctcc atgtatgcat tcacgaacac 6000 cagaaccagg gagattgttg taagctggga cctgctaagg gagctgccga tggccgtctt 6060 cgaaggggtg ctggcgcacg agatgctgca cgctttgatg ttcctagaaa actatgacat 6120 tttctggaaa cgtaccagta aggagctgat cagcgtgtgg aaaaagagaa gtcaaaggaa 6180 gccacccaga gctacgcaga cttcgtcaaa gaacatggtg aggtgctaga cgaccatggc 6240 ccggagttca agcgtcgatg catccccttg tctaaggcct tgggagtaaa tgtggcagat 6300 gtctctgaca aggacaaaac cgagaagaac ttccatctaa ggcggtttga atgggaatgt 6360 aggtcatgcc atgagatggc atacctctcc actaggcgca agcctggctc ccttaccaag 6420 cacaaggatg gatgcaagaa ggcaaattgg aaactcgtgg aggagtatga gaggccgcca 6480 aaagtcccga accacgaatc ctacgcaaga aacgctcgga agtacgacca atagtagagg 6540 agtttggaca acagggacaa tgctgcagta catgaaccac ctacacccag gcatcatgaa 6600 gactcaaatg caattattta aaatcatctt tttgtttttt ttgcttattt tattttagcc 6660 ttatcaagtg aacgctatgt cgcgctagta ttaattctta ttattatttc ttttcactgt 6720 cctaaacttc tccttctttt ctttttcttt tgcattctct atatctttag ccttgttaaa 6780 tagaaactat aaattttttg ttttcttttc ttttctatca agaaaaggga gagccttcaa 6840 tatattttgt tattctagtt ttattttgta aatactataa ttacaatatg taaacaatgc 6900 acgaacaaaa aagtgcattt cttttgttag ctgtacccca tgcaggagtg ctatgggcaa 6960 taaatcatat tatc 6974 // ![]() |