![]() |
EBI DbfetchID AF164610; SV 1; linear; genomic DNA; STD; HUM; 9178 BP. XX AC AF164610; XX DT 01-SEP-1999 (Rel. 60, Created) DT 07-SEP-1999 (Rel. 61, Last updated, Version 2) XX DE Homo sapiens endogenous retrovirus HERV-K102, complete sequence. XX KW . XX OS Homo sapiens (human) OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; OC Homo. XX RN [1] RP 1-9178 RX DOI; 10.1016/S0960-9822(99)80390-X RX PUBMED; 10469592. RA Barbulescu M., Turner G., Seaman M.I., Deinard A.S., Kidd K.K., Lenz J.; RT "Many human endogenous retrovirus K (HERV-K) proviruses are unique to RT humans"; RL Curr. Biol. 9(16):861-868(1999). XX RN [2] RP 1-9178 RA Barbulescu M., Turner G., Seaman M.I., Deinard A.S., Kidd K.K., Lenz J.; RT ; RL Submitted (02-JUL-1999) to the EMBL/GenBank/DDBJ databases. RL Molecular Genetics, Albert Einstein College of Medicine, 1300 Morris Park RL Avenue, Bronx, NY 10461, USA XX DR GOA; P61567. DR GOA; P61582. DR GOA; P63131. DR GOA; P63135. DR InterPro; IPR000467; G_patch. DR InterPro; IPR000477; Reverse_transcriptase. DR InterPro; IPR001037; Integrase_C_retrovir. DR InterPro; IPR001584; Integrase_cat-core. DR InterPro; IPR001969; Peptidase_aspartic_AS. DR InterPro; IPR001995; Peptidase_A2_cat. DR InterPro; IPR002156; RNase_H. DR InterPro; IPR003308; Integrase_Zn-bd_dom_N. DR InterPro; IPR004295; GP36. DR InterPro; IPR009007; Peptidase_aspartic_catalytic. DR InterPro; IPR010661; RVT_thumb. DR InterPro; IPR012337; PolynucTfrase_RNaseH_fold. DR InterPro; IPR018061; Pept_A2A_retrovirus_sg. DR UniProtKB/Swiss-Prot; P61567; ENK12_HUMAN. DR UniProtKB/Swiss-Prot; P61582; NP12_HUMAN. DR UniProtKB/Swiss-Prot; P63131; VPK12_HUMAN. DR UniProtKB/Swiss-Prot; P63135; POK12_HUMAN. XX FH Key Location/Qualifiers FH FT source 1..9178 FT /organism="Homo sapiens" FT /mol_type="genomic DNA" FT /db_xref="taxon:9606" FT repeat_region 1..9178 FT /rpt_family="human endogenous retrovirus HERV-K102" FT LTR 1..968 FT CDS 1112..2596 FT /codon_start=1 FT /gene="gag" FT /product="Gag protein" FT /db_xref="GOA:P63130" FT /db_xref="InterPro:IPR000721" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR003322" FT /db_xref="InterPro:IPR008916" FT /db_xref="InterPro:IPR008919" FT /db_xref="InterPro:IPR010999" FT /db_xref="InterPro:IPR013084" FT /db_xref="UniProtKB/Swiss-Prot:P63130" FT /protein_id="AAD51792.1" FT /translation="MGQTKSKIKSKYASYLSFIKILLKRGGVKVSTKNLIKLFQIIEQF FT CPWFPEQGTLDLKDWKRIGKELKQAGRKGNIIPLTVWNDWAIIKAALEPFQTEEDSVSV FT SDALGSCIIDCNENTRKKSQKETEGLHCEYVAEPVMAQSTQNVDYNQLQEVIYPETLKL FT EGKGPELVGPSESKPRGTSHLPAGQVPVTLQPQKQVKENKTQPPVAYQYWPPAELQYRP FT PPESQYGYPGMPPAPQGRAPYPQPPTRRLNPTAPPSRQGSELHEIIDKSRKEGDTEAWQ FT FPVTLEPMPPGEGAQEGEPPTVEARYKSFSIKMLKDMKEGVKQYGPNSPYMRTLLDSIA FT HGHRLIPYDWEILAKSSLSPSQFLQFKTWWIDGVQEQVRRNRAANPPVNIDADQLLGIG FT QNWSTISQQALMQNEAIEQVRAICLRAWEKIQDPGSTCPSFNTVRQGSKEPYPDFVARL FT QDVAQKSIADEKARKVIVELMAYENANPDVNQPLSH" FT LTR 8212..9178 XX SQ Sequence 9178 BP; 2952 A; 1869 C; 1959 G; 2398 T; 0 other; tgtggggaaa agcaagagag atcagattgt tactgtgtct gtgtagaaag aagtagacat 60 gggagactcc attttgttat gtgctaagaa aaattcttct gccttgagat tctgttaatc 120 tatgacctta cccccaaccc cgtgctctct gaaacgtgtg ctgtgtcaac tcagggttga 180 atggattaag ggcggtgcag gatgtgcttt gttaaacaga tgcttgaagg cagcatgctc 240 cttaagagtc atcaccactc cctaatctca agtacccagg gacacaaaaa ctgcggaagg 300 ccgcagggac ctctgcctag gaaagccagg tattgtccaa ggtttctccc catgtgatag 360 tctgaaatat ggcctcgtgg gaagggaaag acctgaccgt cccccagccc gacacccgta 420 aagggtctgt gctgaggagg attagtataa gaggaaggaa tgcctcttgc agttgagaca 480 agaggaaggc atctgtctcc tgcctgtccc tgggcaatgg aatgtctcgg tataaaaccc 540 gattgtatgc tccatctact gagataggga aaaaccgcct tagggctgga ggtgggacct 600 gcgggcagca atactgcttt gtaaagcact gagatgttta tgtgtatgca tatccaaaag 660 cacagcactt aatcctttac attgtctatg atgccaagac ctttgttcac gtgtttgtct 720 gctgaccctc tccccacaat tgtcttgtga ccctgacaca tccccctctt tgagaaacac 780 ccacagatga tcaataaata ctaagggaac tcagaggctg gcgggatcct ccacatgctg 840 aacgctggtt ccccgggtcc ccttatttct ttctctatac tttgtctctg tgtctttttc 900 ttttccaaat ctctcatccc accttacgag aaacacccac aggtgtgtag gggcaaccca 960 cccctacatc tggtgcccaa cgtggaggct tttctctagg gtgaaggtac gctcgagcgt 1020 ggtcattgag gacaagtcga cgagagatcc cgagtacatc tacagtcagc cttacggtaa 1080 gcttgtgcgc tcggaagaag ctagggtgat aatggggcaa actaaaagta aaattaaaag 1140 taaatatgcc tcttatctca gctttattaa aattctttta aaaagagggg gagttaaagt 1200 atctacaaaa aatctaatca agctatttca aataatagaa caattttgcc catggtttcc 1260 agaacaagga actttagatc taaaagattg gaaaagaatt ggtaaggaac taaaacaagc 1320 aggtaggaag ggtaatatca ttccacttac agtatggaat gattgggcca ttattaaagc 1380 agctttagaa ccatttcaaa cagaagaaga tagcgtttca gtttctgatg cccttggaag 1440 ctgtataata gattgtaatg aaaacacaag gaaaaaatcc cagaaagaaa cggaaggttt 1500 acattgcgaa tatgtagcag agccggtaat ggctcagtca acgcaaaatg ttgactataa 1560 tcaattacag gaggtgatat atcctgaaac gttaaaatta gaaggaaaag gtccagaatt 1620 agtggggcca tcagagtcta aaccacgagg cacaagtcat cttccagcag gtcaggtgcc 1680 cgtaacatta caacctcaaa agcaggttaa agaaaataag acccaaccgc cagtagccta 1740 tcaatactgg cctccggctg aacttcagta tcggccaccc ccagaaagtc agtatggata 1800 tccaggaatg cccccagcac cacagggcag ggcgccatac cctcagccgc ccactaggag 1860 acttaatcct acggcaccac ctagtagaca gggtagtgaa ttacatgaaa ttattgataa 1920 atcaagaaag gaaggagata ctgaggcatg gcaattccca gtaacgttag aaccgatgcc 1980 acctggagaa ggagcccaag agggagagcc tcccacagtt gaggccagat acaagtcttt 2040 ttcgataaaa atgctaaaag atatgaaaga gggagtaaaa cagtatggac ccaactcccc 2100 ttatatgagg acattattag attccattgc tcatggacat agactcattc cttatgattg 2160 ggagattctg gcaaaatcgt ctctctcacc ctctcaattt ttacaattta agacttggtg 2220 gattgatggg gtacaagaac aggtccgaag aaatagggct gccaatcctc cagttaacat 2280 agatgcagat caactattag gaataggtca aaattggagt actattagtc aacaagcatt 2340 aatgcaaaat gaggccattg agcaagttag agctatctgc cttagagcct gggaaaaaat 2400 ccaagaccca ggaagtacct gcccctcatt taatacagta agacaaggtt caaaagagcc 2460 ctatcctgat tttgtggcaa ggctccaaga tgttgctcaa aagtcaattg ccgatgaaaa 2520 agcccgtaag gtcatagtgg agttgatggc atatgaaaac gccaatcctg atgtcaatca 2580 gccattaagc cattaaaagg aaaggttcct gcaggatcag atgtaatctc agaatatgta 2640 aaagcctgtg atggaatcgg aggagctatg cataaagcta tgcttatggc tcaagcaata 2700 acaggagttg ttttaggagg acaagttaga acatttggag gaaaatgtta taattgtggt 2760 caaattggtc acttaaaaaa gaattgtcca gtcttaaata aacagaatat aactattcaa 2820 gcaactacaa caggtagaga gccacctgac ttatgtccaa gatgtaaaaa aggaaaacat 2880 tgggctagtc aatgtcgttc taaatttgat aaaaatgggc aaccattgtc gggaaacgag 2940 caaaggggcc agcctcaggc cccacaacaa actggggcat tcccaattca gccatttgtt 3000 cctcagggtt ttcaggaaca acaaccccca ctgtcccaag tgtttcaggg aataagccag 3060 ttaccacaat acaacaattg tcccccgcca caagcggcag tgcagcagta gatttatgta 3120 ctatacaagc agtctctctg cttccagggg agcccccaca aaaaatccct acaggggtat 3180 atggcccact gcctgagggg actgtaggac taatcttggg aagatcaagt ctaaatctaa 3240 aaggagttca aattcatact agtgtggttg attcagacta taaaggcgaa attcagttgg 3300 ttattagctc ttcaattcct tggagtgcca gtccaggaga caggattgct caattattac 3360 tcctgccata tattaagggt ggaaatagtg aaataaaaag aataggaggg cttggaagca 3420 ctgatccaac aggaaaggct gcatattggg caagtcaggt cacagagaac agacctgtgt 3480 gtaaggccat tattcaagga aaacagtttg aagggttggt agacactgga gcagatgtct 3540 ctatcattgc tttaaatcag tggccaaaaa attggcctaa acaaaaggct gttacaggac 3600 ttgtcggcat aggcacagcc tcagaagtgt atcaaagtac tgagatttta cattgcttag 3660 ggccagataa tcaagaaagc actgttcagc caatgattac ttcaattcct cttaatctgt 3720 ggggtcgaga tttattacaa caatggggtg cggaaatcac catgcctgcc ccattatata 3780 gccccacgag tcaaaaaatc atgaccaaga tgggatatat accaggaaag ggactaggga 3840 aaaatgaaga tggcattaaa gttccagttg aggctaaaat aaatcaagaa agagaaggaa 3900 tagggtatcc tttttagggg cggccactgt agagcctcct aagcccatac cactaacttg 3960 gaaaacagaa aaaccggtgt gggtaaatca gtggccgcta ccaaaacaaa aactggaggc 4020 tttacattta ttagcaaatg aacagttaga aaagggtcac attgagcctt cgttctcacc 4080 ttggaattct cctgtgtttg taattcagaa gaaatcaggc aaatggcgta tgttaactga 4140 cttaagggct gtaaacgccg taattcaacc catggggcct ctccaacctg ggttgccctc 4200 tccagccatg atcccaaaag attggccttt aattataatt gatctaaagg attgcttttt 4260 taccatccct ctggcagagc aggattgcga aaaatttgcc tttactatac cagccataaa 4320 taataaagaa ccagccacca ggtttcagtg gaaagtgtta cctcagggaa tgcttaatag 4380 tccaactatt tgtcagactt ttgtaggttg agctcttcaa ccagttagag aaaagttttc 4440 agactgttat attattcatt atattgatga tattttatgt gctgcagaaa cgagagataa 4500 attaattgac tgttatacat ttctgcaagc agaggttgcc aatgctggac tggcaatagc 4560 atctgataag atccaaacct ctactccttt tcattattta gggatgcaga tagaaaatag 4620 aaaaattaag ccacaaaaag tagaaataag aaaagacaca ttaaaaacac taaatgattt 4680 tcaaaaatta ctaggagata ttaattggat tcggccaact ctaggcattc ctacttatgc 4740 catgtcaaat ttgttctcta tcttaagagg agactcagac ttaaatagta aaagaatatt 4800 aaccccagag gcaacaaaag aaattaaatt agtggaagaa aaaattcagt cagcgcaaat 4860 aaatagaata gatcccttag ccccactcca acttttgatt tttgccactg cacattctcc 4920 aacaggtatc attattcaaa atactgatct tgtggagtgg tcattccttc ctcacagtac 4980 agttaagact tttacactgt acttggatca aatagctaca ttaattggtc agacaagatt 5040 acgaataata aaattatatg gaaatgaccc agacaaaata gttgtccctt taaccaagga 5100 acaagttaga caagccttta tcaattctgg tgcatggcag attggtcttg ctaattttgt 5160 gggaattatt gataatcatt acccaaaaac aaagatcttc cagttcttaa aactgactac 5220 ttggattcta cctaaaatta ccagacgtga acctttagaa aatgctctaa cagtatttac 5280 tgatggttcc agcaatggaa aagcagctta cacaggaccg aaagaacgag taatcaaaac 5340 tccatatcaa tcggctcaaa gagcagagtt ggttgcagtc attacagtgt tacaagattt 5400 tgaccaacct atcaatatta tatcagattc tgcatatgta gtacaggcta caagggatgt 5460 tgagacagct ctaattaaat atagcatgga tgatcagtta aaccagctat tcaatttatt 5520 acaacaaact gtaagaaaaa gaaatttccc attttatatt actcatattc gagcacacac 5580 taatttacca gggcctttga ctaaagcaaa tgaacaagct gacttactgg tatcatctgc 5640 actcataaaa gcacaagaac ttcatgcttt gactcatgta aatgcagcag gattaaaaaa 5700 caaatttgat gtcacatgga aacaggcaaa agatattgta caacattgca cccagtgtca 5760 aatcttacac ctgcccactc aagaggcagg agttaatccc agaggtctgt gtcctaatgc 5820 attatggcaa atggatgtca cgcatgtacc ttcatttgga agattatcat atgttcacgt 5880 aacagttgat acttattcac atttcatatg ggcaacttgc caaacaggag aaagtacttc 5940 ccatgttaaa aaacatttat tgtcttgttt tgctgtaatg ggagttccag aaaaaatcaa 6000 aactgacaat ggaccaggat attgtagtaa agctttccaa aaattcttaa gtcagtggaa 6060 aatttcacgt acaacaggaa ttccttataa ttcccaagga caggccatag ttgaaagaac 6120 taatagaaca ctcaaaactc aattagttaa acaaaaagaa gggggagaca gtaaggagtg 6180 taccactcct cagatgcaac ttaatctagc actctatact ttaaattttt taaacattta 6240 tagaaatcag actactactt ctgcagaaca acatcttact ggtaaaaaga acagcccaca 6300 tgaaggaaaa ctaatttggt ggaaagataa taaaaataag acatgggaaa tagggaaggt 6360 gataacgtgg gggagaggtt ttgcttgtgt ttcaccagga gaaaatcagc ttcctgtttg 6420 gatacccact agacatttga agttctacaa tgaacccatt ggagatgcaa agaaaagggc 6480 ctccacggag atggtaacac cagtcacatg gatggataat cctatagaaa tatatgttaa 6540 tgatagtgta tgggtacctg gacccataga tgatcgctgc cctgccaaac ctgaggaaga 6600 agggatgatg ataaatattt ccattgggta tcgttatcct cctatttgcc tagggagagc 6660 accaggatgt ttaatgcctg cagtccaaaa ttggttggta gaagtaccta ctgtcagtcc 6720 catcagtaga ttcacttatc acatggtaag cgggatgtca ctcaggccac gggtaaatta 6780 tttacaagac ttttcttatc aaagatcatt aaaatttaga cctaaaggga aaccttgccc 6840 caaggaaatt cccaaagaat caaaaaatac agaagtttta gtttgggaag aatgtgtggc 6900 caatagtgcg gtgatattac aaaacaatga atttggaact attatagatt gggcacctcg 6960 aggtcaattc taccacaatt gctcaggaca aactcagtcg tgtccaagtg cacaagtgag 7020 tccagctgtt gatagcgact taacagaaag tttagacaaa cataagcata aaaaattgca 7080 gtctttctac ccttgggaat ggggagaaaa aagaatctct accccaagac caaaaatagt 7140 aagtcctgtt tctggtcctg aacatccaga attatggagg cttactgtgg cctcacacca 7200 cattagaatt tggtctggaa atcaaacttt agaaacaaga gattgtaagc cattttatac 7260 tatcgaccta aattccagtc taacagttcc tttacaaagt tgcgtaaagc ccccttatat 7320 gctagttgta ggaaatatag ttattaaacc agactcccag actataacct gtgaaaattg 7380 tagattgctt agttgcattg attcaacttt taattggcaa caccgtattc tgctggtgag 7440 agcaagagag ggcgtgtgga tccctgtgtc catggaccga ccatgggagg cctcaccatc 7500 cgtccatatt ttgactgaag tattaaaagg tgttttaaat agatccaaaa gattcatttt 7560 tactttaatt gcagtgatta tgggattaat tgcagtcaca gctacggctg ctgtagcagg 7620 agttgcattg cactcttctg ttcagtcagt aaactttgtt aatgattggc aaaagaattc 7680 tacaagattg tggaattcac aatctagtat tgatcaaaaa ttggcaaatc aaattaatga 7740 tcttagacaa actgtcattt ggatgggaga cagactcatg agcttagaac atcgtttcca 7800 gttacaatgt gactggaata cgtcagattt ttgtattaca ccccaaattt ataatgagtc 7860 tgagcatcac tgggacatgg ttagacgcca tctacaggga agagaagata atctcacttt 7920 agacatttcc aaattaaaag aacaaatttt cgaagcatca aaagcccatt taaatttggt 7980 gccaggaact gaggcaattg caggagttgc tgatggcctc gcaaatctta accctgtcac 8040 ttgggttaag accattggaa gtactacgat tataaatctc atattaatcc ttgtgtgcct 8100 gttttgtctg ttgttagtct gcaggtgtac ccaacagctc cgaagagaca gcgaccatcg 8160 agaacgggcc atgatgacga tggcggtttt gtcgaaaaga aaagggggaa atgtggggaa 8220 aagcaagaga gatcagattg ttactgtgtc tgtgtagaaa gaagtagaca tgggagactc 8280 cattttgtta tgtgttaaga aaaattcttc tgccttgaga ttctgttaat ctatgacctt 8340 acccccaacc ccgtgctctc tgaaacgtgt gctgtgtcaa ctcagggttg aatggattaa 8400 gggcggtgca ggatgtgctt tgttaaacag atgcttgaag gcagcatgct ccttaagagt 8460 catcaccact ccctaatctc aagtacccag gacacaaaaa ctgcggaagg ccgcagggac 8520 ctctgcctag gaaagccagg tattgtccaa ggtttctccc catgtgatag tctgaaatat 8580 ggcctcgtgg gaagggaaag acctgaccgt cccccagccc gacacccgta aagggtctgt 8640 gctgaggagg attagtataa gaggaaggaa tgcctcttgc agttgagaca agaggaaggc 8700 atctgtctcc tgcctgtccc tgggcaatgg aatgtctcgg tataaaaccc gattgtatgc 8760 tccatctact gagataggga aaaaccgcct tagggctgga ggtgggacct gcgggcagca 8820 atactgcttt gtaaagcact gagatgttta tgtgtatgca tatccaaaag cacagcactt 8880 aatcctttac attgtctatg atgccaagac ctttgttcac gtgtttgtct gctgaccctc 8940 tccccacaat tgtcttgtga ccctgacaca tccccctctt tgagaaacac ccacagatga 9000 tcaataaata ctaagggaac tcagaggctg gcgggatcct ccatatgctg aacgctggtt 9060 ccccgggtcc ccttatttct ttctctatac tttgtctctg tgtctttttc ttttccaaat 9120 ctctcgtccc accttacgag aaacacccac aggtgtgtag gggcaaccca cccctaca 9178 // ![]() |