![]() |
EBI DbfetchID AAD51797; SV 1; linear; genomic DNA; STD; HUM; 5640 BP. XX PA AF164614.1 XX DE Homo sapiens (human) Gag-Pro-Pol protein XX OS Homo sapiens (human) OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; OC Homo. OX NCBI_TaxID=9606; XX FH Key Location/Qualifiers FH FT source 1..5640 FT /organism="Homo sapiens" FT /mol_type="genomic DNA" FT CDS join(AF164614.1:1112..3108,AF164614.1:3108..3915, FT AF164614.1:3915..6749) FT /codon_start=1 FT /product="Gag-Pro-Pol protein" FT /note="-1 frameshifts presumed to occur within the codons FT for the last amino acids in the Gag and Pro open reading FT frames" FT /db_xref="GOA:Q9Y6I0" FT /db_xref="HGNC:13915" FT /db_xref="HSSP:1C1A" FT /db_xref="InterPro:IPR010661" FT /db_xref="UniProtKB/Swiss-Prot:Q9BXR3" FT /func_characterised="similar sequence" FT /protein_id="AAD51797.1" FT /translation="MGQTKSKIKSKYASYLSFIKILLKRGGVKVSTKNLIKLFQIIEQF FT CPWFPEQGTLDLKDWKRIGKELKQAGRKGNIIPLTVWNDWAIIKAALEPFQTEEDSVSV FT SDAPGSCIIDCNENTRKKSQKETEGLHCEYVAEPVMAQSTQNVDYNQLQEVIYPETLKL FT EGKGPELVGPSESKPRGTSPLPAGQVPVTLQPQKQVKENKTQPPVAYQYWPPAELQYRP FT PPESQYGYPGMPPAPQGRAPYPQPPTRRLNPTAPPSRQGSKLHEIIDKSRKEGDTEAWQ FT FPVTLEPMPPGEGAQEGEPPTVEARYKSFSIKKLKDMKEGVKQYGPNSPYMRTLLDSIA FT HGHRLIPYDWEILAKSSLSPSQFLQFKTWWIDGVQEQVRRNRAANPPVNIDADQLLGIG FT QNWSTISQQALMQNEAIEQVRAICLRAWEKIQDPGSTCPSFNTVRQGSKEPYPDFVARL FT QDVAQKSIADEKARKVIVELMAYENANPECQSAIKPLKGKVPAGSDVISEYVKACDGIG FT GAMHKAMLMAQAITGVVLGGQVRTFGRKCYNCGQIGHLKKNCPVLNKQNITIQATTTGR FT EPPDLCPRCKKGKHWASQCRSKFDKNGQPLSGNEQRGQPQAPQQTGAFPIQPFVPQGFQ FT GQQPPLSQVFQGISQLPQYNNCPPPQAAVQQVDLCTIQAVSLLPGEPPQKTPTGVYGPL FT PKGTVGLILGRSSLNLKGVQIHTSVVDSDYKGEIQLVISSSIPWSASPRDRIAQLLLLP FT YIKGGNSEIKRIGGLGSTDPTGKAAYWASQVSENRPVCKAIIQGKQFEGLVDTGADVSI FT IALNQWPKNWPKQKAVTGLVGIGTASEVYQSTEILHCLGPDNQESTVQPMITSIPLNLW FT GRDLLQQWGAEITMPAPSYSPTSQKIMTKMGYIPGKGLGKNEDGIKIPVEAKINQEREG FT IGNPCLGAATVEPPKPIPLTWKTEKPVWVNQWPLPKQKLEALHLLANEQLEKGHIEPSF FT SPWNSPVFVIQKKSGKWRMLTDLRAVNAVIQPMGPLQPGLPSPAMIPKDWPLIIIDLKD FT CFFTIPLAEQDCEKFAFTIPAINNKEPATRFQWKVLPQGMLNSPTICQTFVGRALQPVR FT EKFSDCYIIHCIDDILCAAETKDKLIDCYTFLQAEVANAGLAIASDKIQTSTPFHYLGM FT QIENRKIKPQKIEIRKDTLKTLNDFQKLLGDINWIRPTLGIPTYAMSNLFSILRGDSDL FT NSKRMLTPEATKEIKLVEEKIQSAQINRIDPLAPLQLLIFATAHSPTGIIIQNTDLVEW FT SFLPHSTVKTFTLYLDQIATLIGQTRLRIIKLCGNDPDKIVVPLTKEQVRQAFINSGAW FT KIGLANFVGIIDNHYPKTKIFQFLKLTTWILPKITRREPLENALTVFTDGSSNGKAAYT FT GPKERVIKTPYQSAQRAELVAVITVLQDFDQPINIISDSAYVVQATRDVETALIKYSMD FT DQLNQLFNLLQQTVRKRNFPFYITHIRAHTNLPGPLTKANEQADLLVSSALIKAQELHA FT LTHVNAAGLKNKFDVTWKQAKDIVQHCTQCQVLHLPTQEAGVNPRGLCPNALWQMDVTH FT VPSFGRLSYVHVTVDTYSHFIWATCQTGESTSHVKKHLLSCFAVMGVPEKIKTDNGPGY FT CSKAFQKFLSQWKISHTTGIPYNSQGQAIVERTNRTLKTQLVKQKEGGDSKECTTPQMQ FT LNLALYTLNFLNIYRNQTTTSAEQHLTGKKNSPHEGKLIWWKDNKNKTWEIGKVITWGR FT GFACVSPGENQLPVWIPTRHLKFYNEPIRDAKKSTSAETETSQSSTVDSQDEQNGDVRR FT TDEVAIHQEGRAANLGTTKEADAVSYKISREHKGDTNPREYAACSLDDCINGGKSPYAC FT RSSCS" XX SQ Sequence 5640 BP; 1975 A; 1104 C; 1151 G; 1410 T; 0 other; 3961988406 CRC32; atggggcaaa ctaaaagtaa aattaaaagt aaatatgcct cttatctcag ctttattaaa 60 attcttttaa aaagaggggg agttaaagta tctacaaaaa atctaatcaa gctatttcaa 120 ataatagaac aattttgccc atggtttcca gaacaaggaa ctttagatct aaaagattgg 180 aaaagaattg gtaaggaact aaaacaagca ggtaggaagg gtaatatcat tccacttaca 240 gtatggaatg attgggccat tattaaagca gctttagaac catttcaaac agaagaagat 300 agcgtttcag tttctgatgc ccctggaagc tgtataatag attgtaatga aaacacaagg 360 aaaaaatccc agaaagaaac ggaaggttta cattgcgaat atgtagcaga gccggtaatg 420 gctcagtcaa cgcaaaatgt tgactataat caattacagg aggtgatata tcctgaaacg 480 ttaaaattag aaggaaaagg tccagaatta gtggggccat cagagtctaa accacgaggc 540 acaagtcctc ttccagcagg tcaggtgcct gtaacattac aacctcaaaa gcaggttaaa 600 gaaaataaga cccaaccgcc agtagcctat caatactggc ctccggctga acttcagtat 660 cggccacccc cagaaagtca gtatggatat ccaggaatgc ccccagcacc acagggcagg 720 gcgccatacc ctcagccgcc cactaggaga cttaatccta cggcaccacc tagtagacag 780 ggtagtaaat tacatgaaat tattgataaa tcaagaaagg aaggagatac tgaggcatgg 840 caattcccag taacgttaga accgatgcca cctggagaag gagcccaaga gggagagcct 900 cccacagttg aggccagata caagtctttt tcgataaaaa agctaaaaga tatgaaagag 960 ggagtaaaac agtatggacc caactcccct tatatgagga cattattaga ttccattgct 1020 catggacata gactcattcc ttatgattgg gagattctgg caaaatcgtc tctctcaccc 1080 tctcaatttt tacaatttaa gacttggtgg attgatgggg tacaagaaca ggtccgaaga 1140 aatagggctg ccaatcctcc agttaacata gatgcagatc aactattagg aataggtcaa 1200 aattggagta ctattagtca acaagcatta atgcaaaatg aggccattga gcaagttaga 1260 gctatctgcc ttagagcctg ggaaaaaatc caagacccag gaagtacctg cccctcattt 1320 aatacagtaa gacaaggttc aaaagagccc tatcctgatt ttgtggcaag gctccaagat 1380 gttgctcaaa agtcaattgc tgatgaaaaa gcccgtaagg tcatagtgga gttgatggca 1440 tatgaaaacg ccaatcctga gtgtcaatca gccattaagc cattaaaagg aaaggttcct 1500 gcaggatcag atgtaatctc agaatatgta aaagcctgtg atggaatcgg aggagctatg 1560 cataaagcta tgcttatggc tcaagcaata acaggagttg ttttaggagg acaagttaga 1620 acatttggaa gaaaatgtta taattgtggt caaattggtc acttaaaaaa gaattgccca 1680 gtcttaaata aacagaatat aactattcaa gcaactacaa caggtagaga gccacctgac 1740 ttatgtccaa gatgtaaaaa aggaaaacat tgggctagtc aatgtcgttc taaatttgat 1800 aaaaatgggc aaccattgtc gggaaacgag caaaggggcc agcctcaggc cccacaacaa 1860 actggggcat tcccaattca gccatttgtt cctcagggtt ttcagggaca acaaccccca 1920 ctgtcccaag tgtttcaggg aataagccag ttaccacaat acaacaattg tcccccgcca 1980 caagcggcag tgcagcaagt agatttatgt actatacaag cagtctctct gcttccaggg 2040 gagcccccac aaaaaacccc cacaggggta tatggacccc tgcctaaggg gactgtagga 2100 ctaatcttgg gacgatcaag tctaaatcta aaaggagttc aaattcatac tagtgtggtt 2160 gattcagact ataaaggcga aattcaattg gttattagct cttcaattcc ttggagtgcc 2220 agtccaagag acaggattgc tcaattatta ctcctgccat acattaaggg tggaaatagt 2280 gaaataaaaa gaataggagg gcttggaagc actgatccaa caggaaaggc tgcatattgg 2340 gcaagtcagg tctcagagaa cagacctgtg tgtaaggcca ttattcaagg aaaacagttt 2400 gaagggttgg tagacactgg agcagatgtc tctatcattg ctttaaatca gtggccaaaa 2460 aattggccta aacaaaaggc tgttacagga cttgtcggca taggcacagc ctcagaagtg 2520 tatcaaagta cggagatttt acattgctta gggccagata atcaagaaag tactgttcag 2580 ccaatgatta cttcaattcc tcttaatctg tggggtcgag atttattaca acaatggggt 2640 gcggaaatca ccatgcccgc tccatcatat agccccacga gtcaaaaaat catgaccaag 2700 atgggatata taccaggaaa gggactaggg aaaaatgaag atggcattaa aattccagtt 2760 gaggctaaaa taaatcaaga aagagaagga atagggaatc cttgcctagg ggcggccact 2820 gtagagcctc ctaaacccat accattaact tggaaaacag aaaaaccagt gtgggtaaat 2880 cagtggccgc taccaaaaca aaaactggag gctttacatt tattagcaaa tgaacagtta 2940 gaaaagggtc atattgagcc ttcgttctca ccttggaatt ctcctgtgtt tgtaattcag 3000 aagaaatcag gcaaatggcg tatgttaact gacttaaggg ctgtaaacgc cgtaattcaa 3060 cccatggggc ctctccaacc cgggttgccc tctccggcca tgatcccaaa agattggcct 3120 ttaattataa ttgatctaaa ggattgcttt tttaccatcc ctctggcaga gcaggattgc 3180 gaaaaatttg cctttactat accagccata aataataaag aaccagccac caggtttcag 3240 tggaaagtgt tacctcaggg aatgcttaat agtccaacta tttgtcagac ttttgtaggt 3300 cgagctcttc aaccagttag agaaaagttt tcagactgtt atattattca ttgtattgat 3360 gatattttat gtgctgcaga aacgaaagat aaattaattg actgttatac atttctgcaa 3420 gcagaggttg ccaatgctgg actggcaata gcatctgata agatccaaac ctctactcct 3480 tttcattatt tagggatgca gatagaaaat agaaaaatta agccacaaaa aatagaaata 3540 agaaaagaca cattaaaaac actaaatgat tttcaaaaat tactaggaga tattaattgg 3600 attcggccaa ctctaggcat tcctacttat gccatgtcaa atttgttctc tatcttaaga 3660 ggagactcag acttaaatag taaaagaatg ttaaccccag aggcaacaaa agaaattaaa 3720 ttagtggaag aaaaaattca gtcagcgcaa ataaatagaa tagatccctt agccccactc 3780 caacttttga tttttgccac tgcacattct ccaacaggca tcattattca aaatactgat 3840 cttgtggagt ggtcattcct tcctcacagt acagttaaga cttttacatt gtacttggat 3900 caaatagcta cattaatcgg tcagacaaga ttacgaataa taaaattatg tgggaatgac 3960 ccagacaaaa tagttgtccc tttaaccaag gaacaagtta gacaagcctt tatcaattct 4020 ggtgcatgga agattggtct tgctaatttt gtgggaatta ttgataatca ttacccaaaa 4080 acaaagatct tccagttctt aaaattgact acttggattc tacctaaaat taccagacgt 4140 gaacctttag aaaatgctct aacagtattt actgatggtt ccagcaatgg aaaagcagct 4200 tacacaggac cgaaagaacg agtaatcaaa actccatatc aatcggctca aagagcagag 4260 ttggttgcag tcattacagt gttacaagat tttgaccaac ctatcaatat tatatcagat 4320 tctgcatatg tagtacaggc tacaagggat gttgagacag ctctaattaa atatagcatg 4380 gatgatcagt taaaccagct attcaattta ttacaacaaa ctgtaagaaa aagaaatttc 4440 ccattttata ttacacatat tcgagcacac actaatttac cagggccttt gactaaagca 4500 aatgaacaag ctgacttact ggtatcatct gcactcataa aagcacaaga acttcatgct 4560 ttgactcatg taaatgcagc aggattaaaa aacaaatttg atgtcacatg gaaacaggca 4620 aaagatattg tacaacattg cacccagtgt caagtcttac acctgcccac tcaagaggca 4680 ggagttaatc ccagaggtct gtgtcctaat gcattatggc aaatggatgt cacgcatgta 4740 ccttcatttg gaagattatc atatgttcac gtaacagttg atacttattc acatttcata 4800 tgggcaactt gccaaacagg agaaagtact tcccatgtta aaaaacattt attgtcttgt 4860 tttgctgtaa tgggagttcc agaaaaaatc aaaactgaca atggaccagg atattgtagt 4920 aaagctttcc aaaaattctt aagtcagtgg aaaatttcac atacaacagg aattccttat 4980 aattcccaag gacaggccat agttgaaaga actaatagaa cactcaaaac tcaattagtt 5040 aaacaaaaag aagggggaga cagtaaggag tgtaccactc ctcagatgca acttaatcta 5100 gcactctata ctttaaattt tttaaacatt tatagaaatc agactactac ttctgcagaa 5160 caacatctta ctggtaaaaa gaacagccca catgaaggaa aactaatttg gtggaaagat 5220 aataaaaata agacatggga aatagggaag gtgataacgt gggggagagg ttttgcttgt 5280 gtttcaccag gagaaaatca gcttcctgtt tggataccca ctagacattt gaagttctac 5340 aatgaaccca tcagagatgc aaagaaaagc acctccgcgg agacggagac atcgcaatcg 5400 agcaccgttg actcacaaga tgaacaaaat ggtgacgtca gaagaacaga tgaagttgcc 5460 atccaccaag aaggcagagc cgccaacttg ggcacaacta aagaagctga cgcagttagc 5520 tacaaaatat ctagagaaca caaaggtgac acaaacccca gagagtatgc tgcttgcagc 5580 cttgatgatt gtatcaatgg tggtaagtct ccctatgcct gcaggagcag ctgcagctaa 5640 // ![]() |