![]() |
EBI DbfetchID M33152; SV 1; linear; genomic DNA; STD; PRO; 3800 BP. XX AC M33152; XX DT 26-MAY-1990 (Rel. 24, Created) DT 17-APR-2005 (Rel. 83, Last updated, Version 5) XX DE A.vinelandii H2 uptake hydrogenase (hoxK), complete cds, and H2 uptake DE hydrogenase (hoxG), complete cds. XX KW H2 uptake hydrogenase. XX OS Azotobacter vinelandii OC Bacteria; Proteobacteria; Gammaproteobacteria; Pseudomonadales; OC Pseudomonadaceae; Azotobacter. XX RN [1] RP 1-3800 RX DOI; 10.1016/0378-1119(90)90342-O RX PUBMED; 2265761. RA Menon A.L., Stults L.W., Robson R.L., Mortenson L.E.; RT "Cloning, sequencing and characterization of the [NiFe]hydrogenase-encoding RT structural genes (hoxK and hoxG) from Azotobacter vinelandii"; RL Gene 96(1):67-74(1990). XX CC Draft entry and computer-readable sequence for [Unpublished (1990) CC U of Georgia, Dep Biochemistry, Athens, GA 30602] kindly submitted CC by R.L.Robson, 22-MAR-1990. XX FH Key Location/Qualifiers FH FT source 1..3800 FT /organism="Azotobacter vinelandii" FT /strain="OP" FT /mol_type="genomic DNA" FT /clone="pALM21." FT /db_xref="taxon:354" FT sig_peptide 149..283 FT /note="H2 uptake hydrogenase signal peptide (put.); FT putative" FT CDS 149..1225 FT /codon_start=1 FT /transl_table=11 FT /note="H2 uptake hydrogenase (hoxK) precursor" FT /db_xref="GOA:P21950" FT /db_xref="HSSP:1FRF" FT /db_xref="InterPro:IPR017909" FT /db_xref="UniProtKB/Swiss-Prot:P21950" FT /protein_id="AAA82505.1" FT /translation="MSRLETFYDVMRRQGITRRSFLKYCSLTAAALGLGPAFAPRIAHA FT METKPRTPVLWLHGLECTCCSESFIRSAHPLVKDVVLSMISLDYDDTLMAAAGHQAEAA FT LEETMRKYKGEYILAVEGNPPLNEDGMFCIVGGKPFIEQLRHVAKDAKAVIAWGSCASW FT GCVQAARPNPTQAVPIHKVITDKPIVKVPGCPPIAEVMTGVITYMLTFGKLPELDRQGR FT PKMFYGQRIHDKCYRRPHFDAGQFVEHWDDEGARKGYCLYKVGCKGPTSYNACSTVRWN FT EGTSFPIQAGHGCIGCSEDGFWDKGSFYERLTTIPQFGIEKNADEIGAAVAGGVGAAIA FT AHAAVTAIKRLQNKGDRP" FT mat_peptide 284..1222 FT /note="H2 uptake hydrogenase" FT CDS 1222..3030 FT /codon_start=1 FT /transl_table=11 FT /note="H2 uptake hydrogenase (hoxG)" FT /db_xref="GOA:P21949" FT /db_xref="HSSP:2FRV" FT /db_xref="InterPro:IPR018194" FT /db_xref="UniProtKB/Swiss-Prot:P21949" FT /protein_id="AAA82506.1" FT /translation="MSSLPNASQLDKSGRRIVVDPVTRIEGHMRCEVNVDASNVITNAV FT STGTMWRGLEVILKGRDPRDAWAFVERICGVCTGTHALTSVRAVEDALDIRIPYNAHLI FT RNLMDKTLQVHDHIVHFYHLHALDWVNPVNALKADPKATSALQQAVSPAHAKSSPGYFR FT DVQTRLKKFVESGQLGLFSNGYWDNPAYKLPPEADLMAVAHYLEALDLQKDIVKIHTIF FT GGKNPHPNYMVGGVACAINLDDVGAAGAPVNMTSLNFVLERIHEAREFTRNVYLPDVLA FT VAGIYKDWLYGGGLAAHNLLSYGTFTKVPYDKSSDLLPAGAIVGGNWDEVLPVDVRDPE FT EIQEFVSHSWYSYADETKGLHPWDGVTEPKFELGPNTKGSRTHIQEIDEAHKYSWIKAP FT RWRGHAMEVGPLARYIIAYASGREYVKEQVDRSLAAFNQSTGLNLGLKQFLPSTLGRTL FT ARALECELAVDSMLDDWQALVGNIKAGDRATANVEKWDPSTWPKEAKGVGINEAPRGAL FT GHWIRIKDGKIENYQAIVPTTWNGTPRDHLGNIGAYEAALLNTRMERPDEPVEILRTLH FT SFDPCLACSTHVMSPDGQELTRVKVR" FT CDS 3047..3769 FT /codon_start=1 FT /transl_table=11 FT /product="unknown protein" FT /note="ORF3; putative" FT /db_xref="GOA:P23000" FT /db_xref="InterPro:IPR000516" FT /db_xref="UniProtKB/Swiss-Prot:P23000" FT /protein_id="AAA82507.1" FT /translation="MALEKSLETGDGQEKVRKQTAVYVYEAPLRLWHWVTALSIVVLGV FT TGYFIGAPLPTMPGEAMDNYLMGYIRFAHFAAGYVLAIGFLGRVYWAFVGNHHARELFL FT VPVHRKAWWKELWHEVRWYLFLEKTPKKYIGHNPLGQLAMFCFFVVGAVFMSVTGFALY FT AEGLGRDSWADRLFGWVIPLFGQSQDVHTWHHLGMWYLVVFVMVHVYLAVREDIVSRQS FT LISTMVGGWRMFKDDRPD" XX SQ Sequence 3800 BP; 686 A; 1318 C; 1213 G; 583 T; 0 other; tgtatcaagc catgacaaaa acatggcatt ggcgcattat tcgtgcggtt ttcattcagc 60 aaccgtgggc catacaaccg gcgcgccgtc atagccgaag gacggtgcgc aggggcgccg 120 ataacgacct ggccacaagg gtaacggcat gtctcgactc gaaactttct atgacgtgat 180 gcggcgtcag ggcatcacgc gccgcagctt tctcaaatat tgcagcctga ccgccgcggc 240 cctgggcctc ggcccggcct tcgccccgcg gatcgcccac gcgatggaaa ccaagccgcg 300 cactccggtg ctctggctgc acggcctgga gtgcacctgc tgctccgagt cgttcatccg 360 ttcggcccac ccgctggtca aggacgtggt gctgtcgatg atctcgctgg actacgacga 420 caccctgatg gccgccgccg gccaccaggc cgaggccgcc ctcgaagaga ccatgcgcaa 480 gtacaagggc gagtacatcc tcgccgtgga gggcaacccg ccgctcaacg aggacggcat 540 gttctgcatc gtcggcggca agccgttcat cgagcagctc aggcatgtgg cgaaggacgc 600 caaggcggtg atcgcctggg gcagttgcgc cagttggggc tgcgtgcagg cggcccggcc 660 caacccgacc caggcggtgc cgatccacaa ggtcatcacc gacaagccga tcgtcaaggt 720 gcccggctgc ccgccgatcg ccgaggtgat gaccggggtg atcacctaca tgctgacctt 780 cggcaagctg cccgagctgg accgccaggg gcggccgaag atgttctacg gccagcgcat 840 ccacgacaag tgctaccgcc gcccgcactt cgacgccggc cagttcgtcg agcactggga 900 cgacgagggc gcgcgcaagg gctactgcct gtacaaggtc ggctgcaagg gcccgaccag 960 ctacaacgcc tgctcgacgg tgcgctggaa cgagggcact tccttcccga tccaggccgg 1020 ccacggctgc atcggctgct cggaggacgg tttctgggac aagggctcgt tctatgaacg 1080 cctgaccacc attccgcagt tcggcatcga gaagaacgcc gacgaaatcg gcgccgccgt 1140 cgccggcggg gtcggcgcgg ccatcgccgc gcatgccgcg gtcaccgcca tcaagcgcct 1200 gcagaacaag ggggatcgcc catgagcagc ctgccgaacg ccagccaact ggacaagtcc 1260 ggcaggcgca tcgtcgtcga cccggtgacc cgcatcgagg gccacatgcg ctgcgaggtc 1320 aacgtcgacg ccagcaacgt gatcaccaac gccgtctcca ccggcaccat gtggcgcggc 1380 ctggaggtca tcctcaaggg ccgcgacccg cgcgacgcct gggccttcgt cgagcgcatc 1440 tgcggcgtct gcaccggcac ccatgcgctg acctcggtgc gcgcggtgga ggatgccctg 1500 gacatccgca tcccctacaa cgcccacctg atccgcaacc tgatggacaa gacgctgcag 1560 gtgcacgacc acatcgtgca cttctaccac ctgcacgcgc tggactgggt caacccggtc 1620 aacgccctga aggccgatcc caaggctacc tccgccctgc agcaggcggt ttcgccggcc 1680 catgccaagt ccagccccgg ctacttccgc gacgtgcaga cgcgcctgaa gaagttcgtc 1740 gagagcggcc agctcggcct gttctccaac ggctactggg acaatccggc ctacaagctg 1800 ccgcccgagg cggacctgat ggccgtggcc cactacctgg aggcgctgga cctgcagaag 1860 gacatcgtca agatccatac catcttcggc ggcaagaacc cgcatccgaa ctacatggtc 1920 ggcggcgtgg cctgcgccat caacctggac gacgtcggcg ccgccggcgc gccggtcaac 1980 atgaccagcc tgaacttcgt cctcgaacgc atccacgagg cccgcgagtt caccaggaac 2040 gtctacctgc cggacgtgct ggcggtcgcc gggatctaca aggactggct gtacggcggc 2100 ggtctggccg cgcacaacct gctgtcctac ggcaccttca ccaaggtgcc ctacgacaag 2160 tccagcgacc tgttgccggc cggcgccatc gtcggcggca attgggacga ggtgctgccg 2220 gtcgacgtgc gcgatcccga ggagatccag gagttcgtca gccactcctg gtacagctac 2280 gccgacgaaa ccaaggggct gcatccctgg gacggcgtca ccgagccgaa attcgagctc 2340 ggcccgaaca ccaagggcag ccgcacccac atccaggaaa tcgacgaggc gcacaagtac 2400 agctggatca aggcgccgcg ctggcgcggc cacgctatgg aggtcggccc gctggcacgt 2460 tacatcatcg cctacgcttc gggccgcgaa tacgtgaagg aacaggtcga ccgctcgctg 2520 gccgccttca accagagcac cggcctgaac ctcggcctca agcagttcct gccctcgacc 2580 ctcggccgca ccctggcgcg cgccctggag tgcgagctgg cggtggacag catgctcgac 2640 gactggcagg ccctggtcgg caacatcaag gccggcgacc gcgccaccgc caacgtcgag 2700 aagtgggacc cgagcacctg gccgaaggag gccaagggcg tgggcatcaa cgaggcgccg 2760 cgcggcgccc tgggccactg gatcaggatc aaggacggca agatcgagaa ctaccaggcg 2820 atcgtgccga ccacctggaa cggcaccccg cgcgaccatc tgggcaacat cggcgcctac 2880 gaggccgcgc tgctcaacac caggatggag cgcccggacg agccggtgga gatcctgcgc 2940 accctgcaca gcttcgaccc ctgcctggcc tgttcgaccc acgtgatgtc gccggacggc 3000 caggagctga cccgggtgaa ggtccgctga accggaggat tgcgcgatgg cactggaaaa 3060 atccctggaa accggcgacg gccaggagaa ggtccgcaag cagaccgcgg tgtacgtcta 3120 cgaggcgccg ctgcgcctct ggcactgggt cacggcgctg tccatcgtcg tgctcggcgt 3180 gaccggctac ttcatcggcg cgccgctgcc gacgatgccc ggcgaggcga tggacaacta 3240 cctgatgggc tacatccgct tcgcccactt cgccgccggc tacgtgctgg cgatcggctt 3300 cctcggccgg gtctactggg ccttcgtcgg caaccaccac gcccgcgagc tgttcctcgt 3360 gccggtgcac cgcaaggcct ggtggaagga gctgtggcac gaggtgcgct ggtacctgtt 3420 cctggaaaag accccgaaga agtacatcgg ccacaacccc ctgggccagt tggcgatgtt 3480 ctgcttcttc gtggtcggcg cggtgttcat gagcgtcacc ggcttcgccc tctacgccga 3540 ggggctgggg cgggacagct gggccgaccg gctgttcggc tgggtgatcc cgctgttcgg 3600 ccagagccag gacgtgcaca cctggcacca cctgggcatg tggtacctcg tcgtcttcgt 3660 catggtgcat gtctacctgg ccgtgcgcga agacatcgtt tcccggcagt cgctgatctc 3720 caccatggtc ggcggctggc ggatgttcaa ggacgaccgg ccggattgag ccccgtgtcg 3780 tcccttccgt ccgggccggt 3800 // ![]() |