![]() |
EBI DbfetchID Z93338; SV 1; linear; genomic DNA; STD; PRO; 3289 BP. XX AC Z93338; XX DT 25-MAR-1997 (Rel. 51, Created) DT 18-APR-2005 (Rel. 83, Last updated, Version 3) XX DE A.simplex ksdI genes and three open reading frames XX KW ketosteroid isomerase; ksdI gene. XX OS Pimelobacter simplex OC Bacteria; Actinobacteria; Actinobacteridae; Actinomycetales; OC Propionibacterineae; Nocardioidaceae; Pimelobacter. XX RN [1] RC (sites) RX DOI; 10.1111/j.1365-2958.1995.tb02359.x RX PUBMED; 7596291. RA Molnar I., Choi K., Yamashita M., Murooka Y.; RT "Molecular cloning, expression in Streptomyces lividans, and analysis of a RT gene cluster from Arthrobacter simplex encoding 3-ketosteroid-delta RT 1-dehydrogenase, 3-ketosteroid-delta 5-isomerase and a hypothetical RT regulatory protein"; RL Mol. Microbiol. 15(5):895-905(1995). XX RN [2] RP 1-3289 RA Dziadek J., Yamashita M., Murooka Y.; RT "Cloning, sequencing and characterization of the downstream region of KsdDI RT operon of Arthrobacter simplex"; RL Unpublished. XX RN [3] RP 1-3289 RA Dziadek J.; RT ; RL Submitted (19-MAR-1997) to the EMBL/GenBank/DDBJ databases. RL Dziadek J., Polish Academy of Sciences, Centre of Microbiology & Virology, RL 104, Lodowa St., Lodz, Poland, 93-231 XX FH Key Location/Qualifiers FH FT source 1..3289 FT /organism="Pimelobacter simplex" FT /strain="IFO12069" FT /mol_type="genomic DNA" FT /db_xref="taxon:2045" FT CDS 53..424 FT /transl_table=11 FT /gene="ksdI" FT /product="4,5delta ketosteroid isomerase" FT /db_xref="GOA:P77816" FT /db_xref="InterPro:IPR002075" FT /db_xref="UniProtKB/Swiss-Prot:P77816" FT /citation=[1] FT /citation=[2] FT /protein_id="CAB07540.1" FT /translation="MSAEVKAAVARYLDAVAGGSPAAIAALYAPDATLEDPVGADLVRG FT RAAIEEFYGALAGAKVSTELLAVRAVAGHAAFSFRVTTDAGDQQYVVEPIDVMTFDADG FT QITSMRAFWAPGDMVVTPA" FT -35_signal 466..471 FT -10_signal 499..504 FT RBS 572..581 FT CDS 591..2228 FT /transl_table=11 FT /gene="ORF2" FT /product="hypothetical protein" FT /note="low similarity to phytoene dehydrogenase from FT Aphanocapsa sp., 18.9% identity in 376 aa overlap and 63% FT in 11 aa overlap in dehydrogenase specific 30 aa region of FT C-terminal end" FT /db_xref="UniProtKB/TrEMBL:O05089" FT /protein_id="CAB07541.1" FT /translation="MVAPAPRVAAPEDVARGVADRVEPLDRAAVGAQRTAELVGAEAAA FT VPRSLSTTLTAYRSPRSRGAQVRGSASRSGRRSSGRRRCCPGGSRRRCRRRHRVEPAQG FT RVNASSGRPAARASSARCRAGARSPRRTTPGGGPSAGRASRRARSCCGTTVEDQPHRAD FT RVAGPVAAEHGLPEGHVVGRLVDEPAAVAVDHDRAGQRALGQQHLRRPAGQRGDGGEPP FT GLVHQRHSGAHLGGELDRVAGVALVAQAPVVEELGLVAVPHVHVVVEPAGREDDAPARG FT HGDPAAVALEHRADHPVALGDQLDQRGLAPDRDAGAQRAVEQPGRERLPAGQVVAADHH FT AADALSGAAQDARQALAGLARDQVHPLVVRTGDRHRDRCLDDARPQQRPGLAQHRRVER FT LALDAAPRGVAARQLRVVVGVAARPHQLERRRPPQHPDGLGAVPQERLPAGPSGRRHPP FT PPRGRSRRARACRASRASVSAGLAASRRRPERAVEPPTYSVFSTTRTWSPLLARAEGSG FT EPGARADDEQVHGGVGHDGVLSQVAALN" FT -35_signal 2231..2236 FT -10_signal 2246..2255 FT RBS 2270..2273 FT CDS 2291..2929 FT /transl_table=11 FT /gene="ORF3" FT /product="hypothetical protein" FT /function="potential regulatory protein" FT /note="43.1% homology in 58 aa overlap with repressor FT protein of AcrAB and 45.7% homology in 46 aa overlap with FT repressor protein of AcrEF from E. coli" FT /db_xref="GOA:O08306" FT /db_xref="InterPro:IPR001647" FT /db_xref="InterPro:IPR009057" FT /db_xref="InterPro:IPR012287" FT /db_xref="UniProtKB/TrEMBL:O08306" FT /protein_id="CAB07542.1" FT /translation="MGDDAVMSSTAERIRPGRSGILAAATRLFATHGVSGTSLQQIADA FT AGITKAAVYHHFPTKEEVVVVAVLAPALEGEPGHRPHRRRPRGPSGQRPRPPSPRRTRP FT SPTARAGPCSSRTQQIEEYVRNNPDHDELFDRCAMLLTGPDPTPGTGLQVSLFLSGLLG FT PAQDPSCADIDDDALRGHRPGRTPAPAGRRRRLTWRGVEPAARRAGSRP" FT CDS complement(804..2192) FT /transl_table=11 FT /gene="ORF4" FT /product="hypothetical protein" FT /function="potential steroid modyfication enzyme" FT /note="30.4% and 38% homology in 168 aa overlap with FT 3-oxosteroid dehydrogenase from P. testosteroni and A. FT simplex, respectively. 90% homology in 12 aa overlap of FT specific region for enzyme activity" FT /db_xref="GOA:O05090" FT /db_xref="InterPro:IPR003953" FT /db_xref="UniProtKB/TrEMBL:O05090" FT /protein_id="CAB07543.1" FT /translation="MSDTTVDLLVIGSGTGLAAALSAREQGAPRPRRREDRVRRRLDRP FT LGPPSGCGQPGAHRGPARATRSSAARPTSRRWWVTTPRRPRWQAFLRHGPETIRMLRRT FT TSLQLMWARGYSDYHPELPGGDAAGRSIESKPFDASVLGESRALLRPGVVEAPVPMPVT FT GADYKWMNLVARKPGKGLPRVLRRAAQGIGGMVIGRDYLAGGQALAAGLFDGALRAGIP FT IWRETTLVELVTEGDRVVGAVLERDGGRVTVTARRGVVLAAGGFDHDMDMRHRYQAEFL FT DNWSLGNEGNTGDAIKLAAEVGARVTLMDQTWWFPAVAPLPRGTPQVLLAERSLPGSIM FT VDGHGRRFINESTDYMTFGQTVLGRDRAGDPVGSMWLVFDSRTATATCSPARSSRGWPS FT PRSGTTRGSRTRPAPRRAGPRRRPARGRVHATLRRFNTMAAPASTTTSTGATAPTTATT FT ATRP" FT RBS complement(2200..2206) FT -10_signal complement(2222..2227) FT -35_signal complement(2255..2263) XX SQ Sequence 3289 BP; 421 A; 1245 C; 1169 G; 454 T; 0 other; gcatgctctt cagccacctc gccgtgcggc acatgggcac cgaggacgcg cgatgagcgc 60 cgaggtgaag gccgccgtgg cgcgctacct cgatgctgtc gccggcggct cgccggccgc 120 gatcgccgcg ctctacgccc ccgacgccac gctcgaggac cccgtcggcg ccgacctcgt 180 ccgcggccgc gcggcgatcg aagagttcta cggcgccctc gccggcgcga aggtcagcac 240 cgagctgctc gccgtccgcg ccgtcgcggg ccacgccgcg ttctcgttcc gggtcaccac 300 cgacgccggc gaccagcagt acgtcgtcga gccgatcgac gtgatgacgt tcgacgcgga 360 cggccagatc acgtccatgc gggcgttctg ggcgcccggg gacatggtcg tcacgccggc 420 ctgacggtcc cgctgtaaca cgctgtccac cgcgcttccc ggcggttgtc gacgcgctct 480 cggcgtgtcg cacgcgtgtc gcgccgtgga cagcgtgtta cagcggcggg ggccgtcagg 540 cggtggccgc gtgggtggcg acgatgtggc cgaagaccag accctggccg atggtcgcgc 600 cggccccccg ggtagctgcg cccgaagacg ttgcccgcgg tgttgccgat cgcgtagagc 660 ccctcgatcg ggctgccgtc ggcgcgcagc ggacggccga gctcgtcggc gctgaggccg 720 ccgcagtgcc gaggtcgctc agcaccacct tgacggcgta caggtcgccg cggtcgaggg 780 gggcgcaggt tcggggttcg gcgtcacggt cgggtcgccg tagtagcggt cgtaggcgct 840 gttgccccgg tggaagtcgt cgtcgatgcc ggcgccgcca tcgtgttgaa ccggcgcagg 900 gtcgcgtgaa cgcgtcctcg ggcaggccgg cggcgcgggc cagctcggcg cggtgccggg 960 cgggtgcgcg atccccgcgt cgtaccactc ctgggggagg gccatccgcg ggaagagcga 1020 gccggcgagc acgtagctgt tgcggtacga ctgtcgaaga ccagccacat cgagccgacc 1080 gggtcgccgg cccggtcgcg gccgagcacg gtctgcccga aggtcatgta gtcggtcgac 1140 tcgttgatga accggcggcc gtggccgtcg accatgatcg agccgggcag cgagcgctcg 1200 gccagcagca cctgcggcgt cccgcggggc agcggggcga cggcggggaa ccaccaggtc 1260 tggtccatca gcgtcactcg ggcgcccacc tcggcggcga gcttgatcgc gtcgccggtg 1320 ttgccctcgt tgcccaggct ccagttgtcg aggaactcgg cctggtagcg gtgccgcatg 1380 tccatgtcgt ggtcgaaccc gccggccgcg aggacgacgc cccggcgcgc ggtcacggtg 1440 acccggccgc cgtcgcgctc gagcaccgcg ccgaccaccc ggtcgccctc ggtgaccagc 1500 tcgaccagcg tggtctcgcg ccagatcggg atgccggcgc gcagcgcgcc gtcgaacagc 1560 ccggccgcga gcgcctgccc gccggccagg tagtcgcggc cgatcaccat gccgccgatg 1620 ccctgagcgg cgcggcgcag gacgcgcggc aggcccttgc cgggcttgcg cgcgaccagg 1680 ttcatccact tgtagtccgc accggtgacc ggcatcggga ccggtgcctc gacgacgccc 1740 ggccgcagca gcgcccggga ctcgcccagc accgacgcgt cgaacggctt gctctcgatg 1800 ctgcgccccg cggcgtcgcc gcccggcagc tccgggtggt agtcggagta gccgcgcgcc 1860 cacatcagct ggagcgacgt cgtccgccgc agcatccgga tggtctcggg gccgtgccgc 1920 aggaacgcct gccagcgggg ccttcggggc gtcgtcaccc accaccgcct cgaggtaggt 1980 ctcgccgcgc tcgagcgtgt cgcgcgagcc gggcctcggt gagcgccggg ttggccgcat 2040 ccagaaggcg gcccgagcgg gcggtcgagc ctccgacgta ctcggtcttc tcgacgacga 2100 ggacgtggag ccccctgctc gcgcgcgctg agggcagcgg cgagcccggt gcccgagccg 2160 atgacgagca ggtccacggt ggtgtcggac atgacggggt cctttcgcag gtcgcggccc 2220 taaactagcg ttcggctagt tattctagac cgcatgacag ggaaagccca ggatccccgg 2280 gcgaccgacc atgggcgatg atgcggtcat gagcagcacc gccgaacgca tccgcccggg 2340 ccgcagcggc atcctcgccg ccgcgacccg gctcttcgcc acgcacggcg tctccggcac 2400 ctcgctgcag cagatcgcgg acgccgccgg gatcaccaag gccgccgtct accaccactt 2460 ccccaccaag gaggaggtcg tcgtcgtcgc cgtcctggcg cccgccctcg agggcgagcc 2520 agggcatcgt ccgcaccgcc ggcgccctcg aggacccagc gggcagcgac cgaggccgcc 2580 atcgcctcgc cggaccaggc cgtcaccaac cgccagagct gggccgtgct cctccaggac 2640 gcaacagatc gaggagtacg tccgcaacaa ccccgaccac gacgagctct tcgatcgctg 2700 cgccatgctc ctcaccggcc cggatcccac cccgggcacc gggctccagg tctccctctt 2760 cctctccggc ctgctcgggc ccgcgcagga ccccagctgc gccgacatcg acgacgacgc 2820 gctgcgcggg catcgtccgg gccggacgcc ggctcctgct ggccgacgac gacgcctgac 2880 ctggcggggg gttgagcccg cagcgaggcg ggcaggatcg cgtccatgac cgctcctgag 2940 gcgttccccc cgccgccgcc gtacgctccc acgcccgcgc cgacgcccgg gtcgggcccg 3000 tcccccggtc acccgcccgg agctcgatgg tgctgcgcca ctcgcggcgc gtcatcgggc 3060 tggtcgtcac actgtcggca tcctggtgtt cgactacggg tcgggcaagt acctccagga 3120 gcgcgcccgc aacttcggtg acgccgcgat cacgggcaac ctcgtcgtga tggccctcgg 3180 cgccctgatc ctcctgccct cgcggccagc gcccggctct tccgggctcg gcccggtcct 3240 cgccggtctc gtctggggag ggctgccgtt cgtctggtac ctcgtcgac 3289 // ![]() |