![]() |
EBI DbfetchID M87018; SV 2; linear; genomic DNA; STD; PRO; 7534 BP. XX AC M87018; XX DT 22-OCT-1992 (Rel. 33, Created) DT 17-APR-2005 (Rel. 83, Last updated, Version 10) XX DE Clostridium cellulolyticum cellulase gene cluster, complete sequence. XX KW . XX OS Clostridium cellulolyticum OC Bacteria; Firmicutes; Clostridia; Clostridiales; Clostridiaceae; OC Clostridium. XX RN [1] RP 1-7534 RX DOI; 10.1016/0378-1119(92)90062-T RX PUBMED; 1398087. RA Bagnara-Tardif C., Gaudin C., Belaich A., Hoest P., Citard T., RA Belaich J.P.; RT "Sequence analysis of a gene cluster encoding cellulases from Clostridium RT cellulolyticum"; RL Gene 119(1):17-28(1992). XX RN [2] RP 1-7534 RX DOI; 10.1128/JB.182.7.1910-1915.2000 RX PUBMED; 10714996. RA Gaudin C., Belaich A., Champ S., Belaich J.P.; RT "CelE, a multidomain cellulase from Clostridium cellulolyticum: a key RT enzyme in the cellulosome?"; RL J. Bacteriol. 182(7):1910-1915(2000). XX RN [3] RP 1-7534 RA Belaich A., Belaich J.P.; RT ; RL Submitted (26-APR-1993) to the EMBL/GenBank/DDBJ databases. RL CNRS, 31 Chemin Joseph Aiguier, Marseille 13009, France XX RN [4] RC Sequence update by submitter RP 1-7534 RA Belaich A.; RT ; RL Submitted (27-JUL-1999) to the EMBL/GenBank/DDBJ databases. RL CNRS, 31 Chemin Joseph Aiguier, Marseille 13009, France XX CC On Jul 27, 1999 this sequence version replaced gi:926996. XX FH Key Location/Qualifiers FH FT source 1..7534 FT /organism="Clostridium cellulolyticum" FT /mol_type="genomic DNA" FT /db_xref="taxon:1521" FT CDS <1..1011 FT /codon_start=1 FT /transl_table=11 FT /product="unknown protein" FT /note="ORF1" FT /db_xref="GOA:P37698" FT /db_xref="InterPro:IPR000556" FT /db_xref="PDB:1F9D" FT /db_xref="UniProtKB/Swiss-Prot:P37698" FT /protein_id="AAA73866.1" FT /translation="EFYQWLQSAEGAIAGGATNSWNGRYEAVPSGTSTFYGMGYVENPV FT YADPGSNTWFGMQVWSMQRVAELYYKTGDARAKKLLDKWAKWINGEIKFNADGTFQIPS FT TIDWEGQPDTWNPTQGYTGNANLHVKVVNYGTDLGCASSLANTLTYYAAKSGDETSRQN FT AQKLLDAMWNNYSDSKGISTVEQRGDYHRFLDQEVFVPAGWTGKMPNGDVIKSGVKFID FT IRSKYKQDPEWQTMVAALQAGQVPTQRLHRFWAQSEFAVANGVYAILFPDQGPEKLLGD FT VNGDETVDAIDLAILKKYLLNSSTTINTANADMNSDNAIDAIDYALLKKALLSIQ" FT RBS 1122..1126 FT /gene="CelCCC" FT CDS 1140..2522 FT /codon_start=1 FT /transl_table=11 FT /gene="CelCCC" FT /product="endo-beta-1,4-glucanase precursor" FT /EC_number="3.2.1.4" FT /db_xref="GOA:P37699" FT /db_xref="HSSP:1IS9" FT /db_xref="InterPro:IPR019834" FT /db_xref="UniProtKB/Swiss-Prot:P37699" FT /protein_id="AAA73867.1" FT /translation="MIKGSSLKRFKSLVMAAIFSVSIISTAIASSAADQIPFPYDAKYP FT NGAYSCLADSQSIGNNLVRSEWEQWKSAHITSNGARGYKRVQRDATTNYDTVSEGLGYG FT LLLSVYFGEQQLFDDLYRYVKVFLNSNGLMSWRIDSSGNIMGKDSIGAATDADEDIAVS FT LVFAHKKWGTSGGFNYQTEAKNYINNIYNKMVEPGTYVIKAGDTWGGSNVTNPSYFAPA FT WYRIFADFTGNSGWINVANKCYEIADKARNSNTGLVPDWCTANGTPASGQGFDFYYDAI FT RYQWRAAIDYSWYGTAKAKTHCDAISNFFKNIGYANIKDGYTISGSQISSNHTATFVSC FT AAAAAMTGTDTTYAKNIYNECVKVKDSGNYTYFGNTLRMMVLLYTTGNFPNLYTYNSQP FT KPDLKGDVNNDGAIDALDIAALKKAILTQTTSNISLTNADMNNDGNIDAIDFAQLKVKL FT LN" FT sig_peptide 1140..1235 FT /gene="CelCCC" FT mat_peptide 1236..2519 FT /gene="CelCCC" FT /product="endo-beta-1,4-glucanase" FT /EC_number="3.2.1.4" FT sig_peptide 2607..2711 FT /gene="CelCCG" FT CDS 2607..4784 FT /codon_start=1 FT /transl_table=11 FT /gene="CelCCG" FT /product="endo-beta-1,4-glucanase precursor" FT /EC_number="3.2.1.4" FT /db_xref="GOA:P37700" FT /db_xref="InterPro:IPR018221" FT /db_xref="PDB:1G87" FT /db_xref="UniProtKB/Swiss-Prot:P37700" FT /protein_id="AAA73868.1" FT /translation="MLKTKRKLTKAIGVALSISILSSLVSFIPQTNTYAAGTYNYGEAL FT QKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDVGVDLTGGWYDAGDHVKFNLPMSYT FT SAMLAWSLYEDKDAYDKSGQTKYIMDGIKWANDYFIKCNPTPGVYYYQVGDGGKDHSWW FT GPAEVMQMERPSFKVDASKPGSAVCASTAASLASAAVVFKSSDPTYAEKCISHAKNLFD FT MADKAKSDAGYTAASGYYSSSSFYDDLSWAAVWLYLATNDSTYLDKAESYVPNWGKEQQ FT TDIIAYKWGQCWDDVHYGAELLLAKLTNKQLYKDSIEMNLDFWTTGVNGTRVSYTPKGL FT AWLFQWGSLRHATTQAFLAGVYAEWEGCTPSKVSVYKDFLKSQIDYALGSTGRSFVVGY FT GVNPPQHPHHRTAHGSWTDQMTSPTYHRHTIYGALVGGPDNADGYTDEINNYVNNEIAC FT DYNAGFTGALAKMYKHSGGDPIPNFKAIEKITNDEVIIKAGLNSTGPNYTEIKAVVYNQ FT TGWPARVTDKISFKYFMDLSEIVAAGIDPLSLVTSSNYSEGKNTKVSGVLPWDVSNNVY FT YVNVDLTGENIYPGGQSACRREVQFRIAAPQGRRYWNPKNDFSYDGLPTTSTVNTVTNI FT PVYDNGVKVFGNEPAGGSENPDPEILYGDVNSDKNVDALDFAALKKYLLGGTSSIDVKA FT ADTYKDGNIDAIDMATLKKYLLGTITQLPQG" FT mat_peptide 2712..4781 FT /gene="CelCCG" FT /product="endo-beta-1,4-glucanase" FT /EC_number="3.2.1.4" FT terminator 4801..4862 FT /gene="CelCCG" FT RBS 4866..4872 FT /gene="CelCCE" FT sig_peptide 4877..4954 FT /gene="CelCCE" FT /note="putative" FT CDS 4877..7534 FT /codon_start=1 FT /transl_table=11 FT /gene="CelCCE" FT /product="cellulase precursor" FT /note="putative" FT /db_xref="GOA:Q46002" FT /db_xref="HSSP:1DAV" FT /db_xref="InterPro:IPR018201" FT /db_xref="UniProtKB/TrEMBL:Q46002" FT /protein_id="AAA73869.2" FT /translation="MKKRLVKKVAMLIAIVLVLSSSIGQAFALVGAGDLIRNHTFDNRV FT GLPWHVVESYPAKASFEITSDGKYKITAQKIGEAGKGERWDIQFRHRGLALQQGHTYTV FT KFTVTASRACKIYPKIGDQGDPYDEYWNMNQQWNFLELQANTPKTVTQTFTQTKGDKKN FT VEFAFHLAPDKTTSEAQNPASFQPITYTFDEIYIQDPQFAGYTEDPPEPTNVVRLNQVG FT FYPNADKIATVATSSTTPINWQLVNSTGAAVLTGKSTVKGADRASGDNVHIIDFSSYTT FT PGTDYKIVTDVSVTKAGDNESMKFNIGDDLFTQMKYDSMKYFYHNRSAIPIQMPYCDQS FT QWARPAGHTTDILAPDPTKDYKANYTLDVTGGWYDAGDHGKYVVNGGIATWTVMNAYER FT ALHMGGDTSVAPFKDGSLNIPESGNGYPDILDEARYNMKTLLNMQVPAGNELAGMAHHK FT AHDERWTALAVRPDQDTMKRWLQPPSTAATLNLAAIAAQSSRLWKQFDSAFATKCLTAA FT ETAWDAAVAHPEIYATMEQGAGGGAYGDNYVLDDFYWAACELYATTGSDKYLNYIKSSK FT HYLEMPTELTGGENTGITGAFDWGCTAGMGTITLALVPTKLPAADVATAKANIQAAADK FT FISISKAQGYGVPLEEKVISSPFDASVVKGFQWGSNSFVINEAIVMSYAYEFSDVNGTK FT NNKYINGALTAMDYLLGRNPNIQSYITGYGDNPLENPHHRFWAYQADNTFPKPPPGCLS FT GGPNSGLQDPWVKGSGWQPGERPAEKCFMDNIESWSTNEITINWNAPLVWISAYLDEKG FT PEIGGSVTPPTNLGDVNGDGNKDALDFAALKKALLSQDTSTINVANADINKDGSIDAVD FT FALLKSFLLGKITL" FT mat_peptide 4955..7531 FT /gene="CelCCE" FT /product="cellulase" XX SQ Sequence 7534 BP; 2454 A; 1361 C; 1627 G; 2092 T; 0 other; gaattctatc agtggttgca gtcagcagaa ggtgctattg ccggtggagc tacaaactca 60 tggaacggac gttatgaagc agttccttca ggtacatcaa cattctatgg aatgggttat 120 gtagaaaacc ctgtatatgc tgacccaggt agtaacactt ggtttggtat gcaggtatgg 180 tcaatgcagc gtgtagctga attgtactat aagactggcg atgccagagc taagaaactc 240 ttagacaaat gggcaaaatg gattaatggc gaaatcaagt tcaatgctga cggaacattc 300 cagattccta gcacaattga ttgggaagga cagccggata cttggaatcc aacacaggga 360 tacaccggaa atgcaaactt gcatgttaaa gttgttaact atggtactga cctaggttgt 420 gcttcttcac ttgcaaacac attgacttac tatgctgcta aatcaggaga tgaaacttca 480 aggcagaatg cacagaaatt acttgacgct atgtggaata actatagcga ttcaaagggt 540 atatcaactg ttgaacagcg tggtgattac catagattcc ttgatcagga agtttttgta 600 ccagctggtt ggactggaaa aatgcctaac ggcgacgtaa tcaaatctgg tgtcaagttc 660 atagacattc gttccaagta caagcaggat cctgaatggc agacaatggt tgctgcatta 720 caggcaggac aggttccaac tcagagatta caccgtttct gggctcagag tgaatttgca 780 gttgcaaatg gagtttatgc aatactcttc ccagatcaag gtccagaaaa attattgggt 840 gatgtaaacg gtgacgaaac tgtagacgct attgaccttg ctatacttaa aaaatatctt 900 ttaaacagca gtactacaat aaatactgca aatgcagata tgaatagtga taatgctatt 960 gacgctattg actatgctct tttaaagaaa gcacttcttt ctatccaata gtatttaata 1020 ctatgacgca tatgtaacct taaagtccgg acagtatttg gtttgattaa attactcatt 1080 cttgtactgt ccgggcttat gagttacaaa gaaaaaaaga aaaggattaa ggtaagaaca 1140 tgatcaaagg ttcaagctta aagagattta aatcgcttgt tatggcggct atatttagtg 1200 tttcaataat ctcaactgcc atcgcttcaa gtgcagctga tcaaattcct ttcccatatg 1260 acgcaaaata tccaaatgga gcttacagtt gtctggcgga tagtcagtca atcggaaata 1320 atttagttcg cagtgaatgg gaacagtgga aaagtgcaca tattacatca aatggagcaa 1380 gaggctataa aagagtccag agagacgcaa ctacaaacta tgatacggtt tctgaaggac 1440 ttggatacgg tttgctgctt tcagtttact ttggagaaca acaattattt gacgatttat 1500 atcgctatgt taaagtattt ttaaattcga acggacttat gtcctggcgt attgactcca 1560 gcggaaatat aatgggaaaa gacagtattg gtgccgcaac agatgcagat gaagatattg 1620 cggtatccct tgtgtttgct cataaaaaat ggggaacaag tggaggattt aattaccaga 1680 ctgaggctaa aaattacata aataacatat acaataaaat ggtagaaccg ggtacatatg 1740 taataaaagc aggagatacg tggggagggt caaatgtaac taatccgtca tattttgccc 1800 cagcttggta cagaatcttt gctgacttta caggtaattc cggatggatc aatgtagcaa 1860 ataaatgtta tgaaatagct gataaagcaa gaaacagtaa tacaggactt gttcctgatt 1920 ggtgtacagc aaatggtact ccggcatcag gacaaggttt tgatttctat tatgatgcaa 1980 ttcgttacca gtggagagca gctattgatt acagttggta tggcactgca aaagctaaaa 2040 cacattgcga tgctatctca aacttcttca agaatattgg gtatgctaat ataaaagatg 2100 gctacacaat atcaggaagt cagataagtt caaatcatac cgctactttt gtcagctgtg 2160 ctgctgctgc tgcaatgacg ggtactgaca ctacatatgc aaagaacatt tataacgagt 2220 gtgttaaagt aaaagattca ggtaactaca cctatttcgg caatacttta agaatgatgg 2280 ttcttctata tactacgggt aacttcccaa atctatacac ctacaactct caaccaaaac 2340 cggatttaaa aggcgatgtc aataatgatg gtgctataga tgcacttgat attgctgcac 2400 tcaagaaggc tattttaact caaacaactt ctaatataag tttaacaaat gctgatatga 2460 ataatgacgg taatatagat gccattgatt ttgctcagct aaaagttaaa ctgcttaact 2520 agaataaata aaaataattg agtgagcatc tcaggttaaa tttgtcttaa aaatgtttaa 2580 atttaatttt agggagtgat ggcaagttgc ttaagactaa aagaaaattg acaaaagcaa 2640 tcggtgttgc attatcgatt tcaatattat cttcgctagt atcgtttata cctcaaacaa 2700 atacatatgc agcaggaaca tataactatg gagaagcatt acagaaatca ataatgttct 2760 atgaattcca gcgttcggga gatcttccgg ctgataaacg tgacaactgg agagacgatt 2820 ccggtatgaa agacggttct gatgtaggag ttgatcttac aggaggatgg tacgatgcag 2880 gtgaccatgt gaaatttaat ctacctatgt catatacatc tgcaatgctt gcatggtcct 2940 tatatgagga taaggatgct tatgataaga gcggtcagac aaaatatata atggacggta 3000 taaaatgggc taatgattat tttattaaat gtaatccgac acccggtgta tattattacc 3060 aagtaggaga cggcggaaag gaccactctt ggtggggccc tgcggaagta atgcagatgg 3120 aaagaccgtc ttttaaggtt gacgcttcta agcccggttc tgcagtatgt gcttccactg 3180 cagcttctct ggcatctgca gcagtagtct ttaaatccag tgatcctact tatgcagaaa 3240 agtgcataag ccatgcaaag aacctgtttg atatggctga caaagcaaag agtgatgctg 3300 gttatactgc ggcttcaggc tactacagct caagctcatt ttacgatgat ctctcatggg 3360 ctgcagtatg gttatatctt gctacaaatg acagtacata tttagacaaa gcagaatcct 3420 atgtaccgaa ttggggtaaa gaacagcaga cagatattat cgcctacaag tggggacagt 3480 gctgggatga tgttcattat ggtgctgagc ttcttcttgc aaagcttaca aacaaacaat 3540 tgtataagga tagtatagaa atgaaccttg acttctggac aactggtgtt aacggaacac 3600 gtgtttctta cacgccaaag ggtttggcgt ggctattcca atggggttca ttaagacatg 3660 ctacaactca ggctttttta gccggtgttt atgcagagtg ggaaggctgt acgccatcca 3720 aagtatctgt atataaggat ttcctcaaga gtcaaattga ttatgcactt ggcagtaccg 3780 gaagaagttt tgttgtcgga tatggagtaa atcctcctca acatcctcat cacagaactg 3840 ctcacggttc atggacagat caaatgactt caccaacata ccacaggcat actatttatg 3900 gtgcgttggt aggaggaccg gataatgcag atggctatac tgatgaaata aacaattatg 3960 tcaataatga aatagcctgc gattataatg ccggatttac aggtgcactt gcaaaaatgt 4020 acaagcattc tggcggagat ccgattccaa acttcaaggc tatcgaaaaa ataaccaacg 4080 atgaagttat tataaaggca ggtttgaatt caactggccc taactacact gaaatcaagg 4140 ctgttgttta taaccagaca ggatggcctg caagagttac ggacaagata tcatttaaat 4200 attttatgga cttgtctgaa attgtagcag caggaattga tcctttaagc cttgtaacaa 4260 gttcaaatta ttctgaaggt aagaatacta aggtttccgg tgtgttgcca tgggatgttt 4320 caaataatgt ttactatgta aatgttgatt tgacaggaga aaatatctac ccaggcggtc 4380 agtctgcgtg cagacgagaa gttcagttca gaattgccgc accacaggga agaagatatt 4440 ggaatccgaa aaatgatttc tcatatgatg gattaccaac caccagtact gtaaatacgg 4500 ttaccaacat acctgtttat gataacggcg taaaagtatt tggtaacgaa cccgcaggtg 4560 gatcggaaaa ccctgatcct gaaatcttgt atggagacgt aaacagcgac aaaaatgtag 4620 atgcattgga ctttgctgca ttgaagaaat atttacttgg aggcacttcc agcatagatg 4680 ttaaggctgc agatacatac aaggatggga atattgacgc tatagatatg gctaccttga 4740 agaagtattt attgggaaca atcacccaat tacctcaagg ctaatagaag ttcagtttgg 4800 aagtttaatg agtttttaat gcttgcatta ctaaatgtaa gctttaaaaa ataaaaaatt 4860 ttactaggag gtaaatatga aaaaaaggtt agtgaagaaa gttgcgatgc tcatcgcaat 4920 agtgctggtt ctatcttctt caataggaca agcatttgcc cttgttgggg caggagattt 4980 gattcgaaac catacctttg acaacagagt aggtcttcca tggcacgtgg ttgaatcata 5040 ccctgcaaag gcaagttttg aaattacatc tgatggtaag tacaagataa ctgctcaaaa 5100 gatcggtgag gcaggaaaag gtgaaagatg ggatatacaa ttccgtcaca gaggactcgc 5160 attgcaacaa ggtcatactt atacagtaaa gtttactgtt actgctagca gagcttgtaa 5220 aatttatcct aaaataggtg accagggtga tccatatgat gaatactgga atatgaatca 5280 acaatggaat ttcctggaat tacaggctaa tactccaaaa actgtaactc agacatttac 5340 acagactaag ggagataaga agaacgttga atttgctttt caccttgctc ccgataaaac 5400 tacatctgag gcacagaatc cagcaagttt ccaacctata acatatactt ttgatgaaat 5460 ttatattcag gaccctcaat ttgcaggata tactgaagat ccacctgaac ctactaatgt 5520 tgtacgtttg aatcaggtag gtttctatcc taatgctgat aagattgcaa cagtagcaac 5580 aagttcaaca actccaatta actggcagtt ggttaatagt actggagcag ctgttttaac 5640 aggtaaatca actgttaaag gtgccgaccg tgcatcaggt gataatgtcc atatcattga 5700 tttctctagt tacacaacac ctggtaccga ctataagata gtaacagatg tatcagtaac 5760 aaaagccgga gacaatgaaa gtatgaagtt caatattgga gatgaccttt ttactcaaat 5820 gaaatacgat tcaatgaagt atttctatca caacagaagt gctattccaa tacaaatgcc 5880 atactgtgat caatcacaat gggcacgtcc tgcaggacac acaactgata tacttgctcc 5940 agatccaaca aaggattaca aggctaacta cacacttgac gttacaggtg gttggtatga 6000 tgccggtgac catggtaagt atgttgttaa tggtggtatt gcaacctgga ccgtaatgaa 6060 tgcatatgag cgtgcactac acatgggtgg agacacttca gttgctccat ttaaagacgg 6120 ttcattgaac atacctgaaa gcggaaatgg ctatcctgac atactggacg aagctcgtta 6180 caatatgaaa acattattaa atatgcaggt tccagcagga aatgaattag ccggtatggc 6240 tcaccacaaa gctcatgacg aacgttggac agctcttgct gtacgtcccg accaggatac 6300 aatgaaacgt tggttgcagc ctccaagtac agcagctaca ttaaatctgg ctgctattgc 6360 tgcacaaagt tcacgtcttt ggaaacagtt tgattctgct ttcgcaacta agtgtttaac 6420 tgcagcagaa actgcatggg atgcagctgt agctcatcca gaaatatatg caactatgga 6480 acagggtgcc ggtggtggag catacggaga caactatgtt cttgatgatt tctactgggc 6540 agcatgtgaa ttgtatgcaa ctacaggcag tgacaagtat ttgaactaca taaagagctc 6600 aaagcattat ctcgaaatgc ctacagaatt aacaggcggt gagaatactg gaattacagg 6660 ggcttttgac tggggttgta cagcaggtat gggaacaata acacttgcac ttgtacctac 6720 aaagcttccg gcagcagatg ttgctacagc taaagctaat attcaagctg cagctgataa 6780 gttcatatca atttcaaaag cacaaggcta tggtgtacca ctagaagaaa aagtaatttc 6840 atctcctttt gatgcatctg ttgttaaagg tttccaatgg ggatcaaact cattcgttat 6900 taatgaagca atagttatgt catatgctta tgaattcagc gatgttaatg gcacaaagaa 6960 taataaatat attaatggtg ctttaacagc aatggattac ctcctcggac gtaacccaaa 7020 tattcaaagc tatataactg gttatggtga caacccactt gaaaatcctc atcaccgttt 7080 ctgggcatac caggcagaca acacattccc aaaaccacct ccgggatgtc tgtcaggagg 7140 acctaactcc ggcttgcagg atccttgggt taagggttca ggctggcagc caggtgaaag 7200 acctgctgaa aaatgcttca tggacaatat tgaatcttgg tcaacaaacg aaataaccat 7260 caactggaat gctcctcttg tatggatatc agcttacctt gatgaaaagg ggccagagat 7320 tggtgggtca gtgactcctc caactaattt aggagatgtt aacggcgatg gaaacaagga 7380 tgcattggac ttcgctgcat tgaagaaagc cttgttaagc caggatactt ctactataaa 7440 tgttgctaat gctgatataa acaaagatgg ttctattgat gcagttgact ttgcattact 7500 caaatcattc ttgttaggaa aaatcacact gtaa 7534 // ![]() |