![]() |
EBI DbfetchID X60545; SV 1; linear; genomic DNA; STD; PRO; 2590 BP. XX AC X60545; XX DT 07-AUG-1991 (Rel. 29, Created) DT 18-APR-2005 (Rel. 83, Last updated, Version 6) XX DE C.thermocellum celF gene for endo-1,4-beta-glucanase XX KW celF gene; cellulase. XX OS Clostridium thermocellum OC Bacteria; Firmicutes; Clostridia; Clostridiales; Clostridiaceae; OC Clostridium. XX RN [1] RP 1-2590 RA Beguin P.; RT ; RL Submitted (18-JUL-1991) to the EMBL/GenBank/DDBJ databases. RL P. Beguin, Institut Pasteur, 28 Rue du Dr Roux, 75724 Paris, Cedex 15, RL France XX RN [2] RP 1-2590 RX DOI; 10.1016/0923-2508(91)90002-R RX PUBMED; 1805307. RA Navarro A., Chebrou M.C., Beguin P., Aubert J.P.; RT "Nucleotide sequence of the cellulase gene celF of Clostridium RT thermocellum"; RL Res. Microbiol. 142(9):927-936(1991). XX FH Key Location/Qualifiers FH FT source 1..2590 FT /organism="Clostridium thermocellum" FT /strain="NCIB 10682" FT /mol_type="genomic DNA" FT /db_xref="taxon:1515" FT mRNA 53..>2490 FT sig_peptide 271..351 FT CDS 271..2490 FT /transl_table=11 FT /gene="celF" FT /product="cellulase" FT /EC_number="3.2.1.4" FT /db_xref="GOA:P26224" FT /db_xref="InterPro:IPR001701" FT /db_xref="InterPro:IPR001956" FT /db_xref="InterPro:IPR002105" FT /db_xref="InterPro:IPR008928" FT /db_xref="InterPro:IPR008965" FT /db_xref="InterPro:IPR012341" FT /db_xref="InterPro:IPR016134" FT /db_xref="InterPro:IPR018221" FT /db_xref="InterPro:IPR018242" FT /db_xref="InterPro:IPR018247" FT /db_xref="UniProtKB/Swiss-Prot:P26224" FT /protein_id="CAA43035.1" FT /translation="MKKILAFLLTVALVAVVAIPQAVVSFAADFNYGEALQKAIMFYEF FT QRSGKLPENKRNNWRGDSALNDGADNGLDLTGGWYDAGDHVKFNLPMAYAVTMLAWSVY FT ESRDAYVQSGQLPYILDNIKWATDYFIKCHPSPNVYYYQVGDGALDHSWWGPAEVMQMP FT RPSFKVDLTNPGSTVVAETAAAMAASSIVFKPTDPEYAATLLRHAKELFTFADTTRSDA FT GYRAAEGYYSSHSGFYDELTWASIWLYLATGDQSYLDKAESYEPHWERERGTTLISYSW FT AHCWDNKLYGSLLLLAKITGKSYYKQCIENHLDYWTVGFNGSRVQYTPKGLAYLDRWGS FT LRYATTQAFLASVYADWSGCDPAKAAVYKEFAKKQVDYALGSTGRSFVVGFGKNPPRNP FT HHRTAHSSWSALMTEPAECRHILVGALVGGPDGSDSYVDRLDDYQCNEVANDYNAGFVG FT ALAKMYEKYGGEPIPNFVAFETPGEEFYVEAAVNAAGPGFVNIKASIINKSGWPARGSD FT KLSAKYFVDISEAVAKGITLDQITVQSTTNGGAKVSQLLPWDPDNHIYYVNIDFTGINI FT FPGGINEYKRDVYFTITAPYGEGNWDNTNDFSFQGLEQGFTSKKTEYIPLYDGNVRVWG FT KVPDGGSEPDPTPTITVGPTPSVTPTSVPGIMLGDVNFDGRINSTDYSRLKRYVIKSLE FT FTDPEEHQKFIAAADVDGNGRINSTDLYVLNRYILKLIEKFPAEQ" FT misc_feature 352..1680 FT /note="catalytic domain" FT mat_peptide 352..2487 FT /product="cellulase" FT /EC_number="3.2.1.4" FT misc_feature 1783..2184 FT /note="cellulose-binding domain" FT misc_feature 2278..2466 FT /note="duplicated segment" XX SQ Sequence 2590 BP; 781 A; 491 C; 617 G; 701 T; 0 other; tatataatat tcaattaatg ttcaaatttg atgtaaatca atagaaatta atcgaagtaa 60 taccattcag caatccgttt ttcctgtcat cactgtaaaa acaaagtatc cgttaaatgt 120 tgaaaaattt ccttgtaaga aatgcatagt gttgaaaaaa cagagtttgt gaaaggggag 180 atgcagcaaa accgatatta cgaaaaaaat aaatggttaa ataatttatt aataattttt 240 aaagttttta ataaaagggg gagatttaag ttgaagaaaa ttttggcgtt tttgctgaca 300 gttgcgctgg tggcagtagt ggccattcca caagccgtgg taagttttgc tgcggatttc 360 aactatggtg aggcacttca gaaagcaata atgttttatg agttccagcg ctcgggaaaa 420 ctgcccgaaa acaaaagaaa caactggcgt ggagattccg ctcttaatga cggcgcagac 480 aacggtttgg accttacagg cggttggtat gatgccggtg accatgtaaa gttcaacctt 540 ccgatggcct atgccgttac catgctcgca tggagtgttt atgaatcccg ggatgcgtat 600 gtacaaagcg gacagcttcc ttacatactg gacaatatta aatgggctac cgactacttt 660 ataaaatgcc atccaagtcc aaatgtatat tattatcagg tgggagacgg agcattggac 720 cattcatggt ggggacctgc tgaagtaatg cagatgccaa gaccgtcctt caaagtggat 780 ttgaccaatc cgggttcgac tgtggttgct gagacggcag cggctatggc tgcatcctca 840 attgttttca agcctacaga cccggaatat gctgccacac ttttaaggca tgcgaaagaa 900 ctctttactt ttgccgacac cacaagaagt gacgcaggat atagagcggc agagggatac 960 tattcatccc acagcggttt ttatgatgaa cttacctggg cgagtatatg gctgtatctt 1020 gcaacaggag accagtctta tcttgataaa gcagaatcct atgaacctca ttgggaaagg 1080 gaaagaggta caactttaat tagttattcc tgggctcatt gctgggataa caaattgtac 1140 ggttctttgc ttttgttagc aaaaattacc ggcaagtctt attacaagca atgtattgaa 1200 aaccatcttg actattggac cgtcggattt aacggaagca gagttcaata tactccaaaa 1260 ggactcgcat atcttgacag atggggttca ttgagatatg caaccacaca ggcgttcctt 1320 gccagcgttt atgcggactg gtccggctgt gacccggcta aggcggctgt ctacaaggaa 1380 tttgcaaaaa aacaggtgga ttatgcatta ggaagcacag gaagaagctt tgtagtaggt 1440 tttggaaaaa atccgccaag aaatcctcac cacaggacgg cccacagctc atggagcgct 1500 ttaatgaccg aacctgcgga gtgcagacat attctggtgg gtgcattggt tggcggaccg 1560 gacggttcgg attcatatgt tgacaggctc gatgattatc agtgcaatga ggtggccaac 1620 gactataatg ctggatttgt aggtgctctt gccaagatgt atgagaagta tggcggagaa 1680 ccgattccga atttcgttgc ttttgaaaca ccgggggaag aattttatgt tgaagctgcg 1740 gtaaatgctg caggacccgg ttttgtaaat atcaaagctt caataatcaa caagtccggt 1800 tggccggcaa gaggttcaga taaattgtca gccaagtatt ttgtcgatat ttccgaagct 1860 gttgcaaaag gcattacttt ggatcaaatt accgttcagt cgactactaa tggcggagcc 1920 aaggtttcac agcttcttcc gtgggatccg gacaatcata tttattatgt aaacattgac 1980 tttacgggaa taaacatatt ccccggagga ataaatgaat acaagaggga tgtatatttc 2040 actattacgg cgccgtatgg agagggtaac tgggacaata ccaacgactt ctccttccag 2100 ggacttgagc agggctttac aagcaaaaag actgaatata taccgttgta tgacggtaat 2160 gtgagagtat ggggtaaagt accggacgga ggttcggagc ccgatccgac gccgacaatc 2220 accgttggcc ccactccttc ggttacaccg acatcagtac ctggaataat gctcggagat 2280 gtgaattttg acggaagaat aaactcgacg gattattcac gcttaaaaag atatgtaata 2340 aagtctttgg aattcacaga tcctgaagag caccagaagt tcattgcagc tgcggatgtt 2400 gacgggaacg gaagaataaa ctccacagat ttgtatgtgc tcaacaggta catattaaaa 2460 cttattgaaa aattcccggc tgaacagtaa cggcattaat attcaataac aattatcagg 2520 ctgctgcgaa agttgctttt tgcggagttt tatttttatt gcaaaactac ctgtgattat 2580 tgataaaata 2590 // ![]() |