![]() |
EBI DbfetchID M29047; SV 1; linear; genomic DNA; STD; PRO; 2310 BP. XX AC M29047; M29681; XX DT 20-APR-1990 (Rel. 23, Created) DT 17-APR-2005 (Rel. 83, Last updated, Version 7) XX DE F.succinogenes endoglucanase 3 (cel3) gene, complete cds. XX KW cellobiosidase; endoglucanase. XX OS Fibrobacter succinogenes OC Bacteria; Fibrobacteres; Fibrobacterales; Fibrobacteraceae; Fibrobacter. XX RN [1] RP 1-2310 RX PUBMED; 2676979. RA McGavin M.J., Forsberg C.W., Crosby B., Bell A.W., Dignard D., Thomas D.Y.; RT "Structure of the cel-3 gene from Fibrobacter succinogenes S85 and RT characteristics of the encoded gene product, endoglucanase 3"; RL J. Bacteriol. 171(10):5587-5595(1989). XX CC Draft entry and computer-readable sequence for [1] kindly submitted CC by D.Dignard, 14-OCT-1989. XX FH Key Location/Qualifiers FH FT source 1..2310 FT /organism="Fibrobacter succinogenes" FT /mol_type="genomic DNA" FT /db_xref="taxon:833" FT RBS 167..172 FT /note="ribosome binding site" FT sig_peptide 177..245 FT /note="endoglucanase 3 signal peptide A (alt.)" FT sig_peptide 177..251 FT /note="endoglucanase 3 signal peptide A' (alt.)" FT CDS 177..2153 FT /codon_start=1 FT /transl_table=11 FT /note="endoglucanase 3 precursor" FT /db_xref="GOA:P14250" FT /db_xref="HSSP:1CEO" FT /db_xref="InterPro:IPR013781" FT /db_xref="UniProtKB/Swiss-Prot:P14250" FT /protein_id="AAA24893.1" FT /translation="MQLKNFYPKMSVLGIATVMALTACGDENTQALFANNPVPGAENQV FT PVSSSDMSPTSSDAVIDPTSSSAAVVDPSTLPAEGPITMPEGLGTLVDDFEDGDNLSKI FT GDYWYTYNDNDNGGASIITTPLNEEENIIPGRVNNGSNYALQVNYTLDRGDYEYDPYVG FT WGVQVAPDEANGHFGGLTYWYKGGAHEVHIEITDVEDYDVHLAKFPASRTWKQAVVRFK FT DLVQGGWGKEIPFDAKHIMAISFQAKGNKSKLVTDSLFIDNIYLQDSSEVEKDQPDMEI FT KDPVIPVVEFTEAEITVTNPLQEKAMKYLNKGVNFTNWLENADGKFKSFELGESDVKIL FT ADNGFKSLRLPIDLDLYATNRDAFIAGTDTELKFDDDTLFLVLDSFVEWTAKYNMSFVI FT DYHEYDNSYNTTSAKDPNYIKMMAETWKHVAAHYAESPREDLFFELLNEPDMSDGKVTA FT ATWTTAAQAMIDAIRTVDTKHTILFGDAQWYSITLLAKRTPFTDDNIIYVIHTYEPFAF FT THQGGSWTDYATIHDIPFPYDPAKWSTVSGDFGVNKSTKSYVKTNIKNYYKTGSKEAIL FT EQILKAKKWAATNNVPVIINEFGALNLRSTAESRLNYLTAMREICDTLQIPWTHWGYTG FT NFSVIENGKLIEGLDKALGVGSK" FT mat_peptide 246..2150 FT /note="endoglucanase 3 A (alt.)" FT mat_peptide 252..2150 FT /note="endoglucanase 3 A' (alt.)" FT misc_feature 2172..2213 FT /note="region of dyad symmetry" XX SQ Sequence 2310 BP; 649 A; 653 C; 529 G; 479 T; 0 other; ggatccgggt gcgtcagtta aataaaatat tttttaacgt ttttcgtaca gaaagtggac 60 ttttagacca aaacacttat tacacttttt attccgatat atcattttac atagcataaa 120 accgaccccc aaatatatct ttggtaaaaa agaaaaaatc accttaagag ggttttatgc 180 aactcaagaa tttctatccc aaaatgagcg ttctcggtat cgcaaccgtg atggcactta 240 ccgcctgtgg cgatgaaaat acccaggcac tgttcgccaa caatccggtt ccgggtgccg 300 aaaatcaggt tccggtttct agcagcgaca tgagcccgac ctctagcgac gctgtcattg 360 acccgacctc cagctctgcc gcagtggtcg acccgtctac gctccctgca gaaggtccta 420 ttaccatgcc ggaaggtctc ggcactttgg tcgatgactt tgaagatggc gataacttga 480 gcaaaatcgg tgattactgg tacacctaca acgataacga caacggtggt gcatccatca 540 tcacgactcc gctaaacgaa gaagaaaaca tcatcccggg ccgcgtcaac aacggttcca 600 actacgcctt gcaagtcaac tacacgcttg atagaggcga ttacgaatac gatccgtacg 660 taggctgggg cgtgcaggtc gcaccggacg aagccaacgg acatttcggc ggccttacct 720 actggtacaa gggcggcgca cacgaagtac atatcgaaat caccgacgtc gaagactacg 780 acgtgcatct cgccaagttc ccggcatccc gcacatggaa gcaggctgtc gtccgcttca 840 aggacctcgt tcaaggtggc tggggcaagg aaattccgtt cgacgccaag cacatcatgg 900 caatcagctt ccaggccaag ggaaacaaga gcaagctcgt gaccgactcc ctcttcatcg 960 acaacatcta cctgcaggat tcttccgaag ttgaaaagga ccagccggat atggaaatca 1020 aggacccggt cattccggtc gttgaattta ccgaagctga aatcactgtg acgaacccgt 1080 tgcaggaaaa ggccatgaag tacctcaaca agggtgtcaa ctttaccaac tggctcgaaa 1140 acgcagatgg caagttcaag tcctttgaat tgggcgaaag cgacgtcaag attcttgccg 1200 acaacggatt caagagcctc cgcttgccga ttgaccttga cctctatgcc acaaaccgtg 1260 acgcattcat cgcaggcacc gacacagaac tcaagttcga tgacgacacc ttgttcctgg 1320 ttctcgactc cttcgtagaa tggaccgcca agtacaacat gtctttcgtg attgactacc 1380 atgaatatga caacagctac aacaccacca gcgctaagga ccccaactac atcaagatga 1440 tggcagaaac gtggaagcat gttgcagccc actacgccga aagcccccgc gaagacttgt 1500 tcttcgaact cttgaacgaa ccggacatga gcgatggtaa ggtcactgca gcaacatgga 1560 ccaccgcagc ccaggccatg attgacgcca tccgcacggt tgataccaag cacaccatcc 1620 tcttcggtga tgcccagtgg tactccatca cgctcctcgc caagcgcact ccgttcaccg 1680 atgacaacat catctacgtg atccacacct acgaaccgtt cgccttcacg catcagggcg 1740 gttcctggac ggactacgcc accatccacg atattccgtt cccctacgat ccggcaaagt 1800 ggtctacggt ttctggcgac ttcggtgtca acaagagcac aaagtcctac gtgaaaacca 1860 acatcaagaa ctactacaag accggcagca aggaagccat cttggaacag attctcaagg 1920 ccaagaagtg ggccgccacc aacaacgtac cggtgatcat caacgaattc ggcgcattga 1980 acctccgctc taccgctgaa tcccgcctca actacctcac ggccatgcgc gaaatctgcg 2040 ataccctcca gattccttgg acgcactggg gctacaccgg caacttctcc gtgatcgaaa 2100 acggcaagtt gattgaaggc ctcgacaagg cactcggcgt cggtagcaaa taagtctctc 2160 cttaaaaccc cctcaaaaaa aggtcacgca gaaatgcgtg gcttttttag taggaagtag 2220 acggtaggaa gttggaagtt agaagtagga agtaacagga atggcgcaat ggatacagtt 2280 gacacagata cattacaaaa ccccggatcc 2310 // ![]() |