![]() |
EBI DbfetchID L03425; SV 1; linear; genomic DNA; STD; PRO; 3695 BP. XX AC L03425; XX DT 22-OCT-1992 (Rel. 33, Created) DT 14-NOV-2006 (Rel. 89, Last updated, Version 5) XX DE Bacillus circulans beta-D-galactosidase (bgaB) gene, complete cds and DE unidentified orfs. XX KW beta-D-galactosidase; bgaB gene; direct repeat; Shine-Dalgarno sequence. XX OS Bacillus circulans OC Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus. XX RN [1] RP 1-3695 RA Nelms J., Fotheringham I.G.; RT "Two new beta-d-galactosidases from Bacillus circulans: A common sequence RT homology surrounding the putative active site region in several RT B-glycosidase families"; RL Unpublished. XX CC Original source text: Bacillus circulans (individual_isolate G1) CC DNA. XX FH Key Location/Qualifiers FH FT source 1..3695 FT /organism="Bacillus circulans" FT /isolate="G1" FT /mol_type="genomic DNA" FT /db_xref="taxon:1397" FT CDS <1..1102 FT /codon_start=2 FT /transl_table=11 FT /standard_name="orf1" FT /product="unknown" FT /db_xref="GOA:P48843" FT /db_xref="HSSP:1FOB" FT /db_xref="InterPro:IPR013781" FT /db_xref="UniProtKB/Swiss-Prot:P48843" FT /protein_id="AAA22259.1" FT /translation="LKEEAFILGMDVSFMDEIEQHGGSYRDENGQQEDLLTLLKMGDAN FT AIRLRIWNDPVGGFCNLERTVAVAKRVKEHGLHFLLDFHYSDRWADPANQWKPKAWEKL FT SYEELQRAVCNYTADVLRTLKEHDALPDMVQVGNEITPGMLWDEGRVSGEEHDTDEQWE FT RFAGLVKYGIAAVKSVDSEIKIMIHIDRGGDNAESRKFYDRFEALGVEFDIIGLSYYPW FT WHGTLDALRDNLHDLAERYGKPINVVETAYPWTLEQPDGHEWILNQEELLLPGYPASVE FT GQTRYLKDLLQIVREVPGGLGAGFYYWEPAWIPSKEEWSVGHPNNWGNLTMFDFKGQKL FT QSFSALKAGLENETEWDEQPNAALIK" FT gene 1094..3210 FT /gene="bgaB" FT repeat_region 1094..1118 FT /rpt_type=DIRECT FT /rpt_unit_range=1094..1105 FT /rpt_unit_range=1107..1118 FT RBS 1124..1129 FT /gene="bgaB" FT /standard_name="Shine-Dalgarno sequence" FT CDS 1138..3210 FT /codon_start=1 FT /transl_table=11 FT /gene="bgaB" FT /product="beta-D-galactosidase" FT /db_xref="GOA:Q45093" FT /db_xref="HSSP:1KWG" FT /db_xref="InterPro:IPR013529" FT /db_xref="UniProtKB/TrEMBL:Q45093" FT /protein_id="AAA22260.1" FT /translation="MTYKYSPVSSKVPRMLHGADYNPEQWLRYPEVLEEDIRLMKLAKC FT NVMSIGIFSWVSLEPEEGVYTFEWLDQVLDRFAANGIYAFLATPSGARPAWMSAKYPEV FT LRVGANRVRNLHGFRHNHCYTSPVYREKVTAINTKLAERYSDHPAVIGWHISNEFGGDC FT HCDYCQDAFRGWVKNKYGTLDELNHSWWTTFWSHTVTDWSQVESPAPHGETQVHAMNLD FT WRRFVTDQTADFIVHETKPLKAKNPDLPVTTNLMEFYEGLNYWKFADILDFLSWDSYPT FT WHDADEEDKLASRIAMMHDIVRSIKGGQPFLLMESTPSSTNWQEVSKLKKPGMHLLSSL FT QAVAHGSDSVQYFQWRKSRGSSEKLHGAVVDHVGTEHTRVFQDVTDVGTALEGMEAIVG FT TSVPAEVGIIFDWENRWAVNDSQGPRNIGVKYEQTVEAHYEAFWKKGVAVDVIDMDADL FT SKYKLLVAPMLYLVREGVGERIEQFVENGGTFVATYWSGIVNENDLCFLGGFPGPLRKT FT LGIWSEEIDGLHDRDLNGVIPVKGNALQLNAEYDAIELCDLIHLEGAEALATYRSDFYA FT GRPALTVNRLGAGKAYYIATRTKAPFYDDFYGSLIADLGIERALETQLPAGVTAHIRTD FT GTADYVFVQNYTPETKQVQLDEQSYSDLLSGDMMEGSLELQPYDIQVLRRATERK" FT RBS 3409..3413 FT /standard_name="Shine-Dalgarno sequence (for orf2)" FT CDS 3421..>3695 FT /codon_start=1 FT /transl_table=11 FT /standard_name="orf2" FT /product="unknown" FT /note="product is 46% homologous to B. stearothermophilus FT malic enzyme; product unknown" FT /db_xref="UniProtKB/TrEMBL:Q45094" FT /protein_id="AAA22261.1" FT /translation="MNQRNLDGNSFIIRLEMTTKDIKFGEVASAISEAGGDIIAIDVIS FT TNQDVSVRDLTVAVTDAQDNSKIIEGVRQLKGVIIINVSDRTFLLH" XX SQ Sequence 3695 BP; 990 A; 774 C; 1044 G; 887 T; 0 other; tttaaaggaa gaggcattca ttcttggaat ggatgtgtca tttatggatg aaattgagca 60 gcatggtggg agctatcgtg atgagaacgg gcagcaggaa gacttgctga cccttctcaa 120 gatgggagac gccaacgcaa ttcgtttgcg tatatggaac gaccctgtag gcggattctg 180 taatctggag cgaacggtgg cggttgccaa acgggtcaag gagcacggcc tgcatttctt 240 gcttgatttc cattattccg atcgctgggc tgatcctgcc aatcaatgga agccaaaggc 300 ctgggagaaa ctgtcttatg aggaattgca acgtgcggtg tgtaactata cggcagatgt 360 gctgagaaca ctcaaggagc atgatgccct gccggatatg gtacaggtag ggaatgaaat 420 tacgccgggc atgttatggg atgaagggcg agtcagcgga gaagaacatg atacggatga 480 acagtgggag cgttttgctg ggcttgtgaa gtatggtatt gctgcagtta aatccgttga 540 ttcggaaatc aagattatga tccatattga ccgcggcggg gataatgcag agagccgcaa 600 gttctatgat cgctttgaag cgcttggggt ggagtttgat atcattggac tctcttatta 660 tccctggtgg catggaacac tggacgcgtt gcgggacaat ctgcacgact tggctgaacg 720 gtacgggaaa ccgatcaacg ttgttgaaac ggcttatcct tggacactgg agcaacctga 780 tggccacgag tggattctga atcaggaaga attgctgttg ccagggtatc cggcaagtgt 840 ggaaggacag acacgttatc tgaaggatct gctgcaaatt gttcgtgaag ttcccggcgg 900 tctcggtgcc ggattctact attgggagcc tgcctggatt ccaagcaagg aagaatggtc 960 tgttggccat ccgaataact gggggaacct gacgatgttt gacttcaagg gccagaagtt 1020 gcaatcgttt tcagcactca aggccggact ggaaaatgaa acggaatggg atgagcagcc 1080 gaatgctgcg ttgatcaaat agataaatca aattgaaatg attaaggagt tgtcagcatg 1140 acatacaaat attcacccgt aagttcgaaa gtgccgcgca tgttacatgg cgcagactat 1200 aatccggagc aatggcttcg ctatcctgaa gtgcttgaag aagatattcg attgatgaag 1260 cttgccaaat gtaatgtgat gtccattggc atcttctcat gggtatccct tgagccggag 1320 gaaggcgtgt acacgtttga atggctggat caagtactgg atcgttttgc tgccaatgga 1380 atctatgcat tcctggctac cccaagcgga gcaagacccg cctggatgtc ggcaaaatac 1440 cctgaagtgc tccgtgtagg ggccaatcgg gttcgcaacc tgcatggctt ccgtcataac 1500 cactgctaca cctccccggt gtaccgggag aaggttacag caatcaatac aaaacttgca 1560 gaacgttact cggatcatcc cgcagtcatt ggctggcaca tctccaatga atttggcggg 1620 gattgccact gtgattattg ccaggacgct tttcgcggat gggtgaaaaa caaatacggc 1680 acgctggatg agctgaatca ttcctggtgg acgacattct ggagtcatac cgtaacggac 1740 tggagtcaag tggagtcacc tgccccacac ggagagacac aggtccacgc gatgaatttg 1800 gactggcgca gattcgttac ggatcagacc gctgatttta tcgtgcacga aaccaagccg 1860 ctgaaggcca aaaatccgga cctgccggtg acaacaaatc tgatggagtt ttatgaaggg 1920 ttaaactatt ggaagttcgc ggacattctg gatttcctct cctgggacag ctacccaacc 1980 tggcatgatg cagacgaaga agacaagctg gcatcgagaa tagctatgat gcacgatatc 2040 gttcgttcaa ttaaaggtgg gcagcctttc ctgttgatgg agagtactcc gagttcaacc 2100 aactggcaag aggtgagcaa gctgaaaaag cctggcatgc atctgctttc ttctctccag 2160 gcggtagcac acggttcaga tagtgtacag tacttccaat ggagaaaaag tcgaggttcc 2220 agtgagaaac ttcacggcgc ggtagtagat catgttggaa cagaacacac tcgtgtgttc 2280 caggatgtaa cggacgtggg aacggctctt gaaggcatgg aagccattgt aggaacatcg 2340 gtaccggcag aggtgggcat catcttcgat tgggaaaatc gttgggccgt taatgattca 2400 caaggtccgc gcaatatcgg ggtgaagtat gagcaaacgg ttgaagccca ttacgaagca 2460 ttctggaaaa agggagttgc tgttgacgta attgatatgg atgccgacct gtccaaatac 2520 aagctgctgg tcgcacctat gctctacctg gtgcgtgaag gggtcgggga acggattgaa 2580 caattcgttg aaaatggcgg tacattcgta gctacgtatt ggtcaggcat cgtcaatgag 2640 aatgatctgt gtttcctggg cggtttcccg ggaccgctcc gcaaaacgct tggaatctgg 2700 tcggaagaaa tcgacggatt gcatgatcgg gatttgaacg gggtgatccc agtgaagggc 2760 aacgcgcttc aactgaatgc cgagtatgat gcaattgaat tatgcgacct gattcatctg 2820 gaaggtgctg aagcactggc tacgtatcgc tccgactttt atgctggccg accagcgtta 2880 acggttaacc gtctgggagc aggtaaagca tattatatag caacacgtac caaagcaccg 2940 ttttatgatg atttctatgg aagtctaatc gctgacctgg gcattgaacg tgcgcttgaa 3000 acgcagcttc ctgccggagt tacggcacat atccgaacgg atggaacagc tgattatgtg 3060 tttgtacaga actacacacc agaaacgaag caagttcaat tggacgagca gtcctacagt 3120 gatctgttga gcggtgatat gatggagggg agtttggagc tccagccata cgatatccag 3180 gtcctgcgca gagcaacgga gcgaaaatag gatatggtga tccattaagc agaatgaatg 3240 gagtccgcgt acagaaggct gcccttaccg tcttattaga cgataaaagg gtggcctttt 3300 tgtcatccaa agggcaagtt gagtactttt tctgggcaag ataaaggata aacgtgtccg 3360 cgcagcgaag acaaacaggt aacattgtaa ctcatgactt tcagtctggg aggaattgga 3420 atgaatcaaa ggaatctaga cggcaacagc ttcattattc ggctggaaat gacaacaaag 3480 gatatcaaat ttggcgaagt ggcttcggcc atctcggaag ctggcgggga tatcattgcc 3540 atcgacgtga tttcgaccaa tcaggatgta agtgtgcgcg acttgactgt cgccgtaacg 3600 gatgcacaag ataacagcaa aattatagaa ggggtgcgcc agcttaaagg tgtaatcatt 3660 attaacgtat cggatcggac gttcctgctt cattt 3695 // ![]() |