spacer
spacer

EBI Dbfetch

ID   U70664; SV 3; linear; genomic DNA; STD; PRO; 5385 BP.
XX
AC   U70664;
XX
DT   12-JAN-1997 (Rel. 50, Created)
DT   18-APR-2005 (Rel. 83, Last updated, Version 5)
XX
DE   Haloferax alicantei glucose-fructose oxido-reductase (gfo) gene, partial
DE   cds; 2-dehydro-3-deoxyphosphogluconate aldolase, 2-keto-3-deoxygluconate
DE   kinase, and beta-D-galactosidase (bgaH) genes, complete cds; and unknown
DE   gene.
XX
KW   .
XX
OS   Haloferax lucentense
OC   Archaea; Euryarchaeota; Halobacteria; Halobacteriales; Halobacteriaceae;
OC   Haloferax.
XX
RN   [1]
RP   2364-4355
RX   DOI; 10.1016/S0167-4838(96)00174-4
RX   PUBMED; 9048905.
RA   Holmes M.L., Scopes R.K., Moritz R.L., Simpson R.J., Englert C.,
RA   Pfeifer F., Dyall-Smith M.L.;
RT   "Purification and analysis of an extremely halophilic beta-galactosidase
RT   from Haloferax alicantei";
RL   Biochim. Biophys. Acta 1337(2):276-286(1997).
XX
RN   [2]
RP   1-5385
RX   DOI; 10.1046/j.1365-2958.2000.01832.x
RX   PUBMED; 10760168.
RA   Holmes M.L., Dyall-Smith M.L.;
RT   "Sequence and expression of a halobacterial beta-galactosidase gene";
RL   Mol. Microbiol. 36(1):114-122(2000).
XX
RN   [3]
RP   1-5385
RA   Holmes M.L., Dyall-Smith M.L.;
RT   ;
RL   Submitted (12-SEP-1996) to the EMBL/GenBank/DDBJ databases.
RL   Microbiology, University of Melbourne, Grattan, Melbourne, Victoria 3052,
RL   Australia
XX
RN   [4]
RC   Sequence update by submitter
RP   1-5385
RA   Holmes M.L., Dyall-Smith M.L.;
RT   ;
RL   Submitted (11-MAY-1999) to the EMBL/GenBank/DDBJ databases.
RL   Microbiology, University of Melbourne, Grattan, Melbourne, Victoria 3052,
RL   Australia
XX
RN   [5]
RC   Sequence update by submitter
RP   1-5385
RA   Holmes M.L., Dyall-Smith M.L.;
RT   ;
RL   Submitted (31-AUG-2001) to the EMBL/GenBank/DDBJ databases.
RL   Microbiology, University of Melbourne, Grattan, Melbourne, Victoria 3052,
RL   Australia
XX
CC   On Aug 31, 2001 this sequence version replaced gi:4773896.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..5385
FT                   /organism="Haloferax lucentense"
FT                   /strain="SB1"
FT                   /mol_type="genomic DNA"
FT                   /note="from a mutant with increased beta-galactosidase
FT                   activity"
FT                   /db_xref="taxon:2254"
FT   CDS             <1..361
FT                   /codon_start=2
FT                   /transl_table=11
FT                   /gene="gfo"
FT                   /product="glucose-fructose oxido-reductase"
FT                   /db_xref="HSSP:1EVJ"
FT                   /db_xref="UniProtKB/TrEMBL:P94801"
FT                   /protein_id="AAB40120.1"
FT                   /translation="VPDEHSSFMVEFDDGTYLAATVSQNAQATTSLRIVGTNGEILVEP
FT                   AFHMETEIRVTRDDVSVTLDTPQVNQMTELFDYAADRILTDAPIGPDGEHGVLDMRLIE
FT                   AIYDAGESGRVVTLD"
FT   CDS             443..1486
FT                   /codon_start=1
FT                   /transl_table=11
FT                   /product="2-dehydro-3-deoxyphosphogluconate aldolase"
FT                   /note="KDPG aldolase"
FT                   /db_xref="GOA:P94802"
FT                   /db_xref="InterPro:IPR000887"
FT                   /db_xref="UniProtKB/TrEMBL:P94802"
FT                   /protein_id="AAB40121.3"
FT                   /translation="MSHHPARRIRETGLIAIIRGTDADTAIETVEALTRGGVSTVEITA
FT                   NTDGVLGMLRDVSASFTADEVTIGAGTVLDGETARAALLAGAEYLVTPTFDEGVIRTGN
FT                   RYGAPSMVGIASPTEAVNAYEAGAEMVKVFPAGTLGPEFVSALGGPLGHIPTVPTGGVA
FT                   LDTVDEFFDAGATAVGVGSAIVDNDAVFARRFRDHRDQRAFLRRSGRTRTPRLTHTHRS
FT                   EVRRSRLSAVRAANRNVQRKWSLSVDSGVASAEDAFDFVRRHDREVAVNCPFERGGRRA
FT                   VGERCFDWFVRDEPRQERADKGVAGPDGIDRVRLEDRLLVHRSAVSGECTVFPASDHDR
FT                   LEIEIAG"
FT   CDS             complement(1175..2131)
FT                   /codon_start=1
FT                   /transl_table=11
FT                   /product="2-keto-3-deoxygluconate kinase"
FT                   /note="similar to fructokinases"
FT                   /db_xref="GOA:P94803"
FT                   /db_xref="HSSP:1J5V"
FT                   /db_xref="InterPro:IPR011611"
FT                   /db_xref="UniProtKB/TrEMBL:P94803"
FT                   /protein_id="AAB40122.1"
FT                   /translation="MTAELVTFGETMIRLSPPAGERIETARSLEFRTAGAESNVAVAAS
FT                   RLGCSAAWLSKLPDSPLGRRVTTELRTHGVEPYVRWDDDARQGAYYIEQGRAPRPTNVI
FT                   YDRADAAVTTARPDELAVDIVEDAAAFYTSGITPALSETLRETTGELLQTATEAGTTTA
FT                   FDLNYRSKLWSPSDARDACESLFPKVDVLVAAERDIRTVLELDGDAPTLASELAGDFDF
FT                   ETVVVTRGEDGALARHGGTVYEQPVFETDTVDAIGTGDAFVGAFLSRLIADEPVETALA
FT                   YGAATAALKRTVHGDLAVVTPDEVERVLRGGDAGIDR"
FT   CDS             2364..4355
FT                   /codon_start=1
FT                   /transl_table=11
FT                   /gene="bgaH"
FT                   /product="beta-D-galactosidase"
FT                   /db_xref="GOA:P94804"
FT                   /db_xref="HSSP:1KWG"
FT                   /db_xref="InterPro:IPR013781"
FT                   /db_xref="UniProtKB/TrEMBL:P94804"
FT                   /protein_id="AAB40123.2"
FT                   /translation="MTVGVCYFPEHWSRERWETDISQMAEAGIEYVRMGEFAWRRIEPE
FT                   RGTFDFAWLDEAVELIGKFGMKAVLCTPTATPPKWLVDEHPDVRQREQDGTPREWGSRR
FT                   FTCFNSPTYRSETERIVSVLTDRYADNPHVAGWQTDNEFGCHETVTCYCEDCGEAFSEW
FT                   LADRYESVADLNDAWGTTFWSQQYDDFESIDPPKPTPAANHPSRVLAYERFSNDSVAEY
FT                   NRLHAALIREANDEWFVTHNFMGGFSLDAFRLAADLDFLSWDSYPTGFVQDRQPDTPTV
FT                   DELRAGNPDQVSMNHDLQRGAKGKPFWVMEQQPGDINWPPQSPQPADGAMRLWAHHAVA
FT                   HGADAVVYFRWRRCRQGQEQYHAGLRRQDGSPDRGYREASTAADELFDLDSVDASVALV
FT                   HDYESLWATRSQPLSPDWDYWNHLRTYYDALRARGVQVDIVSPEATLERYDAVVAPTLY
FT                   LVGDELSTALTDYVDSGGCLLLGARTGEKDPYNRLHESLAPGPLTALTGAQVARHETLP
FT                   DHVETRLSYDGATYEFRTWASWLAPEVGVPRGEYRTGEAAGNTAIVRNAAGDGSVTYCG
FT                   CWPGDDLADALVTELLDAAGVEYTERFPDGVRVMERDGYTWALNFTSDPVTLTVPDSTG
FT                   FLLGESTVDAFDTAVLDGSIRGVGLASE"
FT   CDS             4827..5300
FT                   /codon_start=1
FT                   /transl_table=11
FT                   /product="unknown"
FT                   /db_xref="InterPro:IPR003961"
FT                   /db_xref="UniProtKB/TrEMBL:P94805"
FT                   /protein_id="AAB40124.1"
FT                   /translation="MDRWTLYCLHEGPFCGVDGRKHDRYRWEHERVLSIPLERPRENGT
FT                   AFALQEYSIVGPNDPSAPSLSVSDESSSSVELSWTEARGAESYTLHRRPPDGDFEVVAD
FT                   EIAENSFYTQYTDAGVSSGSDYEYRLTAVNSVGKTDSTIVRVTTGGSSNAAMD"
FT   misc_feature    5001..5273
FT                   /note="encodes fibronectin type III motif (Fn3)"
XX
SQ   Sequence 5385 BP; 973 A; 1714 C; 1751 G; 947 T; 0 other;
     ggtacccgac gagcacagct cgttcatggt ggagttcgac gacggaacgt acttggccgc        60
     gaccgtcagt caaaacgccc aagcgacgac gagcctccga atcgtcggga ccaacggcga       120
     gattctcgtc gagcctgcct tccacatgga aaccgaaatc cgagtgacgc gcgacgacgt       180
     ttcggtcacg ctcgacacgc cgcaggtcaa tcagatgacc gagttgttcg actacgccgc       240
     agaccgcatc ttgaccgacg cgccgatcgg tcccgacggc gaacacggcg tcctcgatat       300
     gcgactcatc gaggccatct acgacgcagg cgagtccggc cgtgtggtca cgctcgactg       360
     agccgtcggc tcgacgggtc gagatggtat cgagggacgg ggcgaccacc gcgtttaaac       420
     cgttcgtccg agaggagcta gtatgagcca ccacccagca cgccgcattc gggagaccgg       480
     cctcatcgcc atcatccgtg gaaccgacgc cgacacggcg atcgaaaccg tcgaagcgct       540
     cacgcgcggc ggcgtctcga ccgtcgagat aaccgcgaac acggacggcg tcctcggcat       600
     gctccgagac gtgtccgcgt cgttcacggc ggacgaggtc accatcggcg cgggaaccgt       660
     cctcgacggc gagacggctc gcgcggcgct gctcgcaggt gcggagtact tagtcacgcc       720
     gacgttcgac gagggggtca tccggaccgg taacaggtac ggtgcccctt cgatggtcgg       780
     catcgcgtca ccgacggagg cggtcaacgc ctacgaggcc ggtgccgaga tggtgaaggt       840
     gttcccggcc ggaacgctcg gccccgagtt tgtctccgcg ctaggcggtc cgctcggaca       900
     catcccgacg gttccgacgg gtggcgtggc cctcgatacc gtcgacgagt tcttcgacgc       960
     cggagcgact gccgtcggtg tcggcagcgc aatcgtagac aacgacgccg tcttcgcgcg      1020
     gcgatttcgc gaccatcgag accaacgcgc gttccttcgt cgaagcggtc gaacgcgcac      1080
     gccgcgactg acgcacaccc accgctcgga agttcggcgc tcccgtctca gcgcggtccg      1140
     cgcggcgaac cgaaacgttc agcgaaagtg gtcgttatcg gtcgattccg gcgtcgcctc      1200
     cgcggaggac gcgttcgact tcgtcaggcg tcacgaccgc gaggtcgccg tgaactgtcc      1260
     gtttgagcgc ggcggtcgcc gcgccgtagg cgagcgctgt ttcgactggt tcgtccgcga      1320
     tgagccgcga caggaacgcg ccgacaaagg cgtcgccggt cccgatggca tcgaccgtgt      1380
     ccgtctcgaa gaccggttgc tcgtacaccg ttccgccgtg tcgggcgagt gcaccgtctt      1440
     ccccgcgagt gaccacgacc gtctcgaaat cgaaatcgcc ggctaactcc gatgcgagtg      1500
     tcggtgcgtc gccgtcgagt tccagaacag tgcggatgtc tcgttctgcg gcgacgagga      1560
     cgtcgacctt tggaaagagc gattcgcacg cgtcgcgtgc gtcggacggc gaccagagct      1620
     tcgaccggta gttgaggtcg aacgcggtcg tcgtccccgc ttcggtcgcc gtttgcagaa      1680
     gctcgccggt cgtctcccga agcgtctcgg agagcgccgg cgtgatgccg ctggtgtaga      1740
     acgccgccgc gtcttccacg atgtcgacag cgagttcgtc cggtcgcgcg gtcgtgactg      1800
     ccgcatcggc gcggtcgtat atcacgttcg tcggacgcgg cgcgcgcccc tgttcgatgt      1860
     agtacgcgcc ttggcgagcg tcgtcgtccc agcggacgta cggctcgacg ccgtgtgttc      1920
     ggagttccgt ggtaacgcgt cgaccgagcg gcgagtctgg aagcttcgag agccacgcgg      1980
     cggagcagcc gagacgggag gcggcgaccg cgacgttgct ttccgcaccg gcggtccgga      2040
     attcgaggga ccgggcggtc tcgattcgct cgcccgccgg cgggctcagc cgaatcatcg      2100
     tctcgccgaa cgtgacgagt tcagccgtca tcgacgcccc tcggcggctt gcgaccgggt      2160
     ctcgcgttcg atagcggagg cgttcacgcc gacagttaag ccgttcatgt gttcgcacga      2220
     ctgtctctca cggtggtaca taactgcggg cggtggggcc gaaatttgtg acggcacccc      2280
     cgtacgggag gacccgagta gtggatatca atcggtgctc agacaccgga aagaactata      2340
     tctcaccacg ttgatcattg tgtatgacag ttggtgtctg ctatttcccg gagcactggt      2400
     cgcgagagcg ctgggagacc gatatcagtc agatggccga ggctggaatc gaatacgttc      2460
     gaatggggga gttcgcgtgg cgacgaatcg aaccggagcg agggacgttc gatttcgcgt      2520
     ggttagacga ggccgtcgaa ctcatcggga agttcggtat gaaagcggtt ctgtgcacgc      2580
     cgaccgcgac gccgccgaaa tggctcgtcg acgaacatcc cgacgttcga cagcgagagc      2640
     aagacggtac gccgcgtgag tgggggagcc gtcggttcac ctgtttcaac tcacccacct      2700
     accgttccga gaccgaacgc atcgttagcg tgctgaccga ccgatacgcc gacaaccccc      2760
     acgtcgccgg gtggcagact gacaacgaat tcggctgtca cgagacggtt acctgctact      2820
     gcgaggactg tggcgaggca tttagcgaat ggctcgccga ccgctatgag agcgttgccg      2880
     acctcaacga cgcgtgggga acgacgtttt ggagccagca gtacgacgat ttcgagagca      2940
     tcgacccccc aaaaccgaca ccggccgcca accacccttc gcgggtactc gcctacgaac      3000
     ggtttagtaa cgacagcgtg gccgagtaca accgcctgca cgcagccctc atccgcgaag      3060
     caaacgacga gtggttcgtc acgcacaact tcatgggtgg tttttcactc gacgcctttc      3120
     gcctcgccgc cgacctcgat ttcctctcgt gggactccta tccgacgggg ttcgtgcagg      3180
     accgacagcc ggacacgccg acggtcgacg aattacgagc ggggaacccc gaccaagtga      3240
     gcatgaacca cgacctccag cgcggcgcga aaggaaagcc gttctgggtg atggagcaac      3300
     agccgggaga catcaactgg ccgccgcagt cgccgcaacc ggccgacggg gcgatgcgcc      3360
     tgtgggctca ccacgccgtc gcccacggcg cggacgccgt cgtctacttc cgctggcgtc      3420
     gctgtcgtca agggcaagaa cagtaccacg ccgggcttcg gcggcaggac ggctctccgg      3480
     accgcgggta ccgcgaagca tcgaccgccg ccgacgaact gttcgacttg gattcggtcg      3540
     atgcgtcagt cgcgctcgtc cacgactacg agagcctgtg ggcgacgcgt tcgcaacctc      3600
     tctcgcccga ctgggactac tggaaccact tacggacgta ctacgatgcg cttcgcgccc      3660
     gcggcgtgca ggtcgacatc gtctcgccgg aagcgactct cgaacggtac gacgcggtcg      3720
     tcgcaccgac tttgtatctc gtcggcgacg aactgtcgac cgcgctgacc gactacgtcg      3780
     attcgggtgg ctgtctcctc ctcggtgctc gaacggggga gaaagacccg tacaaccggc      3840
     ttcacgagtc gctcgcgccg ggaccactca ccgctctcac cggcgcgcag gtggcgcgtc      3900
     acgaaacgct tcccgaccac gtcgagacgc gactctccta cgacggtgcg acgtacgagt      3960
     tccgaacgtg ggcttcgtgg ctggctcccg aagtcggagt tccacgaggc gagtatcgga      4020
     cgggtgaagc agccggaaac accgcaatcg tgcggaacgc cgccggagac gggagcgtga      4080
     cgtactgtgg ctgctggccg ggggacgacc ttgccgacgc actcgtgaca gagttactgg      4140
     acgccgccgg cgtcgagtac actgagcgat tcccggacgg cgtgcgcgtg atggagcgcg      4200
     acggctatac gtgggcgctt aacttcacga gcgacccggt gacgttgacc gtccccgatt      4260
     ccaccgggtt cctgctcggt gagtccaccg tcgacgcgtt cgataccgcg gtactcgacg      4320
     ggtccatccg aggtgtcgga ctcgcgtccg agtgagtcgg cacgggacca ctcgccaacg      4380
     cgactctcgc ccggcgtgtt ctcgaacgcc gaagagaata acggccggac cgtgccccga      4440
     tgaggcaccg accggacgca ggttcgccgt cacggcgcgt tgaacgtcca cttctggttg      4500
     tcgccgccga cttcggtgta ctggggtgtt gattcctcgc gtgtacacac gcggaagcgc      4560
     gtcacaactg cgtctgtggg gcgttgaggg tgtccgtctt cgtcttgggg ccgtcccgga      4620
     ctgcgcgtcg aagaggtagg ccgcctcgtc tcggaagcta attccgacgc ggtcgccggt      4680
     ctcggggcga atcgacgcgt cggttcgggc gacaatctct tcttgctaaa ccacggtgat      4740
     ttgctgttcg acgacaacga cgcgctccac gtgtacaccg tcgtcgacgg gcaactctac      4800
     gccggggttg cgacagcatc ctcggggtgg accgatggac gctctactgt ctccacgagg      4860
     gcccgttctg tggagtggac ggtcgaaaac acgatagata tcggtgggaa cacgagaggg      4920
     tcctgtcgat accactcgaa cggccccgcg agaacggcac ggcgttcgca ctccaggagt      4980
     actcgattgt gggaccgaac gacccgtctg caccctcact gtcggtgtct gacgagtcgt      5040
     cgtcgagtgt cgaactgtcg tggaccgagg cgcgaggcgc ggagagttac acgctccatc      5100
     ggcggccacc cgacggtgat ttcgaggtcg tcgcggacga aatcgcggaa aattcgtttt      5160
     acacccagta caccgacgcc ggcgtgagtt cgggcagtga ctacgagtac agactcacgg      5220
     ccgtcaatag cgtcggcaag accgactcga ccatcgttcg ggtcacgact ggtggttcgt      5280
     cgaacgcggc gatggactag tcatgtagat tgttcactgt gggcaaacac ttcggtgtcg      5340
     ataccgccag cgggtttcaa acggctcggt cgtgtgaccg cgatc                      5385
//


  
spacer
spacer