![]() |
EBI DbfetchID U70664; SV 3; linear; genomic DNA; STD; PRO; 5385 BP. XX AC U70664; XX DT 12-JAN-1997 (Rel. 50, Created) DT 18-APR-2005 (Rel. 83, Last updated, Version 5) XX DE Haloferax alicantei glucose-fructose oxido-reductase (gfo) gene, partial DE cds; 2-dehydro-3-deoxyphosphogluconate aldolase, 2-keto-3-deoxygluconate DE kinase, and beta-D-galactosidase (bgaH) genes, complete cds; and unknown DE gene. XX KW . XX OS Haloferax lucentense OC Archaea; Euryarchaeota; Halobacteria; Halobacteriales; Halobacteriaceae; OC Haloferax. XX RN [1] RP 2364-4355 RX DOI; 10.1016/S0167-4838(96)00174-4 RX PUBMED; 9048905. RA Holmes M.L., Scopes R.K., Moritz R.L., Simpson R.J., Englert C., RA Pfeifer F., Dyall-Smith M.L.; RT "Purification and analysis of an extremely halophilic beta-galactosidase RT from Haloferax alicantei"; RL Biochim. Biophys. Acta 1337(2):276-286(1997). XX RN [2] RP 1-5385 RX DOI; 10.1046/j.1365-2958.2000.01832.x RX PUBMED; 10760168. RA Holmes M.L., Dyall-Smith M.L.; RT "Sequence and expression of a halobacterial beta-galactosidase gene"; RL Mol. Microbiol. 36(1):114-122(2000). XX RN [3] RP 1-5385 RA Holmes M.L., Dyall-Smith M.L.; RT ; RL Submitted (12-SEP-1996) to the EMBL/GenBank/DDBJ databases. RL Microbiology, University of Melbourne, Grattan, Melbourne, Victoria 3052, RL Australia XX RN [4] RC Sequence update by submitter RP 1-5385 RA Holmes M.L., Dyall-Smith M.L.; RT ; RL Submitted (11-MAY-1999) to the EMBL/GenBank/DDBJ databases. RL Microbiology, University of Melbourne, Grattan, Melbourne, Victoria 3052, RL Australia XX RN [5] RC Sequence update by submitter RP 1-5385 RA Holmes M.L., Dyall-Smith M.L.; RT ; RL Submitted (31-AUG-2001) to the EMBL/GenBank/DDBJ databases. RL Microbiology, University of Melbourne, Grattan, Melbourne, Victoria 3052, RL Australia XX CC On Aug 31, 2001 this sequence version replaced gi:4773896. XX FH Key Location/Qualifiers FH FT source 1..5385 FT /organism="Haloferax lucentense" FT /strain="SB1" FT /mol_type="genomic DNA" FT /note="from a mutant with increased beta-galactosidase FT activity" FT /db_xref="taxon:2254" FT CDS <1..361 FT /codon_start=2 FT /transl_table=11 FT /gene="gfo" FT /product="glucose-fructose oxido-reductase" FT /db_xref="HSSP:1EVJ" FT /db_xref="UniProtKB/TrEMBL:P94801" FT /protein_id="AAB40120.1" FT /translation="VPDEHSSFMVEFDDGTYLAATVSQNAQATTSLRIVGTNGEILVEP FT AFHMETEIRVTRDDVSVTLDTPQVNQMTELFDYAADRILTDAPIGPDGEHGVLDMRLIE FT AIYDAGESGRVVTLD" FT CDS 443..1486 FT /codon_start=1 FT /transl_table=11 FT /product="2-dehydro-3-deoxyphosphogluconate aldolase" FT /note="KDPG aldolase" FT /db_xref="GOA:P94802" FT /db_xref="InterPro:IPR000887" FT /db_xref="UniProtKB/TrEMBL:P94802" FT /protein_id="AAB40121.3" FT /translation="MSHHPARRIRETGLIAIIRGTDADTAIETVEALTRGGVSTVEITA FT NTDGVLGMLRDVSASFTADEVTIGAGTVLDGETARAALLAGAEYLVTPTFDEGVIRTGN FT RYGAPSMVGIASPTEAVNAYEAGAEMVKVFPAGTLGPEFVSALGGPLGHIPTVPTGGVA FT LDTVDEFFDAGATAVGVGSAIVDNDAVFARRFRDHRDQRAFLRRSGRTRTPRLTHTHRS FT EVRRSRLSAVRAANRNVQRKWSLSVDSGVASAEDAFDFVRRHDREVAVNCPFERGGRRA FT VGERCFDWFVRDEPRQERADKGVAGPDGIDRVRLEDRLLVHRSAVSGECTVFPASDHDR FT LEIEIAG" FT CDS complement(1175..2131) FT /codon_start=1 FT /transl_table=11 FT /product="2-keto-3-deoxygluconate kinase" FT /note="similar to fructokinases" FT /db_xref="GOA:P94803" FT /db_xref="HSSP:1J5V" FT /db_xref="InterPro:IPR011611" FT /db_xref="UniProtKB/TrEMBL:P94803" FT /protein_id="AAB40122.1" FT /translation="MTAELVTFGETMIRLSPPAGERIETARSLEFRTAGAESNVAVAAS FT RLGCSAAWLSKLPDSPLGRRVTTELRTHGVEPYVRWDDDARQGAYYIEQGRAPRPTNVI FT YDRADAAVTTARPDELAVDIVEDAAAFYTSGITPALSETLRETTGELLQTATEAGTTTA FT FDLNYRSKLWSPSDARDACESLFPKVDVLVAAERDIRTVLELDGDAPTLASELAGDFDF FT ETVVVTRGEDGALARHGGTVYEQPVFETDTVDAIGTGDAFVGAFLSRLIADEPVETALA FT YGAATAALKRTVHGDLAVVTPDEVERVLRGGDAGIDR" FT CDS 2364..4355 FT /codon_start=1 FT /transl_table=11 FT /gene="bgaH" FT /product="beta-D-galactosidase" FT /db_xref="GOA:P94804" FT /db_xref="HSSP:1KWG" FT /db_xref="InterPro:IPR013781" FT /db_xref="UniProtKB/TrEMBL:P94804" FT /protein_id="AAB40123.2" FT /translation="MTVGVCYFPEHWSRERWETDISQMAEAGIEYVRMGEFAWRRIEPE FT RGTFDFAWLDEAVELIGKFGMKAVLCTPTATPPKWLVDEHPDVRQREQDGTPREWGSRR FT FTCFNSPTYRSETERIVSVLTDRYADNPHVAGWQTDNEFGCHETVTCYCEDCGEAFSEW FT LADRYESVADLNDAWGTTFWSQQYDDFESIDPPKPTPAANHPSRVLAYERFSNDSVAEY FT NRLHAALIREANDEWFVTHNFMGGFSLDAFRLAADLDFLSWDSYPTGFVQDRQPDTPTV FT DELRAGNPDQVSMNHDLQRGAKGKPFWVMEQQPGDINWPPQSPQPADGAMRLWAHHAVA FT HGADAVVYFRWRRCRQGQEQYHAGLRRQDGSPDRGYREASTAADELFDLDSVDASVALV FT HDYESLWATRSQPLSPDWDYWNHLRTYYDALRARGVQVDIVSPEATLERYDAVVAPTLY FT LVGDELSTALTDYVDSGGCLLLGARTGEKDPYNRLHESLAPGPLTALTGAQVARHETLP FT DHVETRLSYDGATYEFRTWASWLAPEVGVPRGEYRTGEAAGNTAIVRNAAGDGSVTYCG FT CWPGDDLADALVTELLDAAGVEYTERFPDGVRVMERDGYTWALNFTSDPVTLTVPDSTG FT FLLGESTVDAFDTAVLDGSIRGVGLASE" FT CDS 4827..5300 FT /codon_start=1 FT /transl_table=11 FT /product="unknown" FT /db_xref="InterPro:IPR003961" FT /db_xref="UniProtKB/TrEMBL:P94805" FT /protein_id="AAB40124.1" FT /translation="MDRWTLYCLHEGPFCGVDGRKHDRYRWEHERVLSIPLERPRENGT FT AFALQEYSIVGPNDPSAPSLSVSDESSSSVELSWTEARGAESYTLHRRPPDGDFEVVAD FT EIAENSFYTQYTDAGVSSGSDYEYRLTAVNSVGKTDSTIVRVTTGGSSNAAMD" FT misc_feature 5001..5273 FT /note="encodes fibronectin type III motif (Fn3)" XX SQ Sequence 5385 BP; 973 A; 1714 C; 1751 G; 947 T; 0 other; ggtacccgac gagcacagct cgttcatggt ggagttcgac gacggaacgt acttggccgc 60 gaccgtcagt caaaacgccc aagcgacgac gagcctccga atcgtcggga ccaacggcga 120 gattctcgtc gagcctgcct tccacatgga aaccgaaatc cgagtgacgc gcgacgacgt 180 ttcggtcacg ctcgacacgc cgcaggtcaa tcagatgacc gagttgttcg actacgccgc 240 agaccgcatc ttgaccgacg cgccgatcgg tcccgacggc gaacacggcg tcctcgatat 300 gcgactcatc gaggccatct acgacgcagg cgagtccggc cgtgtggtca cgctcgactg 360 agccgtcggc tcgacgggtc gagatggtat cgagggacgg ggcgaccacc gcgtttaaac 420 cgttcgtccg agaggagcta gtatgagcca ccacccagca cgccgcattc gggagaccgg 480 cctcatcgcc atcatccgtg gaaccgacgc cgacacggcg atcgaaaccg tcgaagcgct 540 cacgcgcggc ggcgtctcga ccgtcgagat aaccgcgaac acggacggcg tcctcggcat 600 gctccgagac gtgtccgcgt cgttcacggc ggacgaggtc accatcggcg cgggaaccgt 660 cctcgacggc gagacggctc gcgcggcgct gctcgcaggt gcggagtact tagtcacgcc 720 gacgttcgac gagggggtca tccggaccgg taacaggtac ggtgcccctt cgatggtcgg 780 catcgcgtca ccgacggagg cggtcaacgc ctacgaggcc ggtgccgaga tggtgaaggt 840 gttcccggcc ggaacgctcg gccccgagtt tgtctccgcg ctaggcggtc cgctcggaca 900 catcccgacg gttccgacgg gtggcgtggc cctcgatacc gtcgacgagt tcttcgacgc 960 cggagcgact gccgtcggtg tcggcagcgc aatcgtagac aacgacgccg tcttcgcgcg 1020 gcgatttcgc gaccatcgag accaacgcgc gttccttcgt cgaagcggtc gaacgcgcac 1080 gccgcgactg acgcacaccc accgctcgga agttcggcgc tcccgtctca gcgcggtccg 1140 cgcggcgaac cgaaacgttc agcgaaagtg gtcgttatcg gtcgattccg gcgtcgcctc 1200 cgcggaggac gcgttcgact tcgtcaggcg tcacgaccgc gaggtcgccg tgaactgtcc 1260 gtttgagcgc ggcggtcgcc gcgccgtagg cgagcgctgt ttcgactggt tcgtccgcga 1320 tgagccgcga caggaacgcg ccgacaaagg cgtcgccggt cccgatggca tcgaccgtgt 1380 ccgtctcgaa gaccggttgc tcgtacaccg ttccgccgtg tcgggcgagt gcaccgtctt 1440 ccccgcgagt gaccacgacc gtctcgaaat cgaaatcgcc ggctaactcc gatgcgagtg 1500 tcggtgcgtc gccgtcgagt tccagaacag tgcggatgtc tcgttctgcg gcgacgagga 1560 cgtcgacctt tggaaagagc gattcgcacg cgtcgcgtgc gtcggacggc gaccagagct 1620 tcgaccggta gttgaggtcg aacgcggtcg tcgtccccgc ttcggtcgcc gtttgcagaa 1680 gctcgccggt cgtctcccga agcgtctcgg agagcgccgg cgtgatgccg ctggtgtaga 1740 acgccgccgc gtcttccacg atgtcgacag cgagttcgtc cggtcgcgcg gtcgtgactg 1800 ccgcatcggc gcggtcgtat atcacgttcg tcggacgcgg cgcgcgcccc tgttcgatgt 1860 agtacgcgcc ttggcgagcg tcgtcgtccc agcggacgta cggctcgacg ccgtgtgttc 1920 ggagttccgt ggtaacgcgt cgaccgagcg gcgagtctgg aagcttcgag agccacgcgg 1980 cggagcagcc gagacgggag gcggcgaccg cgacgttgct ttccgcaccg gcggtccgga 2040 attcgaggga ccgggcggtc tcgattcgct cgcccgccgg cgggctcagc cgaatcatcg 2100 tctcgccgaa cgtgacgagt tcagccgtca tcgacgcccc tcggcggctt gcgaccgggt 2160 ctcgcgttcg atagcggagg cgttcacgcc gacagttaag ccgttcatgt gttcgcacga 2220 ctgtctctca cggtggtaca taactgcggg cggtggggcc gaaatttgtg acggcacccc 2280 cgtacgggag gacccgagta gtggatatca atcggtgctc agacaccgga aagaactata 2340 tctcaccacg ttgatcattg tgtatgacag ttggtgtctg ctatttcccg gagcactggt 2400 cgcgagagcg ctgggagacc gatatcagtc agatggccga ggctggaatc gaatacgttc 2460 gaatggggga gttcgcgtgg cgacgaatcg aaccggagcg agggacgttc gatttcgcgt 2520 ggttagacga ggccgtcgaa ctcatcggga agttcggtat gaaagcggtt ctgtgcacgc 2580 cgaccgcgac gccgccgaaa tggctcgtcg acgaacatcc cgacgttcga cagcgagagc 2640 aagacggtac gccgcgtgag tgggggagcc gtcggttcac ctgtttcaac tcacccacct 2700 accgttccga gaccgaacgc atcgttagcg tgctgaccga ccgatacgcc gacaaccccc 2760 acgtcgccgg gtggcagact gacaacgaat tcggctgtca cgagacggtt acctgctact 2820 gcgaggactg tggcgaggca tttagcgaat ggctcgccga ccgctatgag agcgttgccg 2880 acctcaacga cgcgtgggga acgacgtttt ggagccagca gtacgacgat ttcgagagca 2940 tcgacccccc aaaaccgaca ccggccgcca accacccttc gcgggtactc gcctacgaac 3000 ggtttagtaa cgacagcgtg gccgagtaca accgcctgca cgcagccctc atccgcgaag 3060 caaacgacga gtggttcgtc acgcacaact tcatgggtgg tttttcactc gacgcctttc 3120 gcctcgccgc cgacctcgat ttcctctcgt gggactccta tccgacgggg ttcgtgcagg 3180 accgacagcc ggacacgccg acggtcgacg aattacgagc ggggaacccc gaccaagtga 3240 gcatgaacca cgacctccag cgcggcgcga aaggaaagcc gttctgggtg atggagcaac 3300 agccgggaga catcaactgg ccgccgcagt cgccgcaacc ggccgacggg gcgatgcgcc 3360 tgtgggctca ccacgccgtc gcccacggcg cggacgccgt cgtctacttc cgctggcgtc 3420 gctgtcgtca agggcaagaa cagtaccacg ccgggcttcg gcggcaggac ggctctccgg 3480 accgcgggta ccgcgaagca tcgaccgccg ccgacgaact gttcgacttg gattcggtcg 3540 atgcgtcagt cgcgctcgtc cacgactacg agagcctgtg ggcgacgcgt tcgcaacctc 3600 tctcgcccga ctgggactac tggaaccact tacggacgta ctacgatgcg cttcgcgccc 3660 gcggcgtgca ggtcgacatc gtctcgccgg aagcgactct cgaacggtac gacgcggtcg 3720 tcgcaccgac tttgtatctc gtcggcgacg aactgtcgac cgcgctgacc gactacgtcg 3780 attcgggtgg ctgtctcctc ctcggtgctc gaacggggga gaaagacccg tacaaccggc 3840 ttcacgagtc gctcgcgccg ggaccactca ccgctctcac cggcgcgcag gtggcgcgtc 3900 acgaaacgct tcccgaccac gtcgagacgc gactctccta cgacggtgcg acgtacgagt 3960 tccgaacgtg ggcttcgtgg ctggctcccg aagtcggagt tccacgaggc gagtatcgga 4020 cgggtgaagc agccggaaac accgcaatcg tgcggaacgc cgccggagac gggagcgtga 4080 cgtactgtgg ctgctggccg ggggacgacc ttgccgacgc actcgtgaca gagttactgg 4140 acgccgccgg cgtcgagtac actgagcgat tcccggacgg cgtgcgcgtg atggagcgcg 4200 acggctatac gtgggcgctt aacttcacga gcgacccggt gacgttgacc gtccccgatt 4260 ccaccgggtt cctgctcggt gagtccaccg tcgacgcgtt cgataccgcg gtactcgacg 4320 ggtccatccg aggtgtcgga ctcgcgtccg agtgagtcgg cacgggacca ctcgccaacg 4380 cgactctcgc ccggcgtgtt ctcgaacgcc gaagagaata acggccggac cgtgccccga 4440 tgaggcaccg accggacgca ggttcgccgt cacggcgcgt tgaacgtcca cttctggttg 4500 tcgccgccga cttcggtgta ctggggtgtt gattcctcgc gtgtacacac gcggaagcgc 4560 gtcacaactg cgtctgtggg gcgttgaggg tgtccgtctt cgtcttgggg ccgtcccgga 4620 ctgcgcgtcg aagaggtagg ccgcctcgtc tcggaagcta attccgacgc ggtcgccggt 4680 ctcggggcga atcgacgcgt cggttcgggc gacaatctct tcttgctaaa ccacggtgat 4740 ttgctgttcg acgacaacga cgcgctccac gtgtacaccg tcgtcgacgg gcaactctac 4800 gccggggttg cgacagcatc ctcggggtgg accgatggac gctctactgt ctccacgagg 4860 gcccgttctg tggagtggac ggtcgaaaac acgatagata tcggtgggaa cacgagaggg 4920 tcctgtcgat accactcgaa cggccccgcg agaacggcac ggcgttcgca ctccaggagt 4980 actcgattgt gggaccgaac gacccgtctg caccctcact gtcggtgtct gacgagtcgt 5040 cgtcgagtgt cgaactgtcg tggaccgagg cgcgaggcgc ggagagttac acgctccatc 5100 ggcggccacc cgacggtgat ttcgaggtcg tcgcggacga aatcgcggaa aattcgtttt 5160 acacccagta caccgacgcc ggcgtgagtt cgggcagtga ctacgagtac agactcacgg 5220 ccgtcaatag cgtcggcaag accgactcga ccatcgttcg ggtcacgact ggtggttcgt 5280 cgaacgcggc gatggactag tcatgtagat tgttcactgt gggcaaacac ttcggtgtcg 5340 ataccgccag cgggtttcaa acggctcggt cgtgtgaccg cgatc 5385 // ![]() |