![]() |
EBI DbfetchID M11047; SV 1; linear; genomic DNA; STD; PRO; 4790 BP. XX AC M11047; M11045-M11046; XX DT 02-JUL-1986 (Rel. 09, Created) DT 17-APR-2005 (Rel. 83, Last updated, Version 5) XX DE S.typhimurium araBAD operon: araB, araA, and araD genes coding for DE ribulokinase, L-arabinose isomerase, and L-ribulose-5-phosphate DE 4-epimerase. XX KW araA gene; araB gene; araBAD operon; araD gene; epimerase; isomerase; KW L-arabinose isomerase; L-ribulose-5-phosphate 4-epimerase; ribulokinase. XX OS Salmonella enterica subsp. enterica serovar Typhimurium OC Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacteriales; OC Enterobacteriaceae; Salmonella. XX RN [1] RP 1-1829 RX DOI; 10.1016/0378-1119(85)90301-4 RX PUBMED; 2989100. RA Lin H.-C., Lei S.-P., Wilcox G.; RT "The araBAD operon of Salmonella typhimurium LT2. I. Nucleotide sequence of RT araB and primary structure of its product, ribulokinase"; RL Gene 34(1):111-122(1985). XX RN [2] RP 1749-3342 RX DOI; 10.1016/0378-1119(85)90302-6 RX PUBMED; 3891513. RA Lin H.-C., Lei S.-P., Wilcox G.; RT "The araBAD operon of Salmonella typhimurium LT2. II. Nucleotide sequence RT of araA and primary structure of its product, L-arabinose isomerase"; RL Gene 34(1):123-128(1985). XX RN [3] RP 3271-4790 RX DOI; 10.1016/0378-1119(85)90303-8 RX PUBMED; 3891514. RA Lin H.-C., Lei S.-P., Wilcox G.; RT "The araBAD operon of Salmonella typhimurium LT2. III. Nucleotide sequence RT of araD and its flanking regions, and primary structure of its product, RT L-ribulose-5-phosphate 4-epimerase"; RL Gene 34(1):129-134(1985). XX CC The sequence preceding araB coding region is part of the CC controlling region between the araC gene and araBAD operon. A CC potential ribosome binding site for the araB gene is located at CC positions 109-112. A 10-bp intercistronic region is located between CC the araB and araA genes. A potential ribosome binding site, CC 'taagga', is located 7 bp distal from the start codon of araA. The CC site overlaps the stop codon of araB . CC A 143-bp intercistronic region exists between the araA and araD CC genes. The presumed ribosome binding site for araD is located at CC positions 3473-3475. This region contains several short CC complementary repeated sequences which can form stable stem-loop CC secondary structures. There is also a stem-loop structure 80 bp CC beyond the stop codon of araD which is followed by an A+T-rich CC sequence. XX FH Key Location/Qualifiers FH FT source 1..4790 FT /organism="Salmonella enterica subsp. enterica serovar FT Typhimurium" FT /mol_type="genomic DNA" FT /db_xref="taxon:90371" FT mRNA 93..>4229 FT /note="araBAD operon mRNA" FT CDS 120..1829 FT /codon_start=1 FT /transl_table=11 FT /gene="araB" FT /product="ribulokinase" FT /EC_number="2.7.1.16" FT /db_xref="GOA:P06188" FT /db_xref="InterPro:IPR000577" FT /db_xref="InterPro:IPR005929" FT /db_xref="InterPro:IPR018484" FT /db_xref="InterPro:IPR018485" FT /db_xref="UniProtKB/Swiss-Prot:P06188" FT /protein_id="AAA27023.1" FT /translation="MAIAIGLDFGSDSVRALAVDCATGDEIATSVEWYPRWQEGRYCDG FT PNNQFRHHPRDYMESMEAALKAVLAQLSAAQRANVVGIGVDSTGSTPAPIDADGNVLAL FT RPEFAENPNAMFVLWKDHTAVEEADEITRLCHKPGKVDYSRYIGGIYSSEWFWAKILHV FT TRQDSAVAQAAVSWIELCDWVPALLSGTTRPQDIRRGRCSAGHKTLWHESWGGLPPASF FT FDELDPCINRHLRYPLFSETFTADLPVGTLCAEWAQRLDLPESVVISGGAFDCHMGAVG FT AGAQPNTLVKVIGTSTCDILIADKQSVGDRAVKGICGQVDGSVVPNFIGLEAGQSAFGD FT IYAWFSRVLSWPLEQLAAQHPELKPQINASQKQLLPALTDAWAKNPSLDHLPVVLDWFN FT GRRTPNANQRLKGVITDLNLATDAPALFGGLVASTAFGARAIQECFTDQGIAVNNVMAL FT GGIARKNQVIMQVCCDVLNRPLQIVASDQCCALGAAIFAAVAAKVHADIPAAQQSMASA FT VERTLRPHPEQAQRFEQLYRRYQQWALSAEQHYLPTAAPAPTTPANQAILTH" FT CDS 1840..3342 FT /codon_start=1 FT /transl_table=11 FT /gene="araA" FT /product="L-arabinose isomerase" FT /EC_number="5.3.1.4" FT /db_xref="GOA:P06189" FT /db_xref="InterPro:IPR003762" FT /db_xref="UniProtKB/Swiss-Prot:P06189" FT /protein_id="AAA27024.1" FT /translation="MTIFDNYEVWFVIGSQHLYGAETLRQVTQHAEHVVNALNTEAKLP FT CKLVLKPLGTSPDEITAICRDANYDDRCAGLVVWLHTFSPAKMWINGLSILNKPLLQFH FT TQFNAALPWDSIDMDFMNLNQTAHGGREFGFIGARMRQQHAVVTGHWQDKEAHTRIGAW FT MRQAVSKQDTRQLKVCRFGDNMREVAVTDGDKVAAQIKFGFSVNTWAVGDLVQVVNSIG FT DGDINALIDEYESSYTLTPATQIHGDKRQNVREAAGIELGMKRFLEQGGFHAFTTTFED FT LHGLKQLPGLAVQRLMQQGYGFAGEGDWKTAALLRIMKVMSTGLQGGTSFMEDYTYHFE FT KGNDLVLGSHMLEVCPSIAVEEKPILDVQHLGIGGKEDPARLIFNTQTGPAIVASLIDL FT GDRYRLLVNCIDTVKTPHSLPKLPVRNALWKAQPDLPTASEAWILAGGAHHTVFSHALD FT LNDMRQFAEIHDIEIAVIDNDTHLPAFKDALRWNEVYYGFKR" FT CDS 3483..4229 FT /codon_start=1 FT /transl_table=11 FT /gene="araD" FT /product="L-ribulose-5-phosphate 4-epimerase" FT /EC_number="5.1.3.4" FT /db_xref="GOA:P06190" FT /db_xref="InterPro:IPR001303" FT /db_xref="InterPro:IPR004661" FT /db_xref="UniProtKB/Swiss-Prot:P06190" FT /protein_id="AAA27025.1" FT /translation="MLEDLKRQVLEANLALPKHNLVTLTWGNVSAVDRERGVLVIKPSG FT VDYSVMTADDMVVVSLESGEVVEGHKKPSSDTPTHRLLYQAFPTIGGIVHTHSRHATIW FT AQAGQPIPATGTTHADYFYGTIPCTRKMTEAEINGEYEWETGNVIVETFEKQGIDAAQM FT PGVLVHSHGPFAWGKNAEDAVHNAIVLEEVAYMGIFCRHLRRSCPTCSNPCWINTIYAN FT TAQKPITGSNASKNASHGGRVDESGR" XX SQ Sequence 4790 BP; 1096 A; 1345 C; 1305 G; 1044 T; 0 other; cgtcacactt tgcaaagcat tagcattttt gtccataaga ttagcggatc ctgcctgacg 60 gtttttgccg cgactctcta ctgtttctcc atacctgttt ttctggatgg agtaagacga 120 tggcaattgc aattggcctc gattttggca gtgattcagt gcgcgctctg gcagtggact 180 gcgccaccgg cgacgagatc gccaccagcg tagagtggta tccgcgctgg caagaaggcc 240 gttattgcga cggcccgaac aaccagttcc gtcatcatcc gcgcgactac atggagtcaa 300 tggaggccgc gctgaaagcc gttctggcac aattaagcgc cgcgcaacgc gcaaatgtcg 360 ttggcattgg cgttgacagc accggctcta cgccagcgcc gattgacgcc gacggtaacg 420 tcctggcgct gcgtccagag ttcgccgaga acccgaatgc gatgtttgtg ctgtggaaag 480 atcacaccgc cgtggaagag gccgacgaaa tcactcgtct gtgccataag ccaggcaagg 540 tcgactactc ccgctatatt ggcggcattt actccagcga atggttctgg gcgaagattc 600 tgcacgtcac ccggcaggat agcgccgtcg cgcaggccgc cgtctcgtgg attgagctgt 660 gcgactgggt gccggcgctg ctttccggca ccactcgccc gcaggatatc cgccgtggcc 720 gctgcagcgc cgggcacaaa acgctgtggc atgaaagctg gggcggtctg ccgcccgcga 780 gcttctttga tgaactcgat ccgtgcatta accgtcatct gcgctacccg ttatttagcg 840 aaaccttcac cgccgatctg cccgtgggca ccctgtgcgc cgaatgggcg cagcgcctcg 900 acttgccgga aagcgtagtg atttccggcg gcgcgttcga ctgtcacatg ggcgcggtcg 960 gcgcgggcgc acagcccaat acgctggtga aagtcatcgg cacgtctacc tgcgacattc 1020 tgattgcgga taaacagagc gtcggggatc gcgccgtgaa aggcatttgc ggtcaggttg 1080 acggcagcgt ggtgccgaac tttatcggtc tggaagcggg gcaatctgct ttcggcgata 1140 tctacgcctg gtttagccgc gtgttgagct ggccgctgga gcaacttgcc gcgcagcacc 1200 cggaactgaa accccagatt aacgccagcc agaagcagct actgccagcg ctcaccgacg 1260 cctgggcgaa aaatccgtcc ctggatcacc tgccggtggt gctcgactgg tttaacggtc 1320 gccgcacgcc aaacgctaat cagcgtctga aaggcgtcat taccgatctc aatctcgcca 1380 ccgacgcgcc agcgctgttt ggcggtctgg tcgcttcgac cgccttcggc gcgcgcgcca 1440 ttcaggagtg ttttaccgat cagggtatcg cggtcaataa cgtgatggcg cttggcggca 1500 tcgcccgtaa aaatcaggtc attatgcagg tctgctgcga cgtactgaat cgtccgttgc 1560 agatcgtcgc ttccgaccag tgttgcgcat taggcgccgc tatctttgcc gccgtcgctg 1620 cgaaagtcca tgccgacatt ccagccgccc agcaaagcat ggcgagcgcg gtagaacgca 1680 ctctgcgccc ccaccctgaa caggcgcaac gcttcgaaca gctttaccgc cgctaccagc 1740 agtgggcgct aagcgcagaa caacattatc ttccgactgc cgcgccggcg ccaacgaccc 1800 cggccaatca ggcaatcctg actcattaag gacacgacaa tgacgatttt tgataattat 1860 gaagtatggt ttgtgattgg cagccagcat ttgtatggcg cagaaaccct gcgtcaggtc 1920 acccaacatg ccgagcatgt ggtcaacgcg ctgaataccg aagccaaact gccatgtaaa 1980 ctggtattaa aaccgctggg cacctcgccg gatgagatta ccgccatttg tcgtgacgcc 2040 aattatgacg atcgctgcgc agggctggtg gtctggctgc acaccttctc cccggccaaa 2100 atgtggatca acgggctgag tatccttaac aaaccactac tgcaattcca tacccaattt 2160 aacgccgccc tgccgtggga cagcattgat atggacttta tgaacctgaa ccagactgcg 2220 cacggcggtc gtgagttcgg ttttatcggc gcgcggatgc gccagcagca cgcggtcgtc 2280 accggtcact ggcaggataa agaggcccat acgcgtatcg gtgcctggat gcgccaggcg 2340 gtctctaaac aggatacccg ccagctaaaa gtctgccgct tcggcgacaa tatgcgtgaa 2400 gtcgcagtga ctgacggtga taaagtggcc gcgcaaatca aatttggctt ttcggtcaat 2460 acctgggcgg tcggcgatct ggtgcaggtg gtgaattcta tcggcgacgg cgatatcaac 2520 gctctgattg acgagtatga aagcagctat accctgacgc ccgccaccca aatccacggc 2580 gataaacgcc agaacgtgcg ggaggcggcg ggtattgaac tcggtatgaa gcgtttcctg 2640 gaacagggcg gcttccacgc attcactact acctttgaag atttacacgg tctgaaacag 2700 cttccgggtc tggccgtaca gcgtctgatg cagcaaggct acggctttgc gggcgaaggc 2760 gactggaaaa ccgccgctct gcttcgcatt atgaaagtga tgtcaaccgg tctgcagggc 2820 ggcacctcat ttatggagga ttacacctac cacttcgaga aaggcaacga tctggtgctc 2880 ggctcgcaca tgctggaagt gtgtccgtcc atcgcggtgg aagagaaacc gatcctcgac 2940 gtccagcacc tcggcattgg cggcaaggaa gatccggcgc gtttgatttt caatacccaa 3000 accggcccgg cgatcgtcgc cagcctgatc gacctcggcg atcgttatcg cctgctggtc 3060 aactgcattg acaccgtaaa aacgccgcac tccctgccga aactgccggt gcgtaacgcg 3120 ctgtggaagg cgcagccgga tctgccgacc gcctccgaag cgtggattct ggctggcggc 3180 gcgcaccata ccgtcttcag ccacgcgctg gatctgaacg atatgcgcca gtttgcagaa 3240 atacacgata tcgaaatcgc ggtgattgat aacgataccc atctgccggc ctttaaggac 3300 gcgctgcgct ggaacgaggt gtattacggg ttcaaacgtt aattggtgaa acggattgcc 3360 cggtggcact gcgtttaccg ggcctacggt cctgtaggcc gaataaggca tttatgtcgc 3420 catccggcac accgtcgctc gtaggccgga taagcgaagc gccatccggc agggagaaaa 3480 caatgttaga agatctcaaa cgccaggtac tggaagctaa tctggcgctg ccaaaacaca 3540 acctggtcac ccttacctgg ggtaacgtta gcgccgtcga tcgcgaacgc ggcgtactgg 3600 tgattaagcc gtccggcgtc gattatagcg tcatgaccgc tgacgatatg gtggtggtca 3660 gcctggagag cggtgaagtc gttgaaggtc ataagaaacc gtcgtccgat acgccaaccc 3720 accgtctgtt gtaccaggca tttccgacta tcggcggcat cgtacacacc cattcgcgcc 3780 acgcgactat ctgggcgcag gcgggtcagc caattccggc gacgggaacc acccatgccg 3840 actatttcta cggtacgatt ccctgcactc gcaaaatgac cgaggcggaa attaatggcg 3900 agtatgaatg ggaaacgggc aatgtcattg ttgaaacctt tgaaaaacaa ggcattgacg 3960 ccgctcaaat gcccggcgtg cttgtccatt cgcacggccc gtttgcctgg ggtaaaaatg 4020 ccgaggatgc agtgcataac gccatcgtgc tggaagaagt ggcctatatg gggatcttct 4080 gccgccactt gcgccgcagt tgcccgacat gcagcaatcc ctgctggata aacactatct 4140 acgcaaacac ggcgcaaaag cctattacgg gcagtaatgc ctctaaaaac gcgtcccatg 4200 gggggcgcgt tgatgaatct ggtcggtgat atattcagca aatgcgcttt gatagacgta 4260 atgatcagaa ctcacatatt caataatatt gtcataatgt ccctgccacg cttttccttc 4320 cagcgcatgg aagaaaatat aatcttcgat tgttgactgc cagcgttgcc catttaacag 4380 atagttaata atggtatccc gatgtccgtt ttttctgtcg tgtccttgcc agtgaaaaaa 4440 agcattgccg ttttcaataa tctcggtacg ccaaatctgt tctgtccatg ttttatactc 4500 aaaaaatcga ctcacggttt ttatggaagg gttagcgcgt tgagtattga cgaaaagata 4560 acggtcgttc cctaccagac gcgcctgcat actcacattc ataaaagatc attcccgaat 4620 accacaaatt ttgataaaaa cacccgcacc cgaaagtcaa aataaaatta tattctaaaa 4680 taaaaattaa attatgcaga gagttcccga cgaattcgca ctgtaatcca tttttattta 4740 accatagcgg ccaattggaa tattatattt ctacctgacg gtgcggatgt 4790 // ![]() |