![]() |
EBI DbfetchID D00399; SV 1; linear; genomic DNA; STD; PRO; 2115 BP. XX AC D00399; XX DT 11-APR-1990 (Rel. 23, Created) DT 01-OCT-2007 (Rel. 93, Last updated, Version 7) XX DE Clostridium thermocellum trp(G)D, trpE genes, complete and partial cds. XX KW . XX OS Clostridium thermocellum OC Bacteria; Firmicutes; Clostridia; Clostridiales; Clostridiaceae; OC Clostridium. XX RN [1] RP 1-2115 RX PUBMED; 2732211. RA Sato S., Nakada Y., Hon-nami K., Yasui K., Shiratsuchi A.; RT "Molecular cloning and the nucleotide sequence of the Clostridium RT thermocellum trpE gene"; RL J. Biochem. 105(3):362-366(1989). XX CC Submitted in computer readable form by Sato,S. on 05-Apr-1989. CC The putative initiation codon of trp(G)D gene overlaps with the CC termination codon of trpE. The predicted amino acid sequences of CC ASIs were compared between thermophilic and mesophilic bacteria. XX FH Key Location/Qualifiers FH FT source 1..2115 FT /organism="Clostridium thermocellum" FT /mol_type="genomic DNA" FT /clone="pUTF71" FT /note="382 bp upstream of Tth111I site" FT /note="JW20" FT /db_xref="taxon:1515" FT -35_signal 151..156 FT /note="putative -35 promoter region" FT -10_signal 175..180 FT /note="putative -10 promoter region" FT misc_feature 187 FT /note="alternative mRNA start sites" FT misc_feature 188 FT /note="alternative mRNA start sites" FT repeat_region 219..227 FT /rpt_type=DIRECT FT repeat_region 225..233 FT /rpt_type=DIRECT FT repeat_region 235..243 FT /rpt_type=DIRECT FT repeat_region 245..253 FT /rpt_type=DIRECT FT repeat_region 257..265 FT /rpt_type=DIRECT FT repeat_region 270..278 FT /rpt_type=INVERTED FT repeat_region 284..291 FT /rpt_type=INVERTED FT repeat_region 298..305 FT /rpt_type=INVERTED FT repeat_region 333..341 FT /rpt_type=INVERTED FT repeat_region 346..354 FT /rpt_type=INVERTED FT RBS 347..354 FT CDS 361..1845 FT /codon_start=1 FT /transl_table=11 FT /gene="trpE" FT /db_xref="GOA:P14953" FT /db_xref="InterPro:IPR005256" FT /db_xref="InterPro:IPR005801" FT /db_xref="InterPro:IPR006805" FT /db_xref="InterPro:IPR015890" FT /db_xref="InterPro:IPR019999" FT /db_xref="UniProtKB/Swiss-Prot:P14953" FT /protein_id="BAA00300.1" FT /translation="MFYPTLDEVKIMAKDYNIIPVTMEVYADMETPISLFKRFEESSCC FT FLLESVEGGEKWARYSIIGKNPFLVVESYKNKTIIRERNGSQREVEGNPVEIIKGIMGK FT FKGANLPNLPRFNGGAVGYFGYDLIRHYENLPNVPEDDMGLPECHFMFTDEVLVYDHLK FT QKIHIIVNLHVNGNIERAYISAVDRIKTIHREILDTRWKTADNSVLSYNKKKNELAVTS FT NISKEDFCRNVLKAKQYIRDGDIFQVVLSQRLCVETNENPFNIYRALRVINPSPYMYYL FT KFGGYRIIGSSPEMLVRVENGIVETCPIAGTRKRGRTKEEDEALEKELLSDEKEIAEHV FT MLVDLGRNDIGRVSKFGTVAVKNLMHIERYSHVMHVVTNVQGEIREDKTPFDALMSILP FT AGTLSGAPKVRAMEIIDELETVKRGPYGGAIGYLSFNGNLDSCITIRTIILKDGKAYVQ FT AGAGIVADSVPEREYEECYNKAMALLKAIEEAGEIR" FT CDS 1842..>2115 FT /codon_start=1 FT /transl_table=11 FT /gene="trp(G)D" FT /note="amino end" FT /db_xref="GOA:P14952" FT /db_xref="InterPro:IPR000991" FT /db_xref="InterPro:IPR001317" FT /db_xref="InterPro:IPR006220" FT /db_xref="InterPro:IPR011702" FT /db_xref="InterPro:IPR017926" FT /db_xref="UniProtKB/Swiss-Prot:P14952" FT /protein_id="BAA00301.1" FT /translation="MIENILIIDNYDSFTYNLYQYVGEISPNIEVYRNDKITLEKIEEM FT NPTHIIISPGPGFPKDAGICIEAIRKFGRYIPILGVCLGHQAIGEA" XX SQ Sequence 2115 BP; 688 A; 367 C; 504 G; 556 T; 0 other; gttcaatgtc aaggattatt gcgaaacagc tcattaaaaa gattattgtt acggtgtcag 60 tgaaagctgt atatttggca caaaacatca tttattatat atcatttgaa aaagtttcaa 120 ataaaattta taacatttat gaaatttaac ttgcattttt cgtattttcc atgctacaat 180 aatcttaaag ataagaataa gttaagaatt ggatgagatg agtttgagtt tgaagagaag 240 agatgagaag agacgagcgt gaataaatgt atttgcgcca tatcgccgga ttcttccatc 300 cggcgttttt agtatggtca tctctcatgt cttccttctt agaaataagg aggagtcgga 360 atgttttatc caaccctgga cgaagtcaaa ataatggcaa aagattataa tatcatacct 420 gtcacaatgg aagtatatgc cgacatggaa acccctataa gcctttttaa aaggtttgag 480 gaaagcagtt gctgtttcct tttggagagc gttgagggcg gtgaaaaatg ggcccggtac 540 tccatcatcg gaaaaaatcc gtttcttgtt gtggaaagct acaaaaacaa aaccattata 600 agggagagga acggttctca aagggaagtt gaaggaaatc ctgttgaaat aataaagggc 660 attatgggga agtttaaagg tgccaacctt ccgaatcttc cgagattcaa cgggggagcg 720 gtgggatatt ttgggtatga cctcatacga cactatgaaa atcttcccaa tgtccccgaa 780 gatgacatgg gtcttccgga atgccatttc atgtttaccg acgaagtgct ggtgtatgac 840 catctaaagc agaaaattca tataattgtt aatttgcatg tcaacggcaa cattgaacgg 900 gcctatataa gcgcggttga ccggataaaa accatacaca gggagattct tgacaccagg 960 tggaaaaccg ctgacaactc tgttctaagt tacaataaaa agaaaaatga acttgcggta 1020 accagcaata tttcaaaaga ggatttctgc cggaatgtgt tgaaggcaaa gcagtatata 1080 agggacggag acatattcca ggtggttttg tcgcaacgct tgtgtgttga gacaaatgaa 1140 aatcctttta acatataccg cgccttaagg gttataaatc cttctccata tatgtattat 1200 cttaaatttg gcggctacag aataataggt tcttcccccg agatgctggt cagggttgaa 1260 aatggaattg tggaaacctg tccgattgca ggaacgcgaa agagaggcag gacaaaagaa 1320 gaggatgagg ctttggaaaa agagcttctt tccgatgaga aagaaatagc cgagcatgtg 1380 atgctggtgg acctgggcag aaacgatatc ggaagagtat cgaaatttgg taccgtagcg 1440 gtaaagaacc ttatgcacat tgagagatat tcccatgtaa tgcatgtggt aacaaacgta 1500 cagggagaga ttcgggagga taagactcct tttgacgccc ttatgtccat tcttcctgcc 1560 ggtacccttt ccggagcgcc aaaggtcagg gctatggaga taatagacga gcttgagacc 1620 gtaaaaagag gtccctacgg cggtgcgatc gggtatctta gctttaacgg caatctcgac 1680 agctgcataa ccataaggac aattatatta aaggacggaa aggcttatgt tcaggccgga 1740 gcgggcatag tcgcggattc ggtcccggaa agggagtatg aagagtgcta caacaaagca 1800 atggcacttc ttaaagccat agaagaggca ggtgaaataa gatgatagaa aacatattga 1860 ttattgataa ttatgattct tttacctaca atttgtacca gtatgtcggg gaaatcagtc 1920 ccaatattga ggtctacaga aatgacaaaa taactttgga aaagatagaa gaaatgaatc 1980 ccacgcatat aataatttct cccggccccg gttttccgaa agatgcagga atatgcatag 2040 aagctataag aaagttcggc aggtatatac ccattctggg ggtatgcctt ggacatcagg 2100 caattggaga agctt 2115 // ![]() |