Thornton Group

Future projects

For our enzyme work, our central question is whether we can predict the evolution of enzyme function – both in terms of adapting to operate on new substrates and evolving new mechanisms. Can we relate changes in function to changes in the structure of the enzyme and changes in the environment? Can we automatically predict or validate enzyme catalytic mechanisms in silico from structural data? We will further develop our data resources (M-CSA) and websites (PDBsum) and develop novel methods to predict transformations and mechanisms using knowledge-based and deep-learning approaches.

For coding variants, we will enhance our web tool (VarSite) to relate variant, 3D structure and function to help non-experts understand the impact of coding variants and how they generate disease phenotypes. To address these questions, we plan to:

develop new methods to analyse the effects of mutations in ligand binding sites

explore variants in co-factor binding sites and their impact on function

apply our methods to specific genes of interest in collaboration with ‘domain’ experts

explore how the same mutations can cause many diseases, and how one disease can have many causes.

For ageing, we will develop tools to combine transcriptome data sets and analyse a small number of common diseases and the impact of ageing on their occurrence.

ArchSchema

ArchSchema is a java webstart application that generates dynamic plots of related Pfam domain architectures. The protein sequences having each architecture can be displayed on the plot and separately listed. Where there is 3D structural information in the PDB, the relevant PDB codes can be shown on …

Atlas of Protein Side-Chain Interactions

This atlas depicts how amino acid side-chains pack against one anotherwithin the knownprotein structures. This packing, which is governed by the interactions between the 20 different types of side-chains, determines the structure, function, and stability of proteins.

Catalytic Site Atlas

CSA is a resource of catalytic sites and residues that have been identified in enzymes using structural data.

Cofactor database

Organic enzyme cofactors are involved in many enzyme reactions. Therefore, the analysis of cofactors is crucial to gain a better understanding of enzyme catalysis. To aid this, we have created the CoFactor database. It provides a web interface to access hand-curated data extracted from the literatur…

CSS

Searches a protein structure for likely catalytic sites

EC-PDB

This database contains the known enzyme structures that have been deposited in the Protein Data Bank (PDB).

FunTree

FunTree provides a range of data resources to detect the evolution of enzyme function within distant structurally related clusters within domain super families as determined by CATH . To access the resource enter a specific CATH superfamily code or search for a structure / sequence / function (eithe…

LigSearch

Identifies small molecules likely to bind to given protein

MACiE

Mechanism, Annotation and Classification in Enzymes. Query for an enzyme, and return enzyme mechanism.

PDBsum

This pictorial database provides an overview of macromolecular structures deposited in the Protein Data Bank archive.

PDBsum Generate

Protein structure analyses, inlcuding secondary structure determination, quality assessment, protein -ligand -protein -DNA interactions.

PITA

Suggests most likely biological unit for X-ray structure of protein

PoreLogo

Generates logo showing conservation of pore-lining residues in transmembrane protein structures

PoreWalker

Detects and characterises transmembrane protein channels from their 3D structure

ProFunc

Protein function prediction

SAS

Annotation of protein sequence with structural info from similar proteins in the PDB

SAS – Sequence Annotated by Structure

SAS is a tool for applying structural information to a given protein sequence. It uses FASTA to scan a given protein sequence against all the proteins of known 3D structure in the Protein Data Bank (PDB). The resultant multiple alignment can be coloured according to different structural features an…

Scorecons

Scores residue conservation based on a given multiple sequence alignment

SurvCurv – Analyse data

Survival and other incident curves