Interesting Structures

This tutorial will cover a spectrum of protein structures available from the Protein Data Bank and the PDBe pages. In order to cover this tutorial you will need a graphical viewer like rasmol installed on your machine. If rasmol is not available, then java will be required to run java-based viewers such as AstexViewer or Jmol. The selection of structures is not exhaustive and is merely intended to provide a feel for structures as present in the PDB and their equivalent quaternary structures in the PQS or PISA servers. The first link on the legend takes you to the PDBe atlas pages for this entry, while the second link (View) will start a java-based viewer.

Example 1

The crystal structure of the Nucleosome Core Particle (PDB entry: 1AOI) (View)

The X-ray crystal structure of the nucleosome core particle of chromatin shows in atomic detail how the histone protein octamer is assembled and how 146 base pairs of DNA are organized into a superhelix around it. Both histone/histone and histone/DNA interactions depend on the histone fold domains and additional, well ordered structure elements extending from this motif.

Example 2

The crystal structure of the 20s proteasome from yeast (PDB entry: 1FNT)(View)

The crystal structure of archaebacterial 20s proteasome (PDB entry: 1YAR)(View)

The proteasome from archaebacteria to eukaryotes have the same basic architecture, which under the electron microscope appears as a cylinder shaped particle, made up of four stacked rings with dimensions of approximately 15 nm in height and 11 nm in diameter. Eukaryotic proteasomes have a more complex structure than the proteasome from the archaebacterium Thermoplasma acidophilum, which contains only two different subunits, alpha and beta of 25.8 kD and 22.3 kD respectively. However, due to gene duplication in the higher organisms there are 14 different genes which encode 28 subunits that form the eukaryotic proteasome.

Example 3

Crystal structure of the bovine F1-Atpase (PDB entry: 1BMF)(View)

(Quoted from http://www.biologie.uni-osnabrueck.de/biophysik/Feniouk/Basics.html) . Prof. J.E.Walker received the nobel prize for chemistry for the determination of the structure of this enzyme.

During catalysis a complex formed by certain subunits rotate relative to the rest of the enzyme. This feature makes ATP synthase the smallest rotary machine ever known.

This enzyme is the primary source of ATP in a vast majority of living species on Earth, including us. In human body it daily generates over 100 kg of ATP, which is subsequently used to provide energy for different biochemical reactions, including DNA and protein synthesis, muscle contraction, transport of nutrients and neural activity, to name just a few.
In plants and photosynthetic bacteria it is essential for solar energy convertion and carbon fixation. This is one of the oldest enzymes on Earth, which appeared earlier then photosynthetic or respiratory enzyme machinery.

This is a membrane enzyme. It is found in eu- and archebacteria in the plasma membrane; it is present in the thylacoid membrane in chloroplasts and in the inner mitochondrial membrane of eucariotic cells. Enzymes from different organisms show striking homology in the primary structure of subunits essential for catalysis.

As could be deduced from the name of the enzyme, it catalyses the reaction of ATP synthesis/hydrolysis. The catalytic act is coupled with vectoral transmembrane translocation of several protons. The driving force for ATP synthesis is the transmembrane electrochemical gradient of protons, while during ATP hydrolysis this gradient is built using the energy of ATP phosphodietheric bond.

Example 4

Crystal structure of P53 in complex with DNA (PDB entry: 1TUP)(View)

p53, also known as tumor protein 53 (TP53), is a transcription factor that regulates the cell cycle and hence functions as a tumor suppressor. It is very important for cells in multicellular organisms to suppress cancer. p53 has been described as "the guardian of the genome" or the "master watchman", referring to its role in conserving stability by preventing genome mutation. The name is due to its molecular mass: it runs as a 53 kilodalton (kDa) protein on SDS-PAGE. Mutations in the p53 tumor suppressor are the most frequently observed genetic alterations in human cancer. The majority of the mutations occur in the core domain which contains the sequence-specific DNA binding activity of the p53 protein (residues 102-292), and they result in loss of DNA binding. p53 has many anti-cancer mechanisms: a) It can activate DNA repair proteins when DNA has sustained damage. b) It can also hold the cell cycle at the G1/S regulation point on DNA damage recognition and c) it can initiate apoptosis, the programmed cell death, if the DNA damage proves to be irreparable.

Example 4

Similar domains, different functions. On the right (PDB entry: 1LIT), on the left (PDB entry: 1B08)

Superposed structure of Lithostathine and Lung Surfactant Protein(SP-D). Lithostathine is colored in blue and is aligned with one domain of SP-D

Human Lithostathine (HLIT) (PDB entry 1LIT) is a pancreatic glycoprotein which inhibits the growth and nucleation of calcium carbonate crystals. Structural comparison with the carbohydrate-recognition domains of rat mannose-binding protein and E-selectin indicates that the C-terminal domain of HLIT shares a common architecture with the C-type lectins. Nevertheless, HLIT does not bind carbohydrate nor does it contain the characteristic calcium-binding sites of the C-type lectins. In consequence, HLIT represents the first structurally characterized member of this superfamily which is not a lectin. Analysis of the charge distribution and calculation of its dipole moment reveal that HLIT is a strongly polarized molecule. Eight acidic residues which are separated by regular 6 angstrom spacings form a unique and continuous patch on the molecular surface. This arrangement coincides with the distribution of calcium ions on certain planes of the calcium carbonate crystal; the dipole moment of HLIT may play a role in orienting the protein on the crystal surface prior to the more specific interactions of the acidic residues.

Human lung surfactant protein D (hSP-D) (PDB entry 1B08) belongs to the collectin family of C-type lectins and participates in the innate immune surveillance against microorganisms in the lung through recognition of carbohydrate ligands present on the surface of pathogens. The involvement of this protein in innate immunity and the allergic response make it the subject of much interest. The structure comprises an alpha-helical coiled-coil and three carbohydrate-recognition domains (CRDs) which belong the the C-type lectin family represented by the the mannan-binding protein (MBP).

Example 5

The crystal structure of human placental ribonuclease inhibitor (green) in comples with human angiogenin (cyan). (PDB entry: 1A4Y)(View)

Human placental RNase inhibitor (hRI), a leucine-rich repeat protein, binds the blood vessel-inducing protein human angiogenin (Ang) with extraordinary affinity (Ki >1 fM). The hRI-Ang binding interface is large and encompasses 26 residues from hRI and 24 from Ang, recruited from multiple domains of both proteins. However, a substantial fraction of the energetically important contacts involve only a single region of each: the C-terminal segment 434-460 of hRI and the ribonucleolytic active centre of Ang, most notably the catalytic residue Lys40.

Example 6

Crystal structure of the 70s ribosome in complex with mRNA and tRNA (PDB entries: 2J00 and 2J01)

The ribosome is the large protein-RNA complex that synthesizes proteins using genetic instructions encoded in the mRNA template. Ribosomes are composed of ribosomal RNA and ribosomal proteins (known as a Ribonucleoprotein or RNP). It translates Messenger RNA (mRNA) into a polypeptide chain (e.g., a protein). It can be thought of as a factory that builds a protein from a set of genetic instructions. Ribosomes can float freely in the cytoplasm (the internal fluid of the cell) or bind to the endoplasmic reticulum, or to the nuclear envelope. These structures traverse more than one entry due to the number of protein and nucleic acid chains, as well as the number of atoms present in the complex. The picture shown above contains 58 chains and over 140000 atoms with coordinates. As a matter of fact, the experiment determined the structure of two such complexes and is divided across 4 pdb files (2j00, 2j01, 2j02 and 2j03) and contains 116 chains and over 280000 atoms, thereby making this the largest structure in the PDB.

Example 7

A part of the Rat Liver Vault determined at 3.50A resolution. (PDB entries: 2ZV4)

From the Abstract: Vaults are among the largest cytoplasmic ribonucleoprotein particlesand are found in numerous eukaryotic species. Roles in multidrugresistance and innate immunity have been suggested, but thecellular function remains unclear. The structure of the vault has been determined at 3.5 angstrom resolution andshows that the cage structure consists of a dimer of half-vaults,with each half-vault comprising 39 identical major vault protein(MVP) chains. Each MVP monomer folds into 12 domains: nine structuralrepeat domains, a shoulder domain, a cap-helix domain, and acap-ring domain. Interactions between the 42-turn-long cap-helixdomains are key to stabilizing the particle. The shoulder domainis structurally similar to a core domain of stomatin, a lipid-raftcomponent in erythrocytes and epithelial cells.

This protein structure is divided across 3 pdb files (2ZV4, 2ZV5 and 2ZUO). The 3 entries together form half of the vault that is comprised of 39 identical chains.

The complete vault can be produced from the crystal structure by applying a crystallographic symmetry operation on the 39 identical chains in order to produce a 78 chain complex (as below)