Get   for     ? 
 Site search     ? 
CAPRI: Critical Assessment of PRediction of Interactions
 
  
   Call For Targets
   Meeting Report
   Evaluation Meeting
Registration
   Evaluation Meeting
Program
   ROUND 1
   ROUND 2
 
 MSD  CAPRI: Critical Assessment of PRediction of Interactions

First community wide experiment on the comparative evaluation of protein-protein docking for structure prediction

Hosted By EMBL/EBI-MSD Group

CAPRI target 03 evaluation results

Raúl Méndez, Leonardo De Maria and Shoshana J. Wodak.
SCMBB Université Libre de Bruxelles, Cp 263, Brussels, Belgium.
Re-accessed on Monday December 16, 2002.
e-mail: raul@ucmb.ulb.ac.be
shosh@ucmb.ulb.ac.be

The evaluation results of the CAPRI TARGET 03 predictions are stored in different directories depending on the criteria that have been used. In the following the directories and their contents are briefly described.


Information.

Directory Information contains the information about target 03, that was used in the evaluation and scoring. It contains the following files (file names are given in bold):

  • capri.3.pdb: the crystal structure of the target (target 03) in PDB format.

  • Contres1: list of residue contacts in the target between FAB subunits (H, L) and Hemagglutinin subunits (A, C).
  • Contres2: list of residue contacts in the target between FAB subunits (T, U) and Hemagglutinin subunits (C, E).
  • capri.3.HL.intres: FAB - Hemagglutinin interface residues (at interface HL).
  • capri.3.TU.intres: FAB - Hemagglutinin interface residues (at interface TU).
  • cc.capri.3.HL.d: list of clashes in the target HL interface.

  • cc.capri.3.TU.d: list of clashes in the target TU interface.


    Final Summary

    File target 03 Final Summary. summarizes all the information about the target 03 evaluation in the same way as the corresponding summary file for target 01. It looks like that:

    
    PREDS			fnat		fnon-nat		 		fIR		 	INTERFACE RES.(OP)	THETA ANGLE	DISTANCE		Nclash		L_rmsd		I_rmsd
    								   FAB       	HMGL      	   FAB       	HMGL      
       
    T03_P26.4.HL 0.683 0.494 0.545 0.824 0.600 0.737 20.4 6.150 100 7.438 1.679 T03_P26.5.HL 0.016 0.984 0.364 0.147 0.400 0.179 123.7 40.716 44 47.340 16.448 T03_P26.1.HL 0.000 1.000 0.242 0.000 0.222 0.000 110.9 81.086 54 84.690 33.794 T03_P26.2.HL 0.000 1.000 0.333 0.000 0.500 0.000 74.5 83.193 62 85.180 30.837 T03_P26.3.HL 0.000 1.000 0.212 0.000 0.179 0.000 175.8 53.365 76 61.492 24.483 .

    Again T03_P26.1.HL means participant 26, prediction 1 for the target 03, FAB interface HL (chains H and L). For all the participants, except for P58, the only interface with Hemagglutinin in their predictions is made by the H and L FAB subunits (note that in capri.3.pdb we have H and T, the Fab heavy chains, and L and U, light chains, that are identical in sequence).

    In contrast, participant 58 used all three possible FAB pairs (HL, IM and JQ) for their calculations. In the evaluation these interfaces are considered to be independent, and compared separately with the reference. This was done for sake of consistency with the target 01 evaluation methodology.

    Column 2 gives the fraction of predicted contacts over native fnat. This fraction is computed as the number of contacts in the prediction that match the contacts in the target, divided by the number of contacts in the target. In fact we compute 2 different ratios, one for each of the 2 different interfaces in the target and retain the largest ratio for the summary. The contact lists for the target are Contres1 and Contres2. As mentioned above, 2 residues are considered as being in contact if at least one atom of one residue is within 5Å of an atoms of the other.

    Colum 3 gives the false positive fnon-nat contact fraction. This fraction is computed as the number of contacts in the prediction that don't match the contacts in the target, divided by the number of contacts in the prediction. This number accounts for the real efficiency of the prediction in term of contact: as bigger is the predicted interface as higher the probability of predict native contacts. The number given in this table is the one referred to the target contact list which gives the highest contact ratio over prediction.

    Columns 4 and 5 list the interface residues ratios over native fIR. Column 4 give the ratio between the residues of the FAB that are part of the interface in the prediction, over the FAB residues that are part of the interface in the target. The 5th column gives the same information for the Hemagglutinin moiety. Note that these ratios are computed considering, respectively the 2 different FAB/Hemagglutinin interfaces in the target, but again we list in the summary the results when compared to the target interface which gives the hihgest contact ratio (ON). Column 5, list the interface contact ratio for the Hemagglutinin. All the interface residues lists are generated using the BRUGEL package.

    Columns 6 and 7 lists the interface residue ratios over prediction. They are analogous to columns 4 and 5 but now dividing the number of residues in the prediction found in the target over the total number of provided residues at the predicted interface.

    Column 8 lists the rotation angle (Theta angle) necessary to fit the FAB molecule in the predicted complex to that in the target, as per capri.3.pdb. To compute this angle, we first perform a rigid-body fit (Kabsch, 1978, Acta. Cryst. A. 34, 827-828) of the Hemagglutinin subunit in the predicted complex, to the Hemagglutinin subunit in the target. The particular subunits superimposed are listed in the detailed information (see the FITTING_SUMMARY below).

    After this first fit, a second fit is performed so as to superimpose the predicted FAB molecules onto its closest counterpart in the target structure (capri.3.pdb closest). The rotation angle corresponding to this second fitting is the listed theta angle. Computed angles and distances are done considering the FAB chain fragments that are sumperimposed in the second fit (see FITTING_SUMMARY section).

    Column 9 lists the distance (in Angstroms) between geometric centers of predicted and target FAB molecules before the second fit. The distance between the geometric centers together with the Theta angle give an idea of the global position of the FAB in the prediction relative to the position in the target.

    Column 10 lists the number clashes Nclash between the FAB and the Hemagglutinins for each predicted complex. Clashes are computed between heavy atom within 3 Å . In the detailed information you can find the clash pairs classified into three categories: from 0 to 1, from 1 to 2 and from 2 to 3 Å.

    Columns 11 and 12 list the RMSD's (Root Mean Square Deviation) values in Å . Column 11 list the RMSD values calculated between the FAB's backbones once the Hemagglutinins are superimposed. Column 12 contain the rsmd's when sumperimposing the backbones of the residues at the interface (FAB + Hemagglutinin) on the prediction upon the counterpart in the target I_rmsd. Residues at the interface are re-defined here, as residues in the target having at least one atom within 10 Å of an atom of the other molecule. The equivalents for those residues in the predictions are considered as to be in the interface to sumperimpose. For all the RMSD calculations we consider the same molecular fragments as for the fits.


    Contact List

    Directory: ContactList contains one file per predicted interface, with information on the residue-residue contacts in the predicted versus the target complexes

    As an example the file T03_P26.4.HL.highlighted is illustrated in part:

    HIGHLIGTHED CONTACT LIST FOR T03_P26.4.HL.highlighted

    Number of contacts = 85 Matching list1 = 43/63 Matching List2 = 39/63


    H24  VAL - A160 THR
    H25  THR - A160 THR
    H26  GLY - A160 THR
    H26  GLY - A161 TYR
    H26  GLY - A162 PRO	1
    H26  GLY - A163 VAL	1
    H26  GLY - A197 GLN
    H27  TYR - A161 TYR
    H27  TYR - A162 PRO	1
    H27  TYR - A163 VAL	1	2
    H27  TYR - E187 THR
    
    

    Each predicted contact that matches the target contact list 1 (Contres1) is highlighted with "1" and if it matches contact list 2 (Contres2) is highlighted with "2".


    INTERFACE_RESIDUES_HIGHLIGHTED

    Directory InterfaceResidues contains one file per predicted interface, with information on the residues forming the FAB-Hemagglutinin interface in the prediction and how well they match those in the target interfaces.

    The information contained in each file is illustrated by an example, T03_P26.4.HL.highlighted

    HLHIGLIGHTED RESIDUE LIST FOR T03_P26.4.HL.highlighted

     

    N_res_FAB = 30 N_res_HMGL = 38 Match HL_FAB = 18/33 Match HL_HMGL = 28/34 Match TU_FAB = 16/32 Match TU_HMGL = 31/35


    
    FAB LIST
     
    H2   VAL   18.433	1
    H24  VAL    3.365
    H25  THR   34.322
    H26  GLY   44.707	1	2
    H27  TYR   69.739	1	2
    H28  SER   50.623	1	2
    
     
    HEMAGGLUTININ LIST
     
    A128 THR   76.441	1	2
    A129 GLY   30.649	1	2
    A157 SER   33.476	2
    A158 GLY    7.710	1	2
    A159 SER   19.594	1	2
    A160 THR   80.965	1
    

    Each time a residue of the FAB or Hemagglutinin in the predicted interface matches one of the interface residues in the target list, it is highlighted with either " 1", "2" or both to indicate which capri T03 interface residue list is matched (1 means capri.3.HL.intres and 2 capri.3.TU.intres).

    Note that interface residues list files and contact list ones are named the same (i.e. T03_P26.4.HL.highlighted) but they are in different directories and their contents are completely different.


    FITTING_SUMMARY

    Directory FittingSummary contains one file per predicted interface, with information on the results of fitting the predicted complex over the target complex. The information contained in each file is illustrated by an example, file T03_P26.4.HL.fitting.summary

    Fitting of E prediction HMGL subunit onto W capri 3 subunit
    Rotation Matrix:
       0.08318   0.95763   0.27572
       0.33307  -0.28748   0.89801
       0.93923   0.01714  -0.34287
      Translation vector 1.640    83.826   -25.483
    Fitting FAB's, HL onto YR
    Theta angle = 20.38
    Distance between geometric centres = 6.150477
    

    As for the evaluation of target 01, we give the information about the first fit (rotation matrix and translation vector including which subunits are involved), the distance between predicted FAB and Capri FAB after this first fit and the Theta angle of the second fit. Again we use the optimal fit in the sense that we scan all the possible Hemagglutinin subunit with the highest number of contact vs. any of the Hemagglutinin subunits in the target and select the transformation that leaves both the predicted and target FAB closest as a first fit.

    For this target 03 evaluation, the first fit was made using the backbones of whole Hemagglutinin large chains A, C, E (skipping residues 1-8 that appear only in P73 submissions and in orginal unbound 1HGH.pdb) as they are the only ones of the heterodimers (AB,CD,EF) forming Hemagglutinin antigen that are involved in the contacts. Also the rmsd on single unbound - bound large chains are better than the corresponding unbound - bound on small Hemagglutinin chains and than the combination bound - unbound large+small. In this context, the prediction large chain with the highest number of contacts is fitted onto capri large chain with the highest number of contacts at the interface which this prediction scores the best. While the second fit was made considering the smallest FAB fragment common to all the participants, e.g. residues 1-117 for H, I, J subunits (H, T in Capri) and residues 1 - 106 in L,M,Q subunits (L,U in Capri). In order to be consistent, the distance between geometric centres was calculated taking into account only this fragment, which in fact corresponds to the variable domains in contact with the Hemagglutinin protein.

    Note that in order to not confuse chain ID's between target and predicted coordinate sets, the chain ID's in the target (capri.3.pdb) were renamed as follows:

    A to W
    B to K
    C to X
    D to N
    E to O
    F to P
    for the Hemagglutinin subunits

    H to Y
    L to R
    T to S
    U to V
    for the FAB subunits.


    FITTED PDB

    Directory FittedPDB contains the files with the coordinates of the predicted and target complexes superimposed, following the first fit, in which the Hemagglutinin subunits have been superimposed (using the listed rotation matrix and translation vector). Fitted Pdb are now regenerated according to the new fits.


    CLOSE_CONTACTS

    Directory CloseContacts contains one file per predicted interface with information on the clashes in each predicted interface.

    For example part of file cc.T03_P26.4.HL.d looks like that:

    FAB Atom        HEMAGGLUTININ Atom    Distance
    --
    H 107 .PHE.CB 	E 189 .GLN.CG     	1.67
    H 107 .PHE.CG 	E 189 .GLN.CG     	1.68
    H 107 .PHE.CB 	E 189 .GLN.CD     	1.76
    H 107 .PHE.CD2  E 189 .GLN.CB     	1.86
    H 107 .PHE.CD2  E 189 .GLN.CG     	1.89
    --
    H 107 .PHE.O  	E 189 .GLN.NE2    	2.13
    H  31 .SER.OG 	A 163 .VAL.O      	2.15
    H 107 .PHE.CB 	E 189 .GLN.OE1    	2.17
    H 100 .TYR.CB 	E 189 .GLN.OE1    	2.19
    
      .
    
      .
    
      .
    

    As in the evaluation of target 01, the list of clashes is segregated into contacts between 0-1 (no contacts in this example), 1-2 and 2-3Å.