An iterative knowledge‐based scoring function for protein–protein recognition

4 February 2008

journal article
research article
Published by Wiley in Proteins-Structure Function and Bioinformatics

Vol. 72 (2), 557-579
https://doi.org/10.1002/prot.21949

Abstract

Using an efficient iterative method, we have developed a distance‐dependent knowledge‐based scoring function to predict protein–protein interactions. The function, referred to as ITScore‐PP, was derived using the crystal structures of a training set of 851 protein–protein dimeric complexes containing true biological interfaces. The key idea of the iterative method for deriving ITScore‐PP is to improve the interatomic pair potentials by iteration, until the pair potentials can distinguish true binding modes from decoy modes for the protein–protein complexes in the training set. The iterative method circumvents the challenging reference state problem in deriving knowledge‐based potentials. The derived scoring function was used to evaluate the ligand orientations generated by ZDOCK 2.1 and the native ligand structures on a diverse set of 91 protein–protein complexes. For the bound test cases, ITScore‐PP yielded a success rate of 98.9% if the top 10 ranked orientations were considered. For the more realistic unbound test cases, the corresponding success rate was 40.7%. Furthermore, for faster orientational sampling purpose, several residue‐level knowledge‐based scoring functions were also derived following the similar iterative procedure. Among them, the scoring function that uses the side‐chain center of mass (SCM) to represent a residue, referred to as ITScore‐PP(SCM), showed the best performance and yielded success rates of 71.4% and 30.8% for the bound and unbound cases, respectively, when the top 10 orientations were considered. ITScore‐PP was further tested using two other published protein–protein docking decoy sets, the ZDOCK decoy set and the RosettaDock decoy set. In addition to binding mode prediction, the binding scores predicted by ITScore‐PP also correlated well with the experimentally determined binding affinities, yielding a correlation coefficient of R = 0.71 on a test set of 74 protein–protein complexes with known affinities. ITScore‐PP is computationally efficient. The average run time for ITScore‐PP was about 0.03 second per orientation (including optimization) on a personal computer with 3.2 GHz Pentium IV CPU and 3.0 GB RAM. The computational speed of ITScore‐PP(SCM) is about an order of magnitude faster than that of ITScore‐PP. ITScore‐PP and/or ITScore‐PP(SCM) can be combined with efficient protein docking software to study protein–protein recognition. Proteins 2008.

This publication has 71 references indexed in Scilit:

A simple reference state makes a significant improvement in near‐native selections from structurally refined docking decoys
Proteins, 2007
Poisson-Boltzmann Calculations of Nonspecific Salt Effects on Protein-Protein Binding Free Energies
Biophysical Journal, 2007
Efficient molecular docking of NMR structures: Application to HIV‐1 protease
Protein Science, 2007
PROTCOM: searchable database of protein complexes enhanced with domain-domain structures
Nucleic Acids Research, 2006
An accurate, residue‐level, pair potential of mean force for folding and binding based on the distance‐scaled, ideal‐gas reference state
Protein Science, 2004
The Protein Data Bank
Nucleic Acids Research, 2000
BLEEP?potential of mean force describing protein-ligand interactions: I. Generating potential
Journal of Computational Chemistry, 1999
Protein docking and complementarity
Journal of Molecular Biology, 1991
A geometric approach to macromolecule-ligand interactions
Journal of Molecular Biology, 1982
Computer analysis of protein-protein interaction
Journal of Molecular Biology, 1978

Cited by 242 articles