SFCscore: Scoring functions for affinity prediction of protein–ligand complexes

3 September 2008

journal article
research article
Published by Wiley in Proteins-Structure Function and Bioinformatics

Vol. 73 (2), 395-419
https://doi.org/10.1002/prot.22058

Abstract

Empirical scoring functions to calculate binding affinities of protein–ligand complexes have been calibrated based on experimental structure and affinity data collected from public and industrial sources. Public data were taken from the AffinDB database, whereas access to industrial data was gained through the Scoring Function Consortium (SFC), a collaborative effort with various pharmaceutical companies and the Cambridge Crystallographic Data Center. More than 850 complexes were obtained by the data collection procedure and subsequently used to setup different training sets for the parameterization of new scoring functions. Over 60 different descriptors were evaluated for all complexes, including terms accounting for interactions with and among aromatic ring systems as well as many surface-dependent terms. After exploratory correlation and regression analyses, stepwise variable selection procedures and systematic searches, the most suitable descriptors were chosen as variables to calibrate regression functions by means of multiple linear regression or partial least squares analysis. Eight different functions are presented herein. Cross-validated r² (Q²) values of up to 0.72 and standard errors (s_PRESS) generally below 1.15 pK_i units suggest highly predictive functions. Extensive unbiased validation was carried out by testing the functions on large data sets from the PDBbind database as used by Wang et al. (J Chem Inf Comput Sci 2004;44:2114–2125) in a comparative analysis of other scoring functions. Superior performance of the SFCscore functions is observed in many cases, but the results also illustrate the need for further improvements. Proteins 2008.

Keywords

This publication has 53 references indexed in Scilit:

An all atom energy based computational protocol for predicting binding affinities of protein–ligand complexes
FEBS Letters, 2005
Virtual screening of chemical libraries
Nature, 2004
An Extensive Test of 14 Scoring Functions Using the PDBbind Refined Set of 800 Protein−Ligand Complexes
Journal of Chemical Information and Computer Sciences, 2004
Novel Scoring Functions Comprising QXP, SASA, and Protein Side-Chain Entropy Terms
Journal of Chemical Information and Computer Sciences, 2004
A new test set for validating predictions of protein–ligand interaction
Proteins-Structure Function and Bioinformatics, 2002
The Protein Data Bank
Nucleic Acids Research, 2000
SCORE: A New Empirical Method for Estimating the Binding Affinity of a Protein-Ligand Complex
Journal of Molecular Modeling, 1998
Development and validation of a genetic algorithm for flexible docking
Journal of Molecular Biology, 1997
Free energy calculations: Applications to chemical and biochemical phenomena
Chemical Reviews, 1993
Validation of the general purpose tripos 5.2 force field
Journal of Computational Chemistry, 1989

Cited by 102 articles