SFCscore: Scoring functions for affinity prediction of protein–ligand complexes
- 3 September 2008
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 73 (2), 395-419
- https://doi.org/10.1002/prot.22058
Abstract
Empirical scoring functions to calculate binding affinities of protein–ligand complexes have been calibrated based on experimental structure and affinity data collected from public and industrial sources. Public data were taken from the AffinDB database, whereas access to industrial data was gained through the Scoring Function Consortium (SFC), a collaborative effort with various pharmaceutical companies and the Cambridge Crystallographic Data Center. More than 850 complexes were obtained by the data collection procedure and subsequently used to setup different training sets for the parameterization of new scoring functions. Over 60 different descriptors were evaluated for all complexes, including terms accounting for interactions with and among aromatic ring systems as well as many surface-dependent terms. After exploratory correlation and regression analyses, stepwise variable selection procedures and systematic searches, the most suitable descriptors were chosen as variables to calibrate regression functions by means of multiple linear regression or partial least squares analysis. Eight different functions are presented herein. Cross-validated r2 (Q2) values of up to 0.72 and standard errors (sPRESS) generally below 1.15 pKi units suggest highly predictive functions. Extensive unbiased validation was carried out by testing the functions on large data sets from the PDBbind database as used by Wang et al. (J Chem Inf Comput Sci 2004;44:2114–2125) in a comparative analysis of other scoring functions. Superior performance of the SFCscore functions is observed in many cases, but the results also illustrate the need for further improvements. Proteins 2008.Keywords
This publication has 53 references indexed in Scilit:
- An all atom energy based computational protocol for predicting binding affinities of protein–ligand complexesFEBS Letters, 2005
- Virtual screening of chemical librariesNature, 2004
- An Extensive Test of 14 Scoring Functions Using the PDBbind Refined Set of 800 Protein−Ligand ComplexesJournal of Chemical Information and Computer Sciences, 2004
- Novel Scoring Functions Comprising QXP, SASA, and Protein Side-Chain Entropy TermsJournal of Chemical Information and Computer Sciences, 2004
- A new test set for validating predictions of protein–ligand interactionProteins-Structure Function and Bioinformatics, 2002
- The Protein Data BankNucleic Acids Research, 2000
- SCORE: A New Empirical Method for Estimating the Binding Affinity of a Protein-Ligand ComplexJournal of Molecular Modeling, 1998
- Development and validation of a genetic algorithm for flexible dockingJournal of Molecular Biology, 1997
- Free energy calculations: Applications to chemical and biochemical phenomenaChemical Reviews, 1993
- Validation of the general purpose tripos 5.2 force fieldJournal of Computational Chemistry, 1989