Comparative Assessment of Scoring Functions on a Diverse Test Set

9 April 2009

journal article
research article
Published by American Chemical Society (ACS) in Journal of Chemical Information and Modeling

Vol. 49 (4), 1079-1093
https://doi.org/10.1021/ci9000053

Abstract

Scoring functions are widely applied to the evaluation of protein-ligand binding in structure-based drug design. We have conducted a comparative assessment of 16 popular scoring functions implemented in main-stream commercial software or released by academic research groups. A set of 195 diverse protein-ligand complexes with high-resolution crystal structures and reliable binding constants were selected through a systematic nonredundant sampling of the PDBbind database and used as the primary test set in our study. All scoring functions were evaluated in three aspects, that is, "docking power", "ranking power", and "scoring power", and all evaluations were independent from the context of molecular docking or virtual screening. As for "docking power", six scoring functions, including GOLD::ASP, DS::PLP1, DrugScore(PDB), GlideScore-SP, DS::LigScore, and GOLD::ChemScore, achieved success rates over 70% when the acceptance cutoff was root-mean-square deviation < 2.0 A. Combining these scoring functions into consensus scoring schemes improved the success rates to 80% or even higher. As for "ranking power" and "scoring power", the top four scoring functions on the primary test set were X-Score, DrugScore(CSD), DS::PLP, and SYBYL::ChemScore. They were able to correctly rank the protein-ligand complexes containing the same type of protein with success rates around 50%. Correlation coefficients between the experimental binding constants and the binding scores computed by these scoring functions ranged from 0.545 to 0.644. Besides the primary test set, each scoring function was also tested on four additional test sets, each consisting of a certain number of protein-ligand complexes containing one particular type of protein. Our study serves as an updated benchmark for evaluating the general performance of today's scoring functions. Our results indicate that no single scoring function consistently outperforms others in all three aspects. Thus, it is important in practice to choose the appropriate scoring functions for different purposes.

Keywords

This publication has 84 references indexed in Scilit:

A Critical Assessment of Docking Programs and Scoring Functions
Journal of Medicinal Chemistry, 2005
The Many Roles of Computation in Drug Discovery
Science, 2004
Recent advances in structure-based rational drug design
Current Opinion in Structural Biology, 2000
Recent developments in structure-based drug design
Journal of Molecular Medicine, 2000
Structure-based drug design
Current Opinion in Biotechnology, 1998
STRUCTURE-BASED DRUG DESIGN:Computational Advances
Annual Review of Pharmacology and Toxicology, 1997
Signalling the fat controller
Nature, 1996
Structure-based drug design
Current Opinion in Structural Biology, 1994
Structure-based drug design: progress, results and challenges
Structure, 1994
Structures-based drug design ten years on
Nature Structural & Molecular Biology, 1994

Cited by 416 articles