Comparative Assessment of Scoring Functions on a Diverse Test Set
- 9 April 2009
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Chemical Information and Modeling
- Vol. 49 (4), 1079-1093
- https://doi.org/10.1021/ci9000053
Abstract
Scoring functions are widely applied to the evaluation of protein-ligand binding in structure-based drug design. We have conducted a comparative assessment of 16 popular scoring functions implemented in main-stream commercial software or released by academic research groups. A set of 195 diverse protein-ligand complexes with high-resolution crystal structures and reliable binding constants were selected through a systematic nonredundant sampling of the PDBbind database and used as the primary test set in our study. All scoring functions were evaluated in three aspects, that is, "docking power", "ranking power", and "scoring power", and all evaluations were independent from the context of molecular docking or virtual screening. As for "docking power", six scoring functions, including GOLD::ASP, DS::PLP1, DrugScore(PDB), GlideScore-SP, DS::LigScore, and GOLD::ChemScore, achieved success rates over 70% when the acceptance cutoff was root-mean-square deviation < 2.0 A. Combining these scoring functions into consensus scoring schemes improved the success rates to 80% or even higher. As for "ranking power" and "scoring power", the top four scoring functions on the primary test set were X-Score, DrugScore(CSD), DS::PLP, and SYBYL::ChemScore. They were able to correctly rank the protein-ligand complexes containing the same type of protein with success rates around 50%. Correlation coefficients between the experimental binding constants and the binding scores computed by these scoring functions ranged from 0.545 to 0.644. Besides the primary test set, each scoring function was also tested on four additional test sets, each consisting of a certain number of protein-ligand complexes containing one particular type of protein. Our study serves as an updated benchmark for evaluating the general performance of today's scoring functions. Our results indicate that no single scoring function consistently outperforms others in all three aspects. Thus, it is important in practice to choose the appropriate scoring functions for different purposes.Keywords
This publication has 84 references indexed in Scilit:
- A Critical Assessment of Docking Programs and Scoring FunctionsJournal of Medicinal Chemistry, 2005
- The Many Roles of Computation in Drug DiscoveryScience, 2004
- Recent advances in structure-based rational drug designCurrent Opinion in Structural Biology, 2000
- Recent developments in structure-based drug designJournal of Molecular Medicine, 2000
- Structure-based drug designCurrent Opinion in Biotechnology, 1998
- STRUCTURE-BASED DRUG DESIGN:Computational AdvancesAnnual Review of Pharmacology and Toxicology, 1997
- Signalling the fat controllerNature, 1996
- Structure-based drug designCurrent Opinion in Structural Biology, 1994
- Structure-based drug design: progress, results and challengesStructure, 1994
- Structures-based drug design ten years onNature Structural & Molecular Biology, 1994