Comprehensive evaluation of ten docking programs on a diverse set of protein–ligand complexes: the prediction accuracy of sampling power and scoring power

Top Cited Papers

7 April 2016

journal article
research article
Published by Royal Society of Chemistry (RSC) in Physical Chemistry Chemical Physics

Vol. 18 (18), 12964-12975
https://doi.org/10.1039/c6cp01555g

Abstract

As one of the most popular computational approaches in modern structure-based drug design, molecular docking can be used not only to identify the correct conformation of a ligand within the target binding pocket but also to estimate the strength of the interaction between a target and a ligand. Nowadays, as a variety of docking programs are available for the scientific community, a comprehensive understanding of the advantages and limitations of each docking program is fundamentally important to conduct more reasonable docking studies and docking-based virtual screening. In the present study, based on an extensive dataset of 2002 protein–ligand complexes from the PDBbind database (version 2014), the performance of ten docking programs, including five commercial programs (LigandFit, Glide, GOLD, MOE Dock, and Surflex-Dock) and five academic programs (AutoDock, AutoDock Vina, LeDock, rDock, and UCSF DOCK), was systematically evaluated by examining the accuracies of binding pose prediction (sampling power) and binding affinity estimation (scoring power). Our results showed that GOLD and LeDock had the best sampling power (GOLD: 59.8% accuracy for the top scored poses; LeDock: 80.8% accuracy for the best poses) and AutoDock Vina had the best scoring power (r_p/r_s of 0.564/0.580 and 0.569/0.584 for the top scored poses and best poses), suggesting that the commercial programs did not show the expected better performance than the academic ones. Overall, the ligand binding poses could be identified in most cases by the evaluated docking programs but the ranks of the binding affinities for the entire dataset could not be well predicted by most docking programs. However, for some types of protein families, relatively high linear correlations between docking scores and experimental binding affinities could be achieved. To our knowledge, this study has been the most extensive evaluation of popular molecular docking programs in the last five years. It is expected that our work can offer useful information for the successful application of these docking tools to different requirements and targets.

Keywords

This publication has 57 references indexed in Scilit:

CSAR Benchmark Exercise 2011–2012: Evaluation of Results from Docking and Relative Ranking of Blinded Congeneric Series
Journal of Chemical Information and Modeling, 2013
Latest developments in molecular docking: 2010–2011 in review
Journal of Molecular Recognition, 2013
Variability in docking success rates due to dataset preparation
Journal of Computer-Aided Molecular Design, 2012
Advances and Challenges in Protein-Ligand Docking
International Journal of Molecular Sciences, 2010
Effect of Input Differences on the Results of Docking Calculations
Journal of Chemical Information and Modeling, 2009
AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility
Journal of Computational Chemistry, 2009
Towards the development of universal, fast and highly accurate docking/scoring methods: a long way to go
British Journal of Pharmacology, 2008
UCSF Chimera?A visualization system for exploratory research and analysis
Journal of Computational Chemistry, 2004
The Many Roles of Computation in Drug Discovery
Science, 2004
Development and validation of a genetic algorithm for flexible docking
Journal of Molecular Biology, 1997

Cited by 656 articles