Comprehensive evaluation of ten docking programs on a diverse set of protein–ligand complexes: the prediction accuracy of sampling power and scoring power
Top Cited Papers
- 7 April 2016
- journal article
- research article
- Published by Royal Society of Chemistry (RSC) in Physical Chemistry Chemical Physics
- Vol. 18 (18), 12964-12975
- https://doi.org/10.1039/c6cp01555g
Abstract
As one of the most popular computational approaches in modern structure-based drug design, molecular docking can be used not only to identify the correct conformation of a ligand within the target binding pocket but also to estimate the strength of the interaction between a target and a ligand. Nowadays, as a variety of docking programs are available for the scientific community, a comprehensive understanding of the advantages and limitations of each docking program is fundamentally important to conduct more reasonable docking studies and docking-based virtual screening. In the present study, based on an extensive dataset of 2002 protein–ligand complexes from the PDBbind database (version 2014), the performance of ten docking programs, including five commercial programs (LigandFit, Glide, GOLD, MOE Dock, and Surflex-Dock) and five academic programs (AutoDock, AutoDock Vina, LeDock, rDock, and UCSF DOCK), was systematically evaluated by examining the accuracies of binding pose prediction (sampling power) and binding affinity estimation (scoring power). Our results showed that GOLD and LeDock had the best sampling power (GOLD: 59.8% accuracy for the top scored poses; LeDock: 80.8% accuracy for the best poses) and AutoDock Vina had the best scoring power (rp/rs of 0.564/0.580 and 0.569/0.584 for the top scored poses and best poses), suggesting that the commercial programs did not show the expected better performance than the academic ones. Overall, the ligand binding poses could be identified in most cases by the evaluated docking programs but the ranks of the binding affinities for the entire dataset could not be well predicted by most docking programs. However, for some types of protein families, relatively high linear correlations between docking scores and experimental binding affinities could be achieved. To our knowledge, this study has been the most extensive evaluation of popular molecular docking programs in the last five years. It is expected that our work can offer useful information for the successful application of these docking tools to different requirements and targets.Keywords
This publication has 57 references indexed in Scilit:
- CSAR Benchmark Exercise 2011–2012: Evaluation of Results from Docking and Relative Ranking of Blinded Congeneric SeriesJournal of Chemical Information and Modeling, 2013
- Latest developments in molecular docking: 2010–2011 in reviewJournal of Molecular Recognition, 2013
- Variability in docking success rates due to dataset preparationJournal of Computer-Aided Molecular Design, 2012
- Advances and Challenges in Protein-Ligand DockingInternational Journal of Molecular Sciences, 2010
- Effect of Input Differences on the Results of Docking CalculationsJournal of Chemical Information and Modeling, 2009
- AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibilityJournal of Computational Chemistry, 2009
- Towards the development of universal, fast and highly accurate docking/scoring methods: a long way to goBritish Journal of Pharmacology, 2008
- UCSF Chimera?A visualization system for exploratory research and analysisJournal of Computational Chemistry, 2004
- The Many Roles of Computation in Drug DiscoveryScience, 2004
- Development and validation of a genetic algorithm for flexible dockingJournal of Molecular Biology, 1997