Synergistic Use of Compound Properties and Docking Scores in Neural Network Modeling of CYP2D6 Binding: Predicting Affinity and Conformational Sampling
- 18 October 2006
- journal article
- Published by American Chemical Society (ACS) in Journal of Chemical Information and Modeling
- Vol. 46 (6), 2698-2708
- https://doi.org/10.1021/ci600267k
Abstract
Cytochrome P450 2D6 (CYP2D6) is used to develop an approach for predicting affinity and relevant binding conformation(s) for highly flexible binding sites. The approach combines the use of docking scores and compound properties as attributes in building a neural network (NN) model. It begins by identifying segments of CYP2D6 that are important for binding specificity, based on structural variability among diverse CYP enzymes. A family of distinct, low-energy conformations of CYP2D6 are generated using simulated annealing (SA) and a collection of 82 compounds with known CYP2D6 affinities are docked. Interestingly, docking poses are observed on the backside of the heme as well as in the known active site. Docking scores for the active site binders, along with compound-specific attributes, are used to train a neural network model to properly bin compounds as strong binders, moderate binders, or nonbinders. Attribute selection is used to preselect the most important scores and compound-specific attributes for the model. A prediction accuracy of 85+/-6% is achieved. Dominant attributes include docking scores for three of the 20 conformations in the ensemble as well as the compound's formal charge, number of aromatic rings, and AlogP. Although compound properties were highly predictive attributes (12% improvement over baseline) in the NN-based prediction of CYP2D6 binders, their combined use with docking score attributes is synergistic (net increase of 23% above baseline). Beyond prediction of affinity, attribute selection provides a way to identify the most relevant protein conformation(s), in terms of binding competence. In the case of CYP2D6, three out of the ensemble of 20 SA-generated structures are found to be the most predictive for binding.Keywords
This publication has 18 references indexed in Scilit:
- An Extensive Test of 14 Scoring Functions Using the PDBbind Refined Set of 800 Protein−Ligand ComplexesJournal of Chemical Information and Computer Sciences, 2004
- Genetic polymorphisms of cytochrome P450 2D6 (CYP2D6): clinical consequences, evolutionary aspects and functional diversityThe Pharmacogenomics Journal, 2004
- Soft Docking and Multiple Receptor Conformations in Virtual ScreeningJournal of Medicinal Chemistry, 2004
- Phe120 contributes to the regiospecificity of cytochrome P450 2D6: mutation leads to the formation of a novel dextromethorphan metaboliteBiochemical Journal, 2004
- Integration of virtual and high-throughput screeningNature Reviews Drug Discovery, 2002
- Further development and validation of empirical scoring functions for structure-based binding affinity predictionJournal of Computer-Aided Molecular Design, 2002
- Summary of information on human CYP enzymes: human P450 metabolism dataDrug Metabolism Reviews, 2002
- FlexE: efficient molecular docking considering protein structure variationsJournal of Molecular Biology, 2001
- Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy functionJournal of Computational Chemistry, 1998
- Development of a pharmacophore for inhibition of human liver cytochrome P-450 2D6: molecular modeling and inhibition studiesJournal of Medicinal Chemistry, 1993