Straightforward Recursive Partitioning Model for Discarding Insoluble Compounds in the Drug Discovery Process
- 18 April 2008
- journal article
- Published by American Chemical Society (ACS) in Journal of Medicinal Chemistry
- Vol. 51 (10), 2891-2897
- https://doi.org/10.1021/jm701407x
Abstract
Poor aqueous solubility is one of the major issues in drug discovery and development, impacting negatively on all aspects of the research and development process. The pharmaceutical industry has realized that solubility issues need to be resolved at the discovery stage. We here present an innovative way to address this problem via a model designed to address the simple question, “Is the compound likely to be sufficiently soluble to provide interpretable data in biological screening assays?” A recursive partitioning (RP) method was applied to a set of 3563 molecules, with in house determined aqueous solubility values. Five models were generated on the basis of a small number of descriptors affording intuitive information regarding structural features influencing solubility. The final model was based on only two descriptors: the molecular weight (MW) and the aromatic proportion (AP). This model provided satisfactory values of accuracy (81%) and precision (75%) for a test set of 1200 compounds, suggesting that the model may add value in compound selection and library design during early drug discovery.Keywords
This publication has 24 references indexed in Scilit:
- Accurate Solubility Prediction with Error Bars for Electrolytes: A Machine Learning ApproachJournal of Chemical Information and Modeling, 2007
- Solubility: it's not just for physical chemistsDrug Discovery Today, 2006
- Global and Local Computational Models for Aqueous Solubility Prediction of Drug-Like MoleculesJournal of Chemical Information and Computer Sciences, 2004
- Prediction of Aqueous Solubility of Organic Compounds by Topological DescriptorsQSAR & Combinatorial Science, 2003
- Prediction of Aqueous Solubility and Partition Coefficient Optimized by a Genetic Algorithm Based Descriptor Selection MethodJournal of Chemical Information and Computer Sciences, 2003
- Prediction of Aqueous Solubility of Organic Compounds Based on a 3D Structure RepresentationJournal of Chemical Information and Computer Sciences, 2002
- Development of Quantitative Structure−Property Relationship Models for Early ADME Evaluation in Drug Discovery. 1. Aqueous SolubilityJournal of Chemical Information and Computer Sciences, 2001
- Search for Predictive Generic Model of Aqueous Solubility Using Bayesian Neural NetsJournal of Chemical Information and Computer Sciences, 2001
- A Fuzzy ARTMAP Based on Quantitative Structure−Property Relationships (QSPRs) for Predicting Aqueous Solubility of Organic CompoundsJournal of Chemical Information and Computer Sciences, 2001
- Estimation of Aqueous Solubility for a Diverse Set of Organic Compounds Based on Molecular TopologyJournal of Chemical Information and Computer Sciences, 2000