QSAR Applicability Domain Estimation by Projection of the Training Set in Descriptor Space: A Review
Top Cited Papers
Open Access
- 1 October 2005
- journal article
- review article
- Published by SAGE Publications in Alternatives to Laboratory Animals
- Vol. 33 (5), 445-459
- https://doi.org/10.1177/026119290503300508
Abstract
As the use of Quantitative Structure Activity Relationship (QSAR) models for chemical management increases, the reliability of the predictions from such models is a matter of growing concern. The OECD QSAR Validation Principles recommend that a model should be used within its applicability domain (AD). The Setubal Workshop report provided conceptual guidance on defining a (Q)SAR AD, but it is difficult to use directly. The practical application of the AD concept requires an operational definition that permits the design of an automatic (computerised), quantitative procedure to determine a model's AD. An attempt is made to address this need, and methods and criteria for estimating AD through training set interpolation in descriptor space are reviewed. It is proposed that response space should be included in the training set representation. Thus, training set chemicals are points in n-dimensional descriptor space and m-dimensional model response space. Four major approaches for estimating interpolation regions in a multivariate space are reviewed and compared: range, distance, geometrical, and probability density distribution.Keywords
This publication has 17 references indexed in Scilit:
- An Approach to Determining Applicability Domains for QSAR Group Contribution Models: An Analysis of SRC KOWWINAlternatives to Laboratory Animals, 2005
- Molecular similarity: a key technique in molecular informaticsOrganic & Biomolecular Chemistry, 2004
- Similarity to Molecules in the Training Set Is a Good Discriminator for Prediction Accuracy in QSARJournal of Chemical Information and Computer Sciences, 2004
- Assessment of Prediction Confidence and Domain Extrapolation of Two Structure–Activity Relationship Models for Predicting Estrogen Receptor Binding ActivityEnvironmental Health Perspectives, 2004
- Approaches to Measure Chemical Similarity – a ReviewQSAR & Combinatorial Science, 2003
- The Importance of Being Earnest: Validation is the Absolute Essential for Successful Application and Interpretation of QSPR ModelsQSAR & Combinatorial Science, 2003
- Do Structurally Similar Molecules Have Similar Biological Activity?Journal of Medicinal Chemistry, 2002
- Transformation of mutagenic aromatic amines into non-mutagenic species by alkyl substituents: Part I. Alkylation ortho to the amino functionMutation Research/Genetic Toxicology and Environmental Mutagenesis, 2001
- A QSAR investigation of the role of hydrophobicity in regulating mutagenicity in the ames test: 1. Mutagenicity of aromatic and heteroaromatic amines in Salmonella typhimurium TA98 and TA100Environmental and Molecular Mutagenesis, 1992
- Exploratory Projection PursuitJournal of the American Statistical Association, 1987