The quality of QSAR models: problems and solutions
- 1 January 2007
- journal article
- research article
- Published by Informa UK Limited in SAR and QSAR in Environmental Research
- Vol. 18 (1-2), 89-100
- https://doi.org/10.1080/10629360601053984
Abstract
Assessment of the quality of goodness-of-fit and the confidence in predictivity (prediction power) are the main terms used to define the statistical quality of QSAR models. Three parts of this assessment can be defined as: (1) Measure of goodness-of-fit. (2) Validation of model stability. (3) Predictivity analysis. Currently there are no mandatory requirements for the validation methods to be used and rules for the quantitative confidence estimates. To compare the statistical quality of QSAR models it is necessary to have an overall statistical quality index which will depend on the goodness-of-fit, validation and predictivity results together. To do so it is necessary to define the set of mandatory parameters for all three parts of assessment listed above and develop the approach for overall quality estimates based on these parameters. It is also necessary to include into the overall index the penalty mechanism for parameter absence. The goal of the present study is to analyse parameters for all three parts of the QSAR model statistical quality assessment and investigate the flexible weighting approach for the overall statistical quality index development. Due the different statistical parameters traditionally used for assessment of goodness-of-fit it is necessary to create the mechanism, which allows flexible set of parameters to be used for the overall statistical quality index. Only after approval by scientific community and regulatory boards the final set of mandatory parameters can be selected.Keywords
This publication has 14 references indexed in Scilit:
- The prospects for using (Q)SARs in a changing political environment--high expectations and a key role for the european commission's joint research centreSAR and QSAR in Environmental Research, 2004
- The role of the European centre for the validation of alternative methods (ECVAM) in the validation of (Q)SARsSAR and QSAR in Environmental Research, 2004
- New methodology of influential point detection in regression model building for the prediction of metabolic clearance rate of glucosecclm, 2004
- The Problem of OverfittingJournal of Chemical Information and Computer Sciences, 2003
- Use of QSARs in international decision-making frameworks to predict ecologic effects and environmental fate of chemical substances.Environmental Health Perspectives, 2003
- Use of QSARs in international decision-making frameworks to predict health effects of chemical substances.Environmental Health Perspectives, 2003
- The Importance of Being Earnest: Validation is the Absolute Essential for Successful Application and Interpretation of QSPR ModelsQSAR & Combinatorial Science, 2003
- Assessing Model Fit by Cross-ValidationJournal of Chemical Information and Computer Sciences, 2003
- Beware of q2!Journal of Molecular Graphics and Modelling, 2002
- Statistical Validation of QSAR ResultsPublished by Wiley ,1995