In silico Prediction of Chemical Ames Mutagenicity
- 17 October 2012
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Chemical Information and Modeling
- Vol. 52 (11), 2840-2847
- https://doi.org/10.1021/ci300400a
Abstract
Mutagenicity is one of the most important end points of toxicity. Due to high cost and laboriousness in experimental tests, it is necessary to develop robust in silico methods to predict chemical mutagenicity. In this paper, a comprehensive database containing 7617 diverse compounds, including 4252 mutagens and 3365 nonmutagens, was constructed. On the basis of this data set, high predictive models were then built using five machine learning methods, namely support vector machine (SVM), C4.5 decision tree (C4.5 DT), artificial neural network (ANN), k-nearest neighbors (kNN), and naïve Bayes (NB), along with five fingerprints, namely CDK fingerprint (FP), Estate fingerprint (Estate), MACCS keys (MACCS), PubChem fingerprint (PubChem), and Substructure fingerprint (SubFP). Performances were measured by cross validation and an external test set containing 831 diverse chemicals. Information gain and substructure analysis were used to interpret the models. The accuracies of fivefold cross validation were from 0.808 to 0.841 for top five models. The range of accuracy for the external validation set was from 0.904 to 0.980, which outperformed that of Toxtree. Three models (PubChem-kNN, MACCS-kNN, and PubChem-SVM) showed high and reliable predictive accuracy for the mutagens and nonmutagens and, hence, could be used in prediction of chemical Ames mutagenicity.Keywords
This publication has 38 references indexed in Scilit:
- ADMET Evaluation in Drug Discovery. 12. Development of Binary Classification Models for Prediction of hERG Potassium Channel BlockageMolecular Pharmaceutics, 2012
- LIBSVMACM Transactions on Intelligent Systems and Technology, 2011
- PaDEL‐descriptor: An open source software to calculate molecular descriptors and fingerprintsJournal of Computational Chemistry, 2010
- An open source multistep model to predict mutagenicity from statistical analysis and relevant structural alertsChemistry Central Journal, 2010
- Trust, But Verify: On the Importance of Chemical Structure Curation in Cheminformatics and QSAR Modeling ResearchJournal of Chemical Information and Modeling, 2010
- The application of discovery toxicology and pathology towards the design of safer pharmaceutical lead candidatesNature Reviews Drug Discovery, 2007
- Progress in QSAR toxicity screening of pharmaceutical impurities and other FDA regulated productsAdvanced Drug Delivery Reviews, 2007
- Novel 2D Fingerprints for Ligand-Based Virtual ScreeningJournal of Chemical Information and Modeling, 2006
- Computer‐assisted analysis of interlaboratory Ames test variabilityJournal of Toxicology and Environmental Health, 1988
- Methods for detecting carcinogens and mutagens with the salmonella/mammalian-microsome mutagenicity testMutation Research/Environmental Mutagenesis and Related Subjects, 1975