Generative chemistry: drug discovery with deep learning generative models
- 4 February 2021
- journal article
- review article
- Published by Springer Science and Business Media LLC in Journal of Molecular Modeling
- Vol. 27 (3), 1-18
- https://doi.org/10.1007/s00894-021-04674-8
Abstract
The de novo design of molecular structures using deep learning generative models introduces an encouraging solution to drug discovery in the face of the continuously increased cost of new drug development. From the generation of original texts, images, and videos, to the scratching of novel molecular structures the creativity of deep learning generative models exhibits the height machine intelligence can achieve. The purpose of this paper is to review the latest advances in generative chemistry which relies on generative modeling to expedite the drug discovery process. This review starts with a brief history of artificial intelligence in drug discovery to outline this emerging paradigm. Commonly used chemical databases, molecular representations, and tools in cheminformatics and machine learning are covered as the infrastructure for generative chemistry. The detailed discussions on utilizing cutting-edge generative architectures, including recurrent neural network, variational autoencoder, adversarial autoencoder, and generative adversarial network for compound generation are focused. Challenges and future perspectives follow.Funding Information
- National Institute on Drug Abuse (P30 DA035778A1)
- U.S. Department of Defense (W81XWH-16-1-0490)
This publication has 123 references indexed in Scilit:
- KNIME-CDK: Workflow-driven cheminformaticsBMC Bioinformatics, 2013
- Enumeration of 166 Billion Organic Small Molecules in the Chemical Universe Database GDB-17Journal of Chemical Information and Modeling, 2012
- Towards a Universal SMILES representation - A standard method to generate canonical SMILES based on the InChIJournal of Cheminformatics, 2012
- Scientific workflow systems: Pipeline Pilot and KNIMEJournal of Computer-Aided Molecular Design, 2012
- Open Babel: An open chemical toolboxJournal of Cheminformatics, 2011
- CHARMM general force field: A force field for drug‐like molecules compatible with the CHARMM all‐atom additive biological force fieldsJournal of Computational Chemistry, 2009
- Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributionsJournal of Cheminformatics, 2009
- Development and testing of a general amber force fieldJournal of Computational Chemistry, 2004
- UniProt: the Universal Protein knowledgebaseNucleic Acids Research, 2004
- The Protein Data BankNucleic Acids Research, 2000