FOG: Fragment Optimized Growth Algorithm for the de Novo Generation of Molecules Occupying Druglike Chemical Space
- 15 June 2009
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Chemical Information and Modeling
- Vol. 49 (7), 1630-1642
- https://doi.org/10.1021/ci9000458
Abstract
An essential feature of all practical de novo molecule generating programs is the ability to focus the potential combinatorial explosion of grown molecules on a desired chemical space. It is a daunting task to balance the generation of new molecules with limitations on growth that produce desired features such as stability in water, synthetic accessibility, or drug-likeness. We have developed an algorithm, Fragment Optimized Growth (FOG), which statistically biases the growth of molecules with desired features. At the heart of the algorithm is a Markov Chain which adds fragments to the nascent molecule in a biased manner, depending on the frequency of specific fragment-fragment connections in the database of chemicals it was trained on. We show that in addition to generating synthetically feasible molecules, it can be trained to grow new molecules that resemble desired classes of molecules such as drugs, natural products, and diversity-oriented synthetic products. In order to classify our grown molecules, we developed the Topology Classifier (TopClass) algorithm that is capable of classifying compounds, for example as drugs or nondrugs. The classification accuracies obtained with TopClass compare favorably with the literature. Furthermore, in contrast to "black-box" approaches such as Neural Networks, TopClass brings to light characteristics of drugs that distinguish them from nondrugs.Keywords
This publication has 68 references indexed in Scilit:
- Virtual Screening Using Binary Kernel Discrimination: Effect of Noisy Training Data and the Optimization of PerformanceJournal of Chemical Information and Modeling, 2006
- Making “Real” Molecules in Virtual SpaceJournal of Chemical Information and Modeling, 2006
- Computer-aided design of non-nucleoside inhibitors of HIV-1 reverse transcriptaseBioorganic & Medicinal Chemistry Letters, 2005
- What Is the Smallest Saturated Acyclic Alkane that Cannot Be Made?Journal of Chemical Information and Modeling, 2004
- BOOMSLANG: A program for combinatorial structure generationJournal of Molecular Graphics, 1996
- A program for the FORWARD generation of synthetic routesJournal of Chemical Information and Computer Sciences, 1992
- A method for automatic generation of novel chemical structures and its potential applications to drug discoveryJournal of Chemical Information and Computer Sciences, 1991
- Automated structure design in 3DTetrahedron Computer Methodology, 1990
- Approaching the logic of synthesis designAccounts of Chemical Research, 1986
- Computer-assisted synthetic analysis. Selection of protective groups for multistep organic syntheses.The Journal of Organic Chemistry, 1985