cando.py: Open Source Software for Predictive Bioanalytics of Large Scale Drug-Protein-Disease Data

28 September 2020

journal article
research article
Published by American Chemical Society (ACS) in Journal of Chemical Information and Modeling

Vol. 60 (9), 4131-4136
https://doi.org/10.1021/acs.jcim.0c00110

Abstract

Traditional drug discovery methods focus on optimizing the efficacy of a drug against a single biological target of interest for a specific disease. However, evidence supports the multitarget theory, i.e., drugs work by exerting their therapeutic effects via interaction with multiple biological targets, which have multiple phenotypic effects. Analytics of drug-protein interactions on a large proteomic scale provides insight into disease systems while also allowing for prediction of putative therapeutics against specific indications. We present a Python package for analysis of drug-proteome and drug-disease relationships implementing the Computational Analysis of Novel Drug Opportunities (CANDO) platform. The CANDO package allows for rapid drug similarity assessment, most notably via an in-house interaction scoring protocol where billions of drug-protein interactions are rapidly scored and the similarity of drug-proteome interaction signatures is calculated. The package also implements a variety of benchmarking protocols for shotgun drug discovery and repurposing, i.e., to determine how every known drug is related to every other in the context of the indications/diseases for which they are approved. Drug predictions are generated through consensus scoring of the most similar compounds to drugs known to treat a particular indication. Support for comparing and ranking novel chemical entities, as well as machine learning modules for both benchmarking and putative drug candidate prediction is also available. The CANDO Python package is available on GitHub at https://github.com/ram-compbio/CANDO, through the Conda Python package installer, and at http://compbio.org/software/.

Funding Information

National Center for Advancing Translational Sciences (UL1TR001412, UL1TR002529)
National Cancer Institute (P30CA023168)
NIH Office of the Director (DP1OD006779)
U.S. National Library of Medicine (T15LM012495)

This publication has 37 references indexed in Scilit:

Prediction of novel drug indications using network driven biological data prioritization and integration
Journal of Cheminformatics, 2014
Molecular Similarity in Medicinal Chemistry
Journal of Medicinal Chemistry, 2013
Protein–ligand binding site recognition using complementary binding-specific substructure comparison and sequence profile alignment
Bioinformatics, 2013
Computational Multitarget Drug Discovery
Published by Wiley ,2012
Drug Effect Prediction by Polypharmacology-Based Interaction Profiling
Journal of Chemical Information and Modeling, 2011
Ribavirin Treatment Effects on Breast Cancers Overexpressing eIF4E, a Biomarker with Prognostic Specificity for Luminal B-Type Breast Cancer
Clinical Cancer Research, 2011
Drug Discovery Using Chemical Systems Biology: Weak Inhibition of Multiple Kinases May Contribute to the Anti-Cancer Effect of Nelfinavir
PLoS Computational Biology, 2011
Molecular targeting of the oncogene eIF4E in acute myeloid leukemia (AML): a proof-of-principle clinical trial with ribavirin
Blood, 2009
The Protein Data Bank, 1999–
Published by International Union of Crystallography (IUCr) ,2006
Efficacy of Proton Pump Inhibitor Drugs against Plasmodium falciparum In Vitro and Their Probable Pharmacophores
Antimicrobial Agents and Chemotherapy, 2002

Cited by 19 articles