cando.py: Open Source Software for Predictive Bioanalytics of Large Scale Drug-Protein-Disease Data
- 28 September 2020
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Chemical Information and Modeling
- Vol. 60 (9), 4131-4136
- https://doi.org/10.1021/acs.jcim.0c00110
Abstract
Traditional drug discovery methods focus on optimizing the efficacy of a drug against a single biological target of interest for a specific disease. However, evidence supports the multitarget theory, i.e., drugs work by exerting their therapeutic effects via interaction with multiple biological targets, which have multiple phenotypic effects. Analytics of drug-protein interactions on a large proteomic scale provides insight into disease systems while also allowing for prediction of putative therapeutics against specific indications. We present a Python package for analysis of drug-proteome and drug-disease relationships implementing the Computational Analysis of Novel Drug Opportunities (CANDO) platform. The CANDO package allows for rapid drug similarity assessment, most notably via an in-house interaction scoring protocol where billions of drug-protein interactions are rapidly scored and the similarity of drug-proteome interaction signatures is calculated. The package also implements a variety of benchmarking protocols for shotgun drug discovery and repurposing, i.e., to determine how every known drug is related to every other in the context of the indications/diseases for which they are approved. Drug predictions are generated through consensus scoring of the most similar compounds to drugs known to treat a particular indication. Support for comparing and ranking novel chemical entities, as well as machine learning modules for both benchmarking and putative drug candidate prediction is also available. The CANDO Python package is available on GitHub at https://github.com/ram-compbio/CANDO, through the Conda Python package installer, and at http://compbio.org/software/.Funding Information
- National Center for Advancing Translational Sciences (UL1TR001412, UL1TR002529)
- National Cancer Institute (P30CA023168)
- NIH Office of the Director (DP1OD006779)
- U.S. National Library of Medicine (T15LM012495)
This publication has 37 references indexed in Scilit:
- Prediction of novel drug indications using network driven biological data prioritization and integrationJournal of Cheminformatics, 2014
- Molecular Similarity in Medicinal ChemistryJournal of Medicinal Chemistry, 2013
- Protein–ligand binding site recognition using complementary binding-specific substructure comparison and sequence profile alignmentBioinformatics, 2013
- Computational Multitarget Drug DiscoveryPublished by Wiley ,2012
- Drug Effect Prediction by Polypharmacology-Based Interaction ProfilingJournal of Chemical Information and Modeling, 2011
- Ribavirin Treatment Effects on Breast Cancers Overexpressing eIF4E, a Biomarker with Prognostic Specificity for Luminal B-Type Breast CancerClinical Cancer Research, 2011
- Drug Discovery Using Chemical Systems Biology: Weak Inhibition of Multiple Kinases May Contribute to the Anti-Cancer Effect of NelfinavirPLoS Computational Biology, 2011
- Molecular targeting of the oncogene eIF4E in acute myeloid leukemia (AML): a proof-of-principle clinical trial with ribavirinBlood, 2009
- The Protein Data Bank, 1999–Published by International Union of Crystallography (IUCr) ,2006
- Efficacy of Proton Pump Inhibitor Drugs against Plasmodium falciparum In Vitro and Their Probable PharmacophoresAntimicrobial Agents and Chemotherapy, 2002