A Hadoop-Based Method to Predict Potential Effective Drug Combination
Open Access
- 23 July 2014
- journal article
- research article
- Published by Hindawi Limited in BioMed Research International
- Vol. 2014, 1-5
- https://doi.org/10.1155/2014/196858
Abstract
Combination drugs that impact multiple targets simultaneously are promising candidates for combating complex diseases due to their improved efficacy and reduced side effects. However, exhaustive screening of all possible drug combinations is extremely time-consuming and impractical. Here, we present a novel Hadoop-based approach to predict drug combinations by taking advantage of the MapReduce programming model, which leads to an improvement of scalability of the prediction algorithm. By integrating the gene expression data of multiple drugs, we constructed data preprocessing and the support vector machines and naïve Bayesian classifiers on Hadoop for prediction of drug combinations. The experimental results suggest that our Hadoop-based model achieves much higher efficiency in the big data processing steps with satisfactory performance. We believed that our proposed approach can help accelerate the prediction of potential effective drugs with the increasing of the combination number at an exponential rate in future. The source code and datasets are available upon request.Keywords
This publication has 21 references indexed in Scilit:
- Prediction of Effective Drug Combinations by Chemical Interaction, Protein Interaction and Target Enrichment of KEGG PathwaysBioMed Research International, 2013
- Prediction of Drug Combinations by Integrating Molecular and Pharmacological DataPLoS Computational Biology, 2011
- Exploiting a Reduced Set of Weighted Average Features to Improve Prediction of DNA-Binding Residues from 3D StructuresPLOS ONE, 2011
- Prediction of conformational B-cell epitopes from 3D structures by random forests with a distance-based featureBMC Bioinformatics, 2011
- An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformaticsBMC Bioinformatics, 2010
- A systems biology approach to identify effective cocktail drugsBMC Systems Biology, 2010
- APIS: accurate prediction of hot spots in protein interfaces by combining protrusion index with solvent accessibilityBMC Bioinformatics, 2010
- In silico feasibility of novel biodegradation pathways for 1,2,4-trichlorobenzeneBMC Systems Biology, 2010
- Closed-loop control of cellular functions using combinatory drugs guided by a stochastic search algorithmProceedings of the National Academy of Sciences, 2008
- Exploration, normalization, and summaries of high density oligonucleotide array probe level dataBiostatistics, 2003