Predicting Drug Side Effects Using Data Analytics and the Integration of Multiple Data Sources
Open Access
- 21 September 2017
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Access
- Vol. 5, 20449-20462
- https://doi.org/10.1109/access.2017.2755045
Abstract
The development of automated approaches employing computational methods using data from publicly available drugs datasets for the prediction of drug side effects has been proposed. This paper presents the use of a hybrid machine learning approach to construct side effect classifiers using an appropriate set of data features. The presented approach utilizes the perspective of data analytics to investigate the effect of drug distribution in the feature space, categorize side effects into several intervals, adopt suitable strategies for each interval, and construct data models accordingly. To verify the applicability of the presented method in side effect prediction, a series of experiments were conducted. The results showed that this approach was able to take into account the characteristics of different types of side effects, thereby achieve better predictive performance. Moreover, different feature selection schemes were coupled with the modeling methods to examine the corresponding effects. In addition, analyses were performed to investigate the task difficulty in terms of data distance and similarity. Examples of visualized networks of associations between drugs and side effects are also discussed to further evaluate the results.Funding Information
- Joint Project of National Sun Yat-sen University and Kaohsiung Medical University (NSYSU-KMU-105-P-020)
This publication has 44 references indexed in Scilit:
- Integrative relational machine-learning for understanding drug side-effect profilesBMC Bioinformatics, 2013
- Relating drug–protein interaction network with drug side effectsBioinformatics, 2012
- Detecting Drug Interactions From Adverse-Event Reports: Interaction Between Paroxetine and Pravastatin Increases Blood Glucose LevelsClinical Pharmacology & Therapeutics, 2011
- PREDICT: a method for inferring novel drug indications with application to personalized medicineMolecular Systems Biology, 2011
- Biclustering of Adverse Drug Events in the FDA's Spontaneous Reporting SystemClinical Pharmacology & Therapeutics, 2010
- Drug-target interaction prediction from chemical, genomic and pharmacological data in an integrated frameworkBioinformatics, 2010
- SMPDB: The Small Molecule Pathway DatabaseNucleic Acids Research, 2009
- PubChem: a public information system for analyzing bioactivities of small moleculesNucleic Acids Research, 2009
- DrugBank: a knowledgebase for drugs, drug actions and drug targetsNucleic Acids Research, 2007
- UniProt: the Universal Protein knowledgebaseNucleic Acids Research, 2004