Evaluation of TB Patients Characteristics Based on Predictive Data Mining Approaches
Open Access
- 1 January 2017
- journal article
- research article
- Published by Scientific Research Publishing, Inc. in Journal of Tuberculosis Research
- Vol. 05 (01), 13-22
- https://doi.org/10.4236/jtr.2017.51002
Abstract
According to the World Health Organization, Tb is the biggest cause of death among the infectious diseases. Due to the high percentage of people with tuberculosis infection and the high number of death among these patients, this study is a prospective study aimed to categorize and find the relationship between different clinical and demographic characteristics. The study was conducted on 600 patients from Masih-e-Daneshvari tuberculosis research center during 2015-2016. The K-Means clustering data mining algorithms and decision trees are used to perform the categorization and determine common indicators among patients. 2 clusters according to Dunn index were chosen as the optimal clusters. Common factors between clusters are provided in detail in the findings section. According to the results of this study, the most important factors identified by the clustering include hemoglobin, age, sex, smoking, alcohol consumption and creatinine. The RBF neural network tree has 98% accuracy. According to the results of this study, the most important factors identified are sex, smoking, alcohol consumption and WBC, albumin.Keywords
This publication has 7 references indexed in Scilit:
- Estimation of Glomerular Filtration Rate Based on Serum Cystatin C versus Creatinine in a Uruguayan PopulationInternational Journal of Nephrology, 2014
- Epidemiology of tuberculosis in Eastern SudanAsian Pacific Journal of Tropical Biomedicine, 2012
- Decision tree discovery for the diagnosis of type II diabetesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2011
- Predicting existence of Mycobacterium tuberculosis on patients using data mining approachesProcedia Computer Science, 2011
- Data Mining and Medical Research StudiesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2010
- A Novel Classification Method for Diagnosis of Diabetes Mellitus Using Artificial Neural NetworksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2010
- How to Deal with Missing Categorical Data: Test of a Simple Bayesian MethodOrganizational Research Methods, 2003