Experimental comparison of classifiers for breast cancer diagnosis

1 November 2012

conference paper
conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE) in 2012 Seventh International Conference on Computer Engineering & Systems (ICCES)

p. 180-185
https://doi.org/10.1109/icces.2012.6408508

Abstract

This paper presents a comparison among the different classifiers decision tree (J48), Multi-LayerPerception (MLP), Naive Bayes (NB), Sequential Minimal Optimization (SMO), and Instance Based for K-Nearest neighbor (IBK) on three different databases of breast cancer (Wisconsin Breast Cancer (WBC), Wisconsin Diagnosis Breast Cancer (WDBC) and Wisconsin Prognosis Breast Cancer (WPBC)) by using classification accuracy and confusion matrix based on 10-fold cross validation method. Also, we introduce a fusion at classification level between these classifiers to get the most suitable multi-classifier approach for each data set. The experimental results show that in the classification using fusion of MLP and J48 with the PCA is superior to the other classifiers using WBC data set. The PCA is used in WBC dataset as a features reduction transformation method in which combines a set of correlated features. The selected attributes are: Uniformity of Cell Size, Mitoses, Clump thickness, Bare Nuclei, Single Epithelial cell size, Marginal adhesion, Bland Chromatin and Class. In WDBC data set the results show that the classification using SMO only or using fusion of SMO and MLP or SMO and IBK is superior to the other classifiers. In WPBC data set the results show that the classification using fusion of MLP, J48, SMO and IBK is superior to the other classifiers. All experiments are conducted in WEKA data mining tool.

Keywords

This publication has 5 references indexed in Scilit:

Ensemble Decision Tree Classifier For Breast Cancer Data
International Journal of Information Technology Convergence and Services, 2012
Approach of Neural Network to Diagnose Breast Cancer on three different Data Set
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2009
Feature selection and classification using flexible neural tree
Neurocomputing, 2006
Supervised fuzzy clustering for the identification of fuzzy classifiers
Pattern Recognition Letters, 2003
The Nature of Statistical Learning Theory
Published by Springer Science and Business Media LLC ,1995

Cited by 36 articles