Robust Feature Selection for Microarray Data Based on Multicriterion Fusion
- 30 June 2011
- journal article
- research article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE/ACM Transactions on Computational Biology and Bioinformatics
- Vol. 8 (4), 1080-1092
- https://doi.org/10.1109/TCBB.2010.103
Abstract
Feature selection often aims to select a compact feature subset to build a pattern classifier with reduced complexity, so as to achieve improved classification performance. From the perspective of pattern analysis, producing stable or robust solution is also a desired property of a feature selection algorithm. However, the issue of robustness is often overlooked in feature selection. In this study, we analyze the robustness issue existing in feature selection for high-dimensional and small-sized gene-expression data, and propose to improve robustness of feature selection algorithm by using multiple feature selection evaluation criteria. Based on this idea, a multicriterion fusion-based recursive feature elimination (MCF-RFE) algorithm is developed with the goal of improving both classification performance and stability of feature selection results. Experimental studies on five gene-expression data sets show that the MCF-RFE algorithm outperforms the commonly used benchmark feature selection algorithm SVM-RFE.Keywords
This publication has 39 references indexed in Scilit:
- LIBSVMACM Transactions on Intelligent Systems and Technology, 2011
- A review of feature selection techniques in bioinformaticsBioinformatics, 2007
- The ties problem resulting from counting-based error estimators and its impact on gene selection algorithmsBioinformatics, 2006
- Semisupervised Learning for Molecular ProfilingIEEE/ACM Transactions on Computational Biology and Bioinformatics, 2005
- Optimal number of features as a function of sample size for various classification rulesBioinformatics, 2004
- Prediction of central nervous system embryonal tumour outcome based on gene expressionNature, 2002
- 10.1162/153244303322753616Applied Physics Letters, 2000
- Wrappers for feature subset selectionArtificial Intelligence, 1997
- The use of the area under the ROC curve in the evaluation of machine learning algorithmsPattern Recognition, 1997
- Support-vector networksMachine Learning, 1995