Gene selection and classification for cancer microarray data based on machine learning and similarity measures
Open Access
- 23 December 2011
- journal article
- Published by Springer Science and Business Media LLC in BMC Genomics
- Vol. 12 (S5), S1
- https://doi.org/10.1186/1471-2164-12-s5-s1
Abstract
Microarray data have a high dimension of variables and a small sample size. In microarray data analyses, two important issues are how to choose genes, which provide reliable and good prediction for disease status, and how to determine the final gene set that is best for classification. Associations among genetic markers mean one can exploit information redundancy to potentially reduce classification cost in terms of time and money.This publication has 31 references indexed in Scilit:
- Feature Selection and Classification of MAQC-II Breast Cancer and Multiple Myeloma Microarray Gene Expression DataPLOS ONE, 2009
- Feature mining and pattern classification for steganalysis of LSB matching steganography in grayscale imagesPattern Recognition, 2008
- A distribution free summarization method for Affymetrix GeneChip® arraysBioinformatics, 2006
- Standards for systems biologyNature Reviews Genetics, 2006
- A new algorithm for comparing and visualizing relationships between hierarchical and flat gene expression data clusteringsBioinformatics, 2005
- Bayesian neural network approaches to ovarian cancer identification from high-resolution mass spectrometry dataBioinformatics, 2005
- Systematic benchmarking of microarray data classification: assessing the role of non-linearity and dimensionality reductionBioinformatics, 2004
- Gene expression profiling predicts clinical outcome of breast cancerNature, 2002
- Computational analysis of microarray dataNature Reviews Genetics, 2001
- Withdrawing an example from the training set: An analytic estimation of its effect on a non-linear parameterised modelNeurocomputing, 2000