Gene selection and classification for cancer microarray data based on machine learning and similarity measures

Open Access

23 December 2011

journal article
Published by Springer Science and Business Media LLC in BMC Genomics

Vol. 12 (S5), S1
https://doi.org/10.1186/1471-2164-12-s5-s1

Abstract

Microarray data have a high dimension of variables and a small sample size. In microarray data analyses, two important issues are how to choose genes, which provide reliable and good prediction for disease status, and how to determine the final gene set that is best for classification. Associations among genetic markers mean one can exploit information redundancy to potentially reduce classification cost in terms of time and money.

This publication has 31 references indexed in Scilit:

Feature Selection and Classification of MAQC-II Breast Cancer and Multiple Myeloma Microarray Gene Expression Data
PLOS ONE, 2009
Feature mining and pattern classification for steganalysis of LSB matching steganography in grayscale images
Pattern Recognition, 2008
A distribution free summarization method for Affymetrix GeneChip® arrays
Bioinformatics, 2006
Standards for systems biology
Nature Reviews Genetics, 2006
A new algorithm for comparing and visualizing relationships between hierarchical and flat gene expression data clusterings
Bioinformatics, 2005
Bayesian neural network approaches to ovarian cancer identification from high-resolution mass spectrometry data
Bioinformatics, 2005
Systematic benchmarking of microarray data classification: assessing the role of non-linearity and dimensionality reduction
Bioinformatics, 2004
Gene expression profiling predicts clinical outcome of breast cancer
Nature, 2002
Computational analysis of microarray data
Nature Reviews Genetics, 2001
Withdrawing an example from the training set: An analytic estimation of its effect on a non-linear parameterised model
Neurocomputing, 2000

Cited by 71 articles