Comparative study of unsupervised dimension reduction techniques for the visualization of microarray gene expression data
Open Access
- 18 November 2010
- journal article
- research article
- Published by Springer Science and Business Media LLC in BMC Bioinformatics
- Vol. 11 (1), 567
- https://doi.org/10.1186/1471-2105-11-567
Abstract
Visualization of DNA microarray data in two or three dimensional spaces is an important exploratory analysis step in order to detect quality issues or to generate new hypotheses. Principal Component Analysis (PCA) is a widely used linear method to define the mapping between the high-dimensional data and its low-dimensional representation. During the last decade, many new nonlinear methods for dimension reduction have been proposed, but it is still unclear how well these methods capture the underlying structure of microarray gene expression data. In this study, we assessed the performance of the PCA approach and of six nonlinear dimension reduction methods, namely Kernel PCA, Locally Linear Embedding, Isomap, Diffusion Maps, Laplacian Eigenmaps and Maximum Variance Unfolding, in terms of visualization of microarray data.Keywords
This publication has 45 references indexed in Scilit:
- Quantitative comparison of microarray experiments with published leukemia related gene expression signaturesBMC Bioinformatics, 2009
- The ordering of expression among a few genes can provide simple cancer biomarkers and signal BRCA1 mutationsBMC Bioinformatics, 2009
- Integrative disease classification based on cross-platform microarray dataBMC Bioinformatics, 2009
- Identification of novel DNA repair proteins via primary sequence, secondary structure, and homologyBMC Bioinformatics, 2009
- Prediction of molecular subtypes in acute myeloid leukemia based on gene expression profilingHaematologica, 2009
- Dimension Reduction for Classification with Gene Expression Microarray DataStatistical Applications in Genetics and Molecular Biology, 2006
- Classifying Gene Expression Profiles from Pairwise mRNA ComparisonsStatistical Applications in Genetics and Molecular Biology, 2004
- PLS Dimension Reduction for Classification with Microarray DataStatistical Applications in Genetics and Molecular Biology, 2004
- Distinct types of diffuse large B-cell lymphoma identified by gene expression profilingNature, 2000
- Analysis of a complex of statistical variables into principal components.Journal of Educational Psychology, 1933