An efficient method to identify differentially expressed genes in microarray experiments
Open Access
- 3 May 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 24 (14), 1583-1589
- https://doi.org/10.1093/bioinformatics/btn215
Abstract
Motivation: Microarray experiments typically analyze thousands to tens of thousands of genes from small numbers of biological replicates. The fact that genes are normally expressed in functionally relevant patterns suggests that gene-expression data can be stratified and clustered into relatively homogenous groups. Cluster-wise dimensionality reduction should make it feasible to improve screening power while minimizing information loss. Results: We propose a powerful and computationally simple method for finding differentially expressed genes in small microarray experiments. The method incorporates a novel stratification-based tight clustering algorithm, principal component analysis and information pooling. Comprehensive simulations show that our method is substantially more powerful than the popular SAM and eBayes approaches. We applied the method to three real microarray datasets: one from a Populus nitrogen stress experiment with 3 biological replicates; and two from public microarray datasets of human cancers with 10 to 40 biological replicates. In all three analyses, our method proved more robust than the popular alternatives for identification of differentially expressed genes. Availability: The C++ code to implement the proposed method is available upon request for academic use. Contact: shuzhang@mtu.edu Supplementary information: Supplementary data are available at Bioinformatics online.Keywords
This publication has 29 references indexed in Scilit:
- Assessing Differential Gene Expression with Small Sample Sizes in Oligonucleotide Arrays Using a Mean‐Variance ModelBiometrics, 2007
- Complex trait analysis of gene expression uncovers polygenic and pleiotropic networks that modulate nervous system functionNature Genetics, 2005
- Randomization Tests for Small Samples: An Application for Genetic Expression DataJournal of the Royal Statistical Society Series C: Applied Statistics, 2003
- Multiple Hypothesis Testing in Microarray ExperimentsStatistical Science, 2003
- Genetic Dissection of Transcriptional Regulation in Budding YeastScience, 2002
- Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression MonitoringScience, 1999
- Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arraysProceedings of the National Academy of Sciences of the United States of America, 1999
- Ratio-based decisions and the quantitative analysis of cDNA microarray imagesJournal of Biomedical Optics, 1997
- Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple TestingJournal of the Royal Statistical Society: Series B (Methodological), 1995
- Post StratificationJournal of the Royal Statistical Society. Series A (General), 1979