A comparison of meta-analysis methods for detecting differentially expressed genes in microarray experiments
Open Access
- 1 February 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 24 (3), 374-382
- https://doi.org/10.1093/bioinformatics/btm620
Abstract
Motivation: The proliferation of public data repositories creates a need for meta-analysis methods to efficiently evaluate, integrate and validate related datasets produced by independent groups. A t-based approach has been proposed to integrate effect size from multiple studies by modeling both intra- and between-study variation. Recently, a non-parametric ‘rank product’ method, which is derived based on biological reasoning of fold-change criteria, has been applied to directly combine multiple datasets into one meta study. Fisher's Inverse χ2 method, which only depends on P-values from individual analyses of each dataset, has been used in a couple of medical studies. While these methods address the question from different angles, it is not clear how they compare with each other. Results: We comparatively evaluate the three methods; t-based hierarchical modeling, rank products and Fisher's Inverse χ2 test with P-values from either the t-based or the rank product method. A simulation study shows that the rank product method, in general, has higher sensitivity and selectivity than the t-based method in both individual and meta-analysis, especially in the setting of small sample size and/or large between-study variation. Not surprisingly, Fisher's χ2 method highly depends on the method used in the individual analysis. Application to real datasets demonstrates that meta-analysis achieves more reliable identification than an individual analysis, and rank products are more robust in gene ranking, which leads to a much higher reproducibility among independent studies. Though t-based meta-analysis greatly improves over the individual analysis, it suffers from a potentially large amount of false positives when P-values serve as threshold. We conclude that careful meta-analysis is a powerful tool for integrating multiple array studies. Contact:fxhong@jimmy.harvard.edu Supplementary information: Supplementary data are available at Bioinformatics online.Keywords
This publication has 32 references indexed in Scilit:
- RankProd: a bioconductor package for detecting differentially expressed genes in meta-analysisBioinformatics, 2006
- Combining Results of Microarray Experiments: A Rank Aggregation ApproachStatistical Applications in Genetics and Molecular Biology, 2006
- RANK-BASED METHODS AS A NON-PARAMETRIC ALTERNATIVE OF THE T-STATISTIC FOR THE ANALYSIS OF BIOLOGICAL MICROARRAY DATAJournal of Bioinformatics and Computational Biology, 2005
- Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experimentsFEBS Letters, 2004
- Bioconductor: open software development for computational biology and bioinformaticsGenome Biology, 2004
- Statistical issues and methods for meta-analysis of microarray data: a case study in prostate cancerFunctional & Integrative Genomics, 2003
- Combining multiple microarray studies and modeling interstudy variationBioinformatics, 2003
- Empirical Bayes Analysis of a Microarray ExperimentJournal of the American Statistical Association, 2001
- Meta-analysis in clinical trialsControlled Clinical Trials, 1986
- The Combination of Estimates from Different ExperimentsBiometrics, 1954