Two-stage designs for experiments with a large number of hypotheses
Open Access
- 9 August 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (19), 3771-3777
- https://doi.org/10.1093/bioinformatics/bti604
Abstract
Motivation: When a large number of hypotheses are investigated the false discovery rate (FDR) is commonly applied in gene expression analysis or gene association studies. Conventional single-stage designs may lack power due to low sample sizes for the individual hypotheses. We propose two-stage designs where the first stage is used to screen the ‘promising’ hypotheses which are further investigated at the second stage with an increased sample size. A multiple test procedure based on sequential individual P-values is proposed to control the FDR for the case of independent normal distributions with known variance. Results: The power of optimal two-stage designs is impressively larger than the power of the corresponding single–stage design with equal costs. Extensions to the case of unknown variances and correlated test statistics are investigated by simulations. Moreover, it is shown that the simple multiple test procedure using first stage data for screening purposes and deriving the test decisions only from second stage data is a very powerful option. Availability: An R-program is available at http://www.meduniwien.ac.at/medstat/research/fdr/application.R Contact:Martin.Posch@meduniwien.ac.at Supplementary information: Supplementary data for this paper is available at Bioinformatics online.Keywords
This publication has 8 references indexed in Scilit:
- Two‐Stage Designs for Gene–Disease Association StudiesBiometrics, 2004
- Strong Control, Conservative Point Estimation and Simultaneous Conservative Consistency of False Discovery Rates: A Unified ApproachJournal of the Royal Statistical Society Series B: Statistical Methodology, 2003
- Optimal two‐stage genotyping in population‐based association studiesGenetic Epidemiology, 2003
- Identifying differentially expressed genes using false discovery rate controlling proceduresBioinformatics, 2003
- Multiple Hypothesis Testing in Microarray ExperimentsStatistical Science, 2003
- A Direct Approach to False Discovery RatesJournal of the Royal Statistical Society Series B: Statistical Methodology, 2002
- Interpretation, Design, and Analysis of Gene Array Expression ExperimentsThe Journals of Gerontology: Series A, 2001
- Group sequential methods in the design and analysis of clinical trialsBiometrika, 1977