Power and sample-size estimation for microbiome studies using pairwise distances and PERMANOVA
Open Access
- 24 April 2015
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 31 (15), 2461-2468
- https://doi.org/10.1093/bioinformatics/btv183
Abstract
Motivation: The variation in community composition between microbiome samples, termed beta diversity, can be measured by pairwise distance based on either presence–absence or quantitative species abundance data. PERMANOVA, a permutation-based extension of multivariate analysis of variance to a matrix of pairwise distances, partitions within-group and between-group distances to permit assessment of the effect of an exposure or intervention (grouping factor) upon the sampled microbiome. Within-group distance and exposure/intervention effect size must be accurately modeled to estimate statistical power for a microbiome study that will be analyzed with pairwise distances and PERMANOVA. Results: We present a framework for PERMANOVA power estimation tailored to marker-gene microbiome studies that will be analyzed by pairwise distances, which includes: (i) a novel method for distance matrix simulation that permits modeling of within-group pairwise distances according to pre-specified population parameters; (ii) a method to incorporate effects of different sizes within the simulated distance matrix; (iii) a simulation-based method for estimating PERMANOVA power from simulated distance matrices; and (iv) an R statistical software package that implements the above. Matrices of pairwise distances can be efficiently simulated to satisfy the triangle inequality and incorporate group-level effects, which are quantified by the adjusted coefficient of determination, omega-squared (). From simulated distance matrices, available PERMANOVA power or necessary sample size can be estimated for a planned microbiome study. Availability and implementation: http://github.com/brendankelly/micropower. Contact: brendank@mail.med.upenn.edu or hongzhe@upenn.edu
This publication has 24 references indexed in Scilit:
- phyloseq: An R Package for Reproducible Interactive Analysis and Graphics of Microbiome Census DataPLOS ONE, 2013
- Structure, function and diversity of the healthy human microbiomeNature, 2012
- A framework for human microbiome researchNature, 2012
- Linking Long-Term Dietary Patterns with Gut Microbial EnterotypesScience, 2011
- Disordered Microbial Communities in the Upper Respiratory Tract of Cigarette SmokersPLOS ONE, 2010
- UniFrac: an effective distance metric for microbial community comparisonThe ISME Journal, 2010
- QIIME allows analysis of high-throughput community sequencing dataNature Methods, 2010
- Introducing mothur: Open-Source, Platform-Independent, Community-Supported Software for Describing and Comparing Microbial CommunitiesApplied and Environmental Microbiology, 2009
- Quantitative and Qualitative β Diversity Measures Lead to Different Insights into Factors That Structure Microbial CommunitiesApplied and Environmental Microbiology, 2007
- UniFrac: a New Phylogenetic Method for Comparing Microbial CommunitiesApplied and Environmental Microbiology, 2005