A Distribution-Free Model for Longitudinal Metagenomic Count Data
Open Access
- 1 July 2022
- Vol. 13 (7), 1183
- https://doi.org/10.3390/genes13071183
Abstract
Longitudinal metagenomics has been widely studied in the recent decade to provide valuable insight for understanding microbial dynamics. The correlation within each subject can be observed across repeated measurements. However, previous methods that assume independent correlation may suffer from incorrect inferences. In addition, methods that do account for intra-sample correlation may not be applicable for count data. We proposed a distribution-free approach, namely CorrZIDF, which extends the current method to model correlated zero-inflated metagenomic count data, offering a powerful and accurate solution for detecting significance features. This method can handle different working correlation structures without specifying each margin distribution of the count data. Through simulation studies, we have shown the robustness of CorrZIDF when selecting a working correlation structure for repeated measures studies to enhance the efficiency of estimation. We also compared four methods using two real datasets, and the new proposed method identified more unique features that were reported previously on the relevant research.Funding Information
- National Institute of General Medical Sciences (1R01GM139829- 430 01)
- National Institute of Health (1P01AI148104-01A1)
- National Institute on Aging (U19AG065169)
- United States Department of Agriculture (ARZT- 431 1361620-H22-149)
This publication has 55 references indexed in Scilit:
- Pregnancy and Perinatal Outcomes Associated with Acinetobacter baumannii InfectionAmerican Journal of Perinatology Reports, 2013
- Dietary-fat-induced taurocholic acid promotes pathobiont expansion and colitis in Il10−/− miceNature, 2012
- Temporal variability in the diversity and composition of stream bacterioplankton communitiesEnvironmental Microbiology, 2012
- Species abundance distributions and richness estimations in fungal metagenomics - lessons learned from community ecologyMolecular Ecology, 2010
- Differential expression analysis for sequence count dataGenome Biology, 2010
- The Effect of Diet on the Human Gut Microbiome: A Metagenomic Analysis in Humanized Gnotobiotic MiceScience Translational Medicine, 2009
- edgeR: a Bioconductor package for differential expression analysis of digital gene expression dataBioinformatics, 2009
- Undersampling bias: the null hypothesis for singleton species in tropical arthropod surveysJournal of Animal Ecology, 2009
- Fusobacterium nucleatum Induces Premature and Term Stillbirths in Pregnant Mice: Implication of Oral Bacteria in Preterm BirthInfection and Immunity, 2004
- Theory & Methods: Modelling Correlated Zero‐inflated Count DataAustralian & New Zealand Journal of Statistics, 2001