Fast Algorithms for Conducting Large-Scale GWAS of Age-at-Onset Traits Using Cox Mixed-Effects Models
- 1 May 2020
- journal article
- research article
- Published by Oxford University Press (OUP) in Genetics
- Vol. 215 (1), 41-58
- https://doi.org/10.1534/genetics.119.302940
Abstract
Age-at-onset (AAO) is one of the critical traits in cohort studies of age-related diseases. Large-scale genome-wide association studies (GWAS) of AAO traits can provide more insights into genetic effects on disease progression and transitions between stages. Moreover, proportional hazards (or Cox) regression models can achieve higher statistical power in a cohort study than a case-control trait using logistic regression. Although mixed-effects models are widely used in GWAS to correct for sample dependence, application of Cox mixed-effects models (CMEMs) to large-scale GWAS is so far hindered by intractable computational cost. In this work, we propose COXMEG, an efficient R package for conducting GWAS of AAO traits using CMEMs. COXMEG introduces fast estimation algorithms for general sparse relatedness matrices including but not limited to block-diagonal pedigree-based matrices. COXMEG also introduces a fast and powerful score test for dense relatedness matrices, accounting for both population stratification and family structure. In addition, COXMEG generalizes existing algorithms to support positive semidefinite relatedness matrices, which are common in twin and family studies. Our simulation studies suggest that COXMEG, depending on the structure of the relatedness matrix, is orders of magnitude computationally more efficient than coxme and coxph with frailty for GWAS. We found that using sparse approximation of relatedness matrices yielded highly comparable results in controlling false positive rate and retaining statistical power for an ethnically homogeneous family-based sample. By applying COXMEG to a study of Alzheimer's disease (AD) with an NIA-LOADFS sample comprising 3456 non-Hispanic whites and 287 African Americans, we identified the APOE ε4 variant with strong statistical power (p=1e-101), far more significant than that reported in a previous study using a transformed variable and a marginal Cox model. Furthermore, we identified a novel SNP rs36051450 (p=2e-9) near GRAMD1B, the minor allele of which significantly reduced the hazards of AD in both genders. These results demonstrated that COXMEG greatly facilitates the application of CMEMs in GWAS of AAO traits.Keywords
This publication has 71 references indexed in Scilit:
- Genome-wide efficient mixed-model analysis for association studiesNature Genetics, 2012
- Frailty modelling for survival data from multi‐centre clinical trialsStatistics in Medicine, 2011
- GCTA: A Tool for Genome-wide Complex Trait AnalysisAmerican Journal of Human Genetics, 2011
- Genome-wide association study of follicular lymphoma identifies a risk locus at 6p21.32Nature Genetics, 2010
- Variance component model to account for sample structure in genome-wide association studiesNature Genetics, 2010
- Analyses of the National Institute on Aging Late-Onset Alzheimer's Disease Family StudyArchives of Neurology, 2008
- Sparse inverse covariance estimation with the graphical lassoBiostatistics, 2007
- Correlated individual frailty: An advantageous approach to survival analysis of bivariate dataMathematical Population Studies, 1995
- Regression Analysis of Multivariate Incomplete Failure Time Data by Modeling Marginal DistributionsJournal of the American Statistical Association, 1989
- Longitudinal data analysis using generalized linear modelsBiometrika, 1986