Comparison of association methods for dense marker data
- 12 June 2008
- journal article
- research article
- Published by Wiley in Genetic Epidemiology
- Vol. 32 (8), 791-799
- https://doi.org/10.1002/gepi.20347
Abstract
While data sets based on dense genome scans are becoming increasingly common, there are many theoretical questions that remain unanswered. How can a large number of markers in high linkage disequilibrium (LD) and rare disease variants be simulated efficiently? How should markers in high LD be analyzed: individually or jointly? Are there fast and simple methods to adjust for correlation of tests? What is the power penalty for conservative Bonferroni adjustments? Assuming that association scans are adequately powered, we attempt to answer these questions. Performance of single‐point and multipoint tests, and their hybrids, is investigated using two simulation designs. The first simulation design uses theoretically derived LD patterns. The second design uses LD patterns based on real data. For the theoretical simulations we used polychoric correlation as a measure of LD to facilitate simulation of markers in LD and rare disease variants. Based on the simulation results of the two studies, we conclude that statistical tests assuming only additive genotype effects (i.e. Armitage and especially multipoint T2) should be used cautiously due to their suboptimal power in certain settings. A false discovery rate (FDR)‐adjusted combination of tests for additive, dominant and recessive effects had close to optimal power. However, the common genotypic χ2 test performed adequately and could be used in lieu of the FDR combination. While some hybrid methods yield (sometimes spectacularly) higher power they are computationally intensive. We also propose an “exact” method to adjust for multiple testing, which yields nominally higher power than the Bonferroni correction. Genet. Epidemiol. 2008.Keywords
This publication has 24 references indexed in Scilit:
- Genetic model testing and statistical power in population‐based association studies of quantitative traitsGenetic Epidemiology, 2007
- A Fast Method for Computing High-Significance Disease Association in Large Population-Based StudiesAmerican Journal of Human Genetics, 2006
- High-Resolution Association Mapping of Quantitative Trait Loci: A Population-Based ApproachGenetics, 2006
- Complement Factor H Polymorphism in Age-Related Macular DegenerationScience, 2005
- Sibship T2 association tests of complex diseases for tightly linked markersHuman Genomics, 2005
- An efficient Monte Carlo approach to assessing statistical significance in genomic studiesBioinformatics, 2004
- Efficient Computation of Significance Levels for Multiple Associations in Large Studies of Correlated Data, Including Genomewide Association StudiesAmerican Journal of Human Genetics, 2004
- Detecting Disease Associations due to Linkage Disequilibrium Using Haplotype Tags: A Class of Tests and the Determinants of Statistical PowerHuman Heredity, 2003
- Genome Association Studies of Complex Diseases by Case-Control DesignsAmerican Journal of Human Genetics, 2003
- Numerical computation of multivariatet-probabilities with application to power calculation of multiple contrastsJournal of Statistical Computation and Simulation, 1999