Methods for testing association between uncertain genotypes and quantitative traits
Open Access
- 11 June 2010
- journal article
- research article
- Published by Oxford University Press (OUP) in Biostatistics
- Vol. 12 (1), 1-17
- https://doi.org/10.1093/biostatistics/kxq039
Abstract
Interpretability and power of genome-wide association studies can be increased by imputing unobserved genotypes, using a reference panel of individuals genotyped at higher marker density. For many markers, genotypes cannot be imputed with complete certainty, and the uncertainty needs to be taken into account when testing for association with a given phenotype. In this paper, we compare currently available methods for testing association between uncertain genotypes and quantitative traits. We show that some previously described methods offer poor control of the false-positive rate (FPR), and that satisfactory performance of these methods is obtained only by using ad hoc filtering rules or by using a harsh transformation of the trait under study. We propose new methods that are based on exact maximum likelihood estimation and use a mixture model to accommodate nonnormal trait distributions when necessary. The new methods adequately control the FPR and also have equal or better power compared to all previously described methods. We provide a fast software implementation of all the methods studied here; our new method requires computation time of less than one computer-day for a typical genome-wide scan, with 2.5 M single nucleotide polymorphisms and 5000 individuals.Keywords
This publication has 32 references indexed in Scilit:
- Practical aspects of imputation-driven meta-analysis of genome-wide association studiesHuman Molecular Genetics, 2008
- Evaluating the Effects of Imputation on the Power, Coverage, and Cost Efficiency of Genome-wide SNP PlatformsAmerican Journal of Human Genetics, 2008
- The CoLaus study: a population-based study to investigate the epidemiology and genetic determinants of cardiovascular risk factors and metabolic syndromeBMC Cardiovascular Disorders, 2008
- QuantiSNP: an Objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping dataNucleic Acids Research, 2007
- A tutorial on statistical methods for population association studiesNature Reviews Genetics, 2006
- Evaluating coverage of genome-wide association studiesNature Genetics, 2006
- Multiple Hypothesis Testing in Microarray ExperimentsStatistical Science, 2003
- A Generalization of the Transmission/Disequilibrium Test for Uncertain-Haplotype TransmissionAmerican Journal of Human Genetics, 1999
- Maximum Likelihood from Incomplete Data Via the EM AlgorithmJournal of the Royal Statistical Society: Series B (Methodological), 1977
- An Analysis of TransformationsJournal of the Royal Statistical Society: Series B (Methodological), 1964