How to deal with the early GWAS data when imputing and combining different arrays is necessary
Open Access
- 21 December 2011
- journal article
- research article
- Published by Springer Science and Business Media LLC in European Journal of Human Genetics
- Vol. 20 (5), 572-576
- https://doi.org/10.1038/ejhg.2011.231
Abstract
Genotype imputation has become an essential tool in the analysis of genome-wide association scans. This technique allows investigators to test association at ungenotyped genetic markers, and to combine results across studies that rely on different genotyping platforms. In addition, imputation is used within long-running studies to reuse genotypes produced across generations of platforms. Typically, genotypes of controls are reused and cases are genotyped on more novel platforms yielding a case–control study that is not matched for genotyping platforms. In this study, we scrutinize such a situation and validate GWAS results by actually retyping top-ranking SNPs with the Sequenom MassArray platform. We discuss the needed quality controls (QCs). In doing so, we report a considerable discrepancy between the results from imputed and retyped data when applying recommended QCs from the literature. These discrepancies appear to be caused by extrapolating differences between arrays by the process of imputation. To avoid false positive results, we recommend that more stringent QCs should be applied. We also advocate reporting the imputation quality measure (RT2) for the post-imputation QCs in publications.Keywords
This publication has 17 references indexed in Scilit:
- Genome‐wide association study identifies a single major locus contributing to survival into old age; the APOE locus revisitedAging Cell, 2011
- Genome-wide association analysis identifies three psoriasis susceptibility lociNature Genetics, 2010
- Data quality control in genetic case-control association studiesNature Protocols, 2010
- Genotype imputation for genome-wide association studiesNature Reviews Genetics, 2010
- Common variants in KCNN3 are associated with lone atrial fibrillationNature Genetics, 2010
- Assessment of global phase uncertainty in case-control studiesBMC Genetics, 2009
- Genotype ImputationAnnual Review of Genomics and Human Genetics, 2009
- Genome-wide association study identifies new multiple sclerosis susceptibility loci on chromosomes 12 and 20Nature Genetics, 2009
- A new multipoint method for genome-wide association studies by imputation of genotypesNature Genetics, 2007
- Genomic Control for Association StudiesBiometrics, 1999