Ascertainment Biases in SNP Chips Affect Measures of Population Divergence
Open Access
- 17 June 2010
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 27 (11), 2534-2547
- https://doi.org/10.1093/molbev/msq148
Abstract
Chip-based high-throughput genotyping has facilitated genome-wide studies of genetic diversity. Many studies have utilized these large data sets to make inferences about the demographic history of human populations using measures of genetic differentiation such as FST or principal component analyses. However, the single nucleotide polymorphism (SNP) chip data suffer from ascertainment biases caused by the SNP discovery process in which a small number of individuals from selected populations are used as discovery panels. In this study, we investigate the effect of the ascertainment bias on inferences regarding genetic differentiation among populations in one of the common genome-wide genotyping platforms. We generate SNP genotyping data for individuals that previously have been subject to partial genome-wide Sanger sequencing and compare inferences based on genotyping data to inferences based on direct sequencing. In addition, we also analyze publicly available genome-wide data. We demonstrate that the ascertainment biases will distort measures of human diversity and possibly change conclusions drawn from these measures in some times unexpected ways. We also show that details of the genotyping calling algorithms can have a surprisingly large effect on population genetic inferences. We not only present a correction of the spectrum for the widely used Affymetrix SNP chips but also show that such corrections are difficult to generalize among studies.Keywords
This publication has 29 references indexed in Scilit:
- Darwinian and demographic forces affecting human protein coding genesGenome Research, 2009
- The Population Reference Sample, POPRES: A Resource for Population, Disease, and Pharmacological Genetics ResearchAmerican Journal of Human Genetics, 2008
- Genes mirror geography within EuropeNature, 2008
- Correlation between Genetic and Geographic Structure in EuropeCurrent Biology, 2008
- Assessing the Evolutionary Impact of Amino Acid Mutations in the Human GenomePLoS Genetics, 2008
- Genome-wide association studies: progress and potential for drug discovery and developmentNature Reviews Drug Discovery, 2008
- Worldwide Human Relationships Inferred from Genome-Wide Patterns of VariationScience, 2008
- Proportionally more deleterious genetic variation in European than in African populationsNature, 2008
- A second generation human haplotype map of over 3.1 million SNPsNature, 2007
- PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage AnalysesAmerican Journal of Human Genetics, 2007