Correcting for ascertainment bias in the inference of population structure
Open Access
- 9 January 2009
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 25 (4), 552-554
- https://doi.org/10.1093/bioinformatics/btn665
Abstract
Background: The ascertainment process of molecular markers amounts to disregard loci carrying alleles with low frequencies. This can result in strong biases in inferences under population genetics models if not properly taken into account by the inference algorithm. Attempting to model this censoring process in view of making inference of population structure (i.e.identifying clusters of individuals) brings up challenging numerical difficulties. Method: These difficulties are related to the presence of intractable normalizing constants in Metropolis–Hastings acceptance ratios. This can be solved via an Markov chain Monte Carlo (MCMC) algorithm known as single variable exchange algorithm (SVEA). Result: We show how this general solution can be implemented for a class of clustering models of broad interest in population genetics that includes the models underlying the computer programs STRUCTURE, GENELAND and GESTE. We also implement the method proposed for a simple example and show that it allows us to reduce the bias substantially. Availability: Further details and a computer program implementing the method are available from http://folk.uio.no/gillesg/AscB/ Contact:gilles.guillot@bio.uio.noKeywords
This publication has 17 references indexed in Scilit:
- Bayesian computation for statistical models with intractable normalizing constantsBrazilian Journal of Probability and Statistics, 2013
- Inference of structure in subdivided populations at low levels of genetic differentiation—the correlated allele frequencies model revisitedBioinformatics, 2008
- An Approximate Bayesian Computation Approach to Overcome Biases That Arise When Using Amplified Fragment Length Polymorphism Markers to Study Population StructureGenetics, 2008
- Ascertainment Bias in Spatially Structured Populations: A Case Study in the Eastern Fence LizardJournal of Heredity, 2007
- Identifying the Environmental Factors That Determine the Genetic Structure of PopulationsGenetics, 2006
- An efficient Markov chain Monte Carlo method for distributions with intractable normalising constantsBiometrika, 2006
- A Spatial Statistical Model for Landscape GeneticsGenetics, 2005
- Correcting for ascertainment biases when analyzing SNP data: applications to the estimation of linkage disequilibriumTheoretical Population Biology, 2003
- Assessing Population Differentiation and Isolation from Single-Nucleotide Polymorphism DataJournal of the Royal Statistical Society Series B: Statistical Methodology, 2002
- The Discovery of Single-Nucleotide Polymorphisms—and Inferences about Human Demographic HistoryAmerican Journal of Human Genetics, 2001