Enhanced Bayesian modelling in BAPS software for learning genetic structures of populations
Open Access
- 16 December 2008
- journal article
- research article
- Published by Springer Science and Business Media LLC in BMC Bioinformatics
- Vol. 9 (1), 539
- https://doi.org/10.1186/1471-2105-9-539
Abstract
Background: During the most recent decade many Bayesian statistical models and software for answering questions related to the genetic structure underlying population samples have appeared in the scientific literature. Most of these methods utilize molecular markers for the inferences, while some are also capable of handling DNA sequence data. In a number of earlier works, we have introduced an array of statistical methods for population genetic inference that are implemented in the software BAPS. However, the complexity of biological problems related to genetic structure analysis keeps increasing such that in many cases the current methods may provide either inappropriate or insufficient solutions. Results: We discuss the necessity of enhancing the statistical approaches to face the challenges posed by the ever-increasing amounts of molecular data generated by scientists over a wide range of research areas and introduce an array of new statistical tools implemented in the most recent version of BAPS. With these methods it is possible, e.g., to fit genetic mixture models using user-specified numbers of clusters and to estimate levels of admixture under a genetic linkage model. Also, alleles representing a different ancestry compared to the average observed genomic positions can be tracked for the sampled individuals, and a priori specified hypotheses about genetic population structure can be directly compared using Bayes' theorem. In general, we have improved further the computational characteristics of the algorithms behind the methods implemented in BAPS facilitating the analyses of large and complex datasets. In particular, analysis of a single dataset can now be spread over multiple computers using a script interface to the software. Conclusion: The Bayesian modelling methods introduced in this article represent an array of enhanced tools for learning the genetic structure of populations. Their implementations in the BAPS software are designed to meet the increasing need for analyzing large-scale population genetics data. The software is freely downloadable for Windows, Linux and Mac OS X systems at http://web.abo.fi/fak/mnf//mate/jc/software/baps.html.Keywords
This publication has 39 references indexed in Scilit:
- Bayesian modeling of recombination events in bacterial populationsBMC Bioinformatics, 2008
- Analysing georeferenced population genetics data with Geneland: a new algorithm to deal with null alleles and a friendly graphical user interfaceBioinformatics, 2008
- Genetic Variation and Population Structure in Native AmericansPLoS Genetics, 2007
- A Markov Chain Monte Carlo Approach for Joint Inference of Population Structure and Inbreeding Rates From Multilocus Genotype DataGenetics, 2007
- Inference of Population Structure Under a Dirichlet Process ModelGenetics, 2007
- T-BAPS: A Bayesian Statistical Tool for Comparison of Microbial Communities Using Terminal-restriction Fragment Length Polymorphism (T-RFLP) DataStatistical Applications in Genetics and Molecular Biology, 2007
- Bayesian Clustering Using Hidden Markov Random Fields in Spatial Population GeneticsGenetics, 2006
- Geneland: a computer package for landscape geneticsMolecular Ecology Notes, 2005
- Genetic Structure of Human PopulationsScience, 2002
- Bayes FactorsJournal of the American Statistical Association, 1995