Genetics in geographically structured populations: defining, estimating and interpreting FST
Top Cited Papers
- 1 September 2009
- journal article
- review article
- Published by Springer Science and Business Media LLC in Nature Reviews Genetics
- Vol. 10 (9), 639-650
- https://doi.org/10.1038/nrg2611
Abstract
Wright's F-statistics, and especially FST, provide important insights into the evolutionary processes that influence the structure of genetic variation within and among populations, and they are among the most widely used descriptive statistics in population and evolutionary genetics. FST is a property of the distribution of allele frequencies among populations. It reflects the joint effects of drift, migration, mutation and selection on the distribution of genetic variation among populations. FST has a central role in population and evolutionary genetics and has wide applications in fields from disease association mapping to forensic science. FST can be used to describe the distribution of genetic variation among any set of samples, but it is most usefully applied when the samples represent discrete units rather than arbitrary divisions along a continuous distribution. Statistics related to FST can be useful for haplotype or microsatellite data if an appropriate measure of evolutionary distance among alleles is available. Comparison of an estimate of FST from marker data with an estimate of QST from continuously varying trait data can be used to detect selection, but the estimate of FST may depend on the choice of marker and the estimate of QST may differ from neutral expectations if there is a non-additive component of genetic variance. Although the simple relationship between FST and migration rates in Wright's island model makes it tempting to infer migration rates from FST, caution is needed if such an approach is to be used. If estimates of FST from many loci are available, it may be possible to identify certain loci as 'outliers' that may have been subject to different patterns of selection or to different demographic processes. Case–control studies for association-mapping studies must account for the possibility that population substructure accounts for an observed association between a marker and a disease. The genomic control method uses background estimates of FST to control for such substructure. In forensic applications, the probabilities of obtaining a match are sometimes calculated for subpopulations that lack specific allele frequency data. A θ correction, in which θ is FST, is used to calculate the probability of a match using allele frequency information from a broader population that the subpopulation is part of. The massive amount of data that is being generated by population genomics projects can be understood fundamentally as allelic variation at individual loci. We therefore expect F-statistics to be at least as useful in understanding these data sets as they have been in population and evolutionary genetics for most of the last century.Keywords
This publication has 116 references indexed in Scilit:
- Drawing inferences about the coancestry coefficientTheoretical Population Biology, 2009
- A Bayesian Hierarchical Model for Analysis of Single-Nucleotide Polymorphisms Diversity in Multilocus, Multipopulation SamplesJournal of the American Statistical Association, 2009
- Accelerated genetic drift on chromosome X during the human dispersal out of AfricaNature Genetics, 2008
- Worldwide Human Relationships Inferred from Genome-Wide Patterns of VariationScience, 2008
- A second generation human haplotype map of over 3.1 million SNPsNature, 2007
- A haplotype map of the human genomeNature, 2005
- The power and promise of population genomics: from genotyping to genome typingNature Reviews Genetics, 2003
- Adaptive population divergence: markers, QTL and traitsTrends in Ecology & Evolution, 2002
- Effects of life history traits on genetic diversity in plant speciesPhilosophical Transactions B, 1996
- Analysis of Gene Diversity in Subdivided PopulationsProceedings of the National Academy of Sciences of the United States of America, 1973