Genetics in geographically structured populations: defining, estimating and interpreting FST

Top Cited Papers

1 September 2009

journal article
review article
Published by Springer Science and Business Media LLC in Nature Reviews Genetics

Vol. 10 (9), 639-650
https://doi.org/10.1038/nrg2611

Abstract

Wright's F-statistics, and especially F_ST, provide important insights into the evolutionary processes that influence the structure of genetic variation within and among populations, and they are among the most widely used descriptive statistics in population and evolutionary genetics. F_ST is a property of the distribution of allele frequencies among populations. It reflects the joint effects of drift, migration, mutation and selection on the distribution of genetic variation among populations. F_ST has a central role in population and evolutionary genetics and has wide applications in fields from disease association mapping to forensic science. F_ST can be used to describe the distribution of genetic variation among any set of samples, but it is most usefully applied when the samples represent discrete units rather than arbitrary divisions along a continuous distribution. Statistics related to F_ST can be useful for haplotype or microsatellite data if an appropriate measure of evolutionary distance among alleles is available. Comparison of an estimate of F_ST from marker data with an estimate of Q_ST from continuously varying trait data can be used to detect selection, but the estimate of F_ST may depend on the choice of marker and the estimate of Q_ST may differ from neutral expectations if there is a non-additive component of genetic variance. Although the simple relationship between F_ST and migration rates in Wright's island model makes it tempting to infer migration rates from F_ST, caution is needed if such an approach is to be used. If estimates of F_ST from many loci are available, it may be possible to identify certain loci as 'outliers' that may have been subject to different patterns of selection or to different demographic processes. Case–control studies for association-mapping studies must account for the possibility that population substructure accounts for an observed association between a marker and a disease. The genomic control method uses background estimates of F_ST to control for such substructure. In forensic applications, the probabilities of obtaining a match are sometimes calculated for subpopulations that lack specific allele frequency data. A θ correction, in which θ is F_ST, is used to calculate the probability of a match using allele frequency information from a broader population that the subpopulation is part of. The massive amount of data that is being generated by population genomics projects can be understood fundamentally as allelic variation at individual loci. We therefore expect F-statistics to be at least as useful in understanding these data sets as they have been in population and evolutionary genetics for most of the last century.

Keywords

This publication has 116 references indexed in Scilit:

Drawing inferences about the coancestry coefficient
Theoretical Population Biology, 2009
A Bayesian Hierarchical Model for Analysis of Single-Nucleotide Polymorphisms Diversity in Multilocus, Multipopulation Samples
Journal of the American Statistical Association, 2009
Accelerated genetic drift on chromosome X during the human dispersal out of Africa
Nature Genetics, 2008
Worldwide Human Relationships Inferred from Genome-Wide Patterns of Variation
Science, 2008
A second generation human haplotype map of over 3.1 million SNPs
Nature, 2007
A haplotype map of the human genome
Nature, 2005
The power and promise of population genomics: from genotyping to genome typing
Nature Reviews Genetics, 2003
Adaptive population divergence: markers, QTL and traits
Trends in Ecology & Evolution, 2002
Effects of life history traits on genetic diversity in plant species
Philosophical Transactions B, 1996
Analysis of Gene Diversity in Subdivided Populations
Proceedings of the National Academy of Sciences of the United States of America, 1973

Cited by 1013 articles