Sampling for Microsatellite-Based Population Genetic Studies: 25 to 30 Individuals per Population Is Enough to Accurately Estimate Allele Frequencies
Open Access
- 12 September 2012
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 7 (9), e45170
- https://doi.org/10.1371/journal.pone.0045170
Abstract
One of the most common questions asked before starting a new population genetic study using microsatellite allele frequencies is “how many individuals do I need to sample from each population?” This question has previously been answered by addressing how many individuals are needed to detect all of the alleles present in a population (i.e. rarefaction based analyses). However, we argue that obtaining accurate allele frequencies and accurate estimates of diversity are much more important than detecting all of the alleles, given that very rare alleles (i.e. new mutations) are not very informative for assessing genetic diversity within a population or genetic structure among populations. Here we present a comparison of allele frequencies, expected heterozygosities and genetic distances between real and simulated populations by randomly subsampling 5–100 individuals from four empirical microsatellite genotype datasets (Formica lugubris, Sciurus vulgaris, Thalassarche melanophris, and Himantopus novaezelandia) to create 100 replicate datasets at each sample size. Despite differences in taxon (two birds, one mammal, one insect), population size, number of loci and polymorphism across loci, the degree of differences between simulated and empirical dataset allele frequencies, expected heterozygosities and pairwise FST values were almost identical among the four datasets at each sample size. Variability in allele frequency and expected heterozygosity among replicates decreased with increasing sample size, but these decreases were minimal above sample sizes of 25 to 30. Therefore, there appears to be little benefit in sampling more than 25 to 30 individuals per population for population genetic studies based on microsatellite allele frequencies.Keywords
This publication has 11 references indexed in Scilit:
- GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research—an updateBioinformatics, 2012
- Effects of sample size, number of markers, and allelic richness on the detection of spatial genetic patternMolecular Ecology Resources, 2011
- Genetic analyses reveal hybridization but no hybrid swarm in one of the world’s rarest birdsMolecular Ecology, 2010
- A simple method for estimating genetic diversity in large populations from finite sample sizesBMC Genomic Data, 2009
- Development of polymorphic microsatellite markers for the New Zealand black stilt (Himantopus novaezelandiae) and cross-amplification in the pied stilt (Himantopus himantopus leucocephalus)Molecular Ecology Resources, 2008
- POWSIM: a computer program for assessing statistical power when testing for genetic differentiationMolecular Ecology Notes, 2006
- Do polymorphic loci require large sample sizes to estimate genetic distances?Heredity, 2004
- Global relationships amongst black‐browed and grey‐headed albatrosses: analysis of population structure using mitochondrial DNA and microsatellitesMolecular Ecology, 2001
- Impact of Landscape Management on the Genetic Structure of Red Squirrel PopulationsScience, 2001
- Characterization of microsatellite loci in Formica lugubris B and their variability in other ant speciesMolecular Ecology, 1996