Delimiting Species Using Single-Locus Data and the Generalized Mixed Yule Coalescent Approach: A Revised Method and Evaluation on Simulated Data Sets
Top Cited Papers
Open Access
- 14 June 2013
- journal article
- research article
- Published by Oxford University Press (OUP) in Systematic Biology
- Vol. 62 (5), 707-724
- https://doi.org/10.1093/sysbio/syt033
Abstract
DNA barcoding-type studies assemble single-locus data from large samples of individuals and species, and have provided new kinds of data for evolutionary surveys of diversity. An important goal of many such studies is to delimit evolutionarily significant species units, especially in biodiversity surveys from environmental DNA samples. The Generalized Mixed Yule Coalescent (GMYC) method is a likelihood method for delimiting species by fitting within- and between-species branching models to reconstructed gene trees. Although the method has been widely used, it has not previously been described in detail or evaluated fully against simulations of alternative scenarios of true patterns of population variation and divergence between species. Here, we present important reformulations to the GMYC method as originally specified, and demonstrate its robustness to a range of departures from its simplifying assumptions. The main factor affecting the accuracy of delimitation is the mean population size of species relative to divergence times between them. Other departures from the model assumptions, such as varying population sizes among species, alternative scenarios for speciation and extinction, and population growth or subdivision within species, have relatively smaller effects. Our simulations demonstrate that support measures derived from the likelihood function provide a robust indication of when the model performs well and when it leads to inaccurate delimitations. Finally, the so-called single-threshold version of the method outperforms the multiple-threshold version of the method on simulated data: we argue that this might represent a fundamental limit due to the nature of evidence used to delimit species in this approach. Together with other studies comparing its performance relative to other methods, our findings support the robustness of GMYC as a tool for delimiting species when only single-locus information is available. [Clusters; coalescent; DNA; genealogical; neutral; speciation; species.]Keywords
This publication has 79 references indexed in Scilit:
- The widely used small subunit 18S rDNA molecule greatly underestimates true diversity in biodiversity surveys of the meiofaunaProceedings of the National Academy of Sciences of the United States of America, 2012
- The Effect of Geographical Scale of Sampling on DNA BarcodingSystematic Biology, 2012
- Reconciling molecular phylogenies with the fossil recordProceedings of the National Academy of Sciences of the United States of America, 2011
- Determining Species Boundaries in a World Full of Rarity: Singletons, Species Delimitation MethodsSystematic Biology, 2011
- Species Delimitation Using a Combined Coalescent and Information-Theoretic Approach: An Example from North American Myotis BatsSystematic Biology, 2010
- Bayesian species delimitation using multilocus sequence dataProceedings of the National Academy of Sciences of the United States of America, 2010
- New Heuristic Methods for Joint Species Delimitation and Species Tree InferenceSystematic Biology, 2009
- Quantifying ecological, morphological, and genetic variation to delimit species in the coast horned lizard species complex ( Phrynosoma )Proceedings of the National Academy of Sciences of the United States of America, 2009
- DNA barcoding the floras of biodiversity hotspotsProceedings of the National Academy of Sciences of the United States of America, 2008
- RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed modelsBioinformatics, 2006