New Algorithms and Methods to Estimate Maximum-Likelihood Phylogenies: Assessing the Performance of PhyML 3.0

Top Cited Papers

Open Access

29 March 2010

journal article
research article
Published by Oxford University Press (OUP) in Systematic Biology

Vol. 59 (3), 307-321
https://doi.org/10.1093/sysbio/syq010

Abstract

PhyML is a phylogeny software based on the maximum-likelihood principle. Early PhyML versions used a fast algorithm performing nearest neighbor interchanges to improve a reasonable starting tree topology. Since the original publication (Guindon S., Gascuel O. 2003. A simple, fast and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst. Biol. 52:696–704), PhyML has been widely used (>2500 citations in ISI Web of Science) because of its simplicity and a fair compromise between accuracy and speed. In the meantime, research around PhyML has continued, and this article describes the new algorithms and methods implemented in the program. First, we introduce a new algorithm to search the tree space with user-defined intensity using subtree pruning and regrafting topological moves. The parsimony criterion is used here to filter out the least promising topology modifications with respect to the likelihood function. The analysis of a large collection of real nucleotide and amino acid data sets of various sizes demonstrates the good performance of this method. Second, we describe a new test to assess the support of the data for internal branches of a phylogeny. This approach extends the recently proposed approximate likelihood-ratio test and relies on a nonparametric, Shimodaira–Hasegawa–like procedure. A detailed analysis of real alignments sheds light on the links between this new approach and the more classical nonparametric bootstrap method. Overall, our tests show that the last version (3.0) of PhyML is fast, accurate, stable, and ready to use. A Web server and binary files are available from http://www.atgc-montpellier.fr/phyml/.

Keywords

This publication has 32 references indexed in Scilit:

Approximate Likelihood-Ratio Test for Branches: A Fast, Accurate, and Powerful Alternative
Systematic Biology, 2006
Improving the efficiency of SPR moves in phylogenetic tree search methods based on maximum likelihood
Bioinformatics, 2005
TREEFINDER: a powerful graphical analysis environment for molecular phylogenetics
BMC Evolutionary Biology, 2004
A Simple, Fast, and Accurate Algorithm to Estimate Large Phylogenies by Maximum Likelihood
Systematic Biology, 2003
Accuracy and Power of the Likelihood Ratio Test in Detecting Adaptive Molecular Evolution
Molecular Biology and Evolution, 2001
Likelihood-Based Tests of Topologies in Phylogenetics
Systematic Biology, 2000
BIONJ: an improved version of the NJ algorithm based on a simple model of sequence data
Molecular Biology and Evolution, 1997
PHYLOGENIES FROM MOLECULAR SEQUENCES: INFERENCE AND RELIABILITY
Annual Review of Genetics, 1988
CONFIDENCE LIMITS ON PHYLOGENIES: AN APPROACH USING THE BOOTSTRAP
Evolution, 1985
Evolution of Protein Molecules
Published by Elsevier BV ,1969

Cited by 15492 articles