CodonPhyML: Fast Maximum Likelihood Phylogeny Estimation under Codon Substitution Models
Open Access
- 23 February 2013
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 30 (6), 1270-1280
- https://doi.org/10.1093/molbev/mst034
Abstract
Markov models of codon substitution naturally incorporate the structure of the genetic code and the selection intensity at the protein level, providing a more realistic representation of protein-coding sequences compared with nucleotide or amino acid models. Thus, for protein-coding genes, phylogenetic inference is expected to be more accurate under codon models. So far, phylogeny reconstruction under codon models has been elusive due to computational difficulties of dealing with high dimension matrices. Here, we present a fast maximum likelihood (ML) package for phylogenetic inference, CodonPhyML offering hundreds of different codon models, the largest variety to date, for phylogeny inference by ML. CodonPhyML is tested on simulated and real data and is shown to offer excellent speed and convergence properties. In addition, CodonPhyML includes most recent fast methods for estimating phylogenetic branch supports and provides an integral framework for models selection, including amino acid and DNA models.Keywords
This publication has 57 references indexed in Scilit:
- MrBayes 3.2: Efficient Bayesian Phylogenetic Inference and Model Choice Across a Large Model SpaceSystematic Biology, 2012
- The Pfam protein families databaseNucleic Acids Research, 2011
- Survey of Branch Support Methods Demonstrates Accuracy, Power, and Robustness of Fast Likelihood-based Approximation SchemesSystematic Biology, 2011
- OMA 2011: orthology inference among 1000 complete genomesNucleic Acids Research, 2010
- New Algorithms and Methods to Estimate Maximum-Likelihood Phylogenies: Assessing the Performance of PhyML 3.0Systematic Biology, 2010
- Estimates of the Effect of Natural Selection on Protein-Coding ContentMolecular Biology and Evolution, 2009
- Pathological rate matrices: from primates to pathogensBMC Bioinformatics, 2008
- PAML 4: Phylogenetic Analysis by Maximum LikelihoodMolecular Biology and Evolution, 2007
- Multiple Comparisons of Log-Likelihoods with Applications to Phylogenetic InferenceMolecular Biology and Evolution, 1999
- A new look at the statistical model identificationIEEE Transactions on Automatic Control, 1974