FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments
Top Cited Papers
Open Access
- 10 March 2010
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 5 (3), e9490
- https://doi.org/10.1371/journal.pone.0009490
Abstract
We recently described FastTree, a tool for inferring phylogenies for alignments with up to hundreds of thousands of sequences. Here, we describe improvements to FastTree that improve its accuracy without sacrificing scalability. Where FastTree 1 used nearest-neighbor interchanges (NNIs) and the minimum-evolution criterion to improve the tree, FastTree 2 adds minimum-evolution subtree-pruning-regrafting (SPRs) and maximum-likelihood NNIs. FastTree 2 uses heuristics to restrict the search for better trees and estimates a rate of evolution for each site (the “CAT” approximation). Nevertheless, for both simulated and genuine alignments, FastTree 2 is slightly more accurate than a standard implementation of maximum-likelihood NNIs (PhyML 3 with default settings). Although FastTree 2 is not quite as accurate as methods that use maximum-likelihood SPRs, most of the splits that disagree are poorly supported, and for large alignments, FastTree 2 is 100–1,000 times faster. FastTree 2 inferred a topology and likelihood-based local support values for 237,882 distinct 16S ribosomal RNAs on a desktop computer in 22 hours and 5.8 gigabytes of memory. FastTree 2 allows the inference of maximum-likelihood phylogenies for huge alignments. FastTree 2 is freely available at http://www.microbesonline.org/fasttree.This publication has 34 references indexed in Scilit:
- New Algorithms and Methods to Estimate Maximum-Likelihood Phylogenies: Assessing the Performance of PhyML 3.0Systematic Biology, 2010
- Fast Statistical AlignmentPLoS Computational Biology, 2009
- FastTree: Computing Large Minimum Evolution Trees with Profiles instead of a Distance MatrixMolecular Biology and Evolution, 2009
- Infernal 1.0: inference of RNA alignmentsBioinformatics, 2009
- A Rapid Bootstrap Algorithm for the RAxML Web ServersSystematic Biology, 2008
- PartTree: an algorithm to build an approximate tree from a large number of unaligned sequencesBioinformatics, 2006
- RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed modelsBioinformatics, 2006
- Greengenes, a Chimera-Checked 16S rRNA Gene Database and Workbench Compatible with ARBApplied and Environmental Microbiology, 2006
- Multiple Comparisons of Log-Likelihoods with Applications to Phylogenetic InferenceMolecular Biology and Evolution, 1999
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981