Weighted Neighbor Joining: A Likelihood-Based Approach to Distance-Based Phylogeny Reconstruction
Open Access
- 1 January 2000
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 17 (1), 189-197
- https://doi.org/10.1093/oxfordjournals.molbev.a026231
Abstract
We introduce a distance-based phylogeny reconstruction method called “weighted neighbor joining,” or “Weighbor” for short. As in neighbor joining, two taxa are joined in each iteration; however, the Weighbor criterion for choosing a pair of taxa to join takes into account that errors in distance estimates are exponentially larger for longer distances. The criterion embodies a likelihood function on the distances, which are modeled as correlated Gaussian random variables with different means and variances, computed under a probabilistic model for sequence evolution. The Weighbor criterion consists of two terms, an additivity term and a positivity term, that quantify the implications of joining the pair. The first term evaluates deviations from additivity of the implied external branches, while the second term evaluates confidence that the implied internal branch has a positive branch length. Compared with maximum-likelihood phylogeny reconstruction, Weighbor is much faster, while building trees that are qualitatively and quantitatively similar. Weighbor appears to be relatively immune to the “long branches attract” and “long branch distracts” drawbacks observed with neighbor joining, BIONJ, and parsimony.Keywords
This publication has 13 references indexed in Scilit:
- Topological bias and inconsistency of maximum likelihood using wrong models.Molecular Biology and Evolution, 1999
- Evolutionary distances for protein-coding sequences: modeling site- specific residue frequenciesMolecular Biology and Evolution, 1998
- The performance of the neighbor-joining method of phylogeny reconstructionPublished by American Mathematical Society (AMS) ,1997
- BIONJ: an improved version of the NJ algorithm based on a simple model of sequence dataMolecular Biology and Evolution, 1997
- Least Squares Estimation of Molecular Distance - Noise Abatement in Phylogenetic ReconstructionTheoretical Population Biology, 1994
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981
- A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequencesJournal of Molecular Evolution, 1980
- Cases in which Parsimony or Compatibility Methods Will be Positively MisleadingSystematic Zoology, 1978
- Evolution of Protein MoleculesPublished by Elsevier BV ,1969
- Construction of Phylogenetic TreesScience, 1967