MultiPhyl: a high-throughput phylogenomics webserver using distributed computing
Open Access
- 8 May 2007
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 35 (Web erver), W33-W37
- https://doi.org/10.1093/nar/gkm359
Abstract
With the number of fully sequenced genomes increasing steadily, there is greater interest in performing large-scale phylogenomic analyses from large numbers of individual gene families. Maximum likelihood (ML) has been shown repeatedly to be one of the most accurate methods for phylogenetic construction. Recently, there have been a number of algorithmic improvements in maximum-likelihood-based tree search methods. However, it can still take a long time to analyse the evolutionary history of many gene families using a single computer. Distributed computing refers to a method of combining the computing power of multiple computers in order to perform some larger overall calculation. In this article, we present the first high-throughput implementation of a distributed phylogenetics platform, MultiPhyl, capable of using the idle computational resources of many heterogeneous non-dedicated machines to form a phylogenetics supercomputer. MultiPhyl allows a user to upload hundreds or thousands of amino acid or nucleotide alignments simultaneously and perform computationally intensive tasks such as model selection, tree searching and bootstrapping of each of the alignments using many desktop machines. The program implements a set of 88 amino acid models and 56 nucleotide maximum likelihood models and a variety of statistical methods for choosing between alternative models. A MultiPhyl webserver is available for public use at: http://www.cs.nuim.ie/distributed/multiphyl.php.Keywords
This publication has 16 references indexed in Scilit:
- A fungal phylogeny based on 42 complete genomes derived from supertree and combined gene analysisBMC Evolutionary Biology, 2006
- Phylogenomics and the reconstruction of the tree of lifeNature Reviews Genetics, 2005
- The Opisthokonta and the Ecdysozoa May Not Be Clades: Stronger Support for the Grouping of Plant and Animal than for Animal and Fungi and Stronger Support for the Coelomata than EcdysozoaMolecular Biology and Evolution, 2005
- RAxML-III: a fast program for maximum likelihood-based inference of large phylogenetic treesBioinformatics, 2004
- IQPNNI: Moving Fast Through Tree Space and Stopping in TimeMolecular Biology and Evolution, 2004
- A Simple, Fast, and Accurate Algorithm to Estimate Large Phylogenies by Maximum LikelihoodSystematic Biology, 2003
- MrBayes 3: Bayesian phylogenetic inference under mixed modelsBioinformatics, 2003
- Genetic Algorithms and Parallel Processing in Maximum-Likelihood Phylogeny InferenceMolecular Biology and Evolution, 2002
- TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computingBioinformatics, 2002
- fastDNAml: a tool for construction of phylogenetic trees of DNA sequences using maximum likelihoodBioinformatics, 1994