AIR: A batch-oriented web program package for construction of supermatrices ready for phylogenomic analyses
Open Access
- 28 October 2009
- journal article
- Published by Springer Science and Business Media LLC in BMC Bioinformatics
- Vol. 10 (1), 357
- https://doi.org/10.1186/1471-2105-10-357
Abstract
Large multigene sequence alignments have over recent years been increasingly employed for phylogenomic reconstruction of the eukaryote tree of life. Such supermatrices of sequence data are preferred over single gene alignments as they contain vastly more information about ancient sequence characteristics, and are thus more suitable for resolving deeply diverging relationships. However, as alignments are expanded, increasingly numbers of sites with misleading phylogenetic information are also added. Therefore, a major goal in phylogenomic analyses is to maximize the ratio of information to noise; this can be achieved by the reduction of fast evolving sites. Here we present a batch-oriented web-based program package, named AIR that allows 1) transformation of several single genes to one multigene alignment, 2) identification of evolutionary rates in multigene alignments and 3) removal of fast evolving sites. These three processes can be done with the programs AIR-A ppender, AIR-I dentifier, and AIR-R emover (AIR), which can be used independently or in a semi-automated pipeline. AIR produces user-friendly output files with filtered and non-filtered alignments where residues are colored according to their evolutionary rates. Other bioinformatics applications linked to the AIR package are available at the Bioportal http://www.bioportal.uio.no, University of Oslo; together these greatly improve the flexibility, efficiency and quality of phylogenomic analyses. The AIR program package allows for efficient creation of multigene alignments and better assessment of evolutionary rates in sequence alignments. Removing fast evolving sites with the AIR programs has been employed in several recent phylogenomic analyses resulting in improved phylogenetic resolution and increased statistical support for branching patterns among the early diverging eukaryotes.Keywords
This publication has 35 references indexed in Scilit:
- Diversification of unicellular eukaryotes: cryptomonad colonizations of marine and fresh waters inferred from revised 18S rRNA phylogenyEnvironmental Microbiology, 2008
- Multigene Phylogeny of Choanozoa and the Origin of AnimalsPLOS ONE, 2008
- A Phylogenomic Investigation into the Origin of MetazoaMolecular Biology and Evolution, 2008
- RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed modelsBioinformatics, 2006
- Computing Bayes Factors Using Thermodynamic IntegrationSystematic Biology, 2006
- An Empirical Assessment of Long-Branch Attraction Artefacts in Deep Eukaryotic PhylogenomicsSystematic Biology, 2005
- Multigene Analyses of Bilaterian Animals Corroborate the Monophyly of Ecdysozoa, Lophotrochozoa, and ProtostomiaMolecular Biology and Evolution, 2005
- Phylogenetic signal in nucleotide data from seed plants: implications for resolving the seed plant tree of lifeAmerican Journal of Botany, 2004
- The Consistent Phylogenetic Signal in Genome Trees Revealed by Reducing the Impact of NoiseJournal of Molecular Evolution, 2004
- An Approximately Unbiased Test of Phylogenetic Tree SelectionSystematic Biology, 2002