MetaPhOrs: orthology and paralogy predictions from multiple phylogenetic evidence using a consistency-based confidence score
Open Access
- 11 December 2010
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 39 (5), e32
- https://doi.org/10.1093/nar/gkq953
Abstract
Reliable prediction of orthology is central to comparative genomics. Approaches based on phylogenetic analyses closely resemble the original definition of orthology and paralogy and are known to be highly accurate. However, the large computational cost associated to these analyses is a limiting factor that often prevents its use at genomic scales. Recently, several projects have addressed the reconstruction of large collections of high-quality phylogenetic trees from which orthology and paralogy relationships can be inferred. This provides us with the opportunity to infer the evolutionary relationships of genes from multiple, independent, phylogenetic trees. Using such strategy, we combine phylogenetic information derived from different databases, to predict orthology and paralogy relationships for 4.1 million proteins in 829 fully sequenced genomes. We show that the number of independent sources from which a prediction is made, as well as the level of consistency across predictions, can be used as reliable confidence scores. A webserver has been developed to easily access these data ( http://orthology.phylomedb.org ), which provides users with a global repository of phylogeny-based orthology and paralogy predictions.Keywords
This publication has 24 references indexed in Scilit:
- ETE: a python Environment for Tree ExplorationBMC Bioinformatics, 2010
- eggNOG v2.0: extending the evolutionary genealogy of genes with enhanced non-supervised orthologous groups, species and functional annotationsNucleic Acids Research, 2009
- trimAl: a tool for automated alignment trimming in large-scale phylogenetic analysesBioinformatics, 2009
- Berkeley PHOG: PhyloFacts orthology group prediction web serverNucleic Acids Research, 2009
- EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebratesGenome Research, 2008
- Large-scale assignment of orthology: back to phylogenetics?Genome Biology, 2008
- TreeFam: 2008 UpdateNucleic Acids Research, 2007
- PhylomeDB: a database for genome-wide collections of gene phylogeniesNucleic Acids Research, 2007
- Natural history and evolutionary principles of gene duplication in fungiNature, 2007
- Automatic genome-wide reconstruction of phylogenetic gene treesBioinformatics, 2007