Phylogenetic distances are encoded in networks of interacting pathways
Open Access
- 26 September 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 24 (22), 2579-2585
- https://doi.org/10.1093/bioinformatics/btn503
Abstract
Motivation: Although metabolic reactions are unquestionably shaped by evolutionary processes, the degree to which the overall structure and complexity of their interconnections are linked to the phylogeny of species has not been evaluated in depth. Here, we apply an original metabolome representation, termed Network of Interacting Pathways or NIP, with a combination of graph theoretical and machine learning strategies, to address this question. NIPs compress the information of the metabolic network exhibited by a species into much smaller networks of overlapping metabolic pathways, where nodes are pathways and links are the metabolites they exchange. Results: Our analysis shows that a small set of descriptors of the structure and complexity of the NIPs combined into regression models reproduce very accurately reference phylogenetic distances derived from 16S rRNA sequences (10-fold cross-validation correlation coefficient higher than 0.9). Our method also showed better scores than previous work on metabolism-based phylogenetic reconstructions, as assessed by branch distances score, topological similarity and second cousins score. Thus, our metabolome representation as network of overlapping metabolic pathways captures sufficient information about the underlying evolutionary events leading to the formation of metabolic networks and species phylogeny. It is important to note that precise knowledge of all of the reactions in these pathways is not required for these reconstructions. These observations underscore the potential for the use of abstract, modular representations of metabolic reactions as tools in studying the evolution of species. Contact:aurelien.mazurie@pasteur.fr Supplementary information: Supplementary data are available at Bioinformatics online.This publication has 34 references indexed in Scilit:
- The evolution of modularity in bacterial metabolic networksProceedings of the National Academy of Sciences of the United States of America, 2008
- A network perspective on the topological importance of enzymes and their phylogenetic conservationBMC Bioinformatics, 2007
- Phylogenetic reconstruction from non-genomic dataBioinformatics, 2007
- A metabolic network in the evolutionary context: Multiscale structure and modularityProceedings of the National Academy of Sciences of the United States of America, 2006
- Functional cartography of complex metabolic networksNature, 2005
- Phylogenetic comparison of metabolic capacities of organisms at genome levelMolecular Phylogenetics and Evolution, 2004
- Multiple sequence alignment with the Clustal series of programsNucleic Acids Research, 2003
- Hierarchical Organization of Modularity in Metabolic NetworksScience, 2002
- NoticesCladistics, 1989
- The Use of Tree Comparison MetricsSystematic Zoology, 1985