Putative regulatory sites unraveled by network‐embedded thermodynamic analysis of metabolome data

Abstract
As one of the most recent members of the omics family, large‐scale quantitative metabolomics data are currently complementing our systems biology data pool and offer the chance to integrate the metabolite level into the functional analysis of cellular networks. Network‐embedded thermodynamic analysis (NET analysis) is presented as a framework for mechanistic and model‐based analysis of these data. By coupling the data to an operating metabolic network via the second law of thermodynamics and the metabolites' Gibbs energies of formation, NET analysis allows inferring functional principles from quantitative metabolite data; for example it identifies reactions that are subject to active allosteric or genetic regulation as exemplified with quantitative metabolite data from Escherichia coli and Saccharomyces cerevisiae . Moreover, the optimization framework of NET analysis was demonstrated to be a valuable tool to systematically investigate data sets for consistency, for the extension of sub‐omic metabolome data sets and for resolving intracompartmental concentrations from cell‐averaged metabolome data. Without requiring any kind of kinetic modeling, NET analysis represents a perfectly scalable and unbiased approach to uncover insights from quantitative metabolome data. ### Synopsis Systems biology strives to gain a quantitative genome‐scale understanding of the complex and highly interrelated cellular processes and phenomena. Such in‐depth understanding will ultimately be achieved by a tight interplay between the two prominent pillars of systems biology: mathematical models and omics data. In the context of the latter, owing to the recent development of affordable and powerful mass spectrometers, large‐scale sets of quantitative metabolome data are currently complementing our data pool ([Goodacre et al , 2004][1]; [Nielsen and Oliver, 2005][2]). In order to fully exploit the wealth of information contained in large‐scale data sets and to convert data into a body of knowledge, integration into mathematical models is required. For quantitative metabolome data, kinetic models describing enzyme reaction rates would represent the natural way for computational analysis. However, because of the lack of comprehensive knowledge about in vivo reaction mechanisms and parameters, and the still existing challenges on the measurement side as well as on the computational analysis side, it is very unlikely that large‐scale kinetic models will become available in the near future. Until today, large‐scale sets of quantitative metabolome data cannot be assimilated into mathematical models ([Nielsen and Oliver, 2005][2]) and thus, insight, for instance into underlying regulatory mechanisms, can hardly be inferred. In this work, we present a computational thermodynamics‐based framework for the analysis of quantitative metabolome data, whereby the mapping onto a stoichiometric reaction network and a coupling to fluxome data allow for extraction of novel insight from the data without requiring any kind of kinetic modeling. More specifically, in the developed network‐embedded thermodynamic analysis (NET analysis), experimentally determined intracellular fluxes and metabolite concentrations are coupled to each other via the second law of thermodynamics and the metabolites’ Gibbs energies of formation, whereas an optimization algorithm is employed to resolve network‐constrained, feasible ranges of Gibbs energies of reaction along with feasible ranges of unmeasured concentrations ([Figure 1][3]). We first examined a small set of measured metabolite concentrations obtained from an Escherichia coli chemostat culture to illustrate the concept and the application of NET analysis and to demonstrate its ability to extract insight from even limited metabolite data. First, we showed that NET analysis could serve as a tool to check thermodynamic consistency of a data set. Thermodynamic consistency was approved for the analyzed data set, although several other published data sets were found to be infeasible, which emphasizes the need for quality analyses of metabolome data before they enter databases or are used in modeling efforts. In a next step, we investigated whether NET analysis could also be used for prediction of unmeasured metabolite concentrations. In the analyzed data set, besides a few measured metabolites, the measurement provided only pooled concentrations for several isobaric molecules. With NET analysis, it was possible to resolve narrow concentration ranges for the individual metabolites. Moreover, concentration ranges were also predicted for some unmeasured metabolites. It can be envisioned that this predictive capability of NET analysis will support the development of more efficient analytical methods, as computable concentrations do not need to be determined experimentally. Measured metabolite concentrations hardly provide any insights into the organization of metabolism, that is, the regulatory structure responsible for routing of matter via the different metabolic pathways, the result of which is a certain intracellular flux distribution. A flux distribution is established by the fact that in comparison with the neighboring reactions, the rates of some reactions, are limited by the available catalytic activity, so that at branch points, mass flux is accordingly distributed into the possible pathways. A limited catalytic activity of a reaction manifests itself in a large Gibbs energy of reaction. Reactions operating far from equilibrium are more likely to impose flux control ([Wang et al , 2004][4]), and it is assumed that such reactions are more likely to be regulated by the cell ([Crabtree et al , 1997][5]). With NET analysis, reactions under putative active genetic or allosteric regulation can be identified from (even incomplete) metabolome data. For the data considered here, the respective results are provided...