StanDep: Capturing transcriptomic variability improves context-specific metabolic models
Open Access
- 1 May 2020
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLoS Computational Biology
- Vol. 16 (5), e1007764
- https://doi.org/10.1371/journal.pcbi.1007764
Abstract
Diverse algorithms can integrate transcriptomics with genome-scale metabolic models (GEMs) to build context-specific metabolic models. These algorithms require identification of a list of high confidence (core) reactions from transcriptomics, but parameters related to identification of core reactions, such as thresholding of expression profiles, can significantly change model content. Importantly, current thresholding approaches are burdened with setting singular arbitrary thresholds for all genes; thus, resulting in removal of enzymes needed in small amounts and even many housekeeping genes. Here, we describe StanDep, a novel heuristic method for using transcriptomics to identify core reactions prior to building context-specific metabolic models. StanDep clusters gene expression data based on their expression pattern across different contexts and determines thresholds for each cluster using data-dependent statistics, specifically standard deviation and mean. To demonstrate the use of StanDep, we built hundreds of models for the NCI-60 cancer cell lines. These models successfully increased the inclusion of housekeeping reactions, which are often lost in models built using standard thresholding approaches. Further, StanDep also provided a transcriptomic explanation for inclusion of lowly expressed reactions that were otherwise only supported by model extraction methods. Our study also provides novel insights into how cells may deal with context-specific and ubiquitous functions. StanDep, as a MATLAB toolbox, is available at https://github.com/LewisLabUCSD/StanDep Author summary Integration of transcriptomics data with genome-scale metabolic models is appealing but challenging due to the number of parametric decisions required to be made to by the user. This is further exacerbated by models failing to capture functionalities which are important for cellular maintenance. In this study, we propose a thresholding method for functionally qualifying a metabolic reaction to be active. We used our method to extract models of NCI-60 cancer cell lines, human tissues, and C. elegans cell types. We show that our thresholding method improves the coverage of functions required for cellular maintenance. We also validated and compared models built with our approach against those with existing approaches using CRISPR-Cas9 essentiality screens. Overall, our study provides novel insights into how cells may deal with context-specific and ubiquitous functions.This publication has 60 references indexed in Scilit:
- Metabolite Profiling Identifies a Key Role for Glycine in Rapid Cancer Cell ProliferationScience, 2012
- Large-scale in silico modeling of metabolic interactions between cell types in the human brainNature Biotechnology, 2010
- Reactive oxygen species, oxidative stress, and cell death correlate with level of CoQ 10 deficiencyThe FASEB Journal, 2010
- Blueprint for antimicrobial hit discovery targeting metabolic networksProceedings of the National Academy of Sciences of the United States of America, 2010
- Computational reconstruction of tissue‐specific metabolic models: application to human liver metabolismMolecular Systems Biology, 2010
- GTP Cyclohydrolase I Expression, Protein, and Activity Determine Intracellular Tetrahydrobiopterin Levels, Independent of GTP Cyclohydrolase Feedback Regulatory Protein ExpressionOnline Journal of Public Health Informatics, 2009
- Modeling Hybridoma Cell Metabolism Using a Generic Genome-Scale Metabolic Model of Mus musculusBiotechnology Progress, 2008
- Regulation of Glycan Structures in Animal TissuesOnline Journal of Public Health Informatics, 2008
- Profiling the expression pattern of GPI transamidase complex subunits in human cancerLaboratory Investigation, 2008
- Systematic functional analysis of the Caenorhabditis elegans genome using RNAiNature, 2003