A dual-parameter identification approach for data-based predictive modeling of hybrid gene regulatory network-growth kinetics in Pseudomonas putida mt-2
- 6 May 2020
- journal article
- research article
- Published by Springer Science and Business Media LLC in Bioprocess and Biosystems Engineering
- Vol. 43 (9), 1671-1688
- https://doi.org/10.1007/s00449-020-02360-2
Abstract
Data integration to model-based description of biological systems incorporating gene dynamics improves the performance of microbial systems. Bioprocess performance, typically predicted using empirical Monod-type models, is essential for a sustainable bioeconomy. To replace empirical models, we updated a hybrid gene regulatory network-growth kinetic model, predicting aromatic pollutants degradation and biomass growth in Pseudomonas putida mt-2. We modeled a complex biological system including extensive information to understand the role of the regulatory elements in toluene biodegradation and biomass growth. The updated model exhibited extra complications such as the existence of oscillations and discontinuities. As parameter estimation of complex biological models remains a key challenge, we used the updated model to present a dual-parameter identification approach (the ‘dual approach’) combining two independent methodologies. Approach I handled the complexity by incorporation of demonstrated biological knowledge in the model-development process and combination of global sensitivity analysis and optimisation. Approach II complemented Approach I handling multimodality, ill-conditioning and overfitting through regularisation estimation, global optimisation, and identifiability analysis. To systematically quantify the biological system, we used a vast amount of high-quality time-course data. The dual approach resulted in an accurately calibrated kinetic model (NRMSE: 0.17055) efficiently handling the additional model complexity. We tested model validation using three independent experimental data sets, achieving greater predictive power (NRMSE: 0.18776) than the individual approaches (NRMSE I: 0.25322, II: 0.25227) and increasing model robustness. These results demonstrated data-driven predictive modeling potentially leading to bioprocess’ model-based control and optimisation.Keywords
Funding Information
- Marie Skłodowska-Curie ITN “SymBioSys” (675585)
- Spanish Ministry of Science, Innovation and Universities and the European Union FEDER 'SYNBIOCONTROL'(). (DPI2017-82896-C2-2-R)
This publication has 60 references indexed in Scilit:
- Linking genes to microbial growth kinetics—An integrated biochemical systems engineering approachMetabolic Engineering, 2011
- Classic and contemporary approaches to modeling biochemical reactionsGenes & Development, 2010
- An iterative identification procedure for dynamic modeling of biochemical networksBMC Systems Biology, 2010
- Genome-wide investigation and functional characterization of the β-ketoadipate pathway in the nitrogen-fixing and root-associated bacterium Pseudomonas stutzeriA1501BMC Microbiology, 2010
- Roles of Effectors in XylS-Dependent Transcription Activation: Intramolecular Domain Derepression and DNA BindingJournal of Bacteriology, 2008
- Genomotyping of Pseudomonas putida strains using P. putida KT2440-based high-density DNA microarrays: implications for transcriptomics studiesApplied Microbiology and Biotechnology, 2007
- Toward Global Parametric Estimability of a Large-Scale Kinetic Single-Cell Model for Mammalian Cell CulturesIndustrial & Engineering Chemistry Research, 2005
- Global sensitivity indices for nonlinear mathematical models and their Monte Carlo estimatesMathematics and Computers in Simulation, 2001
- Nonlinear RegressionJournal of Marketing Research, 1990
- Algorithm 573: NL2SOL—An Adaptive Nonlinear Least-Squares Algorithm [E4]ACM Transactions on Mathematical Software, 1981