Common and distinct variation in data fusion of designed experimental data
Open Access
- 3 December 2019
- journal article
- research article
- Published by Springer Science and Business Media LLC in Metabolomics
- Vol. 16 (1), 1-11
- https://doi.org/10.1007/s11306-019-1622-2
Abstract
Introduction Integrative analysis of multiple data sets can provide complementary information about the studied biological system. However, data fusion of multiple biological data sets can be complicated as data sets might contain different sources of variation due to underlying experimental factors. Therefore, taking the experimental design of data sets into account could be of importance in data fusion concept. Objectives In the present work, we aim to incorporate the experimental design information in the integrative analysis of multiple designed data sets. Methods Here we describe penalized exponential ANOVA simultaneous component analysis (PE-ASCA), a new method for integrative analysis of data sets from multiple compartments or analytical platforms with the same underlying experimental design. Results Using two simulated cases, the result of simultaneous component analysis (SCA), penalized exponential simultaneous component analysis (P-ESCA) and ANOVA-simultaneous component analysis (ASCA) are compared with the proposed method. Furthermore, real metabolomics data obtained from NMR analysis of two different brains tissues (hypothalamus and midbrain) from the same piglets with an underlying experimental design is investigated by PE-ASCA. Conclusions This method provides an improved understanding of the common and distinct variation in response to different experimental factors.This publication has 30 references indexed in Scilit:
- Joint and individual variation explained (JIVE) for integrated analysis of multiple data typesThe Annals of Applied Statistics, 2013
- DISCO-SCA and Properly Applied GSVD as Swinging Methods to Find Common and Distinctive ProcessesPLOS ONE, 2012
- Choline metabolism provides novel insights into nonalcoholic fatty liver disease and its progressionCurrent Opinion in Gastroenterology, 2012
- Bovine colostrum is superior to enriched formulas in stimulating intestinal function and necrotising enterocolitis resistance in preterm pigsBritish Journal of Nutrition, 2010
- Statistical validation of megavariate effects in ASCABMC Bioinformatics, 2007
- Fusion of Mass Spectrometry-Based Metabolomics DataAnalytical Chemistry, 2005
- ANOVA-simultaneous component analysis (ASCA): a new tool for analyzing designed metabolomics dataBioinformatics, 2005
- Multilevel component analysis of time-resolved metabolic fingerprinting dataAnalytica Chimica Acta, 2005
- O2‐PLS, a two‐block (X–Y) latent variable regression (LVR) method with an integral OSC filterJournal of Chemometrics, 2003
- Linear ModelsTechnometrics, 1999