A Customizable Analysis Flow in Integrative Multi-Omics
Open Access
- 27 November 2020
- journal article
- review article
- Published by MDPI AG in Biomolecules
- Vol. 10 (12), 1606
- https://doi.org/10.3390/biom10121606
Abstract
The number of researchers using multi-omics is growing. Though still expensive, every year it is cheaper to perform multi-omic studies, often exponentially so. In addition to its increasing accessibility, multi-omics reveals a view of systems biology to an unprecedented depth. Thus, multi-omics can be used to answer a broad range of biological questions in finer resolution than previous methods. We used six omic measurements—four nucleic acid (i.e., genomic, epigenomic, transcriptomics, and metagenomic) and two mass spectrometry (proteomics and metabolomics) based—to highlight an analysis workflow on this type of data, which is often vast. This workflow is not exhaustive of all the omic measurements or analysis methods, but it will provide an experienced or even a novice multi-omic researcher with the tools necessary to analyze their data. This review begins with analyzing a single ome and study design, and then synthesizes best practices in data integration techniques that include machine learning. Furthermore, we delineate methods to validate findings from multi-omic integration. Ultimately, multi-omic integration offers a window into the complexity of molecular interactions and a comprehensive view of systems biology.This publication has 58 references indexed in Scilit:
- STAR: ultrafast universal RNA-seq alignerBioinformatics, 2012
- An integrated encyclopedia of DNA elements in the human genomeNature, 2012
- Metagenomic microbial community profiling using unique clade-specific marker genesNature Methods, 2012
- Personal Omics Profiling Reveals Dynamic Molecular and Medical PhenotypesCell, 2012
- Fast gapped-read alignment with Bowtie 2Nature Methods, 2012
- The variant call format and VCFtoolsBioinformatics, 2011
- Skyline: an open source document editor for creating and analyzing targeted proteomics experimentsBioinformatics, 2010
- edgeR: a Bioconductor package for differential expression analysis of digital gene expression dataBioinformatics, 2009
- Systematic and integrative analysis of large gene lists using DAVID bioinformatics resourcesNature Protocols, 2008
- Bacteriophage genomicsCurrent Opinion in Microbiology, 2008