Deciphering programs of transcriptional regulation by combined deconvolution of multiple omics layers
Preprint
- 8 October 2017
- preprint
- research article
- Published by Cold Spring Harbor Laboratory
- p. 199547
- https://doi.org/10.1101/199547
Abstract
Metazoans are crucially dependent on multiple layers of gene regulatory mechanisms which allow them to control gene expression across developmental stages, tissues and cell types. Multiple recent research consortia have aimed to generate comprehensive datasets to profile the activity of these cell type- and condition-specific regulatory landscapes across many different cell lines and primary cells. However, extraction of genes or regulatory elements specific to certain entities from these datasets remains challenging. We here propose a novel method based on non-negative matrix factorization for disentangling and associating huge multi-assay datasets including chromatin accessibility and gene expression data. Taking advantage of implementations of NMF algorithms in the GPU CUDA environment full datasets composed of tens of thousands of genes as well as hundreds of samples can be processed without the need for prior feature selection to reduce the input size. Applying this framework to multiple layers of genomic data derived from human blood cells we unravel mechanisms of regulation of cell type-specific expression in T-cells and monocytes.Keywords
This publication has 45 references indexed in Scilit:
- The International Human Epigenome Consortium: A Blueprint for Scientific Collaboration and DiscoveryCell, 2016
- Chromatin state dynamics during blood formationScience, 2014
- An atlas of active enhancers across human cell types and tissuesNature, 2014
- Metazoan promoters: emerging characteristics and insights into transcriptional regulationNature Reviews Genetics, 2012
- The NIH Roadmap Epigenomics Mapping ConsortiumNature Biotechnology, 2010
- Enhancers: The abundance and function of regulatory sequences beyond promotersDevelopmental Biology, 2010
- The ENCODE (ENCyclopedia Of DNA Elements) ProjectScience, 2004
- Metagenes and molecular pattern discovery using matrix factorizationProceedings of the National Academy of Sciences of the United States of America, 2004
- Transcription regulation and animal diversityNature, 2003
- Learning the parts of objects by non-negative matrix factorizationNature, 1999