Discovery of directional and nondirectional pioneer transcription factors by modeling DNase profile magnitude and shape
Open Access
- 19 January 2014
- journal article
- research article
- Published by Springer Science and Business Media LLC in Nature Biotechnology
- Vol. 32 (2), 171-178
- https://doi.org/10.1038/nbt.2798
Abstract
A new algorithm learns features of chromatin profiles left by transcription factors, shedding light on when DNA sequence motifs are bound. We describe protein interaction quantitation (PIQ), a computational method for modeling the magnitude and shape of genome-wide DNase I hypersensitivity profiles to identify transcription factor (TF) binding sites. Through the use of machine-learning techniques, PIQ identified binding sites for >700 TFs from one DNase I hypersensitivity analysis followed by sequencing (DNase-seq) experiment with accuracy comparable to that of chromatin immunoprecipitation followed by sequencing (ChIP-seq). We applied PIQ to analyze DNase-seq data from mouse embryonic stem cells differentiating into prepancreatic and intestinal endoderm. We identified 120 and experimentally validated eight 'pioneer' TF families that dynamically open chromatin. Four pioneer TF families only opened chromatin in one direction from their motifs. Furthermore, we identified 'settler' TFs whose genomic binding is principally governed by proximity to open chromatin. Our results support a model of hierarchical TF binding in which directional and nondirectional pioneer activity shapes the chromatin landscape for population by settler TFs.This publication has 48 references indexed in Scilit:
- Pioneer transcription factors: establishing competence for gene expressionGenes & Development, 2011
- Lineage Regulators Direct BMP and Wnt Pathways to Cell-Specific Programs during Differentiation and RegenerationCell, 2011
- Master Transcription Factors Determine Cell-Type-Specific Responses to TGF-β SignalingCell, 2011
- Study of FoxA Pioneer Factor at Silent Genes Reveals Rfx-Repressed Enhancer at Cdx2 and a Potential Indicator of Esophageal Adenocarcinoma DevelopmentPLoS Genetics, 2011
- The nucleosome map of the mammalian liverNature Structural & Molecular Biology, 2011
- G4 motifs correlate with promoter-proximal transcriptional pausing in human genesNucleic Acids Research, 2011
- Control of the Embryonic Stem Cell StateCell, 2011
- Chromatin accessibility pre-determines glucocorticoid receptor binding patternsNature Genetics, 2011
- Accurate inference of transcription factor binding from DNA sequence and chromatin accessibility dataGenome Research, 2010
- Pluripotency and Cellular Reprogramming: Facts, Hypotheses, Unresolved IssuesCell, 2010