Sequence and chromatin determinants of cell-type–specific transcription factor binding
Open Access
- 5 September 2012
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 22 (9), 1723-1734
- https://doi.org/10.1101/gr.127712.111
Abstract
Gene regulatory programs in distinct cell types are maintained in large part through the cell-type–specific binding of transcription factors (TFs). The determinants of TF binding include direct DNA sequence preferences, DNA sequence preferences of cofactors, and the local cell-dependent chromatin context. To explore the contribution of DNA sequence signal, histone modifications, and DNase accessibility to cell-type–specific binding, we analyzed 286 ChIP-seq experiments performed by the ENCODE Consortium. This analysis included experiments for 67 transcriptional regulators, 15 of which were profiled in both the GM12878 (lymphoblastoid) and K562 (erythroleukemic) human hematopoietic cell lines. To model TF-bound regions, we trained support vector machines (SVMs) that use flexible k-mer patterns to capture DNA sequence signals more accurately than traditional motif approaches. In addition, we trained SVM spatial chromatin signatures to model local histone modifications and DNase accessibility, obtaining significantly more accurate TF occupancy predictions than simpler approaches. Consistent with previous studies, we find that DNase accessibility can explain cell-line–specific binding for many factors. However, we also find that of the 10 factors with prominent cell-type–specific binding patterns, four display distinct cell-type–specific DNA sequence preferences according to our models. Moreover, for two factors we identify cell-specific binding sites that are accessible in both cell types but bound only in one. For these sites, cell-type–specific sequence models, rather than DNase accessibility, are better able to explain differential binding. Our results suggest that using a single motif for each TF and filtering for chromatin accessible loci is not always sufficient to accurately account for cell-type–specific binding profiles.Keywords
This publication has 47 references indexed in Scilit:
- The accessible chromatin landscape of the human genomeNature, 2012
- Master Transcription Factors Determine Cell-Type-Specific Responses to TGF-β SignalingCell, 2011
- DNA methylation status predicts cell type-specific enhancer activityThe EMBO Journal, 2011
- Mapping and analysis of chromatin state dynamics in nine human cell typesNature, 2011
- Discovery and characterization of chromatin states for systematic annotation of the human genomeNature Biotechnology, 2010
- Simple Combinations of Lineage-Determining Transcription Factors Prime cis-Regulatory Elements Required for Macrophage and B Cell IdentitiesMolecular Cell, 2010
- YY1's longer DNA-binding motifsGenomics, 2009
- Design and analysis of ChIP-seq experiments for DNA-binding proteinsNature Biotechnology, 2008
- Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genomeNature Genetics, 2007
- An algorithm for finding protein–DNA binding sites with applications to chromatin- immunoprecipitation microarray experimentsNature Biotechnology, 2002