Using deep sequencing to characterize the biophysical mechanism of a transcriptional regulatory sequence
- 3 May 2010
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences of the United States of America
- Vol. 107 (20), 9158-9163
- https://doi.org/10.1073/pnas.1004290107
Abstract
Cells use protein-DNA and protein-protein interactions to regulate transcription. A biophysical understanding of this process has, however, been limited by the lack of methods for quantitatively characterizing the interactions that occur at specific promoters and enhancers in living cells. Here we show how such biophysical information can be revealed by a simple experiment in which a library of partially mutated regulatory sequences are partitioned according to their in vivo transcriptional activities and then sequenced en masse. Computational analysis of the sequence data produced by this experiment can provide precise quantitative information about how the regulatory proteins at a specific arrangement of binding sites work together to regulate transcription. This ability to reliably extract precise information about regulatory biophysics in the face of experimental noise is made possible by a recently identified relationship between likelihood and mutual information. Applying our experimental and computational techniques to the Escherichia coli lac promoter, we demonstrate the ability to identify regulatory protein binding sites de novo, determine the sequence-dependent binding energy of the proteins that bind these sites, and, importantly, measure the in vivo interaction energy between RNA polymerase and a DNA-bound transcription factor. Our approach provides a generally applicable method for characterizing the biophysical basis of transcriptional regulation by a specified regulatory sequence. The principles of our method can also be applied to a wide range of other problems in molecular biology.This publication has 30 references indexed in Scilit:
- Deciphering a transcriptional regulatory code: modeling short‐range repression in the Drosophila embryoMolecular Systems Biology, 2010
- High-resolution analysis of DNA regulatory elements by synthetic saturation mutagenesisNature Biotechnology, 2009
- Analysis of combinatorial cis-regulation in synthetic and genomic promotersNature, 2008
- Predicting expression patterns from regulatory sequence in Drosophila segmentationNature, 2008
- Combinatorial transcriptional control of the lactose operon of Escherichia coliProceedings of the National Academy of Sciences of the United States of America, 2007
- Precise physical models of protein–DNA interaction from high-throughput dataProceedings of the National Academy of Sciences of the United States of America, 2007
- Compact, universal DNA microarrays to comprehensively determine transcription-factor binding site specificitiesNature Biotechnology, 2006
- A comprehensive library of fluorescent transcriptional reporters for Escherichia coliNature Methods, 2006
- Genome-Wide Location and Function of DNA Binding ProteinsScience, 2000
- Selection of DNA binding sites by regulatory proteins: II. The binding specificity of cyclic AMP receptor protein to recognition sitesJournal of Molecular Biology, 1988