Genotype to Phenotype Mapping and the Fitness Landscape of the E. coli lac Promoter
Open Access
- 1 May 2013
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 8 (5), e61570
- https://doi.org/10.1371/journal.pone.0061570
Abstract
Genotype-to-phenotype maps and the related fitness landscapes that include epistatic interactions are difficult to measure because of their high dimensional structure. Here we construct such a map using the recently collected corpora of high-throughput sequence data from the 75 base pairs long mutagenized E. coli lac promoter region, where each sequence is associated with its phenotype, the induced transcriptional activity measured by a fluorescent reporter. We find that the additive (non-epistatic) contributions of individual mutations account for about two-thirds of the explainable phenotype variance, while pairwise epistasis explains about 7% of the variance for the full mutagenized sequence and about 15% for the subsequence associated with protein binding sites. Surprisingly, there is no evidence for third order epistatic contributions, and our inferred fitness landscape is essentially single peaked, with a small amount of antagonistic epistasis. There is a significant selective pressure on the wild type, which we deduce to be multi-objective optimal for gene expression in environments with different nutrient sources. We identify transcription factor (CRP) and RNA polymerase binding sites in the promotor region and their interactions without difficult optimization steps. In particular, we observe evidence for previously unexplored genetic regulatory mechanisms, possibly kinetic in nature. We conclude with a cautionary note that inferred properties of fitness landscapes may be severely influenced by biases in the sequence data.Other Versions
This publication has 51 references indexed in Scilit:
- Operator Sequence Alters Gene Expression Independently of Transcription Factor Occupancy in BacteriaCell Reports, 2012
- Reciprocal sign epistasis is a necessary condition for multi-peaked fitness landscapesJournal of Theoretical Biology, 2011
- Quantitative analysis of fitness and genetic interactions in yeast on a genome scaleNature Methods, 2010
- Sparse coding and high-order correlations in fine-scale cortical networksNature, 2010
- Genome-wide identification of post-translational modulators of transcription factor activity in human B cellsNature Biotechnology, 2009
- Epistasis — the essential role of gene interactions in the structure and evolution of genetic systemsNature Reviews Genetics, 2008
- Next-generation DNA sequencingNature Biotechnology, 2008
- Energy-dependent fitness: A quantitative model for the evolution of yeast transcription factor binding sitesProceedings of the National Academy of Sciences of the United States of America, 2008
- Combinatorial transcriptional control of the lactose operon of Escherichia coliProceedings of the National Academy of Sciences of the United States of America, 2007
- Weak pairwise correlations imply strongly correlated network states in a neural populationNature, 2006