Discovery of Fur binding site clusters in Escherichia coli by information theory models

Abstract
Fur is a DNA binding protein that represses bacterial iron uptake systems. Eleven footprinted Escherichia coli Fur binding sites were used to create an initial information theory model of Fur binding, which was then refined by adding 13 experimentally confirmed sites. When the refined model was scanned across all available footprinted sequences, sequence walkers, which are visual depictions of predicted binding sites, frequently appeared in clusters that fit the footprints ( ∼ 83% coverage). This indicated that the model can accurately predict Fur binding. Within the clusters, individual walkers were separated from their neighbors by exactly 3 or 6 bases, consistent with models in which Fur dimers bind on different faces of the DNA helix. When the E. coli genome was scanned, we found 363 unique clusters, which includes all known Fur-repressed genes that are involved in iron metabolism. In contrast, only a few of the known Fur-activated genes have predicted Fur binding sites at their promoters. These observations suggest that Fur is either a direct repressor or an indirect activator. The Pseudomonas aeruginosa and Bacillus subtilis Fur models are highly similar to the E. coli Fur model, suggesting that the Fur–DNA recognition mechanism may be conserved for even distantly related bacteria.