Rigorous pattern-recognition methods for DNA sequences: Analysis of promoter sequences from Escherichia coli
- 5 November 1985
- journal article
- Published by Elsevier BV in Journal of Molecular Biology
- Vol. 186 (1), 117-128
- https://doi.org/10.1016/0022-2836(85)90262-1
Abstract
The basic nature of the sequence features that define a promoter sequence for Escherichia coli RNA polymerase have been established by a variety of biochemical and genetic methods. We have developed rigorous analytical methods for finding unknown patterns that occur imperfectly in a set of several sequences, and have used them to examine a set of bacterial promoters. The algorithm easily discovers the “consensus” sequences for the −10 and −35 regions, which are essentially identical to the results of previous analyses, but requires no prior assumptions about the common patterns. By explicitly specifying the nature of the search for consensus sequences, we give a rigorous definition to this concept that should be widely applicable. We also have provided estimates for the statistical significance of common patterns discovered in sets of sequences.Keywords
This publication has 25 references indexed in Scilit:
- Conformational change in the DNA associated with an unusual promoter mutation in a tRNA operon of SalmonellaCell, 1984
- Correct transcription of an immunoglobulin κ gene requires an upstream fragment containing conserved sequence elementsNature, 1984
- Cyclic AMP Receptor Protein: Role in Transcription ActivationScience, 1984
- Rapid searches for complex patterns in biological moleculesNucleic Acids Research, 1984
- Base sequence and helix structure variation in B and A DNAJournal of Molecular Biology, 1983
- Compilation and analysis ofEscherichia colipromoter DNA sequencesNucleic Acids Research, 1983
- On the different binding affinities of CRP at thelac, galandmalT promoter regionsNucleic Acids Research, 1983
- Essential structure ofE. colipromoter effect of spacer length between the two consensus sequences on promoter functionNucleic Acids Research, 1983
- A lac promoter with a changed distance between -10 and -35 regionsNucleic Acids Research, 1982
- Gene organization and primary structure of a ribosomal RNA operon from Escherichia coliJournal of Molecular Biology, 1981