Defining the DNA uptake specificity of naturally competent Haemophilus influenzae cells
Open Access
- 29 June 2012
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 40 (17), 8536-8549
- https://doi.org/10.1093/nar/gks640
Abstract
Some naturally competent bacteria exhibit both a strong preference for DNA fragments containing specific ‘uptake sequences’ and dramatic overrepresentation of these sequences in their genomes. Uptake sequences are often assumed to directly reflect the specificity of the DNA uptake machinery, but the actual specificity has not been well characterized for any bacterium. We produced a detailed analysis of Haemophilus influenzae ’s uptake specificity, using Illumina sequencing of degenerate uptake sequences in fragments recovered from competent cells. This identified an uptake motif with the same consensus as the motif overrepresented in the genome, with a 9 bp core (AAGTGCGGT) and two short flanking T-rich tracts. Only four core bases (GCGG) were critical for uptake, suggesting that these make strong specific contacts with the uptake machinery. Other core bases had weaker roles when considered individually, as did the T-tracts, but interaction effects between these were also determinants of uptake. The properties of genomic uptake sequences are also constrained by mutational biases and selective forces acting on USSs with coding and termination functions. Our findings define constraints on gene transfer by natural transformation and suggest how the DNA uptake machinery overcomes the physical constraints imposed by stiff highly charged DNA molecules.Keywords
This publication has 60 references indexed in Scilit:
- Structure and Function of PilQ, a Secretin of the DNA Transporter from the Thermophilic Bacterium Thermus thermophilus HB27Online Journal of Public Health Informatics, 2011
- Using deep sequencing to characterize the biophysical mechanism of a transcriptional regulatory sequenceProceedings of the National Academy of Sciences of the United States of America, 2010
- BEDTools: a flexible suite of utilities for comparing genomic featuresBioinformatics, 2010
- Bind-n-Seq: high-throughput analysis of in vitro protein–DNA interactions using massively parallel sequencingNucleic Acids Research, 2009
- The Sequence Alignment/Map format and SAMtoolsBioinformatics, 2009
- Coevolution of DNA Uptake Sequences and Bacterial ProteomesGenome Biology and Evolution, 2009
- Accurate whole human genome sequencing using reversible terminator chemistryNature, 2008
- New Functional Identity for the DNA Uptake Sequence in Transformation and Its Presence in Transcriptional TerminatorsJournal of Bacteriology, 2007
- A graph-based motif detection algorithm models complex nucleotide dependencies in transcription factor binding sitesNucleic Acids Research, 2006
- Sequence logos: a new way to display consensus sequencesNucleic Acids Research, 1990