Abstract
Enterobacterial repetitive intergenic consensus (ERIC) sequences are 127-bp imperfect palindromes that occur in multiple copies in the genomes of enteric bacteria and vibrios. Here we investigate the distribution of these elements in the complete genome sequences of nine Escherichia coli (including Shigella species) strains. There is a significant tendency for copies to be adjacent to more highly expressed genes. There is considerable variation among strains with respect to the presence of an element in any particular intergenic region, but some copies appear to have been conserved since before the divergence of E. coli and Salmonella enterica. In comparisons of orthologous copies between these species, ERIC sequences are surprisingly conserved, implying that they have acquired some function, perhaps related to mRNA stability. The relationships among copies within E. coli are consistent with a master copy mode of generation. Insertion of new copies seems to occur at, and involve duplication of, the dinucleotide TA. Two classes of inserts of about 70 bp each occur at different specific sites within ERIC sequences; these inserts evolve independently of the ERIC sequences. The small number of ERIC sequences in E. coli genomes indicates that a widely used bacterial fingerprinting method using primers based on ERIC sequences (ERIC-PCR) does not rely on the presence of ERIC sequences.