Using Genomic Sequencing for Classical Genetics in E. coli K12
Open Access
- 25 February 2011
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 6 (2), e16717
- https://doi.org/10.1371/journal.pone.0016717
Abstract
We here develop computational methods to facilitate use of 454 whole genome shotgun sequencing to identify mutations in Escherichia coli K12. We had Roche sequence eight related strains derived as spontaneous mutants in a background without a whole genome sequence. They provided difference tables based on assembling each genome to reference strain E. coli MG1655 (NC_000913). Due to the evolutionary distance to MG1655, these contained a large number of both false negatives and positives. By manual analysis of the dataset, we detected all the known mutations (24 at nine locations) and identified and genetically confirmed new mutations necessary and sufficient for the phenotypes we had selected in four strains. We then had Roche assemble contigs de novo, which we further assembled to full-length pseudomolecules based on synteny with MG1655. This hybrid method facilitated detection of insertion mutations and allowed annotation from MG1655. After removing one genome with less than the optimal 20- to 30-fold sequence coverage, we identified 544 putative polymorphisms that included all of the known and selected mutations apart from insertions. Finally, we detected seven new mutations in a total of only 41 candidates by comparing single genomes to composite data for the remaining six and using a ranking system to penalize homopolymer sequencing and misassembly errors. An additional benefit of the analysis is a table of differences between MG1655 and a physiologically robust E. coli wild-type strain NCM3722. Both projects were greatly facilitated by use of comparative genomics tools in the CoGe software package (http://genomevolution.org/).Keywords
This publication has 46 references indexed in Scilit:
- Assembly of large genomes using second-generation sequencingGenome Research, 2010
- Understanding the Differences between Genome Sequences of Escherichia coli B Strains REL606 and BL21(DE3) and Comparison of the E. coli B and K-12 GenomesJournal of Molecular Biology, 2009
- Genome evolution and adaptation in a long-term experiment with Escherichia coliNature, 2009
- Increased expression of Mg 2+ transport proteins enhances the survival of Salmonella enterica at high temperatureProceedings of the National Academy of Sciences of the United States of America, 2009
- Genetic Suppressors and Recovery of Repressed Biochemical MemoryOnline Journal of Public Health Informatics, 2009
- De novo assembly using low-coverage short read sequence data from the rice pathogen Pseudomonas syringae pv. oryzaeGenome Research, 2008
- Comprehensive mutation identification in an evolved bacterial cooperator and its cheating ancestorProceedings of the National Academy of Sciences of the United States of America, 2006
- Construction of Escherichia coli K‐12 in‐frame, single‐gene knockout mutants: the Keio collectionMolecular Systems Biology, 2006
- The Complete Genome Sequence of Escherichia coli K-12Science, 1997
- tRNAscan-SE: A Program for Improved Detection of Transfer RNA Genes in Genomic SequenceNucleic Acids Research, 1997