AlbaTraDIS: Comparative analysis of large datasets from parallel transposon mutagenesis experiments
Open Access
- 17 July 2020
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLoS Computational Biology
- Vol. 16 (7), e1007980
- https://doi.org/10.1371/journal.pcbi.1007980
Abstract
Bacteria need to survive in a wide range of environments. Currently, there is an incomplete understanding of the genetic basis for mechanisms underpinning survival in stressful conditions, such as the presence of anti-microbials. Transposon directed insertion-site sequencing (TraDIS) is a powerful tool to identify genes and networks which are involved in survival and fitness under a given condition by simultaneously assaying the fitness of millions of mutants, thereby relating genotype to phenotype and contributing to an understanding of bacterial cell biology. A recent refinement of this approach allows the roles of essential genes in conditional stress survival to be inferred by altering their expression. These advancements combined with the rapidly falling costs of sequencing now allows comparisons between multiple experiments to identify commonalities in stress responses to different conditions. This capacity however poses a new challenge for analysis of multiple data sets in conjunction. To address this analysis need, we have developed ‘AlbaTraDIS’; a software application for rapid large-scale comparative analysis of TraDIS experiments that predicts the impact of transposon insertions on nearby genes. AlbaTraDIS can identify genes which are up or down regulated, or inactivated, between multiple conditions, producing a filtered list of genes for further experimental validation as well as several accompanying data visualisations. We demonstrate the utility of our new approach by applying it to identify genes used by Escherichia coli to survive in a wide range of different concentrations of the biocide Triclosan. AlbaTraDIS identified all well characterised Triclosan resistance genes, including the primary target, fabI. A number of new loci were also implicated in Triclosan resistance and the predicted phenotypes for a selection of these were validated experimentally with results being consistent with predictions. AlbaTraDIS provides a simple and rapid method to analyse multiple transposon mutagenesis data sets allowing this technology to be used at large scale. To our knowledge this is the only tool currently available that can perform these tasks. AlbaTraDIS is written in Python 3 and is available under the open source licence GNU GPL 3 from https://github.com/quadram-institute-bioscience/albatradis.Keywords
Funding Information
- BBSRC Core Capability Grant (BB/CCG1860/1)
- BBSRC Institute Strategic Programme Microbes in the Food Chain (BB/R012504/1 and BBS/E/F/000PR10349)
This publication has 30 references indexed in Scilit:
- Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variationNucleic Acids Research, 2012
- Artemis: an integrated platform for visualization and analysis of high-throughput sequence-based experimental dataBioinformatics, 2011
- A scaling normalization method for differential expression analysis of RNA-seq dataGenome Biology, 2010
- Simultaneous assay of every Salmonella Typhi gene using one million transposon mutantsGenome Research, 2009
- Tracking insertion mutants within libraries by deep sequencing and a genome-wide screen for Haemophilus genes required in the lungProceedings of the National Academy of Sciences of the United States of America, 2009
- Tn-seq: high-throughput parallel sequencing for fitness and genetic interaction studies in microorganismsNature Methods, 2009
- Identifying Genetic Determinants Needed to Establish a Human Gut Symbiont in Its HabitatCell Host & Microbe, 2009
- Exposure of Escherichia coli and Salmonella enterica serovar Typhimurium to triclosan induces a species-specific response, including drug detoxificationJournal of Antimicrobial Chemotherapy, 2009
- An array of Escherichia coli clones over-expressing essential proteins: A new strategy of identifying cellular targets of potent antibacterial compoundsBiochemical and Biophysical Research Communications, 2006
- Cytoscape: A Software Environment for Integrated Models of Biomolecular Interaction NetworksGenome Research, 2003