Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins
Top Cited Papers
Open Access
- 20 November 2014
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 43 (3), e15
- https://doi.org/10.1093/nar/gku1196
Abstract
The emergence of new sequencing technologies has facilitated the use of bacterial whole genome alignments for evolutionary studies and outbreak analyses. These datasets, of increasing size, often include examples of multiple different mechanisms of horizontal sequence transfer resulting in substantial alterations to prokaryotic chromosomes. The impact of these processes demands rapid and flexible approaches able to account for recombination when reconstructing isolates’ recent diversification. Gubbins is an iterative algorithm that uses spatial scanning statistics to identify loci containing elevated densities of base substitutions suggestive of horizontal sequence transfer while concurrently constructing a maximum likelihood phylogeny based on the putative point mutations outside these regions of high sequence diversity. Simulations demonstrate the algorithm generates highly accurate reconstructions under realistically parameterized models of bacterial evolution, and achieves convergence in only a few hours on alignments of hundreds of bacterial genome sequences. Gubbins is appropriate for reconstructing the recent evolutionary history of a variety of haploid genotype alignments, as it makes no assumptions about the underlying mechanism of recombination. The software is freely available for download at github.com/sanger-pathogens/Gubbins, implemented in Python and C and supported on Linux and Mac OS X.Keywords
This publication has 57 references indexed in Scilit:
- Genomic Characterisation of Invasive Non-Typhoidal Salmonella enterica Subspecies enterica Serovar Bovismorbificans Isolates from MalawiPLoS Neglected Tropical Diseases, 2013
- Whole-genome sequencing to identify transmission of Mycobacterium abscessus between patients with cystic fibrosis: a retrospective cohort studyThe Lancet, 2013
- Chromosome Painting In Silico in a Bacterial Species Reveals Fine Population StructureMolecular Biology and Evolution, 2013
- Intracontinental spread of human invasive Salmonella Typhimurium pathovariants in sub-Saharan AfricaNature Genetics, 2012
- RAxML-Light: a tool for computing terabyte phylogeniesBioinformatics, 2012
- Whole-genome analysis of diverse Chlamydia trachomatis strains identifies phylogenetic relationships masked by current clinical typingNature Genetics, 2012
- Detection of recombination events in bacterial genomes from large population samplesNucleic Acids Research, 2011
- RDP3: a flexible and fast computer program for analyzing recombinationBioinformatics, 2010
- High-throughput sequencing provides insights into genome variation and evolution in Salmonella TyphiNature Genetics, 2008
- How clonal are bacteria?Proceedings of the National Academy of Sciences of the United States of America, 1993