Interpreting whole genome sequencing for investigating tuberculosis transmission: a systematic review
Open Access
- 22 March 2016
- journal article
- review article
- Published by Springer Science and Business Media LLC in BMC Medicine
- Vol. 14 (1), 1-13
- https://doi.org/10.1186/s12916-016-0566-x
Abstract
Background: Whole genome sequencing (WGS) is becoming an important part of epidemiological investigations of infectious diseases due to greater resolution and cost reductions compared to traditional typing approaches. Many public health and clinical teams will increasingly use WGS to investigate clusters of potential pathogen transmission, making it crucial to understand the benefits and assumptions of the analytical methods for investigating the data. We aimed to understand how different approaches affect inferences of transmission dynamics and outline limitations of the methods. Methods: We comprehensively searched electronic databases for studies that presented methods used to interpret WGS data for investigating tuberculosis (TB) transmission. Two authors independently selected studies for inclusion and extracted data. Due to considerable methodological heterogeneity between studies, we present summary data with accompanying narrative synthesis rather than pooled analyses. Results: Twenty-five studies met our inclusion criteria. Despite the range of interpretation tools, the usefulness of WGS data in understanding TB transmission often depends on the amount of genetic diversity in the setting. Where diversity is small, distinguishing re-infections from relapses may be impossible; interpretation may be aided by the use of epidemiological data, examining minor variants and deep sequencing. Conversely, when within-host diversity is large, due to genetic hitchhiking or co-infection of two dissimilar strains, it is critical to understand how it arose. Greater understanding of microevolution and mixed infection will enhance interpretation of WGS data. Conclusions: As sequencing studies have sampled more intensely and integrated multiple sources of information, the understanding of TB transmission and diversity has grown, but there is still much to be learnt about the origins of diversity that will affect inferences from these data. Public health teams and researchers should combine epidemiological, clinical and WGS data to strengthen investigations of transmission.Funding Information
- Engineering and Physical Sciences Research Council (EP/K026003/1)
- National Institute for Health Research (PDF-2014-07-008)
- Medical Research Council
- Engineering and Physical Sciences Research Council
- University College London
- Public Health England
This publication has 75 references indexed in Scilit:
- Multidrug-Resistant Tuberculosis in Panama Is Driven by Clonal Expansion of a Multidrug-Resistant Mycobacterium tuberculosis Strain Related to the KZN Extensively Drug-Resistant M. tuberculosis Strain from South AfricaJournal of Clinical Microbiology, 2013
- Whole-genome sequencing to delineate Mycobacterium tuberculosis outbreaks: a retrospective observational studyThe Lancet Infectious Diseases, 2013
- Mycobacterium tuberculosis – Heterogeneity revealed through whole genome sequencingTuberculosis, 2012
- A tale of three next generation sequencing platforms: comparison of Ion torrent, pacific biosciences and illumina MiSeq sequencersBMC Genomics, 2012
- Use of whole genome sequencing to estimate the mutation rate of Mycobacterium tuberculosis during latent infectionNature Genetics, 2011
- The impact of next-generation sequencing on genomicsJournal of Genetics and Genomics, 2011
- Microevolution of Mycobacterium tuberculosis in a Tuberculosis PatientJournal of Clinical Microbiology, 2010
- High-Resolution Typing by Integration of Genome Sequencing Data in a Large Tuberculosis ClusterJournal of Clinical Microbiology, 2010
- Reconstructing disease outbreaks from genetic data: a graph approachHeredity, 2010
- Evidence for Recombination inMycobacterium tuberculosisJournal of Bacteriology, 2006