A Single-Step Sequencing Method for the Identification of Mycobacterium tuberculosis Complex Species

Abstract
The Mycobacterium tuberculosis complex (MTC) comprises closely related species responsible for strictly human and zoonotic tuberculosis. Accurate species determination is useful for the identification of outbreaks and epidemiological links. Mycobacterium africanum and Mycobacterium canettii are typically restricted to Africa and M. bovis is a re-emerging pathogen. Identification of these species is difficult and expensive. The Exact Tandem Repeat D (ETR-D; alias Mycobacterial Interspersed Repetitive Unit 4) was sequenced in MTC species type strains and 110 clinical isolates, in parallel to reference polyphasic identification based on phenotype profiling and sequencing of pncA, oxyR, hsp65, gyrB genes and the major polymorphism tandem repeat. Inclusion of M. tuberculosis isolates in the expanding, antibiotic-resistant Beijing clone was determined by Rv0927c gene sequencing. The ETR-D (780-bp) sequence unambiguously identified MTC species type strain except M. pinnipedii and M. microti thanks to six single nucleotide polymorphisms, variable numbers (1–7 copies) of the tandem repeat and two deletions/insertions. The ETR-D sequencing agreed with phenotypic identification in 107/110 clinical isolates and with reference polyphasic molecular identification in all isolates, comprising 98 M. tuberculosis, 5 M. bovis BCG type, 5 M. canettii, and 2 M. africanum. For M. tuberculosis isolates, the ETR-D sequence was not significantly associated with the Beijing clone. ETR-D sequencing allowed accurate, single-step identification of the MTC at the species level. It circumvented the current expensive, time-consuming polyphasic approach. It could be used to depict epidemiology of zoonotic and human tuberculosis, especially in African countries where several MTC species are emerging. The Mycobacterium tuberculosis complex (MTC) comprises several closely related species responsible for strictly human and zoonotic tuberculosis. Some of the species are restricted to Africa and were responsible for the high prevalence of tuberculosis. However, their identification at species level is difficult and expansive. Accurate species identification of all members is warranted in order to distinguish between strict human and zoonotic tuberculosis, to trace source exposure during epidemiological studies, and for the appropriate treatment of patients. In this paper, the Exact Tandem Repeat D (ETR-D) intergenic region was investigated in order to distinguish MTC species. The ETR-D sequencing unambiguously identified MTC species type strain except M. pinnipedii and M. microti, and the results agreed with phenotypic and molecular identification. This finding offers a new tool for the rapid and accurate identification of MTC species in a single sequencing reaction, replacing the current time-consuming polyphasic approach. Its use could assist public health interventions and aid in the control of zoonotic transmission in African countries, and could be of particular interest with the current emergence of multidrug-resistant and extended-resistance isolates.