Tuberculosis (TB) is the number one cause of human death due to an infectious disease. The causative agents of TB are a group of closely related bacteria known as the Mycobacterium tuberculosis ...complex (MTBC). As the MTBC exhibits a clonal population structure with low DNA sequence diversity, methods (such as multilocus sequence typing) that are applied to more genetically diverse bacteria are uninformative, and much of the ecology and evolution of the MTBC has therefore remained unknown. Owing to recent advances in whole-genome sequencing and analyses of large collections of MTBC clinical isolates from around the world, many new insights have been gained, including a better understanding of the origin of the MTBC as an obligate pathogen and its molecular evolution and population genetic characteristics both within and between hosts, as well as many aspects related to antibiotic resistance. The purpose of this Review is to summarize these recent discoveries and discuss their relevance for developing better tools and strategies to control TB.
Summary
The causative agent of human tuberculosis (TB), Mycobacterium tuberculosis, is an obligate pathogen that evolved to exclusively persist in human populations. For M. tuberculosis to transmit ...from person to person, it has to cause pulmonary disease. Therefore, M. tuberculosis virulence has likely been a significant determinant of the association between M. tuberculosis and humans. Indeed, the evolutionary success of some M. tuberculosis genotypes seems at least partially attributable to their increased virulence. The latter possibly evolved as a consequence of human demographic expansions. If co‐evolution occurred, humans would have counteracted to minimize the deleterious effects of M. tuberculosis virulence. The fact that human resistance to infection has a strong genetic basis is a likely consequence of such a counter‐response. The genetic architecture underlying human resistance to M. tuberculosis remains largely elusive. However, interactions between human genetic polymorphisms and M. tuberculosis genotypes have been reported. Such interactions are consistent with local adaptation and allow for a better understanding of protective immunity in TB. Future ‘genome‐to‐genome’ studies, in which locally associated human and M. tuberculosis genotypes are interrogated in conjunction, will help identify new protective antigens for the development of better TB vaccines.
Video podcast available
Go to www.immunologicalreviews.com to watch an interview with Guest Editor Carl Nathan.
Summary New tools for controlling tuberculosis are urgently needed. Despite our emerging understanding of the biogeography of Mycobacterium tuberculosis , the implications for development of new ...diagnostics, drugs, and vaccines is unknown. M tuberculosis has a clonal genetic population structure that is geographically constrained. Evidence suggests strain-specific differences in virulence and immunogenicity in light of this global phylogeography. We propose a strain selection framework, based on robust phylogenetic markers, which will allow for systematic and comprehensive evaluation of new tools for tuberculosis control.
The molecular clock and its phylogenetic applications to genomic data have changed how we study and understand one of the major human pathogens, Mycobacterium tuberculosis (MTB), the etiologic agent ...of tuberculosis. Genome sequences of MTB strains sampled at different times are increasingly used to infer when a particular outbreak begun, when a drug-resistant clone appeared and expanded, or when a strain was introduced into a specific region. Despite the growing importance of the molecular clock in tuberculosis research, there is a lack of consensus as to whether MTB displays a clocklike behavior and about its rate of evolution. Here we performed a systematic study of the molecular clock of MTB on a large genomic data set (6,285 strains), covering different epidemiological settings and most of the known global diversity. We found that sampling times below 15-20 years were often insufficient to calibrate the clock of MTB. For data sets where such calibration was possible, we obtained a clock rate between 1x10-8 and 5x10-7 nucleotide changes per-site-per-year (0.04-2.2 SNPs per-genome-per-year), with substantial differences between clades. These estimates were not strongly dependent on the time of the calibration points as they changed only marginally when we used epidemiological isolates (sampled in the last 40 years) or three ancient DNA samples (about 1,000 years old) to calibrate the tree. Additionally, the uncertainty and the discrepancies in the results of different methods were sometimes large, highlighting the importance of using different methods, and of considering carefully their assumptions and limitations.
Bacterial factors favoring the unprecedented multidrug-resistant tuberculosis (MDR-TB) epidemic in the former Soviet Union remain unclear. We utilized whole genome sequencing and Bayesian statistics ...to analyze the evolutionary history, temporal emergence of resistance and transmission networks of MDR
complex isolates from Karakalpakstan, Uzbekistan (2001-2006). One clade (termed Central Asian outbreak, CAO) dating back to 1974 (95% HPD 1969-1982) subsequently acquired resistance mediating mutations to eight anti-TB drugs. Introduction of standardized WHO-endorsed directly observed treatment, short-course in Karakalpakstan in 1998 likely selected for CAO-strains, comprising 75% of sampled MDR-TB isolates in 2005/2006. CAO-isolates were also identified in a published cohort from Russia (2008-2010). Similarly, the presence of mutations supposed to compensate bacterial fitness deficits was associated with transmission success and higher drug resistance rates. The genetic make-up of these MDR-strains threatens the success of both empirical and standardized MDR-TB therapies, including the newly WHO-endorsed short MDR-TB regimen in Uzbekistan.
The Mycobacterium tuberculosis Complex (MTBC) consists of a clonal group of several mycobacterial lineages pathogenic to a range of different mammalian hosts. In this chapter, we discuss the origins ...and the evolutionary forces shaping the genomic diversity of the human-adapted MTBC. Advances in whole-genome sequencing have brought invaluable insights into the macro-evolution of the MTBC, and the biogeographical distribution of the different MTBC lineages, the phylogenetic relationships between these lineages. Moreover, micro-evolutionary processes start to be better understood, including those influencing bacterial mutation rates and those governing the fate of new mutations emerging within patients during treatment. Current genomic and epidemiological evidence reflect the fact that, through ecological specialization, the MTBC affecting humans became an obligate and extremely well-adapted human pathogen. Identifying the adaptive traits of human-adapted MTBC and unraveling the bacterial loci that interact with human genomic variation might help identify new targets for developing better vaccines and designing more effective treatments.
Renewed efforts in tuberculosis (TB) research have led to important new insights into the biology and epidemiology of this devastating disease. Yet, in the face of the modern epidemics of HIV/AIDS, ...diabetes, and multidrug resistance--all of which contribute to susceptibility to TB--global control of the disease will remain a formidable challenge for years to come. New high-throughput genomics technologies are already contributing to studies of TB's epidemiology, comparative genomics, evolution, and host-pathogen interaction. We argue here, however, that new multidisciplinary approaches--especially the integration of epidemiology with systems biology in what we call "systems epidemiology"--will be required to eliminate TB.
Epidemics of drug-resistant bacteria emerge worldwide, even as resistant strains frequently have reduced fitness compared to their drug-susceptible counterparts. Data from model systems suggest that ...the fitness cost of antimicrobial resistance can be reduced by compensatory mutations; however, there is limited evidence that compensatory evolution has any significant role in the success of drug-resistant bacteria in human populations. Here we describe a set of compensatory mutations in the RNA polymerase genes of rifampicin-resistant M. tuberculosis, the etiologic agent of human tuberculosis (TB). M. tuberculosis strains harboring these compensatory mutations showed a high competitive fitness in vitro. Moreover, these mutations were associated with high fitness in vivo, as determined by examining their relative clinical frequency across patient populations. Of note, in countries with the world's highest incidence of multidrug-resistant (MDR) TB, more than 30% of MDR clinical isolates had this form of mutation. Our findings support a role for compensatory evolution in the global epidemics of MDR TB.
High-throughput DNA sequencing produces vast amounts of data, with millions of short reads that usually have to be mapped to a reference genome or newly assembled. Both reference-based mapping and de ...novo assembly are computationally intensive, generating large intermediary data files, and thus require bioinformatics skills that are often lacking in the laboratories producing the data. Moreover, many research and practical applications in microbiology require only a small fraction of the whole genome data.
We developed KvarQ, a new tool that directly scans fastq files of bacterial genome sequences for known variants, such as single nucleotide polymorphisms (SNP), bypassing the need of mapping all sequencing reads to a reference genome and de novo assembly. Instead, KvarQ loads "testsuites" that define specific SNPs or short regions of interest in a reference genome, and directly synthesizes the relevant results based on the occurrence of these markers in the fastq files. KvarQ has a versatile command line interface and a graphical user interface. KvarQ currently ships with two "testsuites" for Mycobacterium tuberculosis, but new "testsuites" for other organisms can easily be created and distributed. In this article, we demonstrate how KvarQ can be used to successfully detect all main drug resistance mutations and phylogenetic markers in 880 bacterial whole genome sequences. The average scanning time per genome sequence was two minutes. The variant calls of a subset of these genomes were validated with a standard bioinformatics pipeline and revealed >99% congruency.
KvarQ is a user-friendly tool that directly extracts relevant information from fastq files. This enables researchers and laboratory technicians with limited bioinformatics expertise to scan and analyze raw sequencing data in a matter of minutes. KvarQ is open-source, and pre-compiled packages with a graphical user interface are available at http://www.swisstph.ch/kvarq.