Significance Mycobacterium tuberculosis Beijing family is a group of globally emerging bacterial strains that are responsible for more than a quarter of the global tuberculosis epidemic. Here, we ...combine whole-genome sequencing and large-scale genotyping to map the temporal and spatial changes of the genetic diversity within this strain family. We reveal a southern East Asia origin and a parallel evolution of this bacterial genotype with modern humans in East Asia during the last 30,000 years. The recently globally emerged Beijing strains mainly belong to a hypervirulent subtype that most likely has initially been selected for adaption to increased population densities during the agricultural transition in northern China.
The Beijing family is the most successful genotype of Mycobacterium tuberculosis and responsible for more than a quarter of the global tuberculosis epidemic. As the predominant genotype in East Asia, the Beijing family has been emerging in various areas of the world and is often associated with disease outbreaks and antibiotic resistance. Revealing the origin and historical dissemination of this strain family is important for understanding its current global success. Here we characterized the global diversity of this family based on whole-genome sequences of 358 Beijing strains. We show that the Beijing strains endemic in East Asia are genetically diverse, whereas the globally emerging strains mostly belong to a more homogenous subtype known as âmodernâ Beijing. Phylogeographic and coalescent analyses indicate that the Beijing family most likely emerged around 30,000 y ago in southern East Asia, and accompanied the early colonization by modern humans in this area. By combining the genomic data and genotyping result of 1,793 strains from across China, we found the âmodernâ Beijing sublineage experienced massive expansions in northern China during the Neolithic era and subsequently spread to other regions following the migration of Han Chinese. Our results support a parallel evolution of the Beijing family and modern humans in East Asia. The dominance of the âmodernâ Beijing sublineage in East Asia and its recent global emergence are most likely driven by its hypervirulence, which might reflect adaption to increased human population densities linked to the agricultural transition in northern China.
Tuberculosis caused 20% of all human deaths in the Western world between the seventeenth and nineteenth centuries and remains a cause of high mortality in developing countries. In analogy to other ...crowd diseases, the origin of human tuberculosis has been associated with the Neolithic Demographic Transition, but recent studies point to a much earlier origin. We analyzed the whole genomes of 259 M. tuberculosis complex (MTBC) strains and used this data set to characterize global diversity and to reconstruct the evolutionary history of this pathogen. Coalescent analyses indicate that MTBC emerged about 70,000 years ago, accompanied migrations of anatomically modern humans out of Africa and expanded as a consequence of increases in human population density during the Neolithic period. This long coevolutionary history is consistent with MTBC displaying characteristics indicative of adaptation to both low and high host densities.
Wastewater-based epidemiology (WBE) has proven to be an effective tool for epidemiological surveillance of SARS-CoV-2 during the current COVID-19 pandemic. Furthermore, combining WBE together with ...high-throughput sequencing techniques can be useful for the analysis of SARS-CoV-2 viral diversity present in a given sample. The present study focuses on the genomic analysis of SARS-CoV-2 in 76 sewage samples collected during the three epidemiological waves that occurred in Spain from 14 wastewater treatment plants distributed throughout the country. The results obtained demonstrate that the metagenomic analysis of SARS-CoV-2 in wastewater allows the detection of mutations that define the B.1.1.7 lineage and the ability of the technique to anticipate the detection of certain mutations before they are detected in clinical samples. The study proves the usefulness of sewage sequencing to track Variants of Concern that can complement clinical testing to help in decision-making and in the analysis of the evolution of the pandemic.
•Spatial and temporal analysis of SARS-CoV-2 sequences from Spanish wastewaters.•Presence of amino acid substitutions in the spike protein not previously described in Spain.•Detection of amino acid substitutions in the spike protein even months before their detection in clinical samples.•SARS-CoV-2 genomics in wastewater as a complementary tool for WBE.
Molecular typing of 964 specimens from patients in Ethiopia with lymph node or pulmonary tuberculosis showed a similar distribution of Mycobacterium tuberculosis strains between the 2 disease ...manifestations and a minimal role for M. bovis. We report a novel phylogenetic lineage of M. tuberculosis strongly associated with the Horn of Africa.
Modern strains of Mycobacterium tuberculosis from the Americas are closely related to those from Europe, supporting the assumption that human tuberculosis was introduced post-contact. This notion, ...however, is incompatible with archaeological evidence of pre-contact tuberculosis in the New World. Comparative genomics of modern isolates suggests that M. tuberculosis attained its worldwide distribution following human dispersals out of Africa during the Pleistocene epoch, although this has yet to be confirmed with ancient calibration points. Here we present three 1,000-year-old mycobacterial genomes from Peruvian human skeletons, revealing that a member of the M. tuberculosis complex caused human disease before contact. The ancient strains are distinct from known human-adapted forms and are most closely related to those adapted to seals and sea lions. Two independent dating approaches suggest a most recent common ancestor for the M. tuberculosis complex less than 6,000 years ago, which supports a Holocene dispersal of the disease. Our results implicate sea mammals as having played a role in transmitting the disease to humans across the ocean.
RNA sequencing provides a new perspective on the genome of Mycobacterium tuberculosis by revealing an extensive presence of non-coding RNA, including long 5' and 3' untranslated regions, antisense ...transcripts, and intergenic small RNA (sRNA) molecules. More than a quarter of all sequence reads mapping outside of ribosomal RNA genes represent non-coding RNA, and the density of reads mapping to intergenic regions was more than two-fold higher than that mapping to annotated coding sequences. Selected sRNAs were found at increased abundance in stationary phase cultures and accumulated to remarkably high levels in the lungs of chronically infected mice, indicating a potential contribution to pathogenesis. The ability of tubercle bacilli to adapt to changing environments within the host is critical to their ability to cause disease and to persist during drug treatment; it is likely that novel post-transcriptional regulatory networks will play an important role in these adaptive responses.
Abstract
Motivation
Tuberculosis (TB) remains one of the main causes of death worldwide. The long and cumbersome process of culturing Mycobacterium tuberculosis complex (MTBC) bacteria has encouraged ...the development of specific molecular tools for detecting the pathogen. Most of these tools aim to become novel TB diagnostics, and big efforts and resources are invested in their development, looking for the endorsement of the main public health agencies. Surprisingly, no study has been conducted where the vast amount of genomic data available is used to identify the best MTBC diagnostic markers.
Results
In this work, we used large-scale comparative genomics to identify 40 MTBC-specific loci. We assessed their genetic diversity and physiological features to select 30 that are good targets for diagnostic purposes. Some of these markers could be used to assess the physiological status of the bacilli. Remarkably, none of the most used MTBC markers is in our catalog. Illustrating the translational potential of our work, we develop a specific qPCR assay for quantification and identification of MTBC DNA. Our rational design of targeted molecular assays for TB could be used in many other fields of clinical and basic research.
Availability and implementation
The database of non-tuberculous mycobacteria assemblies can be accessed at: 10.5281/zenodo.3374377.
Supplementary information
Supplementary data are available at Bioinformatics online.
Whole genome sequencing (WGS) has been proposed as a tool for diagnosing drug resistance in tuberculosis. However, reports of its effectiveness in endemic countries with important numbers of drug ...resistance are scarce. The goal of this study was to evaluate the effectiveness of this procedure in isolates from a tuberculosis endemic region in Mexico.
WGS analysis was performed in 81 tuberculosis positive clinical isolates with a known phenotypic profile of resistance against first-line drugs (isoniazid, rifampin, ethambutol, pyrazinamide and streptomycin). Mutations related to drug resistance were identified for each isolate; drug resistant genotypes were predicted and compared with the phenotypic profile. Genotypes and transmission clusters based on genetic distances were also characterized.
Prediction by WGS analysis of resistance against isoniazid, rifampicin, ethambutol, pyrazinamide and streptomycin showed sensitivity values of 84%, 96%, 71%, 75% and 29%, while specificity values were 100%, 94%, 90%, 90% and 98%, respectively. Prediction of multidrug resistance showed a sensitivity of 89% and specificity of 97%. Moreover, WGS analysis revealed polymorphisms related to second-line drug resistance, enabling classification of eight and two clinical isolates as pre- and extreme drug-resistant cases, respectively. Lastly, four lineages were identified in the population (L1, L2, L3 and L4). The most frequent of these was L4, which included 90% (77) of the isolates. Six transmission clusters were identified; the most frequent was TC6, which included 13 isolates with a L4.1.1 and a predominantly multidrug-resistant condition.
The results illustrate the utility of WGS for establishing the potential for prediction of resistance against first and second line drugs in isolates of tuberculosis from the region. They also demonstrate the feasibility of this procedure for use as a tool to support the epidemiological surveillance of drug- and multidrug-resistant tuberculosis.
Whole genome sequencing provides better delineation of transmission clusters in Mycobacterium tuberculosis than traditional methods. However, its ability to reveal individual transmission links ...within clusters is limited. Here, we used a 2-step approach based on Bayesian transmission reconstruction to (1) identify likely index and missing cases, (2) determine risk factors associated with transmitters, and (3) estimate when transmission happened.
We developed our transmission reconstruction method using genomic and epidemiological data from a population-based study from Valencia Region, Spain. Tuberculosis (TB) incidence during the study period was 8.4 cases per 100,000 people. While the study is ongoing, the sampling frame for this work includes notified TB cases between 1 January 2014 and 31 December 2016. We identified a total of 21 transmission clusters that fulfilled the criteria for analysis. These contained a total of 117 individuals diagnosed with active TB (109 with epidemiological data). Demographic characteristics of the study population were as follows: 80/109 (73%) individuals were Spanish-born, 76/109 (70%) individuals were men, and the mean age was 42.51 years (SD 18.46). We found that 66/109 (61%) TB patients were sputum positive at diagnosis, and 10/109 (9%) were HIV positive. We used the data to reveal individual transmission links, and to identify index cases, missing cases, likely transmitters, and associated transmission risk factors. Our Bayesian inference approach suggests that at least 60% of index cases are likely misidentified by local public health. Our data also suggest that factors associated with likely transmitters are different to those of simply being in a transmission cluster, highlighting the importance of differentiating between these 2 phenomena. Our data suggest that type 2 diabetes mellitus is a risk factor associated with being a transmitter (odds ratio 0.19 95% CI 0.02-1.10, p < 0.003). Finally, we used the most likely timing for transmission events to study when TB transmission occurred; we identified that 5/14 (35.7%) cases likely transmitted TB well before symptom onset, and these were largely sputum negative at diagnosis. Limited within-cluster diversity does not allow us to extrapolate our findings to the whole TB population in Valencia Region.
In this study, we found that index cases are often misidentified, with downstream consequences for epidemiological investigations because likely transmitters can be missed. Our findings regarding inferred transmission timing suggest that TB transmission can occur before patient symptom onset, suggesting also that TB transmits during sub-clinical disease. This result has direct implications for diagnosing TB and reducing transmission. Overall, we show that a transition to individual-based genomic epidemiology will likely close some of the knowledge gaps in TB transmission and may redirect efforts towards cost-effective contact investigations for improved TB control.
Tuberculosis (TB) is caused by gram-positive bacteria known as the Mycobacterium tuberculosis complex (MTBC). MTBC include several human-associated lineages and several variants adapted to domestic ...and, more rarely, wild animal species. We report an M. tuberculosis strain isolated from a wild chimpanzee in Côte d'Ivoire that was shown by comparative genomic and phylogenomic analyses to belong to a new lineage of MTBC, closer to the human-associated lineage 6 (also known as M. africanum West Africa 2) than to the other classical animal-associated MTBC strains. These results show that the general view of the genetic diversity of MTBC is limited and support the possibility that other MTBC variants exist, particularly in wild mammals in Africa. Exploring this diversity is crucial to the understanding of the biology and evolutionary history of this widespread infectious disease.