Abstract
Motivation
Microbial sequences generated from clinical samples are often contaminated with human host sequences that must be removed for ethical and legal reasons. Care must be taken to ...excise host sequences without inadvertently removing target microbial sequences to the detriment of downstream analyses such as variant calling and de novo assembly.
Results
To facilitate accurate host decontamination of both short and long sequencing reads, we developed Hostile, a tool capable of accurate host read removal using a laptop. We demonstrate that our approach removes at least 99.6% of real human reads and retains at least 99.989% of simulated bacterial reads. Using Hostile with a masked reference genome further increases bacterial read retention (≥99.997%) with negligible (≤0.001%) reduction in human read removal performance. Compared with an existing tool, Hostile removes 21%–23% more human short reads and 21–43 times fewer bacterial reads, typically in less time.
Availability and implementation
Hostile is implemented as an MIT-licensed Python package available from https://github.com/bede/hostile together with supplementary material.
Whole-genome sequencing has opened the way for investigating the dynamics and genomic evolution of bacterial pathogens during the colonization and infection of humans. The application of this ...technology to the longitudinal study of adaptation in an infected host--in particular, the evolution of drug resistance and host adaptation in patients who are chronically infected with opportunistic pathogens--has revealed remarkable patterns of convergent evolution, suggestive of an inherent repeatability of evolution. In this Review, we describe how these studies have advanced our understanding of the mechanisms and principles of within-host genome evolution, and we consider the consequences of findings such as a potent adaptive potential for pathogenicity. Finally, we discuss the possibility that genomics may be used in the future to predict the clinical progression of bacterial infections and to suggest the best option for treatment.
Summary Background Diagnosis of Clostridium difficile infection is controversial because of many laboratory methods, compounded by two reference methods. Cytotoxigenic culture detects toxigenic C ...difficile and gives a positive result more frequently (eg, because of colonisation, which means that individuals can have the bacterium but no free toxin) than does the cytotoxin assay, which detects preformed toxin in faeces. We aimed to validate the reference methods according to clinical outcomes and to derive an optimum laboratory diagnostic algorithm for C difficile infection. Methods In this prospective, multicentre study, we did cytotoxigenic culture and cytotoxin assays on 12 420 faecal samples in four UK laboratories. We also performed tests that represent the three main targets for C difficile detection: bacterium (glutamate dehydrogenase), toxins, or toxin genes. We used routine blood test results, length of hospital stay, and 30-day mortality to clinically validate the reference methods. Data were categorised by reference method result: group 1, cytotoxin assay positive; group 2, cytotoxigenic culture positive and cytotoxin assay negative; and group 3, both reference methods negative. Findings Clinical and reference assay data were available for 6522 inpatient episodes. On univariate analysis, mortality was significantly higher in group 1 than in group 2 (72/435 16·6% vs 20/207 9·7%, p=0·044) and in group 3 (503/5880 8·6%, p<0·001), but not in group 2 compared with group 3 (p=0·4). A multivariate analysis accounting for potential confounders confirmed the mortality differences between groups 1 and 3 (OR 1·61, 95% CI 1·12–2·31). Multistage algorithms performed better than did standalone assays. Interpretation We noted no increase in mortality when toxigenic C difficile alone was present. Toxin (cytotoxin assay) positivity correlated with clinical outcome, and so this reference method best defines true cases of C difficile infection. A new diagnostic category of potential C difficile excretor (cytotoxigenic culture positive but cytotoxin assay negative) could be used to characterise patients with diarrhoea that is probably not due to C difficile infection, but who can cause cross-infection. Funding Department of Health and Health Protection Agency, UK.
Summary Background Tuberculosis incidence in the UK has risen in the past decade. Disease control depends on epidemiological data, which can be difficult to obtain. Whole-genome sequencing can detect ...microevolution within Mycobacterium tuberculosis strains. We aimed to estimate the genetic diversity of related M tuberculosis strains in the UK Midlands and to investigate how this measurement might be used to investigate community outbreaks. Methods In a retrospective observational study, we used Illumina technology to sequence M tuberculosis genomes from an archive of frozen cultures. We characterised isolates into four groups: cross-sectional, longitudinal, household, and community. We measured pairwise nucleotide differences within hosts and between hosts in household outbreaks and estimated the rate of change in DNA sequences. We used the findings to interpret network diagrams constructed from 11 community clusters derived from mycobacterial interspersed repetitive-unit–variable-number tandem-repeat data. Findings We sequenced 390 separate isolates from 254 patients, including representatives from all five major lineages of M tuberculosis . The estimated rate of change in DNA sequences was 0·5 single nucleotide polymorphisms (SNPs) per genome per year (95% CI 0·3–0·7) in longitudinal isolates from 30 individuals and 25 families. Divergence is rarely higher than five SNPs in 3 years. 109 (96%) of 114 paired isolates from individuals and households differed by five or fewer SNPs. More than five SNPs separated isolates from none of 69 epidemiologically linked patients, two (15%) of 13 possibly linked patients, and 13 (17%) of 75 epidemiologically unlinked patients (three-way comparison exact p<0·0001). Genetic trees and clinical and epidemiological data suggest that super-spreaders were present in two community clusters. Interpretation Whole-genome sequencing can delineate outbreaks of tuberculosis and allows inference about direction of transmission between cases. The technique could identify super-spreaders and predict the existence of undiagnosed cases, potentially leading to early treatment of infectious patients and their contacts. Funding Medical Research Council, Wellcome Trust, National Institute for Health Research, and the Health Protection Agency.
Summary Background Infection with Clostridium difficile is the primary infective cause of antibiotic-associated diarrhoea. We aimed to compare efficacy and safety of fidaxomicin and vancomycin to ...treat patients with C difficile infection in Europe, Canada, and the USA. Methods In this multicentre, double-blind, randomised, non-inferiority trial, we enrolled patients from 45 sites in Europe and 41 sites in the USA and Canada between April 19, 2007, and Dec 11, 2009. Eligible patients were aged 16 years or older with acute, toxin-positive C difficile infection. Patients were randomly allocated (1:1) to receive oral fidaxomicin (200 mg every 12 h) or oral vancomycin (125 mg every 6 h) for 10 days. The primary endpoint was clinical cure, defined as resolution of diarrhoea and no further need for treatment. An interactive voice-response system and computer-generated randomisation schedule gave a randomisation number and medication kit number for each patient. Participants and investigators were masked to treatment allocation. Non-inferiority was prespecified with a margin of 10%. Modified intention-to-treat and per-protocol populations were analysed. This study is registered with ClinicalTrials.gov , number NCT00468728. Findings Of 535 patients enrolled, 270 were assigned fidaxomicin and 265 vancomycin. After 26 patients were excluded, 509 were included in the modified intention-to-treat (mITT) population. 198 (91·7%) of 216 patients in the per-protocol population given fidaxomicin achieved clinical cure, compared with 213 (90·6%) of 235 given vancomycin, meeting the criterion for non-inferiority (one-sided 97·5% CI −4·3%). Non-inferiority was also shown for clinical cure in the mITT population, with 221 (87·7%) of 252 patients given fidaxomicin and 223 (86·8%) of 257 given vancomycin cured (one-sided 97·5% CI −4·9%). In most subgroup analyses of the primary endpoint in the mITT population, outcomes in the two treatment groups did not differ significantly; although patients receiving concomitant antibiotics for other infections had a higher cure rate with fidaxomicin (46 90·2% of 51) than with vancomycin (33 73·3% of 45; p=0·031). Occurrence of treatment-emergent adverse events did not differ between groups. 20 (7·6%) of 264 patients given at least one dose of fidaxomicin and 17 (6·5%) of 260 given vancomycin died. Interpretation Fidaxomicin could be an alternative treatment for infection with C difficile , with similar efficacy and safety to vancomycin. Funding Optimer Pharmaceuticals.
Bacterial virulence is a multifaceted trait where the interactions between pathogen and host factors affect the severity and outcome of the infection. Toxin secretion is central to the biology of ...many bacterial pathogens and is widely accepted as playing a crucial role in disease pathology. To understand the relationship between toxicity and bacterial virulence in greater depth, we studied two sequenced collections of the major human pathogen Staphylococcus aureus and found an unexpected inverse correlation between bacterial toxicity and disease severity. By applying a functional genomics approach, we identified several novel toxicity-affecting loci responsible for the wide range in toxic phenotypes observed within these collections. To understand the apparent higher propensity of low toxicity isolates to cause bacteraemia, we performed several functional assays, and our findings suggest that within-host fitness differences between high- and low-toxicity isolates in human serum is a contributing factor. As invasive infections, such as bacteraemia, limit the opportunities for onward transmission, highly toxic strains could gain an additional between-host fitness advantage, potentially contributing to the maintenance of toxicity at the population level. Our results clearly demonstrate how evolutionary trade-offs between toxicity, relative fitness, and transmissibility are critical for understanding the multifaceted nature of bacterial virulence.
The heptavalent pneumococcal conjugate vaccine (PCV7) was introduced in the United States (US) in 2000 and has significantly reduced invasive pneumococcal disease; however, the incidence of ...nonvaccine serotype invasive disease, particularly due to serotype 19A, has increased. The serotype 19A increase can be explained in part by expansion of a genotype that has been circulating in the US prior to vaccine implementation (and other countries since at least 1990), but also by the emergence of a novel "vaccine escape recombinant" pneumococcal strain. This strain has a genotype that previously was only associated with vaccine serotype 4, but now expresses a nonvaccine serotype 19A capsule. Based on prior evidence for capsular switching by recombination at the capsular locus, the genetic event that resulted in this novel serotype/genotype combination might be identifiable from the DNA sequence of individual pneumococcal strains. Therefore, the aim of this study was to characterise the putative recombinational event(s) at the capsular locus that resulted in the change from a vaccine to a nonvaccine capsular type. Sequencing the capsular locus flanking regions of 51 vaccine escape (progeny), recipient, and putative donor pneumococci revealed a 39 kb recombinational fragment, which included the capsular locus, flanking regions, and two adjacent penicillin-binding proteins, and thus resulted in a capsular switch and penicillin nonsusceptibility in a single genetic event. Since 2003, 37 such vaccine escape strains have been detected, some of which had evolved further. Furthermore, two new types of serotype 19A vaccine escape strains emerged in 2005. To our knowledge, this is the first time a single recombinational event has been documented in vivo that resulted in both a change of serotype and penicillin nonsusceptibility. Vaccine escape by genetic recombination at the capsular locus has the potential to reduce PCV7 effectiveness in the longer term.
Routine full characterization of
is culture based, taking many weeks. Whole-genome sequencing (WGS) can generate antibiotic susceptibility profiles to inform treatment, augmented with strain ...information for global surveillance; such data could be transformative if provided at or near the point of care. We demonstrate a low-cost method of DNA extraction directly from patient samples for
WGS. We initially evaluated the method by using the Illumina MiSeq sequencer (40 smear-positive respiratory samples obtained after routine clinical testing and 27 matched liquid cultures).
was identified in all 39 samples from which DNA was successfully extracted. Sufficient data for antibiotic susceptibility prediction were obtained from 24 (62%) samples; all results were concordant with reference laboratory phenotypes. Phylogenetic placement was concordant between direct and cultured samples. With Illumina MiSeq/MiniSeq, the workflow from patient sample to results can be completed in 44/16 h at a reagent cost of £96/£198 per sample. We then employed a nonspecific PCR-based library preparation method for sequencing on an Oxford Nanopore Technologies MinION sequencer. We applied this to cultured
strain BCG DNA and to combined culture-negative sputum DNA and BCG DNA. For flow cell version R9.4, the estimated turnaround time from patient to identification of BCG, detection of pyrazinamide resistance, and phylogenetic placement was 7.5 h, with full susceptibility results 5 h later. Antibiotic susceptibility predictions were fully concordant. A critical advantage of MinION is the ability to continue sequencing until sufficient coverage is obtained, providing a potential solution to the problem of variable amounts of
DNA in direct samples.
The race to produce vaccines against severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) began when the first sequence was published, and this forms the basis for vaccines currently deployed ...globally. Independent lineages of SARS-CoV-2 have recently been reported: UK, B.1.1.7; South Africa, B.1.351; and Brazil, P.1. These variants have multiple changes in the immunodominant spike protein that facilitates viral cell entry via the angiotensin-converting enzyme-2 (ACE2) receptor. Mutations in the receptor recognition site on the spike are of great concern for their potential for immune escape. Here, we describe a structure-function analysis of B.1.351 using a large cohort of convalescent and vaccinee serum samples. The receptor-binding domain mutations provide tighter ACE2 binding and widespread escape from monoclonal antibody neutralization largely driven by E484K, although K417N and N501Y act together against some important antibody classes. In a number of cases, it would appear that convalescent and some vaccine serum offers limited protection against this variant.
Display omitted
•Reduced B.1.351 neutralization by mAbs and sera induced by early SARS-CoV-2 isolates•B.1.351 neutralization titer reduced 8- to 9-fold for Pfizer and AstraZeneca vaccinees•E484K, K417N, and N501Y cause widespread escape from mAbs•NTD deletion in B.1.351 abrogates neutralization by a potent neutralizing human mAb
Structure-function analysis of the SARS-CoV-2 variant B.1.351 using serum samples from convalescent and vaccinated individuals reveals how mutations in the viral spike protein result in tighter binding to the receptor ACE2 and allow escape from monoclonal antibody neutralization.
Combined genotyping/whole genome sequencing and epidemiological data suggest that in endemic settings only a minority of Clostridium difficile infection, CDI, is acquired from other cases. ...Asymptomatic patients are a potential source for many unexplained cases.
We prospectively screened a cohort of medical inpatients in a UK teaching hospital for asymptomatic C. difficile carriage using stool culture. Electronic and questionnaire data were used to determine risk factors for asymptomatic carriage by logistic regression. Carriage isolates were compared with all hospital/community CDI cases from the same geographic region, from 12 months before the study to 3 months after, using whole genome sequencing and hospital admission data, assessing particularly for evidence of onward transmission from asymptomatic cases.
Of 227 participants recruited, 132 provided ≥1 stool samples for testing. 18 participants were culture-positive for C. difficile, 14/132(11%) on their first sample. Independent risk factors for asymptomatic carriage were patient reported loose/frequent stool (but not meeting CDI criteria of ≥3 unformed stools in 24 hours), previous overnight hospital stay within 6 months, and steroid/immunosuppressant medication in the last 6 months (all p≤0.02). Surprisingly antibiotic exposure in the last 6 months was independently associated with decreased risk of carriage (p = 0.005). The same risk factors were identified excluding participants reporting frequent/loose stool. 13/18(72%) asymptomatically colonised patients carried toxigenic strains from common disease-causing lineages found in cases. Several plausible transmission events to asymptomatic carriers were identified, but in this relatively small study no clear evidence of onward transmission from an asymptomatic case was seen.
Transmission events from any one asymptomatic carrier are likely to be relatively rare, but as asymptomatic carriage is common, it may still be an important source of CDI, which could be quantified in larger studies. Risk factors established for asymptomatic carriage may help identify patients for inclusion in such studies.