Advances in modern sequencing technologies allow us to generate sufficient data to analyze hundreds of bacterial genomes from a single machine in a single day. This potential for sequencing massive ...numbers of genomes calls for fully automated methods to produce high-quality assemblies and variant calls. We introduce Pilon, a fully automated, all-in-one tool for correcting draft assemblies and calling sequence variants of multiple sizes, including very large insertions and deletions. Pilon works with many types of sequence data, but is particularly strong when supplied with paired end data from two Illumina libraries with small e.g., 180 bp and large e.g., 3-5 Kb inserts. Pilon significantly improves draft genome assemblies by correcting bases, fixing mis-assemblies and filling gaps. For both haploid and diploid genomes, Pilon produces more contiguous genomes with fewer errors, enabling identification of more biologically relevant genes. Furthermore, Pilon identifies small variants with high accuracy as compared to state-of-the-art tools and is unique in its ability to accurately identify large sequence variants including duplications and resolve large insertions. Pilon is being used to improve the assemblies of thousands of new genomes and to identify variants from thousands of clinically relevant bacterial strains. Pilon is freely available as open source software.
The major clinical challenge in the treatment of high-grade serous ovarian cancer (HGSOC) is the development of progressive resistance to platinum-based chemotherapy. The objective of this study was ...to determine whether intra-tumour genetic heterogeneity resulting from clonal evolution and the emergence of subclonal tumour populations in HGSOC was associated with the development of resistant disease.
Evolutionary inference and phylogenetic quantification of heterogeneity was performed using the MEDICC algorithm on high-resolution whole genome copy number profiles and selected genome-wide sequencing of 135 spatially and temporally separated samples from 14 patients with HGSOC who received platinum-based chemotherapy. Samples were obtained from the clinical CTCR-OV03/04 studies, and patients were enrolled between 20 July 2007 and 22 October 2009. Median follow-up of the cohort was 31 mo (interquartile range 22-46 mo), censored after 26 October 2013. Outcome measures were overall survival (OS) and progression-free survival (PFS). There were marked differences in the degree of clonal expansion (CE) between patients (median 0.74, interquartile range 0.66-1.15), and dichotimization by median CE showed worse survival in CE-high cases (PFS 12.7 versus 10.1 mo, p = 0.009; OS 42.6 versus 23.5 mo, p = 0.003). Bootstrap analysis with resampling showed that the 95% confidence intervals for the hazard ratios for PFS and OS in the CE-high group were greater than 1.0. These data support a relationship between heterogeneity and survival but do not precisely determine its effect size. Relapsed tissue was available for two patients in the CE-high group, and phylogenetic analysis showed that the prevalent clonal population at clinical recurrence arose from early divergence events. A subclonal population marked by a NF1 deletion showed a progressive increase in tumour allele fraction during chemotherapy.
This study demonstrates that quantitative measures of intra-tumour heterogeneity may have predictive value for survival after chemotherapy treatment in HGSOC. Subclonal tumour populations are present in pre-treatment biopsies in HGSOC and can undergo expansion during chemotherapy, causing clinical relapse.
The immune system responds vigorously to microbial infection while permitting lifelong colonization by the microbiome. Mechanisms that facilitate the establishment and stability of the gut microbiota ...remain poorly described. We found that a regulatory system in the prominent human commensal
modulates its surface architecture to invite binding of immunoglobulin A (IgA) in mice. Specific immune recognition facilitated bacterial adherence to cultured intestinal epithelial cells and intimate association with the gut mucosal surface in vivo. The IgA response was required for
(and other commensal species) to occupy a defined mucosal niche that mediates stable colonization of the gut through exclusion of exogenous competitors. Therefore, in addition to its role in pathogen clearance, we propose that IgA responses can be co-opted by the microbiome to engender robust host-microbial symbiosis.
Breast cancers are complex ecosystems of malignant cells and the tumour microenvironment
. The composition of these tumour ecosystems and interactions within them contribute to responses to cytotoxic ...therapy
. Efforts to build response predictors have not incorporated this knowledge. We collected clinical, digital pathology, genomic and transcriptomic profiles of pre-treatment biopsies of breast tumours from 168 patients treated with chemotherapy with or without HER2 (encoded by ERBB2)-targeted therapy before surgery. Pathology end points (complete response or residual disease) at surgery
were then correlated with multi-omic features in these diagnostic biopsies. Here we show that response to treatment is modulated by the pre-treated tumour ecosystem, and its multi-omics landscape can be integrated in predictive models using machine learning. The degree of residual disease following therapy is monotonically associated with pre-therapy features, including tumour mutational and copy number landscapes, tumour proliferation, immune infiltration and T cell dysfunction and exclusion. Combining these features into a multi-omic machine learning model predicted a pathological complete response in an external validation cohort (75 patients) with an area under the curve of 0.87. In conclusion, response to therapy is determined by the baseline characteristics of the totality of the tumour ecosystem captured through data integration and machine learning. This approach could be used to develop predictors for other cancers.
Fusobacterium nucleatum is associated with colorectal cancer and promotes colonic tumor formation in preclinical models. However, fusobacteria are core members of the human oral microbiome and less ...prevalent in the healthy gut, raising questions about how fusobacteria localize to CRC. We identify a host polysaccharide and fusobacterial lectin that explicates fusobacteria abundance in CRC. Gal-GalNAc, which is overexpressed in CRC, is recognized by fusobacterial Fap2, which functions as a Gal-GalNAc lectin. F. nucleatum binding to clinical adenocarcinomas correlates with Gal-GalNAc expression and is reduced upon O-glycanase treatment. Clinical fusobacteria strains naturally lacking Fap2 or inactivated Fap2 mutants show reduced binding to Gal-GalNAc-expressing CRC cells and established CRCs in mice. Additionally, intravenously injected F. nucleatum localizes to mouse tumor tissues in a Fap2-dependent manner, suggesting that fusobacteria use a hematogenous route to reach colon adenocarcinomas. Thus, targeting F. nucleatum Fap2 or host epithelial Gal-GalNAc may reduce fusobacteria potentiation of CRC.
Display omitted
•Gal-GalNAc is highly expressed in human CRC, metastases, and a preclinical CRC model•Fap2 is a fusobacterial Gal-GalNAc-binding lectin•Fap2 mediates F. nucleatum binding to Gal-GalNAc overexpressed in CRC•Blood-borne Fap2-expressing F. nucleatum localizes to orthotopic mouse colon tumors
F. nucleatum is associated with CRC and promotes colonic tumor formation in mice. Abed et al. identify a host polysaccharide, Gal-GalNAc, and fusobacterial lectin (Fap2) that explicates fusobacteria abundance in CRC. Targeting Fap2 or host Gal-GalNAc may provide a means to reduce F. nucleatum potentiation of CRC.
T-cell infiltration in estrogen receptor (ER)-negative breast tumours has been associated with longer survival. To investigate this association and the potential of tumour T-cell infiltration as a ...prognostic and predictive marker, we have conducted the largest study of T cells in breast cancer to date.
Four studies totalling 12 439 patients were used for this work. Cytotoxic (CD8+) and regulatory (forkhead box protein 3, FOXP3+) T cells were quantified using immunohistochemistry (IHC). IHC for CD8 was conducted using available material from all four studies (8978 samples) and for FOXP3 from three studies (5239 samples)-multiple imputation was used to resolve missing data from the remaining patients. Cox regression was used to test for associations with breast cancer-specific survival.
In ER-negative tumours triple-negative breast cancer and human epidermal growth factor receptor 2 (human epidermal growth factor receptor 2 (HER2) positive), presence of CD8+ T cells within the tumour was associated with a 28% 95% confidence interval (CI) 16% to 38% reduction in the hazard of breast cancer-specific mortality, and CD8+ T cells within the stroma with a 21% (95% CI 7% to 33%) reduction in hazard. In ER-positive HER2-positive tumours, CD8+ T cells within the tumour were associated with a 27% (95% CI 4% to 44%) reduction in hazard. In ER-negative disease, there was evidence for greater benefit from anthracyclines in the National Epirubicin Adjuvant Trial in patients with CD8+ tumours hazard ratio (HR) = 0.54; 95% CI 0.37-0.79 versus CD8-negative tumours (HR = 0.87; 95% CI 0.55–1.38). The difference in effect between these subgroups was significant when limited to cases with complete data (Pheterogeneity = 0.04) and approached significance in imputed data (Pheterogeneity = 0.1).
The presence of CD8+ T cells in breast cancer is associated with a significant reduction in the relative risk of death from disease in both the ER-negative supplementary Figure S1, available at Annals of Oncology online and the ER-positive HER2-positive subtypes. Tumour lymphocytic infiltration may improve risk stratification in breast cancer patients classified into these subtypes.
NCT00003577.
Efficient identification of drug mechanisms of action remains a challenge. Computational docking approaches have been widely used to predict drug binding targets; yet, such approaches depend on ...existing protein structures, and accurate structural predictions have only recently become available from AlphaFold2. Here, we combine AlphaFold2 with molecular docking simulations to predict protein‐ligand interactions between 296 proteins spanning Escherichia coli's essential proteome, and 218 active antibacterial compounds and 100 inactive compounds, respectively, pointing to widespread compound and protein promiscuity. We benchmark model performance by measuring enzymatic activity for 12 essential proteins treated with each antibacterial compound. We confirm extensive promiscuity, but find that the average area under the receiver operating characteristic curve (auROC) is 0.48, indicating weak model performance. We demonstrate that rescoring of docking poses using machine learning‐based approaches improves model performance, resulting in average auROCs as large as 0.63, and that ensembles of rescoring functions improve prediction accuracy and the ratio of true‐positive rate to false‐positive rate. This work indicates that advances in modeling protein‐ligand interactions, particularly using machine learning‐based approaches, are needed to better harness AlphaFold2 for drug discovery.
Synopsis
Assessing molecular docking simulations based on AlphaFold2‐predicted structures with high‐throughput measurements of protein‐ligand interactions reveals weak model performance. Machine learning‐based approaches improve performance and better harness AlphaFold2 for drug discovery.
AlphaFold2‐based molecular docking predictions for 296 Escherichia coli proteins, 218 active antibacterial compounds and 100 inactive compounds predict widespread promiscuity and similar distributions of binding affinities between active and inactive compounds.
Quantitative enzymatic inhibition assays for 12 essential E. coli proteins treated with each of the 218 antibacterial compounds confirm extensive promiscuity.
The enzymatic inhibition dataset reveals that the performance of the molecular docking model is weak.
Rescoring of docking poses using machine learning‐based scoring functions improves model performance.
Assessing molecular docking simulations based on AlphaFold2‐predicted structures with high‐throughput measurements of protein‐ligand interactions reveals weak model performance. Machine learning‐based approaches improve performance and better harness AlphaFold2 for drug discovery.
Circulating tumour DNA (ctDNA) carrying tumour-specific sequence alterations may provide a minimally invasive means to dynamically assess tumour burden and response to treatment in cancer patients. ...Somatic TP53 mutations are a defining feature of high-grade serous ovarian carcinoma (HGSOC). We tested whether these mutations could be used as personalised markers to monitor tumour burden and early changes as a predictor of response and time to progression (TTP).
We performed a retrospective analysis of serial plasma samples collected during routine clinical visits from 40 patients with HGSOC undergoing heterogeneous standard of care treatment. Patient-specific TP53 assays were developed for 31 unique mutations identified in formalin-fixed paraffin-embedded tumour DNA from these patients. These assays were used to quantify ctDNA in 318 plasma samples using microfluidic digital PCR. The TP53 mutant allele fraction (TP53MAF) was compared to serum CA-125, the current gold-standard response marker for HGSOC in blood, as well as to disease volume on computed tomography scans by volumetric analysis. Changes after one cycle of treatment were compared with TTP. The median TP53MAF prior to treatment in 51 relapsed treatment courses was 8% (interquartile range IQR 1.2%-22%) compared to 0.7% (IQR 0.3%-2.0%) for seven untreated newly diagnosed stage IIIC/IV patients. TP53MAF correlated with volumetric measurements (Pearson r = 0.59, p < 0.001), and this correlation improved when patients with ascites were excluded (r = 0.82). The ratio of TP53MAF to volume of disease was higher in relapsed patients (0.04% per cm3) than in untreated patients (0.0008% per cm3, p = 0.004). In nearly all relapsed patients with disease volume > 32 cm3, ctDNA was detected at ≥20 amplifiable copies per millilitre of plasma. In 49 treatment courses for relapsed disease, pre-treatment TP53MAF concentration, but not CA-125, was associated with TTP. Response to chemotherapy was seen earlier with ctDNA, with a median time to nadir of 37 d (IQR 28-54) compared with a median time to nadir of 84 d (IQR 42-116) for CA-125. In 32 relapsed treatment courses evaluable for response after one cycle of chemotherapy, a decrease in TP53MAF of >60% was an independent predictor of TTP in multivariable analysis (hazard ratio 0.22, 95% CI 0.07-0.67, p = 0.008). Conversely, a decrease in TP53MAF of ≤60% was associated with poor response and identified cases with TTP < 6 mo with 71% sensitivity (95% CI 42%-92%) and 88% specificity (95% CI 64%-99%). Specificity was improved when patients with recent drainage of ascites were excluded. Ascites drainage led to a reduction of TP53MAF concentration. The limitations of this study include retrospective design, small sample size, and heterogeneity of treatment within the cohort.
In this retrospective study, we demonstrated that ctDNA is correlated with volume of disease at the start of treatment in women with HGSOC and that a decrease of ≤60% in TP53MAF after one cycle of chemotherapy was associated with shorter TTP. These results provide evidence that ctDNA has the potential to be a highly specific early molecular response marker in HGSOC and warrants further investigation in larger cohorts receiving uniform treatment.
This paper draws on primary and secondary data to propose a taxonomy of strategies, or "schools," for knowledge management. The primary purpose of this framework is to guide executives on choices to ...initiate knowledge management projects according to goals, organizational character, and technological, behavioral, or economic biases. It may also be useful to teachers in demonstrating the scope of knowledge management and to researchers in generating propositions for further study.
In a context where schools are held more and more accountable for the education they provide, data-based decision making has become increasingly important. This book brings together scholars from ...several countries to examine data-based decision making. Data-based decision making in this book refers to making decisions based on a broad range of evidence, such as scores on students' assessments, classroom observations etc. The following chapters are included: (1) Introduction (Kim, Schildkamp, et al.); (2) Data-based Decision Making: An Overview (Mei Kuin Lai, et al.); (3) Analysis and Discussion of Classroom and Achievement Data to Raise Student Achievement (Mei Kuin Lai, et al.); (4) From "Intuition"--to "Data"-based Decision Making in Dutch Secondary Schools? (Kim Schildkamp, et al.); (5) Professional Attitudes to the Use of Data in England (Christopher Downey, et al.); (6) Approaches to Effective Data Use: Does One Size Fit All? (Elizabeth Archer, et al.); (7) Improving Data Literacy in Schools: Lessons from the School Feedback Project (Jan Vanhoof, et al.); (8) Implementation of a Data Initiative in the NCLB Era (Jeffrey C. Wayman, et al.); (9) Towards Data-Informed Decisions: From Ministry Policy to School Practice (Robert Dunn, et al.); (10) Conclusions and a Data Use Framework (Kim Schildkamp, (et al.); and (11) Data Use: Where to from Here? (Lorna, Earl, et al.)