Visual inspection of histopathology slides is one of the main methods used by pathologists to assess the stage, type and subtype of lung tumors. Adenocarcinoma (LUAD) and squamous cell carcinoma ...(LUSC) are the most prevalent subtypes of lung cancer, and their distinction requires visual inspection by an experienced pathologist. In this study, we trained a deep convolutional neural network (inception v3) on whole-slide images obtained from The Cancer Genome Atlas to accurately and automatically classify them into LUAD, LUSC or normal lung tissue. The performance of our method is comparable to that of pathologists, with an average area under the curve (AUC) of 0.97. Our model was validated on independent datasets of frozen tissues, formalin-fixed paraffin-embedded tissues and biopsies. Furthermore, we trained the network to predict the ten most commonly mutated genes in LUAD. We found that six of them-STK11, EGFR, FAT1, SETBP1, KRAS and TP53-can be predicted from pathology images, with AUCs from 0.733 to 0.856 as measured on a held-out population. These findings suggest that deep-learning models can assist pathologists in the detection of cancer subtype or gene mutations. Our approach can be applied to any cancer type, and the code is available at https://github.com/ncoudray/DeepPATH .
The metazoan genome is compartmentalized in areas of highly interacting chromatin known as topologically associating domains (TADs). TADs are demarcated by boundaries mostly conserved across cell ...types and even across species. However, a genome-wide characterization of TAD boundary strength in mammals is still lacking. In this study, we first use fused two-dimensional lasso as a machine learning method to improve Hi-C contact matrix reproducibility, and, subsequently, we categorize TAD boundaries based on their insulation score. We demonstrate that higher TAD boundary insulation scores are associated with elevated CTCF levels and that they may differ across cell types. Intriguingly, we observe that super-enhancers are preferentially insulated by strong boundaries. Furthermore, we demonstrate that strong TAD boundaries and super-enhancer elements are frequently co-duplicated in cancer patients. Taken together, our findings suggest that super-enhancers insulated by strong TAD boundaries may be exploited, as a functional unit, by cancer cells to promote oncogenesis.
A major challenge in cancer treatment is predicting clinical response to anti-cancer drugs on a personalized basis. Using a pharmacogenomics database of 1,001 cancer cell lines, we trained deep ...neural networks for prediction of drug response and assessed their performance on multiple clinical cohorts. We demonstrate that deep neural networks outperform the current state in machine learning frameworks. We provide a proof of concept for the use of deep neural network-based frameworks to aid precision oncology strategies.
Display omitted
•A machine learning (ML) workflow is designed to predict drug response in cancer patients•Deep neural networks (DNNs) surpass current ML algorithms in drug response prediction•DNNs predict drug response and survival in various large clinical cohorts•DNNs capture intricate biological interactions linked to specific drug response pathways
Sakellaropoulos et al. designed a machine learning workflow to predict drug response and survival of cancer patients. All pipelines are trained on a large panel of cancer cell lines and tested in clinical cohorts. DNN outperforms other machine learning algorithms by capturing pathways that link gene expression with drug response.
During self-renewal, cell-type-defining features are drastically perturbed in mitosis and must be faithfully reestablished upon G1 entry, a process that remains largely elusive. Here, we ...characterized at a genome-wide scale the dynamic transcriptional and architectural resetting of mouse pluripotent stem cells (PSCs) upon mitotic exit. We captured distinct waves of transcriptional reactivation with rapid induction of stem cell genes and transient activation of lineage-specific genes. Topological reorganization at different hierarchical levels also occurred in an asynchronous manner and showed partial coordination with transcriptional resetting. Globally, rapid transcriptional and architectural resetting associated with mitotic retention of H3K27 acetylation, supporting a bookmarking function. Indeed, mitotic depletion of H3K27ac impaired the early reactivation of bookmarked, stem-cell-associated genes. However, 3D chromatin reorganization remained largely unaffected, suggesting that these processes are driven by distinct forces upon mitotic exit. This study uncovers principles and mediators of PSC molecular resetting during self-renewal.
Display omitted
•Rapid resetting of the stem cell program and transient activation of lineage genes•Chromatin contacts around stem cells enhancers reform faster than structural loops•Chromatin reorganization partially associates with transcriptional levels and kinetics•Loss of mitotic H3K27ac perturbs transcriptional but not architectural resetting
Pelham-Webb et al. characterize the kinetics of transcriptional reactivation and 3D chromatin reorganization in pluripotent stem cells after cell division and examine the degree of coordination between these processes. Furthermore, they interrogate the role of mitotic bookmarking factors in the rapid molecular resetting of stem cell identity.
CTCF and cohesin play a key role in organizing chromatin into topologically associating domain (TAD) structures. Disruption of a single CTCF binding site is sufficient to change chromosomal ...interactions leading to alterations in chromatin modifications and gene regulation. However, the extent to which alterations in chromatin modifications can disrupt 3D chromosome organization leading to transcriptional changes is unknown. In multiple myeloma, a 4;14 translocation induces overexpression of the histone methyltransferase, NSD2, resulting in expansion of H3K36me2 and shrinkage of antagonistic H3K27me3 domains. Using isogenic cell lines producing high and low levels of NSD2, here we find oncogene activation is linked to alterations in H3K27ac and CTCF within H3K36me2 enriched chromatin. A logistic regression model reveals that differentially expressed genes are significantly enriched within the same insulated domain as altered H3K27ac and CTCF peaks. These results identify a bidirectional relationship between 2D chromatin and 3D genome organization in gene regulation.
Abstract Aberrations in the capacity of DNA/chromatin modifiers and transcription factors to bind non-coding regions can lead to changes in gene regulation and impact disease phenotypes. However, ...identifying distal regulatory elements and connecting them with their target genes remains challenging. Here, we present MethNet, a pipeline that integrates large-scale DNA methylation and gene expression data across multiple cancers, to uncover cis regulatory elements (CREs) in a 1 Mb region around every promoter in the genome. MethNet identifies clusters of highly ranked CREs, referred to as ‘hubs’, which contribute to the regulation of multiple genes and significantly affect patient survival. Promoter-capture Hi-C confirmed that highly ranked associations involve physical interactions between CREs and their gene targets, and CRISPR interference based single-cell RNA Perturb-seq validated the functional impact of CREs. Thus, MethNet-identified CREs represent a valuable resource for unraveling complex mechanisms underlying gene expression, and for prioritizing the verification of predicted non-coding disease hotspots.
Chronic kidney disease (CKD) refers to a spectrum of diseases defined by renal fibrosis, permanent alterations in kidney structure, and low glomerular-filtration rate. Prolonged epithelial-tubular ...damage involves a series of changes that eventually lead to CKD, highlighting the importance of tubular epithelial cells in this process. Lysophosphatidic acid (LPA) is a bioactive lipid that signals mainly through its six cognate LPA receptors and is implicated in several chronic inflammatory pathological conditions. In this report, we have stimulated human proximal tubular epithelial cells (HKC-8) with LPA and 175 other possibly pathological stimuli, and simultaneously detected the levels of 27 intracellular phosphoproteins and 32 extracellular secreted molecules with multiplex ELISA. This quantification revealed a large amount of information concerning the signaling and the physiology of HKC-8 cells that can be extrapolated to other proximal tubular epithelial cells. LPA responses clustered with pro-inflammatory stimuli such as TNF and IL-1, promoting the phosphorylation of important inflammatory signaling hubs, including CREB1, ERK1, JUN, IκΒα, and MEK1, as well as the secretion of inflammatory factors of clinical relevance, including CCL2, CCL3, CXCL10, ICAM1, IL-6, and IL-8, most of them shown for the first time in proximal tubular epithelial cells. The identified LPA-induced signal-transduction pathways, which were pharmacologically validated, and the secretion of the inflammatory factors offer novel insights into the possible role of LPA in CKD pathogenesis.
Activation of regulatory elements is thought to be inversely correlated with DNA methylation levels. However, it is difficult to determine whether DNA methylation is compatible with chromatin ...accessibility or transcription factor (TF) binding if assays are performed separately. We developed a fast, low-input, low sequencing depth method, EpiMethylTag, that combines ATAC-seq or ChIP-seq (M-ATAC or M-ChIP) with bisulfite conversion, to simultaneously examine accessibility/TF binding and methylation on the same DNA. Here we demonstrate that EpiMethylTag can be used to study the functional interplay between chromatin accessibility and TF binding (CTCF and KLF4) at methylated sites.
Multiple sclerosis (MS) is a major health problem, leading to a significant disability and patient suffering. Although chronic activation of the immune system is a hallmark of the disease, its ...pathogenesis is poorly understood, while current treatments only ameliorate the disease and may produce severe side effects.
Here, we applied a network-based modeling approach based on phosphoproteomic data to uncover the differential activation in signaling wiring between healthy donors, untreated patients, and those under different treatments. Based in the patient-specific networks, we aimed to create a new approach to identify drug combinations that revert signaling to a healthy-like state. We performed ex vivo multiplexed phosphoproteomic assays upon perturbations with multiple drugs and ligands in primary immune cells from 169 subjects (MS patients, n=129 and matched healthy controls, n=40). Patients were either untreated or treated with fingolimod, natalizumab, interferon-β, glatiramer acetate, or the experimental therapy epigallocatechin gallate (EGCG). We generated for each donor a dynamic logic model by fitting a bespoke literature-derived network of MS-related pathways to the perturbation data. Last, we developed an approach based on network topology to identify deregulated interactions whose activity could be reverted to a "healthy-like" status by combination therapy. The experimental autoimmune encephalomyelitis (EAE) mouse model of MS was used to validate the prediction of combination therapies.
Analysis of the models uncovered features of healthy-, disease-, and drug-specific signaling networks. We predicted several combinations with approved MS drugs that could revert signaling to a healthy-like state. Specifically, TGF-β activated kinase 1 (TAK1) kinase, involved in Transforming growth factor β-1 proprotein (TGF-β), Toll-like receptor, B cell receptor, and response to inflammation pathways, was found to be highly deregulated and co-druggable with all MS drugs studied. One of these predicted combinations, fingolimod with a TAK1 inhibitor, was validated in an animal model of MS.
Our approach based on donor-specific signaling networks enables prediction of targets for combination therapy for MS and other complex diseases.