The epithelial to mesenchymal transition (EMT) is a key cellular process underlying cancer progression, with multiple intermediate states whose molecular hallmarks remain poorly characterised. To ...fill this gap, we present a method to robustly evaluate EMT transformation in individual tumours based on transcriptomic signals. We apply this approach to explore EMT trajectories in 7180 tumours of epithelial origin and identify three macro-states with prognostic and therapeutic value, attributable to epithelial, hybrid E/M and mesenchymal phenotypes. We show that the hybrid state is relatively stable and linked with increased aneuploidy. We further employ spatial transcriptomics and single cell datasets to explore the spatial heterogeneity of EMT transformation and distinct interaction patterns with cytotoxic, NK cells and fibroblasts in the tumour microenvironment. Additionally, we provide a catalogue of genomic events underlying distinct evolutionary constraints on EMT transformation. This study sheds light on the aetiology of distinct stages along the EMT trajectory, and highlights broader genomic and environmental hallmarks shaping the mesenchymal transformation of primary tumours.
Insects entombed in copal, the sub-fossilized resin precursor of amber, represent a potential source of genetic data for extinct and extant, but endangered or elusive, species. Despite several ...studies demonstrated that it is not possible to recover endogenous DNA from insect inclusions, the preservation of biomolecules in fossilized resins samples is still under debate. In this study, we tested the possibility of obtaining endogenous ancient DNA (aDNA) molecules from insects preserved in copal, applying experimental protocols specifically designed for aDNA recovery. We were able to extract endogenous DNA molecules from one of the two samples analyzed, and to identify the taxonomic status of the specimen. Even if the sample was found well protected from external contaminants, the recovered DNA was low concentrated and extremely degraded, compared to the sample age. We conclude that it is possible to obtain genomic data from resin-entombed organisms, although we discourage aDNA analysis because of the destructive method of extraction protocols and the non-reproducibility of the results.
The equivalence of human induced pluripotent stem cells (hiPSCs) and human embryonic stem cells (hESCs) remains controversial. Here we use genetically matched hESC and hiPSC lines to assess the ...contribution of cellular origin (hESC vs. hiPSC), the Sendai virus (SeV) reprogramming method and genetic background to transcriptional and DNA methylation patterns while controlling for cell line clonality and sex. We find that transcriptional and epigenetic variation originating from genetic background dominates over variation due to cellular origin or SeV infection. Moreover, the 49 differentially expressed genes we detect between genetically matched hESCs and hiPSCs neither predict functional outcome nor distinguish an independently derived, larger set of unmatched hESC and hiPSC lines. We conclude that hESCs and hiPSCs are molecularly and functionally equivalent and cannot be distinguished by a consistent gene expression signature. Our data further imply that genetic background variation is a major confounding factor for transcriptional and epigenetic comparisons of pluripotent cell lines, explaining some of the previously observed differences between genetically unmatched hESCs and hiPSCs.
Human pluripotent stem cells (hPSCs) have the capacity to give rise to all differentiated cells of the adult. TGF-beta is used routinely for expansion of conventional hPSCs as flat epithelial ...colonies expressing the transcription factors POU5F1/OCT4, NANOG, SOX2. Here we report a global analysis of the transcriptional programme controlled by TGF-beta followed by an unbiased gain-of-function screening in multiple hPSC lines to identify factors mediating TGF-beta activity. We identify a quartet of transcriptional regulators promoting hPSC self-renewal including ZNF398, a human-specific mediator of pluripotency and epithelial character in hPSCs. Mechanistically, ZNF398 binds active promoters and enhancers together with SMAD3 and the histone acetyltransferase EP300, enabling transcription of TGF-beta targets. In the context of somatic cell reprogramming, inhibition of ZNF398 abolishes activation of pluripotency and epithelial genes and colony formation. Our findings have clear implications for the generation of bona fide hPSCs for regenerative medicine.
Topoisomerase I-DNA-cleavage complexes (Top1cc) stabilized by camptothecin (CPT) have specific effects at transcriptional levels. We recently reported that Top1cc increase antisense transcript ...(aRNAs) levels at divergent CpG-island promoters and, transiently, DNA/RNA hybrids (R-loop) in nuclear and mitochondrial genomes of colon cancer HCT116 cells. However, the relationship between R-loops and aRNAs was not established. Here, we show that aRNAs can form R-loops in N-TERA-2 cells under physiological conditions, and that promoter-associated R-loops are somewhat increased and extended in length immediately upon cell exposure to CPT. In contrast, persistent Top1ccs reduce the majority of R-loops suggesting that CPT-accumulated aRNAs are not commonly involved in R-loops. The enhancement of aRNAs by Top1ccs is present both in human colon cancer HCT116 cells and WI38 fibroblasts suggesting a common response of cancer and normal cells. Although Top1ccs lead to DSB and DDR kinases activation, we do not detect a dependence of aRNA accumulation on ATM or DNA-PK activation. However, we showed that the cell response to persistent Top1ccs can involve an impairment of aRNA turnover rather than a higher synthesis rate. Finally, a genome-wide analysis shows that persistent Top1ccs also determine an accumulation of sense transcripts at 5'-end gene regions suggesting an increased occurrence of truncated transcripts. Taken together, the results indicate that Top1 may regulate transcription initiation by modulating RNA polymerase-generated negative supercoils, which can in turn favor R-loop formation at promoters, and that transcript accumulation at TSS is a response to persistent transcriptional stress by Top1 poisoning.
Although clopidogrel is still frequently used in patients with acute coronary syndromes (ACS), its efficacy is hampered by interpatient response variability caused by genetic polymorphisms associated ...with clopidogrel's metabolism.
The goal of this study was to evaluate whether selecting antiplatelet therapy (clopidogrel, prasugrel, or ticagrelor) on the basis of a patient's genetic and clinical characteristics leads to better clinical outcomes compared with the standard of care, which bases the selection on clinical characteristics alone.
Patients hospitalized for ACS were randomly assigned to standard of care or the pharmacogenomic arm, which included the genotyping of ABCB1, CYP2C19*2, and CYP2C19*17 using an ST Q3 system that provides data within 70 min at each patient's bedside. The patients were followed up for 12 ± 1 month for the primary composite endpoint of cardiovascular death and the first occurrence of nonfatal myocardial infarction, nonfatal stroke, and major bleeding defined according to Bleeding Academic Research Consortium type 3 to 5 criteria.
After enrolling 888 patients, the study was prematurely stopped. Clopidogrel was used more frequently in the standard-of-care arm (50.7% vs. 43.3%), ticagrelor in the pharmacogenomic arm (42.6% vs. 32.7%; p = 0.02), and prasugrel was equally used in both arms. The primary endpoint occurred in 71 patients (15.9%) in the pharmacogenomic arm and in 114 (25.9%) in the standard-of-care arm (hazard ratio: 0.58; 95% confidence interval: 0.43 to 0.78; p < 0.001).
A personalized approach to selecting antiplatelet therapy for patients with ACS may reduce ischemic and bleeding events. (Pharmacogenetics of Clopidogrel in Patients With Acute Coronary Syndromes PHARMCLO; NCT03347435).
Precision medicine is a medical approach that takes into account individual genetic variability and often requires Next Generation Sequencing data in order to predict new treatments. Here we present ...GMIEC, Genomic Modules Identification et Characterization for genomics medicine, an application that is able to identify specific drugs at the level of single patient integrating multi-omics data such as RNA-sequencing, copy-number variation, methylation, Chromatin Immuno-Precipitation and Exome/Whole Genome sequencing. It is also possible to include clinical data related to each patient. GMIEC has been developed as a web-based R-Shiny platform and gives as output a table easy to use and explore.
We present GMIEC, a Shiny application for genomics medicine. The tool allows the users the integration of two or more multiple omics datasets (e.g. gene-expression, copy-number), at sample level, to identify groups of genes that share common genomic and corresponding drugs. We demonstrate the characteristics of our application by using it to analyze a prostate cancer data set.
GMIEC provides a simple interface for genomics medicine. GMIEC was develop with Shiny to provide an application that does not require advanced programming skills. GMIEC consists of three sub-application for the analysis (GMIEC-AN), the visualization (GMIEC-VIS) and the exploration of results (GMIEC-RES). GMIEC is an open source software and is available at https://github.com/guidmt/GMIEC-shiny.
Human skin is maintained by the differentiation and maturation of interfollicular stem and progenitors cells. We used DeepCAGE, genome-wide profiling of histone modifications and retroviral ...integration analysis, to map transcripts, promoters, enhancers, and super-enhancers (SEs) in prospectively isolated keratinocytes and transit-amplifying progenitors, and retrospectively defined keratinocyte stem cells. We show that >95% of the active promoters are in common and differentially regulated in progenitors and differentiated keratinocytes, while approximately half of the enhancers and SEs are stage specific and account for most of the epigenetic changes occurring during differentiation. Transcription factor (TF) motif identification and correlation with TF binding site maps allowed the identification of TF circuitries acting on enhancers and SEs during differentiation. Overall, our study provides a broad, genome-wide description of chromatin dynamics and differential enhancer and promoter usage during epithelial differentiation, and describes a novel approach to identify active regulatory elements in rare stem cell populations.
•Differentiation of epidermal progenitors is accompanied by enhancer remodeling•A TF circuitry operating on super-enhancers regulates epidermal differentiation•TP63 is a key master regulator of stage-specific super-enhancers•MLV integration marks enhancers in retrospectively defined epidermal stem cells
Epithelia are maintained by the continuous commitment and differentiation of somatic stem cells. Mavilio and colleagues utilized high-throughput approaches to identify the stage-specific enhancers and super-enhancers and the circuitry of transcription factors controlling the differential expression of promoters and, ultimately, the cell-specific gene-expression programs.
Therapy resistance in cancer is often driven by a subpopulation of cells that are temporarily arrested in a non-proliferative G0 state, which is difficult to capture and whose mutational drivers ...remain largely unknown.
We develop methodology to robustly identify this state from transcriptomic signals and characterise its prevalence and genomic constraints in solid primary tumours. We show that G0 arrest preferentially emerges in the context of more stable, less mutated genomes which maintain TP53 integrity and lack the hallmarks of DNA damage repair deficiency, while presenting increased APOBEC mutagenesis. We employ machine learning to uncover novel genomic dependencies of this process and validate the role of the centrosomal gene CEP89 as a modulator of proliferation and G0 arrest capacity. Lastly, we demonstrate that G0 arrest underlies unfavourable responses to various therapies exploiting cell cycle, kinase signalling and epigenetic mechanisms in single-cell data.
We propose a G0 arrest transcriptional signature that is linked with therapeutic resistance and can be used to further study and clinically track this state.
Acute myocardial infarction is primarily due to coronary atherosclerotic plaque rupture and subsequent thrombus formation. Platelets play a key role in the genesis and progression of both ...atherosclerosis and thrombosis. Since platelets are anuclear cells that inherit their mRNA from megakaryocyte precursors and maintain it unchanged during their life span, gene expression profiling at the time of an acute myocardial infarction provides information concerning the platelet gene expression preceding the coronary event. In ST-segment elevation myocardial infarction (STEMI), a gene-by-gene analysis of the platelet gene expression identified five differentially expressed genes: FKBP5, S100P, SAMSN1, CLEC4E and S100A12. The logistic regression model used to combine the gene expression in a STEMI vs healthy donors score showed an AUC of 0.95. The same five differentially expressed genes were externally validated using platelet gene expression data from patients with coronary atherosclerosis but without thrombosis. Platelet gene expression profile highlights five genes able to identify STEMI patients and to discriminate them in the background of atherosclerosis. Consequently, early signals of an imminent acute myocardial infarction are likely to be found by platelet gene expression profiling before the infarction occurs.