The majority of newly diagnosed prostate cancers are slow growing, with a long natural life history. Yet a subset can metastasize with lethal consequences. We reconstructed the phylogenies of 293 ...localized prostate tumors linked to clinical outcome data. Multiple subclones were detected in 59% of patients, and specific subclonal architectures associate with adverse clinicopathological features. Early tumor development is characterized by point mutations and deletions followed by later subclonal amplifications and changes in trinucleotide mutational signatures. Specific genes are selectively mutated prior to or following subclonal diversification, including MTOR, NKX3-1, and RB1. Patients with low-risk monoclonal tumors rarely relapse after primary therapy (7%), while those with high-risk polyclonal tumors frequently do (61%). The presence of multiple subclones in an index biopsy may be necessary, but not sufficient, for relapse of localized prostate cancer, suggesting that evolution-aware biomarkers should be studied in prospective studies of low-risk tumors suitable for active surveillance.
Display omitted
•The phylogenies of 293 localized prostate cancers were reconstructed•Multiple subclones were detected in 59% of patients•Specific genes are selectively mutated early or late in tumor evolution•Subclonal architecture adds prognostic ability to previously developed biomarkers
Tumors evolve during their natural life history. We studied the evolution of newly diagnosed prostate tumors and identified specific genes mutated early or late in a tumor’s life history. Considering subclonality improved predictions of disease aggressivity, identifying those patients who might be good candidates for receiving less treatment.
Global transcriptomic imbalance is a ubiquitous feature associated with cancer, including hepatocellular carcinoma (HCC). Analyses of 1,225 clinical HCC samples revealed that a large numbers of RNA ...binding proteins (RBPs) are dysregulated and that RBP dysregulation is associated with poor prognosis. We further identified that oncogenic activation of a top candidate RBP, negative elongation factor E (NELFE), via somatic copy-number alterations enhanced MYC signaling and promoted HCC progression. Interestingly, NELFE induces a unique tumor transcriptome by selectively regulating MYC-associated genes. Thus, our results revealed NELFE as an oncogenic protein that may contribute to transcriptome imbalance in HCC through the regulation of MYC signaling.
Display omitted
•mRNA binding proteins (RBPs) are globally dysregulated in HCC•Tumor-associated RBPs are predictive of HCC patient survival•NELFE is oncogenic and enhances MYC signaling•NELFE regulates MYC signaling by binding directly to MYC or its targets
Dang et al. show that a large numbers of RNA binding proteins (RBPs) are dysregulated in hepatocellular carcinoma (HCC) and that NELFE, an RBP, enhances MYC-induced HCC development by regulating the binding of MYC to target promoters and the mRNA stability of several MYC-regulated genes.
Alternative polyadenylation (APA) affects most mammalian genes. The genome-wide investigation of APA has been hampered by an inability to reliably profile it using conventional RNA-seq. We describe ...'Quantification of APA' (QAPA), a method that infers APA from conventional RNA-seq data. QAPA is faster and more sensitive than other methods. Application of QAPA reveals discrete, temporally coordinated APA programs during neurogenesis and that there is little overlap between genes regulated by alternative splicing and those by APA. Modeling of these data uncovers an APA sequence code. QAPA thus enables the discovery and characterization of programs of regulated APA using conventional RNA-seq.
High-throughput sequencing allows the detection and quantification of frequencies of somatic single nucleotide variants (SNV) in heterogeneous tumor cell populations. In some cases, the evolutionary ...history and population frequency of the subclonal lineages of tumor cells present in the sample can be reconstructed from these SNV frequency measurements. But automated methods to do this reconstruction are not available and the conditions under which reconstruction is possible have not been described.
We describe the conditions under which the evolutionary history can be uniquely reconstructed from SNV frequencies from single or multiple samples from the tumor population and we introduce a new statistical model, PhyloSub, that infers the phylogeny and genotype of the major subclonal lineages represented in the population of cancer cells. It uses a Bayesian nonparametric prior over trees that groups SNVs into major subclonal lineages and automatically estimates the number of lineages and their ancestry. We sample from the joint posterior distribution over trees to identify evolutionary histories and cell population frequencies that have the highest probability of generating the observed SNV frequency data. When multiple phylogenies are consistent with a given set of SNV frequencies, PhyloSub represents the uncertainty in the tumor phylogeny using a "partial order plot". Experiments on a simulated dataset and two real datasets comprising tumor samples from acute myeloid leukemia and chronic lymphocytic leukemia patients demonstrate that PhyloSub can infer both linear (or chain) and branching lineages and its inferences are in good agreement with ground truth, where it is available.
PhyloSub can be applied to frequencies of any "binary" somatic mutation, including SNVs as well as small insertions and deletions. The PhyloSub and partial order plot software is available from https://github.com/morrislab/phylosub/.
mRNA processing, transport, translation, and ultimately degradation involve a series of dedicated protein complexes that often assemble into large membraneless structures such as stress granules ...(SGs) and processing bodies (PBs). Here, systematic in vivo proximity-dependent biotinylation (BioID) analysis of 119 human proteins associated with different aspects of mRNA biology uncovers 7424 unique proximity interactions with 1,792 proteins. Classical bait-prey analysis reveals connections of hundreds of proteins to distinct mRNA-associated processes or complexes, including the splicing and transcriptional elongation machineries (protein phosphatase 4) and the CCR4-NOT deadenylase complex (CEP85, RNF219, and KIAA0355). Analysis of correlated patterns between endogenous preys uncovers the spatial organization of RNA regulatory structures and enables the definition of 144 core components of SGs and PBs. We report preexisting contacts between most core SG proteins under normal growth conditions and demonstrate that several core SG proteins (UBAP2L, CSDE1, and PRRC2C) are critical for the formation of microscopically visible SGs.
Display omitted
•We performed BioID on 119 human proteins involved in various facets of mRNA biology•Proximal relationships reveal the spatial organization of RNA regulatory structures•Prey-based analysis identifies 144 protein components of cytosolic RNA granules•UBAP2L, CSDE1, and PRRC2C are required for efficient formation of stress granules
Youn et al. performed proximity-based proteomics on 119 human proteins involved in the mRNA life cycle, focusing on cytosolic RNA granule components that are important for mRNA regulation. Systematic analysis of the proximal interactome revealed 144 core components of cytosolic RNA granules and illuminated the spatial organization of RNA regulatory structures.
Genomic analyses often involve scanning for potential transcription factor (TF) binding sites using models of the sequence specificity of DNA binding proteins. Many approaches have been developed to ...model and learn a protein's DNA-binding specificity, but these methods have not been systematically compared. Here we applied 26 such approaches to in vitro protein binding microarray data for 66 mouse TFs belonging to various families. For nine TFs, we also scored the resulting motif models on in vivo data, and found that the best in vitro-derived motifs performed similarly to motifs derived from the in vivo data. Our results indicate that simple models based on mononucleotide position weight matrices trained by the best methods perform similarly to more complex models for most TFs examined, but fall short in specific cases (<10% of the TFs examined here). In addition, the best-performing motifs typically have relatively low information content, consistent with widespread degeneracy in eukaryotic TF sequence preferences.
Cancer cells enter a reversible drug-tolerant persister (DTP) state to evade death from chemotherapy and targeted agents. It is increasingly appreciated that DTPs are important drivers of therapy ...failure and tumor relapse. We combined cellular barcoding and mathematical modeling in patient-derived colorectal cancer models to identify and characterize DTPs in response to chemotherapy. Barcode analysis revealed no loss of clonal complexity of tumors that entered the DTP state and recurred following treatment cessation. Our data fit a mathematical model where all cancer cells, and not a small subpopulation, possess an equipotent capacity to become DTPs. Mechanistically, we determined that DTPs display remarkable transcriptional and functional similarities to diapause, a reversible state of suspended embryonic development triggered by unfavorable environmental conditions. Our study provides insight into how cancer cells use a developmentally conserved mechanism to drive the DTP state, pointing to novel therapeutic opportunities to target DTPs.
Display omitted
•CRC cells possess an equipotent capacity to enter a drug-tolerant persister state•Tumors that recur following a DTP state maintain clonal complexity•DTP-state tumors are similar to diapause, an embryonic survival program•Similar to diapause, DTPs are dependent on autophagy for survival
Any cancer cell has the ability to enter a drug-tolerant persister state in response to chemotherapy regimens by acquiring a reversible functional state akin to diapause.
Subclonal reconstruction from bulk tumor DNA sequencing has become a pillar of cancer evolution studies, providing insight into the clonality and relative ordering of mutations and mutational ...processes. We provide an outline of the complex computational approaches used for subclonal reconstruction from single and multiple tumor samples. We identify the underlying assumptions and uncertainties in each step and suggest best practices for analysis and quality assessment. This guide provides a pragmatic resource for the growing user community of subclonal reconstruction methods.
Cytoscape Web is a web-based network visualization tool–modeled after Cytoscape–which is open source, interactive, customizable and easily integrated into web sites. Multiple file exchange formats ...can be used to load data into Cytoscape Web, including GraphML, XGMML and SIF. Availability and Implementation: Cytoscape Web is implemented in Flex/ActionScript with a JavaScript API and is freely available at http://cytoscapeweb.cytoscape.org/ Contact: gary.bader@utoronto.ca Supplementary information: Supplementary data are available at Bioinformatics online.
Tumors often contain multiple subpopulations of cancerous cells defined by distinct somatic mutations. We describe a new method, PhyloWGS, which can be applied to whole-genome sequencing data from ...one or more tumor samples to reconstruct complete genotypes of these subpopulations based on variant allele frequencies (VAFs) of point mutations and population frequencies of structural variations. We introduce a principled phylogenic correction for VAFs in loci affected by copy number alterations and we show that this correction greatly improves subclonal reconstruction compared to existing methods. PhyloWGS is free, open-source software, available at https://github.com/morrislab/phylowgs.