Computational efforts to identify functional elements within genomes leverage comparative sequence information by looking for regions that exhibit evidence of selective constraint. One way of ...detecting constrained elements is to follow a bottom-up approach by computing constraint scores for individual positions of a multiple alignment and then defining constrained elements as segments of contiguous, highly scoring nucleotide positions. Here we present GERP++, a new tool that uses maximum likelihood evolutionary rate estimation for position-specific scoring and, in contrast to previous bottom-up methods, a novel dynamic programming approach to subsequently define constrained elements. GERP++ evaluates a richer set of candidate element breakpoints and ranks them based on statistical significance, eliminating the need for biased heuristic extension techniques. Using GERP++ we identify over 1.3 million constrained elements spanning over 7% of the human genome. We predict a higher fraction than earlier estimates largely due to the annotation of longer constrained elements, which improves one to one correspondence between predicted elements with known functional sequences. GERP++ is an efficient and effective tool to provide both nucleotide- and element-level constraint scores within deep multiple sequence alignments.
Non-ribosomal peptide synthetases are important enzymes for the assembly of complex peptide natural products. Within these multi-modular assembly lines, condensation domains perform the central ...function of chain assembly, typically by forming a peptide bond between two peptidyl carrier protein (PCP)-bound substrates. In this work, we report structural snapshots of a condensation domain in complex with an aminoacyl-PCP acceptor substrate. These structures allow the identification of a mechanism that controls access of acceptor substrates to the active site in condensation domains. The structures of this complex also allow us to demonstrate that condensation domain active sites do not contain a distinct pocket to select the side chain of the acceptor substrate during peptide assembly but that residues within the active site motif can instead serve to tune the selectivity of these central biosynthetic domains.
Drugs targeting cyclin-dependent kinases 4 and 6 (CDK4/6) are promising new treatments for melanoma and other solid malignancies. In studies on CDK4/6 inhibitor resistance, protein arginine ...methyltransferase 5 (PRMT5) regulation of alternative splicing was shown to be an important downstream component of the CDK4/6 pathway. However, the full effects of inhibition of CDK4/6 on splicing events in melanoma and the extent to which they are dependent on PRMT5 has not been established. We performed full-length mRNA sequencing on CHL1 and A375 melanoma cell lines treated with the CDK4/6 inhibitor palbociclib and the PRMT5 inhibitor GSK3326595 and analysed data for differential gene expression and differential pre-mRNA splicing induced by these agents. Changes in gene expression and RNA splicing were more extensive under PRMT5 inhibition than under CDK4/6 inhibition. Although PRMT5 inhibition and CDK4/6 inhibition induced common RNA splicing events and gene expression profiles, the majority of events induced by CDK4/6 inhibition were distinct. Our findings indicate CDK4/6 has the ability to regulate alternative splicing in a manner that is distinct from PRMT5 inhibition, resulting in divergent changes in gene expression under each therapy.
The genetic programs that maintain leukemia stem cell (LSC) self-renewal and oncogenic potential have been well defined; however, the comprehensive epigenetic landscape that sustains LSC cellular ...identity and functionality is less well established. We report that LSCs in MLL-associated leukemia reside in an epigenetic state of relative genome-wide high-level H3K4me3 and low-level H3K79me2. LSC differentiation is associated with reversal of these broad epigenetic profiles, with concomitant downregulation of crucial MLL target genes and the LSC maintenance transcriptional program that is driven by the loss of H3K4me3, but not H3K79me2. The H3K4-specific demethylase KDM5B negatively regulates leukemogenesis in murine and human MLL-rearranged AML cells, demonstrating a crucial role for the H3K4 global methylome in determining LSC fate.
Display omitted
•MLL LSCs are maintained in a hyper-H3K4me3 and hypo-H3K79me2 epigenomic state•LSC differentiation is associated with global inversion of the histone methylome•H3K4me3 serves a crucial mechanistic role in LSC maintenance•Histone demethylase KDM5B negatively regulates LSC potential
Wong et al. show that leukemia stem cells (LSCs) in MLL-rearranged leukemia are in a relative genome-wide high-H3K4me3 and low-H3K79me2 state. Differentiation of LSCs reverses this pattern, but the reduced expression of key MLL target genes is driven mainly by the loss of H3K4me3, which is regulated by KDM5B.
Tumors of distinct tissues of origin and genetic makeup display common hallmark cellular phenotypes, including sustained proliferation, suppression of cell death, and altered metabolism. These ...phenotypic commonalities have been proposed to stem from disruption of conserved regulatory mechanisms evolved during the transition to multicellularity to control fundamental cellular processes such as growth and replication. Dating the evolutionary emergence of human genes through phylostratigraphy uncovered close association between gene age and expression level in RNA sequencing data from The Cancer Genome Atlas for seven solid cancers. Genes conserved with unicellular organisms were strongly up-regulated, whereas genes of metazoan origin were primarily inactivated. These patterns were most consistent for processes known to be important in cancer, implicating both selection and active regulation during malignant transformation. The coordinated expression of strongly interacting multicellularity and unicellularity processes was lost in tumors. This separation of unicellular and multicellular functions appeared to be mediated by 12 highly connected genes, marking them as important general drivers of tumorigenesis. Our findings suggest common principles closely tied to the evolutionary history of genes underlie convergent changes at the cellular process level across a range of solid cancers. We propose altered activity of genes at the interfaces between multicellular and unicellular regions of human gene regulatory networks activate primitive transcriptional programs, driving common hallmark features of cancer. Manipulation of cross-talk between biological processes of different evolutionary origins may thus present powerful and broadly applicable treatment strategies for cancer.
Spatial proteomics technologies have revealed an underappreciated link between the location of cells in tissue microenvironments and the underlying biology and clinical features, but there is ...significant lag in the development of downstream analysis methods and benchmarking tools. Here we present SPIAT (spatial image analysis of tissues), a spatial-platform agnostic toolkit with a suite of spatial analysis algorithms, and spaSim (spatial simulator), a simulator of tissue spatial data. SPIAT includes multiple colocalization, neighborhood and spatial heterogeneity metrics to characterize the spatial patterns of cells. Ten spatial metrics of SPIAT are benchmarked using simulated data generated with spaSim. We show how SPIAT can uncover cancer immune subtypes correlated with prognosis in cancer and characterize cell dysfunction in diabetes. Our results suggest SPIAT and spaSim as useful tools for quantifying spatial patterns, identifying and validating correlates of clinical outcomes and supporting method development.
Neoplastic growth and many of the hallmark properties of cancer are driven by the disruption of molecular networks established during the emergence of multicellularity. Regulatory pathways and ...molecules that evolved to impose regulatory constraints upon networks established in earlier unicellular organisms enabled greater communication and coordination between the diverse cell types required for multicellularity, but also created liabilities in the form of points of vulnerability in the network that when mutated or dysregulated facilitate the development of cancer. These factors are usually overlooked in genomic analyses of cancer, but understanding where vulnerabilities to cancer lie in the networks of multicellular species would provide important new insights into how core molecular processes and gene regulation change during tumourigenesis. We describe how the evolutionary origins of genes influence their roles in cancer, and how connections formed between unicellular and multicellular genes that act as key regulatory hubs for normal tissue homeostasis can also contribute to malignant transformation when disrupted. Tumours in general are characterised by increased dependence on unicellular processes for survival, and major dysregulation of the control structures imposed on these processes during the evolution of multicellularity. Mounting molecular evidence suggests altered interactions at the interface between unicellular and multicellular genes play key roles in the initiation and progression of cancer. Furthermore, unicellular network regions activated in cancer show high degrees of robustness and plasticity, conferring increased adaptability to tumour cells by supporting effective responses to environmental pressures such as drug exposure. Examining how the links between multicellular and unicellular regions get disrupted in tumours has great potential to identify novel drivers of cancer, and to guide improvements to cancer treatment by identifying more effective therapeutic strategies. Recent successes in targeting unicellular processes by novel compounds underscore the logic of such approaches. Further gains could come from identifying genes at the interface between unicellular and multicellular processes and manipulating the communication between network regions of different evolutionary ages.
Extensive transcriptional alterations are observed in cancer, many of which activate core biological processes established in unicellular organisms or suppress differentiation pathways formed in ...metazoans. Through rigorous, integrative analysis of genomics data from a range of solid tumors, we show many transcriptional changes in tumors are tied to mutations disrupting regulatory interactions between unicellular and multicellular genes within human gene regulatory networks (GRNs). Recurrent point mutations were enriched in regulator genes linking unicellular and multicellular subnetworks, while copy-number alterations affected downstream target genes in distinctly unicellular and multicellular regions of the GRN. Our results depict drivers of tumourigenesis as genes that created key regulatory links during the evolution of early multicellular life, whose dysfunction creates widespread dysregulation of primitive elements of the GRN. Several genes we identified as important in this process were associated with drug response, demonstrating the potential clinical value of our approach.
Metazoans inherited genes from unicellular ancestors that perform essential biological processes such as cell division, metabolism, and protein translation. Multicellularity requires careful control ...and coordination of these unicellular genes to maintain tissue integrity and homeostasis. Gene regulatory networks (GRNs) that arose during metazoan evolution are frequently altered in cancer, resulting in over-expression of unicellular genes. We propose that an imbalance in co-expression of unicellular (UC) and multicellular (MC) genes is a driving force in cancer.
We combine gene co-expression analysis to infer changes to GRNs in cancer with protein sequence conservation data to distinguish genes with UC and MC origins. Co-expression networks created using RNA sequencing data from 31 tumor types and normal tissue samples are divided into modules enriched for UC genes, MC genes, or mixed UC-MC modules. The greatest differences between tumor and normal tissue co-expression networks occur within mixed UC-MC modules. MC and UC genes not commonly co-expressed in normal tissues form distinct co-expression modules seen only in tumors. The degree of rewiring of genes within mixed UC-MC modules increases with tumor grade and stage. Mixed UC-MC modules are enriched for somatic mutations in cancer genes, particularly amplifications, suggesting an important driver of the rewiring observed in tumors is copy number changes.
Our study shows the greatest changes to gene co-expression patterns during tumor progression occur between genes of MC and UC origins, implicating the breakdown and rewiring of metazoan gene regulatory networks in cancer development and progression.
Sarcomas are a key feature of Li-Fraumeni and related syndromes (LFS/LFL), associated with germline TP53 mutations. Current penetrance estimates for TP53 mutations are subject to significant ...ascertainment bias. The International Sarcoma Kindred Study is a clinic-based, prospective cohort of adult-onset sarcoma cases, without regard to family history. The entire cohort was screened for mutations in TP53 using high-resolution melting analysis and Sanger sequencing, and multiplex-ligation-dependent probe amplification and targeted massively parallel sequencing for copy number changes. Pathogenic TP53 mutations were detected in blood DNA of 20/559 sarcoma probands (3.6%); 17 were germline and 3 appeared to be somatically acquired. Of the germline carriers, one appeared to be mosaic, detectable in the tumor and blood, but not epithelial tissues. Germline mutation carriers were more likely to have multiple cancers (47% vs 15% for non-carriers, P = 3.0×10(-3)), and earlier cancer onset (33 vs 48 years, P = 1.19×10(-3)). The median survival of mutation carriers following first cancer diagnosis was not significantly different from non-carriers. Only 10/17 (59%) pedigrees met classical or Chompret criteria for LFS. In summary, germline TP53 mutations are not rare in adult patients with sarcoma, with implications for screening, surveillance, treatment and genetic counselling of carriers and family members.