Cancers are caused by the accumulation of genomic alterations. Therefore, analyses of cancer genome sequences and structures provide insights for understanding cancer biology, diagnosis and therapy. ...The application of second-generation DNA sequencing technologies (also known as next-generation sequencing) - through whole-genome, whole-exome and whole-transcriptome approaches - is allowing substantial advances in cancer genomics. These methods are facilitating an increase in the efficiency and resolution of detection of each of the principal types of somatic cancer genome alterations, including nucleotide substitutions, small insertions and deletions, copy number alterations, chromosomal rearrangements and microbial infections. This Review focuses on the methodological considerations for characterizing somatic genome alterations in cancer and the future prospects for these approaches.
How somatic mutations accumulate in normal cells is poorly understood. A comprehensive analysis of RNA sequencing data from ~6700 samples across 29 normal tissues revealed multiple somatic variants, ...demonstrating that macroscopic clones can be found in many normal tissues. We found that sun-exposed skin, esophagus, and lung have a higher mutation burden than other tested tissues, which suggests that environmental factors can promote somatic mosaicism. Mutation burden was associated with both age and tissue-specific cell proliferation rate, highlighting that mutations accumulate over both time and number of cell divisions. Finally, normal tissues were found to harbor mutations in known cancer genes and hotspots. This study provides a broad view of macroscopic clonal expansion in human tissues, thus serving as a foundation for associating clonal expansion with environmental factors, aging, and risk of disease.
Multiple myeloma (MM) is a genetically complex disease that evolves from pre-malignant stages, such as monoclonal gammaopathy of undetermined significance and smouldering multiple myeloma, and ...progresses to symptomatic MM; this continuum provides a unique framework to study the sequential genomic evolution of MM. In the past 5 years, results from large-scale whole-exome sequencing studies have provided new insights into the clonal heterogeneity and evolution of the disease. Moreover, the recurrent co-occurrence of genomic events helps to dissect the genomic complexity underlying tumour progression. According to the primary genetic events involved in tumorigenesis, MM tumours are hierarchically subdivided into hyperdiploid and non-hyperdiploid subtypes; subsequently, secondary genetic events lead to tumour progression. In this Review, we describe the 'driver' gene alterations involved in the development and progression of MM, with a focus on the sequential acquisition of the main genomic aberrations. We also provide valuable insight into the clonal heterogeneity and clonal evolution of the disease, as well as into the therapeutic implications of a comprehensive understanding of the genomic complexity of MM.
Detection of somatic point substitutions is a key step in characterizing the cancer genome. However, existing methods typically miss low-allelic-fraction mutations that occur in only a subset of the ...sequenced cells owing to either tumor heterogeneity or contamination by normal cells. Here we present MuTect, a method that applies a Bayesian classifier to detect somatic mutations with very low allele fractions, requiring only a few supporting reads, followed by carefully tuned filters that ensure high specificity. We also describe benchmarking approaches that use real, rather than simulated, sequencing data to evaluate the sensitivity and specificity as a function of sequencing depth, base quality and allelic fraction. Compared with other methods, MuTect has higher sensitivity with similar specificity, especially for mutations with allelic fractions as low as 0.1 and below, making MuTect particularly useful for studying cancer subclones and their evolution in standard exome and genome sequencing data.
Post-sequencing quality control is a crucial component of RNA sequencing (RNA-seq) data generation and analysis, as sample quality can be affected by sample storage, extraction and sequencing ...protocols. RNA-seq is increasingly applied to cohorts ranging from hundreds to tens of thousands of samples in size, but existing tools do not readily scale to these sizes, and were not designed for a wide range of sample types and qualities. Here, we describe RNA-SeQC 2, an efficient reimplementation of RNA-SeQC (DeLuca et al., 2012) that adds multiple metrics designed to characterize sample quality across a wide range of RNA-seq protocols.
The command-line tool, documentation and C++ source code are available at the GitHub repository https://github.com/getzlab/rnaseqc. Code and data for reproducing the figures in this paper are available at https://github.com/getzlab/rnaseqc2-paper.
Supplementary data are available at Bioinformatics online.
Multiple myeloma (MM) is a haematological malignancy of plasma cells characterized by substantial intraclonal genetic heterogeneity. Although therapeutic advances made in the past few years have led ...to improved outcomes and longer survival, MM remains largely incurable. Over the past decade, genomic analyses of patient samples have demonstrated that MM is not a single disease but rather a spectrum of haematological entities that all share similar clinical symptoms. Moreover, analyses of samples from monoclonal gammopathy of undetermined significance and smouldering MM have also shown the existence of genetic heterogeneity in precursor stages, in some cases remarkably similar to that of MM. This heterogeneity highlights the need for a greater dissection of underlying disease biology, especially the clonal diversity and molecular events underpinning MM at each stage to enable the stratification of individuals with a high risk of progression. Emerging single-cell sequencing technologies present a superlative solution to delineate the complexity of monoclonal gammopathy of undetermined significance, smouldering MM and MM. In this Review, we discuss how genomics has revealed novel insights into clonal evolution patterns of MM and provide examples from single-cell studies that are beginning to unravel the mutational and phenotypic characteristics of individual cells within the bone marrow tumour, immune microenvironment and peripheral blood. We also address future perspectives on clinical application, proposing that multi-omics single-cell profiling can guide early patient diagnosis, risk stratification and treatment strategies.
Abstract
Identifying robust, patient-specific, and predictive biomarkers presents a major obstacle in precision oncology. To optimize patient-specific therapeutic strategies, here we couple pathway ...knowledge with large-scale drug sensitivity, RNAi, and CRISPR-Cas9 screening data from 460 cell lines. Pathway activity levels are found to be strong predictive biomarkers for the essentiality of 15 proteins, including the essentiality of MAD2L1 in breast cancer patients with high BRCA-pathway activity. We also find strong predictive biomarkers for the sensitivity to 31 compounds, including BCL2 and microtubule inhibitors (MTIs). Lastly, we show that Bcl-xL inhibition can modulate the activity of a predictive biomarker pathway and re-sensitize lung cancer cells and tumors to MTI therapy. Overall, our results support the use of pathways in helping to achieve the goal of precision medicine by uncovering dozens of predictive biomarkers.
Type 2 diabetes (T2D) is a heterogeneous disease for which (1) disease-causing pathways are incompletely understood and (2) subclassification may improve patient management. Unlike other biomarkers, ...germline genetic markers do not change with disease progression or treatment. In this paper, we test whether a germline genetic approach informed by physiology can be used to deconstruct T2D heterogeneity. First, we aimed to categorize genetic loci into groups representing likely disease mechanistic pathways. Second, we asked whether the novel clusters of genetic loci we identified have any broad clinical consequence, as assessed in four separate subsets of individuals with T2D.
In an effort to identify mechanistic pathways driven by established T2D genetic loci, we applied Bayesian nonnegative matrix factorization (bNMF) clustering to genome-wide association study (GWAS) results for 94 independent T2D genetic variants and 47 diabetes-related traits. We identified five robust clusters of T2D loci and traits, each with distinct tissue-specific enhancer enrichment based on analysis of epigenomic data from 28 cell types. Two clusters contained variant-trait associations indicative of reduced beta cell function, differing from each other by high versus low proinsulin levels. The three other clusters displayed features of insulin resistance: obesity mediated (high body mass index BMI and waist circumference WC), "lipodystrophy-like" fat distribution (low BMI, adiponectin, and high-density lipoprotein HDL cholesterol, and high triglycerides), and disrupted liver lipid metabolism (low triglycerides). Increased cluster genetic risk scores were associated with distinct clinical outcomes, including increased blood pressure, coronary artery disease (CAD), and stroke. We evaluated the potential for clinical impact of these clusters in four studies containing individuals with T2D (Metabolic Syndrome in Men Study METSIM, N = 487; Ashkenazi, N = 509; Partners Biobank, N = 2,065; UK Biobank UKBB, N = 14,813). Individuals with T2D in the top genetic risk score decile for each cluster reproducibly exhibited the predicted cluster-associated phenotypes, with approximately 30% of all individuals assigned to just one cluster top decile. Limitations of this study include that the genetic variants used in the cluster analysis were restricted to those associated with T2D in populations of European ancestry.
Our approach identifies salient T2D genetically anchored and physiologically informed pathways, and supports the use of genetics to deconstruct T2D heterogeneity. Classification of patients by these genetic pathways may offer a step toward genetically informed T2D patient management.
Everolimus, an inhibitor of the mammalian target of rapamycin (mTOR), is effective in treating tumors harboring alterations in the mTOR pathway. Mechanisms of resistance to everolimus remain ...undefined. Resistance developed in a patient with metastatic anaplastic thyroid carcinoma after an extraordinary 18-month response. Whole-exome sequencing of pretreatment and drug-resistant tumors revealed a nonsense mutation in TSC2, a negative regulator of mTOR, suggesting a mechanism for exquisite sensitivity to everolimus. The resistant tumor also harbored a mutation in MTOR that confers resistance to allosteric mTOR inhibition. The mutation remains sensitive to mTOR kinase inhibitors.