Highlights • We describe insights into mutation rate from high-throughput genome sequencing of families. • A paternal bias and agebeffect in mutation has been quantified at the genome-wide level. • ...Copy number variants arise less frequently than do point mutations, but affect more bases. • Future research will yield insights into the mutation rate of other forms of variation.
Copy number variants (CNVs) play an important role in human disease and population diversity. Advancements in technology have allowed for the analysis of CNVs in thousands of individuals with disease ...in addition to thousands of controls. These studies have identified rare CNVs associated with neuropsychiatric diseases such as autism, schizophrenia, and intellectual disability. In addition, copy number polymorphisms (CNPs) are present at higher frequencies in the population, show high diversity in copy number, sequence, and structure, and have been associated with multiple phenotypes, primarily related to immune or environmental response. However, the landscape of copy number variation still remains largely unexplored, especially for smaller CNVs and those embedded within complex regions of the human genome. An integrated approach including characterization of single nucleotide variants and CNVs in a large number of individuals with disease and normal genomes holds the promise of thoroughly elucidating the genetic basis of human disease and diversity.
Rare copy-number variants (CNVs) have been implicated in autism and intellectual disability. These variants are large and affect many genes but lack clear specificity toward autism as opposed to ...developmental-delay phenotypes. We exploited the repeat architecture of the genome to target segmental duplication-mediated rearrangement hotspots (n = 120, median size 1.78 Mbp, range 240 kbp to 13 Mbp) and smaller hotspots flanked by repetitive sequence (n = 1,247, median size 79 kbp, range 3–96 kbp) in 2,588 autistic individuals from simplex and multiplex families and in 580 controls. Our analysis identified several recurrent large hotspot events, including association with 1q21 duplications, which are more likely to be identified in individuals with autism than in those with developmental delay (p = 0.01; OR = 2.7). Within larger hotspots, we also identified smaller atypical CNVs that implicated CHD1L and ACACA for the 1q21 and 17q12 deletions, respectively. Our analysis, however, suggested no overall increase in the burden of smaller hotspots in autistic individuals as compared to controls. By focusing on gene-disruptive events, we identified recurrent CNVs, including DPP10, PLCB1, TRPM1, NRXN1, FHIT, and HYDIN, that are enriched in autism. We found that as the size of deletions increases, nonverbal IQ significantly decreases, but there is no impact on autism severity; and as the size of duplications increases, autism severity significantly increases but nonverbal IQ is not affected. The absence of an increased burden of smaller CNVs in individuals with autism and the failure of most large hotspots to refine to single genes is consistent with a model where imbalance of multiple genes contributes to a disease state.
Abstract Background CRISPR-Cas9 dropout screens are formidable tools for investigating biology with unprecedented precision and scale. However, biases in data lead to potential confounding effects on ...interpretation and compromise overall quality. The activity of Cas9 is influenced by structural features of the target site, including copy number amplifications (CN bias). More worryingly, proximal targeted loci tend to generate similar gene-independent responses to CRISPR-Cas9 targeting (proximity bias), possibly due to Cas9-induced whole chromosome-arm truncations or other genomic structural features and different chromatin accessibility levels. Results We benchmarked eight computational methods, rigorously evaluating their ability to reduce both CN and proximity bias in the two largest publicly available cell-line-based CRISPR-Cas9 screens to date. We also evaluated the capability of each method to preserve data quality and heterogeneity by assessing the extent to which the processed data allows accurate detection of true positive essential genes, established oncogenetic addictions, and known/novel biomarkers of cancer dependency. Our analysis sheds light on the ability of each method to correct biases under different scenarios. AC-Chronos outperforms other methods in correcting both CN and proximity biases when jointly processing multiple screens of models with available CN information, whereas CRISPRcleanR is the top performing method for individual screens or when CN information is not available. In addition, Chronos and AC-Chronos yield a final dataset better able to recapitulate known sets of essential and non-essential genes. Conclusions Overall, our investigation provides guidance for the selection of the most appropriate bias-correction method, based on its strengths, weaknesses and experimental settings.
Immune and targeted therapies achieve long-term survival in metastatic melanoma; however, new treatment strategies are needed to improve patients' outcomes
. We report on the efficacy, safety and ...biomarker analysis from the single-arm safety run-in (part 1; n = 9) and biomarker (part 2; n = 27) cohorts of the randomized, placebo-controlled, phase 3 COMBI-i trial (NCT02967692) of the anti-PD-1 antibody spartalizumab, in combination with the BRAF inhibitor dabrafenib and MEK inhibitor trametinib. Patients (n = 36) had previously untreated BRAF V600-mutant unresectable or metastatic melanoma. In part 1, the recommended phase 3 regimen was identified based on the incidence of dose-limiting toxicities (DLTs; primary endpoint): 400 mg of spartalizumab every 4 weeks plus 150 mg of dabrafenib twice daily plus 2 mg of trametinib once daily. Part 2 characterized changes in PD-L1 levels and CD8
cells following treatment (primary endpoint), and analyzed additional biomarkers. Assessments of efficacy and safety were key secondary endpoints (median follow-up, 24.3 months). Spartalizumab plus dabrafenib and trametinib led to an objective response rate (ORR) of 78%, including 44% complete responses (CRs). Grade ≥3 treatment-related adverse events (TRAEs) were experienced by 72% of patients. All patients had temporary dose modifications, and 17% permanently discontinued all three study drugs due to TRAEs. Early progression-free survival (PFS) events were associated with low tumor mutational burden/T cell-inflamed gene expression signature (GES) or high immunosuppressive tumor microenvironment (TME) GES levels at baseline; an immunosuppressive TME may also preclude CR. Overall, the efficacy, safety and on-treatment biomarker modulations associated with spartalizumab plus dabrafenib and trametinib are promising, and biomarkers that may predict long-term benefit were identified.
In patients with metastatic cancer, spatial heterogeneity of somatic alterations may lead to incomplete assessment of a cancer's mutational profile when analyzing a single tumor biopsy. In this ...study, we perform sequencing of cell-free DNA (cfDNA) and distinct metastatic tissue samples from ten rapid autopsy cases with pre-treated metastatic cancer. We show that levels of heterogeneity in genetic biomarkers vary between patients but that gene expression signatures representative of the tumor microenvironment are more consistent. Across nine patients with plasma samples available, we are able to detect 62/62 truncal and 47/121 non-truncal point mutations in cfDNA. We observe that mutation clonality in cfDNA is correlated with the number of metastatic lesions in which the mutation is detected and use this result to derive a clonality threshold to classify truncal and non-truncal driver alterations with reasonable specificity. In contrast, mutation truncality is more often incorrectly assigned when studying single tissue samples. Our results demonstrate the utility of a single cfDNA sample relative to that of single tissue samples when treating patients with metastatic cancer.
Clonal hematopoiesis of indeterminate potential (CHIP) is associated with increased risk of atherosclerotic cardiovascular disease, and mouse experiments suggest that CHIP related to Tet2 loss of ...function in myeloid cells accelerates atherosclerosis via augmented interleukin (IL) 1β signaling.
To assess whether individuals with CHIP have greater cardiovascular event reduction in response to IL-1β neutralization in the Canankinumab Anti-inflammatory Thrombosis Outcomes Trial (CANTOS).
This randomized clinical trial took place from April 2011 to June 2017 at more than 1000 clinical sites in 39 countries. Targeted deep sequencing of genes previously associated with CHIP in a subset of trial participants using genomic DNA prepared from baseline peripheral blood samples were analyzed. All participants had prior myocardial infarction and elevated high-sensitivity C-reactive protein level above 0.20 mg/dL. Analysis took place between June 2017 and December 2021.
Canakinumab, an anti-IL-1β antibody, given at doses of 50, 150, and 300 mg once every 3 months.
Major adverse cardiovascular events (MACE).
A total of 338 patients (8.6%) were identified in this subset with evidence for clonal hematopoiesis. As expected, the incidence of CHIP increased with age; the mean (SD) age of patients with CHIP was 66.3 (9.2) years and 61.5 (9.6) years in patients without CHIP. Unlike other populations that were not preselected for elevated C-reactive protein, in the CANTOS population variants in TET2 were more common than DNMT3A (119 variants in 103 patients vs 86 variants in 85 patients). Placebo-treated patients with CHIP showed a nonsignificant increase in the rate of MACE compared with patients without CHIP using a Cox proportional hazard model (hazard ratio, 1.32 95% CI, 0.86-2.04; P = .21). Exploratory analyses of placebo-treated patients with a somatic variant in either TET2 or DNMT3A (n = 58) showed an equivocal risk for MACE (hazard ratio, 1.65 95% CI, 0.97-2.80; P = .06). Patients with CHIP due to somatic variants in TET2 also had reduced risk for MACE while taking canakinumab (hazard ratio, 0.38 95% CI, 0.15-0.96) with equivocal difference compared with others (P for interaction = .14).
These results are consistent with observations of increased risk for cardiovascular events in patients with CHIP and raise the possibility that those with TET2 variants may respond better to canakinumab than those without CHIP. Future studies are required to further substantiate this hypothesis.
ClinicalTrials.gov Identifier: NCT01327846.
Background Somatic alterations in the cancer genome, some of which are associated with changes in gene expression, have been characterized in multiple studies across diverse cancer types. However, ...less is known about germline variants that influence tumor biology by shaping the cancer transcriptome. Methods We performed expression quantitative trait loci (eQTL) analyses using multi-dimensional data from The Cancer Genome Atlas to explore the role of germline variation in mediating the cancer transcriptome. After accounting for associations between somatic alterations and gene expression, we determined the contribution of inherited variants to the cancer transcriptome relative to that of somatic variants. Finally, we performed an interaction analysis using estimates of tumor cellularity to identify cell type-restricted eQTLs. Results The proportion of genes with at least one eQTL varied between cancer types, ranging between 0.8% in melanoma to 28.5% in thyroid cancer and was correlated more strongly with intratumor heterogeneity than with somatic alteration rates. Although contributions to variance in gene expression was low for most genes, some eQTLs accounted for more than 30% of expression of proximal genes. We identified cell type-restricted eQTLs in genes known to be cancer drivers including LPP and EZH2 that were associated with disease-specific mortality in TCGA but not associated with disease risk in published GWAS. Together, our results highlight the need to consider germline variation in interpreting cancer biology beyond risk prediction. Keywords: eQTL, TCGA, Cancer genomics, Germline variants
Focal cortical dysplasia (FCD), hemimegalencephaly, and megalencephaly constitute a spectrum of malformations of cortical development with shared neuropathologic features. These disorders are ...associated with significant childhood morbidity and mortality.
To identify the underlying molecular cause of FCD, hemimegalencephaly, and diffuse megalencephaly.
Patients with FCD, hemimegalencephaly, or megalencephaly (mean age, 11.7 years; range, 2-32 years) were recruited from Pediatric Hospital A. Meyer, the University of Hong Kong, and Seattle Children's Research Institute from June 2012 to June 2014. Whole-exome sequencing (WES) was performed on 8 children with FCD or hemimegalencephaly using standard-depth (50-60X) sequencing in peripheral samples (blood, saliva, or skin) from the affected child and their parents and deep (150-180X) sequencing in affected brain tissue. Targeted sequencing and WES were used to screen 93 children with molecularly unexplained diffuse or focal brain overgrowth. Histopathologic and functional assays of phosphatidylinositol 3-kinase-AKT (serine/threonine kinase)-mammalian target of rapamycin (mTOR) pathway activity in resected brain tissue and cultured neurons were performed to validate mutations.
Whole-exome sequencing and targeted sequencing identified variants associated with this spectrum of developmental brain disorders.
Low-level mosaic mutations of MTOR were identified in brain tissue in 4 children with FCD type 2a with alternative allele fractions ranging from 0.012 to 0.086. Intermediate-level mosaic mutation of MTOR (p.Thr1977Ile) was also identified in 3 unrelated children with diffuse megalencephaly and pigmentary mosaicism in skin. Finally, a constitutional de novo mutation of MTOR (p.Glu1799Lys) was identified in 3 unrelated children with diffuse megalencephaly and intellectual disability. Molecular and functional analysis in 2 children with FCD2a from whom multiple affected brain tissue samples were available revealed a mutation gradient with an epicenter in the most epileptogenic area. When expressed in cultured neurons, all MTOR mutations identified here drive constitutive activation of mTOR complex 1 and enlarged neuronal size.
In this study, mutations of MTOR were associated with a spectrum of brain overgrowth phenotypes extending from FCD type 2a to diffuse megalencephaly, distinguished by different mutations and levels of mosaicism. These mutations may be sufficient to cause cellular hypertrophy in cultured neurons and may provide a demonstration of the pattern of mosaicism in brain and substantiate the link between mosaic mutations of MTOR and pigmentary mosaicism in skin.
Matched sequencing of both tumor and normal tissue is routinely used to classify variants of uncertain significance (VUS) into somatic vs. germline. However, assays used in molecular diagnostics ...focus on known somatic alterations in cancer genes and often only sequence tumors. Therefore, an algorithm that reliably classifies variants would be helpful for retrospective exploratory analyses. Contamination of tumor samples with normal cells results in differences in expected allelic fractions of germline and somatic variants, which can be exploited to accurately infer genotypes after adjusting for local copy number. However, existing algorithms for determining tumor purity, ploidy and copy number are not designed for unmatched short read sequencing data.
We describe a methodology and corresponding open source software for estimating tumor purity, copy number, loss of heterozygosity (LOH), and contamination, and for classification of single nucleotide variants (SNVs) by somatic status and clonality. This R package, PureCN, is optimized for targeted short read sequencing data, integrates well with standard somatic variant detection pipelines, and has support for matched and unmatched tumor samples. Accuracy is demonstrated on simulated data and on real whole exome sequencing data.
Our algorithm provides accurate estimates of tumor purity and ploidy, even if matched normal samples are not available. This in turn allows accurate classification of SNVs. The software is provided as open source (Artistic License 2.0) R/Bioconductor package PureCN (http://bioconductor.org/packages/PureCN/).