Germline structural variants (SVs) are challenging to resolve by conventional genetic testing assays. Long-read sequencing has improved the global characterization of SVs, but its sensitivity at ...cancer susceptibility loci has not been reported. Nanopore long-read genome sequencing was performed for nineteen individuals with pathogenic copy number alterations in BRCA1, BRCA2, CHEK2 and PALB2 identified by prior clinical testing. Fourteen variants, which spanned single exons to whole genes and included a tandem duplication, were accurately represented. Defining the precise breakpoints of SVs in BRCA1 and CHEK2 revealed unforeseen allelic heterogeneity and informed the mechanisms underlying the formation of recurrent deletions. Integrating read-based and statistical phasing further helped define extended haplotypes associated with founder alleles. Long-read sequencing is a sensitive method for characterizing private, recurrent and founder SVs underlying breast cancer susceptibility. Our findings demonstrate the potential for nanopore sequencing as a powerful genetic testing assay in the hereditary cancer setting.
Full text
Available for:
EMUNI, FIS, FZAB, GEOZS, GIS, IJS, IMTLJ, KILJ, KISLJ, MFDPS, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, SBMB, SBNM, UKNU, UL, UM, UPUK, VKSCE, ZAGLJ
Abstract
Summary
Reliably identifying genomic rearrangements and interpreting their impact is a key step in understanding their role in human cancers and inherited genetic diseases. Many short read ...algorithmic approaches exist but all have appreciable false negative rates. A common approach is to evaluate the union of multiple tools increasing sensitivity, followed by filtering to retain specificity. Here we describe an application framework for the rapid generation of structural variant consensus, unique in its ability to visualize the genetic impact and context as well as process both genome and transcriptome data.
Availability and implementation
http://mavis.bcgsc.ca
Supplementary information
Supplementary data are available at Bioinformatics online.
Immune checkpoint inhibitors (ICI) have revolutionized the treatment of solid tumors with dramatic and durable responses seen across multiple tumor types. However, identifying patients who will ...respond to these drugs remains challenging, particularly in the context of advanced and previously treated cancers.
We characterized fresh tumor biopsies from a heterogeneous pan-cancer cohort of 98 patients with metastatic predominantly pretreated disease through the Personalized OncoGenomics program at BC Cancer (Vancouver, Canada) using whole genome and transcriptome analysis (WGTA). Baseline characteristics and follow-up data were collected retrospectively.
We found that tumor mutation burden, independent of mismatch repair status, was the most predictive marker of time to progression (
= 0.007), but immune-related CD8
T-cell and M1-M2 macrophage ratio scores were more predictive for overall survival (OS;
= 0.0014 and 0.0012, respectively). While
programmed death-ligand 1 (PD-L1) gene expression is comparable with protein levels detected by IHC, we did not observe a clinical benefit for patients with this marker. We demonstrate that a combination of markers based on WGTA provides the best stratification of patients (
= 0.00071, OS), and also present a case study of possible acquired resistance to pembrolizumab in a patient with non-small cell lung cancer.
Interpreting the tumor-immune interface to predict ICI efficacy remains challenging. WGTA allows for identification of multiple biomarkers simultaneously that in combination may help to identify responders, particularly in the context of a heterogeneous population of advanced and previously treated cancers, thus precluding tumor type-specific testing.
Cervical cancer is the most common cancer affecting sub-Saharan African women and is prevalent among HIV-positive (HIV
) individuals. No comprehensive profiling of cancer genomes, transcriptomes or ...epigenomes has been performed in this population thus far. We characterized 118 tumors from Ugandan patients, of whom 72 were HIV
, and performed extended mutation analysis on an additional 89 tumors. We detected human papillomavirus (HPV)-clade-specific differences in tumor DNA methylation, promoter- and enhancer-associated histone marks, gene expression and pathway dysregulation. Changes in histone modification at HPV integration events were correlated with upregulation of nearby genes and endogenous retroviruses.
Full text
Available for:
FZAB, GEOZS, IJS, IMTLJ, KILJ, KISLJ, MFDPS, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, SBMB, SBNM, UKNU, UL, UM, UPUK, VKSCE, ZAGLJ
Somatic hypermutation (SHM) in the variable region of immunoglobulin genes (IGV) naturally occurs in a narrow window of B cell development to provide high-affinity antibodies. However, SHM can also ...aberrantly target proto-oncogenes and cause genome instability. The role of aberrant SHM (aSHM) has been widely studied in various non-Hodgkin's lymphoma particularly in diffuse large B-cell lymphoma (DLBCL). Although, it has been speculated that aSHM targets a wide range of genome loci so far only twelve genes have been identified as targets of aSHM through the targeted sequencing of selected genes. A genome-wide study aiming at identifying a comprehensive set of aSHM targets recurrently occurring in DLBCL has not been previously undertaken. Here, we present a comprehensive assessment of the somatic hypermutated genes in DLBCL identified through an analysis of genomic and transcriptome data derived from 40 DLBCL patients. Our analysis verifies that there are indeed many genes that are recurrently affected by aSHM. In particular, we have identified 32 novel targets that show same or higher level of aSHM activity than genes previously reported. Amongst these novel targets, 22 genes showed a significant correlation between mRNA abundance and aSHM.
High-throughput total nucleic acid (TNA) purification methods based on solid-phase reversible immobilization (SPRI) beads produce TNA suitable for both genomic and transcriptomic applications. Even ...so, small RNA species, including miRNA, bind weakly to SPRI beads under standard TNA purification conditions, necessitating a separate workflow using column-based methods that are difficult to automate. Here, an SPRI-based high-throughput TNA purification protocol that recovers DNA, RNA and small RNA, called GSC-modified RLT+ Aline bead-based protocol (GRAB-ALL), which incorporates modifications to enhance small RNA recovery is presented. GRAB-ALL was benchmarked against existing nucleic acid purification workflows and GRAB-ALL efficiently purifies TNA, including small RNA, for next-generation sequencing applications in a plate-based format suitable for automated high-throughput sample preparation.
Recently, the landscape of single base mutations in diffuse large B-cell lymphoma (DLBCL) was described. Here we report the discovery of a gene fusion between TBL1XR1 and TP63, the only recurrent ...somatic novel gene fusion identified in our analysis of transcriptome data from 96 DLBCL cases. Based on this cohort and a further 157 DLBCL cases analyzed by FISH, the incidence in de novo germinal center B cell–like (GCB) DLBCL is 5% (6 of 115). The fusion appears exclusive to GCB and was not seen in 138 non-GCB cases examined (P = .008, Fisher exact test) but was present at low incidence in follicular lymphoma (1 of 81). In all 7 cases identified, the 3′ end of the fusion consists of exons 4 and onwards of TP63. The recurrence, subtype enrichment, and the remarkably conserved nature of the TP63 portion of the fusion suggest an important functional role in the lymphomas that harbor this event.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
Population sequencing often requires collaboration across a distributed network of sequencing centers for the timely processing of thousands of samples. In such massive efforts, it is important that ...participating scientists can be confident that the accuracy of the sequence data produced is not affected by which center generates the data. A study was conducted across three established sequencing centers, located in Montreal, Toronto, and Vancouver, constituting Canada's Genomics Enterprise (www.cgen.ca). Whole genome sequencing was performed at each center, on three genomic DNA replicates from three well-characterized cell lines. Secondary analysis pipelines employed by each site were applied to sequence data from each of the sites, resulting in three datasets for each of four variables (cell line, replicate, sequencing center, and analysis pipeline), for a total of 81 datasets. These datasets were each assessed according to multiple quality metrics including concordance with benchmark variant truth sets to assess consistent quality across all three conditions for each variable. Three-way concordance analysis of variants across conditions for each variable was performed. Our results showed that the variant concordance between datasets differing only by sequencing center was similar to the concordance for datasets differing only by replicate, using the same analysis pipeline. We also showed that the statistically significant differences between datasets result from the analysis pipeline used, which can be unified and updated as new approaches become available. We conclude that genome sequencing projects can rely on the quality and reproducibility of aggregate data generated across a network of distributed sites.