Extra-cranial malignant rhabdoid tumors (MRTs) and cranial atypical teratoid RTs (ATRTs) are heterogeneous pediatric cancers driven primarily by SMARCB1 loss. To understand the genome-wide molecular ...relationships between MRTs and ATRTs, we analyze multi-omics data from 140 MRTs and 161 ATRTs. We detect similarities between the MYC subgroup of ATRTs (ATRT-MYC) and extra-cranial MRTs, including global DNA hypomethylation and overexpression of HOX genes and genes involved in mesenchymal development, distinguishing them from other ATRT subgroups that express neural-like features. We identify five DNA methylation subgroups associated with anatomical sites and SMARCB1 mutation patterns. Groups 1, 3, and 4 exhibit cytotoxic T cell infiltration and expression of immune checkpoint regulators, consistent with a potential role for immunotherapy in rhabdoid tumor patients.
Display omitted
•MYC subgroup of cranial RTs (ATRT-MYC) is molecularly similar to extra-cranial RTs•Five DNA methylation subgroups are identified in RTs across multiple organ sites•Groups 1, 3, and 4 exhibit cytotoxic T cell infiltration and PD1 and PD-L1 expression
Chun et al. report similarities between the MYC subgroup of cranial and extra-cranial rhabdoid tumors (RTs) at genetic, gene-expression, and epigenetic levels. They identify five DNA methylation subgroups of RTs across multiple organ sites, and some subgroups exhibit increased levels of immune cell infiltration and immune checkpoint expression.
Malignant rhabdoid tumors (MRTs) are rare lethal tumors of childhood that most commonly occur in the kidney and brain. MRTs are driven by SMARCB1 loss, but the molecular consequences of SMARCB1 loss ...in extra-cranial tumors have not been comprehensively described and genomic resources for analyses of extra-cranial MRT are limited. To provide such data, we used whole-genome sequencing, whole-genome bisulfite sequencing, whole transcriptome (RNA-seq) and microRNA sequencing (miRNA-seq), and histone modification profiling to characterize extra-cranial MRTs. Our analyses revealed gene expression and methylation subgroups and focused on dysregulated pathways, including those involved in neural crest development.
•Extra-cranial malignant rhabdoid tumors (MRTs) exhibit molecular heterogeneity•Evidence is presented for epigenetic reprogramming of HOX genes•MRTs exhibit dysregulated expression of genes involved in neural crest development•Dysregulation of oncogenes and tumor suppressor genes is reported
Chun et al. perform integrated molecular analyses of extra-cranial malignant rhabdoid tumors (MRTs) and show that, although SMARCB1 loss drives nearly all MRTs, there are two subgroups of MRTs that are associated with patient age and differentially expressed genes.
Structural variants (SVs) may be an underestimated cause of hereditary cancer syndromes given the current limitations of short-read next-generation sequencing. Here we investigated the utility of ...long-read sequencing in resolving germline SVs in cancer susceptibility genes detected through short-read genome sequencing.
Known or suspected deleterious germline SVs were identified using Illumina genome sequencing across a cohort of 669 advanced cancer patients with paired tumor genome and transcriptome sequencing. Candidate SVs were subsequently assessed by Oxford Nanopore long-read sequencing.
Nanopore sequencing confirmed eight simple pathogenic or likely pathogenic SVs, resolving three additional variants whose impact could not be fully elucidated through short-read sequencing. A recurrent sequencing artifact on chromosome 16p13 and one complex rearrangement on chromosome 5q35 were subsequently classified as likely benign, obviating the need for further clinical assessment. Variant configuration was further resolved in one case with a complex pathogenic rearrangement affecting TSC2.
Our findings demonstrate that long-read sequencing can improve the validation, resolution, and classification of germline SVs. This has important implications for return of results, cascade carrier testing, cancer screening, and prophylactic interventions.
The role for routine whole genome and transcriptome analysis (WGTA) for poor prognosis pediatric cancers remains undetermined. Here, we characterize somatic mutations, structural rearrangements, copy ...number variants, gene expression, immuno-profiles and germline cancer predisposition variants in children and adolescents with relapsed, refractory or poor prognosis malignancies who underwent somatic WGTA and matched germline sequencing. Seventy-nine participants with a median age at enrollment of 8.8 y (range 6 months to 21.2 y) are included. Germline pathogenic/likely pathogenic variants are identified in 12% of participants, of which 60% were not known prior. Therapeutically actionable variants are identified by targeted gene report and whole genome in 32% and 62% of participants, respectively, and increase to 96% after integrating transcriptome analyses. Thirty-two molecularly informed therapies are pursued in 28 participants with 54% achieving a clinical benefit rate; objective response or stable disease ≥6 months. Integrated WGTA identifies therapeutically actionable variants in almost all tumors and are directly translatable to clinical care of children with poor prognosis cancers.
The grizzly bear (
ssp.
) represents the largest population of brown bears in North America. Its genome was sequenced using a microfluidic partitioning library construction technique, and these data ...were supplemented with sequencing from a nanopore-based long read platform. The final assembly was 2.33 Gb with a scaffold N50 of 36.7 Mb, and the genome is of comparable size to that of its close relative the polar bear (2.30 Gb). An analysis using 4104 highly conserved mammalian genes indicated that 96.1% were found to be complete within the assembly. An automated annotation of the genome identified 19,848 protein coding genes. Our study shows that the combination of the two sequencing modalities that we used is sufficient for the construction of highly contiguous reference quality mammalian genomes. The assembled genome sequence and the supporting raw sequence reads are available from the NCBI (National Center for Biotechnology Information) under the bioproject identifier PRJNA493656, and the assembly described in this paper is version QXTK01000000.
Population sequencing often requires collaboration across a distributed network of sequencing centers for the timely processing of thousands of samples. In such massive efforts, it is important that ...participating scientists can be confident that the accuracy of the sequence data produced is not affected by which center generates the data. A study was conducted across three established sequencing centers, located in Montreal, Toronto, and Vancouver, constituting Canada's Genomics Enterprise (www.cgen.ca). Whole genome sequencing was performed at each center, on three genomic DNA replicates from three well-characterized cell lines. Secondary analysis pipelines employed by each site were applied to sequence data from each of the sites, resulting in three datasets for each of four variables (cell line, replicate, sequencing center, and analysis pipeline), for a total of 81 datasets. These datasets were each assessed according to multiple quality metrics including concordance with benchmark variant truth sets to assess consistent quality across all three conditions for each variable. Three-way concordance analysis of variants across conditions for each variable was performed. Our results showed that the variant concordance between datasets differing only by sequencing center was similar to the concordance for datasets differing only by replicate, using the same analysis pipeline. We also showed that the statistically significant differences between datasets result from the analysis pipeline used, which can be unified and updated as new approaches become available. We conclude that genome sequencing projects can rely on the quality and reproducibility of aggregate data generated across a network of distributed sites.
We present a genome assembly of
Caretta caretta (the Loggerhead sea turtle; Chordata, Testudines, Cheloniidae), generated from genomic data from two unrelated females. The genome sequence is 2.13 ...gigabases in size. The majority of the assembly is scaffolded into 28 chromosomal representations with a remaining 2% of the assembly being excluded from these.
RNA sequencing (RNAseq) has been widely used to generate bulk gene expression measurements collected from pools of cells. Only relatively recently have single-cell RNAseq (scRNAseq) methods provided ...opportunities for gene expression analyses at the single-cell level, allowing researchers to study heterogeneous mixtures of cells at unprecedented resolution. Tumors tend to be composed of heterogeneous cellular mixtures and are frequently the subjects of such analyses. Extensive method developments have led to several protocols for scRNAseq but, owing to the small amounts of RNA in single cells, technical constraints have required compromises. For example, the majority of scRNAseq methods are limited to sequencing only the 3′ or 5′ termini of transcripts. Other protocols that facilitate full-length transcript profiling tend to capture only polyadenylated mRNAs and are generally limited to processing only 96 cells at a time. Here, we address these limitations and present a novel protocol that allows for the high-throughput sequencing of full-length, total RNA at single-cell resolution. We demonstrate that our method produced strand-specific sequencing data for both polyadenylated and non-polyadenylated transcripts, enabled the profiling of transcript regions beyond only transcript termini, and yielded data rich enough to allow identification of cell types from heterogeneous biological samples.