Structural chromosomal rearrangements that can lead to in-frame gene-fusions are a leading source of information for diagnosis, risk stratification, and prognosis in pediatric acute lymphoblastic ...leukemia (ALL). Traditional methods such as karyotyping and FISH struggle to accurately identify and phase such large-scale chromosomal aberrations in ALL genomes. We therefore evaluated linked-read WGS for detecting chromosomal rearrangements in primary samples of from 12 patients diagnosed with ALL. We assessed the effect of input DNA quality on phased haplotype block size and the detectability of copy number aberrations and structural variants in the ALL genomes. We found that biobanked DNA isolated by standard column-based extraction methods was sufficient to detect chromosomal rearrangements even at low 10x sequencing coverage. Linked-read WGS enabled precise, allele-specific, digital karyotyping at a base-pair resolution for a wide range of structural variants including complex rearrangements and aneuploidy assessment. With use of haplotype information from the linked-reads, we also identified previously unknown structural variants, such as a compound heterozygous deletion of ERG in a patient with the DUX4-IGH fusion gene. We conclude that linked-read WGS allows detection of important pathogenic variants in ALL genomes at a resolution beyond that of traditional karyotyping and FISH.
Acute lymphoblastic leukemia is marked by aberrant transcriptional features that alter cell differentiation, self-renewal, and proliferative features. We sought to identify the transcription factors ...exhibiting altered and subtype-specific expression patterns in B-ALL and report here that SOX11, a developmental and neuronal transcription factor, is aberrantly expressed in the ETV6-RUNX1 and TCF3-PBX1 subtypes of acute B-cell leukemias. We show that a high expression of SOX11 leads to alterations of gene expression that are typically associated with cell adhesion, migration, and differentiation. A high expression is associated with DNA hypomethylation at the SOX11 locus and a favorable outcome. The results indicate that SOX11 expression marks a group of patients with good outcomes and thereby prompts further study of its use as a biomarker.
DNA methylation is the most studied epigenetic mark involved in regulation of gene expression. For low input samples, a limited number of methods for quantifying DNA methylation genome-wide has been ...evaluated. Here, we compared a series of input DNA amounts (1-10ng) from two methylome library preparation protocols, enzymatic methyl-seq (EM-seq) and post-bisulfite adaptor tagging (PBAT) adapted from single-cell PBAT. EM-seq takes advantage of enzymatic activity while PBAT relies on conventional bisulfite conversion for detection of DNA methylation. We found that both methods accurately quantified DNA methylation genome-wide. They produced expected distribution patterns around genomic features, high C-T transition efficiency at non-CpG sites and high correlation between input amounts. However, EM-seq performed better in regard to library and sequencing quality, i.e. EM-seq produced larger insert sizes, higher alignment rates and higher library complexity with lower duplication rate compared to PBAT. Moreover, EM-seq demonstrated higher CpG coverage, better CpG site overlap and higher consistency between input series. In summary, our data suggests that EM-seq overall performed better than PBAT in whole-genome methylation quantification of low input samples.
B‐cell precursor acute lymphoblastic leukemia (BCP‐ALL) can be classified into subtypes according to the genetic aberrations they display. For instance, the translocation t(12;21)(p13;q22), ...representing the ETV6‐RUNX1 fusion gene (ER), is present in a quarter of BCP‐ALL cases. However, around 10% of the cases lack classifying chromosomal abnormalities (B‐other). In pediatric ER BCP‐ALL, rearrangement mediated by RAG (recombination‐activating genes) has been proposed as the predominant driver of oncogenic rearrangement. Herein we analyzed almost 1600 pediatric BCP‐ALL samples to determine which subtypes express RAG. We demonstrate that RAG1 mRNA levels are especially high in the ETV6‐RUNX1 (ER) subtype and in a subset of B‐other samples. We also define 31 genes that are co‐expressed with RAG1 (RAG1‐signature) in the ER subtype, a signature that also identifies this subset of B‐other samples. Moreover, this subset also shares leukemia and pro‐B gene expression signatures as well as high levels of the ETV6 target genes (BIRC7, WBP1L, CLIC5, ANGPTL2) with the ER subtype, indicating that these B‐other cases are the recently identified ER‐like subtype. We validated our results in a cohort where ER‐like has been defined, which confirmed expression of the RAG1‐signature in this recently described subtype. Taken together, our results demonstrate that the RAG1‐signature identifies the ER‐like subtype. As there are no definitive genetic markers to identify this novel subtype, the RAG1‐signature represents a means to screen for this leukemia in children.
The recombination‐activating genes (RAG) may play a role in leukemogenesis. We determined, therefore, in around 1600 BCP‐ALL samples which subtypes express RAG and defined a set of co‐expressed genes, the RAG1‐signature, which was thereafter used to identify potential new subtypes. We demonstrate that the RAG1‐signature identifies the recently described ETV6‐RUNX1‐like BCP‐ALL.
Structural chromosomal rearrangements that lead to expressed fusion genes are a hallmark of acute lymphoblastic leukemia (ALL). In this study, we performed transcriptome sequencing of 134 primary ALL ...patient samples to comprehensively detect fusion transcripts.
We combined fusion gene detection with genome-wide DNA methylation analysis, gene expression profiling, and targeted sequencing to determine molecular signatures of emerging ALL subtypes.
We identified 64 unique fusion events distributed among 80 individual patients, of which over 50% have not previously been reported in ALL. Although the majority of the fusion genes were found only in a single patient, we identified several recurrent fusion gene families defined by promiscuous fusion gene partners, such as ETV6, RUNX1, PAX5, and ZNF384, or recurrent fusion genes, such as DUX4-IGH. Our data show that patients harboring these fusion genes displayed characteristic genome-wide DNA methylation and gene expression signatures in addition to distinct patterns in single nucleotide variants and recurrent copy number alterations.
Our study delineates the fusion gene landscape in pediatric ALL, including both known and novel fusion genes, and highlights fusion gene families with shared molecular etiologies, which may provide additional information for prognosis and therapeutic options in the future.
The aim of this data paper is to describe a collection of 33 genomic, transcriptomic and epigenomic sequencing datasets of the B-cell acute lymphoblastic leukemia (ALL) cell line REH. REH is one of ...the most frequently used cell lines for functional studies of pediatric ALL, and these data provide a multi-faceted characterization of its molecular features. The datasets described herein, generated with short- and long-read sequencing technologies, can both provide insights into the complex aberrant karyotype of REH, and be used as reference datasets for sequencing data quality assessment or for methods development.
Abstract Multiply-primed rolling circle amplification (RCA) is a suitable technique for amplification of circular templates and has been used to identify novel human papillomaviruses (HPV). In this ...study we develop an efficient RCA for whole genome amplification of HPV using HPV-specific primers in clinical samples and establish a protocol for whole genome sequencing using the Sanger method. Amplification of cloned HPV-genomes by RCA was compared using specific primers against random hexamers. Using HPV-specific primers increased the effectiveness on average 15.2 times and the enrichment of HPV relative to human gDNA on average 62.2 times, as compared to using random hexamer. RCA products were sequenced without need for cloning, even when using low-input amounts. The technique was successfully used on 4 patient samples from FTA cards, to generate whole HPV-genome sequences. Degenerated HPV-specific primers for RCA produce DNA of sufficient quality and quantity suitable for sequencing and other potential downstream analyses.
Abstract
The mechanisms driving clonal heterogeneity and evolution in relapsed pediatric acute lymphoblastic leukemia (ALL) are not fully understood. We performed whole genome sequencing of samples ...collected at diagnosis, relapse(s) and remission from 29 Nordic patients. Somatic point mutations and large-scale structural variants were called using individually matched remission samples as controls, and allelic expression of the mutations was assessed in ALL cells using RNA-sequencing. We observed an increased burden of somatic mutations at relapse, compared to diagnosis, and at second relapse compared to first relapse. In addition to 29 known ALL driver genes, of which nine genes carried recurrent protein-coding mutations in our sample set, we identified putative non-protein coding mutations in regulatory regions of seven additional genes that have not previously been described in ALL. Cluster analysis of hundreds of somatic mutations per sample revealed three distinct evolutionary trajectories during ALL progression from diagnosis to relapse. The evolutionary trajectories provide insight into the mutational mechanisms leading relapse in ALL and could offer biomarkers for improved risk prediction in individual patients.