Numerous large-scale genomic studies of matched tumor-normal samples have established the somatic landscapes of most cancer types. However, the downstream analysis of data from somatic mutations ...entails a number of computational and statistical approaches, requiring usage of independent software and numerous tools. Here, we describe an R Bioconductor package, Maftools, which offers a multitude of analysis and visualization modules that are commonly used in cancer genomic studies, including driver gene identification, pathway, signature, enrichment, and association analyses. Maftools only requires somatic variants in Mutation Annotation Format (MAF) and is independent of larger alignment files. With the implementation of well-established statistical and computational methods, Maftools facilitates data-driven research and comparative analysis to discover novel results from publicly available data sets. In the present study, using three of the well-annotated cohorts from The Cancer Genome Atlas (TCGA), we describe the application of Maftools to reproduce known results. More importantly, we show that Maftools can also be used to uncover novel findings through integrative analysis.
DNA methylation is a widely investigated epigenetic mark with important roles in development and disease. High-throughput assays enable genome-scale DNA methylation analysis in large numbers of ...samples. Here, we describe a new version of our RnBeads software - an R/Bioconductor package that implements start-to-finish analysis workflows for Infinium microarrays and various types of bisulfite sequencing. RnBeads 2.0 ( https://rnbeads.org/ ) provides additional data types and analysis methods, new functionality for interpreting DNA methylation differences, improved usability with a novel graphical user interface, and better use of computational resources. We demonstrate RnBeads 2.0 in four re-runnable use cases focusing on cell differentiation and cancer.
Computational analysis and interactive visualization of biological networks and protein structures are common tasks for gaining insight into biological processes. This protocol describes three ...workflows based on the NetworkAnalyzer and RINalyzer plug-ins for Cytoscape, a popular software platform for networks. NetworkAnalyzer has become a standard Cytoscape tool for comprehensive network topology analysis. In addition, RINalyzer provides methods for exploring residue interaction networks derived from protein structures. The first workflow uses NetworkAnalyzer to perform a topological analysis of biological networks. The second workflow applies RINalyzer to study protein structure and function and to compute network centrality measures. The third workflow combines NetworkAnalyzer and RINalyzer to compare residue networks. The full protocol can be completed in ∼2 h.
Rapidly increasing amounts of molecular interaction data are being produced by various experimental techniques and computational prediction methods. In order to gain insight into the organization and ...structure of the resultant large complex networks formed by the interacting molecules, we have developed the versatile Cytoscape plugin NetworkAnalyzer. It computes and displays a comprehensive set of topological parameters, which includes the number of nodes, edges, and connected components, the network diameter, radius, density, centralization, heterogeneity, and clustering coefficient, the characteristic path length, and the distributions of node degrees, neighborhood connectivities, average clustering coefficients, and shortest path lengths. NetworkAnalyzer can be applied to both directed and undirected networks and also contains extra functionality to construct the intersection or union of two networks. It is an interactive and highly customizable application that requires no expert knowledge in graph theory from the user. Availability: NetworkAnalyzer can be downloaded via the Cytoscape web site: http://www.cytoscape.org Contact: mario.albrecht@mpi-inf.mpg.de Supplementary information: Supplementary data are available at Bioinformatics online.
•Presentation of all the necessary steps of downstream analysis for bisulfite sequencing experiments starting from read alignment and quality check.•Comparison of differential methylation ...methods.•Comparison of methylome segmentation methods.•Suggestions for dealing with large data sets using on-disk data structures.•Review of guided user interfaces for methylation analysis.
DNA methylation is one of the main epigenetic modifications in the eukaryotic genome; it has been shown to play a role in cell-type specific regulation of gene expression, and therefore cell-type identity. Bisulfite sequencing is the gold-standard for measuring methylation over the genomes of interest. Here, we review several techniques used for the analysis of high-throughput bisulfite sequencing. We introduce specialized short-read alignment techniques as well as pre/post-alignment quality check methods to ensure data quality. Furthermore, we discuss subsequent analysis steps after alignment. We introduce various differential methylation methods and compare their performance using simulated and real bisulfite sequencing datasets. We also discuss the methods used to segment methylomes in order to pinpoint regulatory regions. We introduce annotation methods that can be used for further classification of regions returned by segmentation and differential methylation methods. Finally, we review software packages that implement strategies to efficiently deal with large bisulfite sequencing datasets locally and we discuss online analysis workflows that do not require any prior programming skills. The analysis strategies described in this review will guide researchers at any level to the best practices of bisulfite sequencing analysis.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UL, UM, UPCLJ, UPUK, ZRSKP
Charting differences between tumors and normal tissue is a mainstay of cancer research. However, clonal tumor expansion from complex normal tissue architectures potentially obscures cancer-specific ...events, including divergent epigenetic patterns. Using whole-genome bisulfite sequencing of normal B cell subsets, we observed broad epigenetic programming of selective transcription factor binding sites coincident with the degree of B cell maturation. By comparing normal B cells to malignant B cells from 268 patients with chronic lymphocytic leukemia (CLL), we showed that tumors derive largely from a continuum of maturation states reflected in normal developmental stages. Epigenetic maturation in CLL was associated with an indolent gene expression pattern and increasingly favorable clinical outcomes. We further uncovered that most previously reported tumor-specific methylation events are normally present in non-malignant B cells. Instead, we identified a potential pathogenic role for transcription factor dysregulation in CLL, where excess programming by EGR and NFAT with reduced EBF and AP-1 programming imbalances the normal B cell epigenetic program.
Full text
Available for:
IJS, NUK, SBMB, UL, UM, UPUK
Several mechanisms of action have been proposed for DNA methyltransferase and histone deacetylase inhibitors (DNMTi and HDACi), primarily based on candidate-gene approaches. However, less is known ...about their genome-wide transcriptional and epigenomic consequences. By mapping global transcription start site (TSS) and chromatin dynamics, we observed the cryptic transcription of thousands of treatment-induced non-annotated TSSs (TINATs) following DNMTi and HDACi treatment. The resulting transcripts frequently splice into protein-coding exons and encode truncated or chimeric ORFs translated into products with predicted abnormal or immunogenic functions. TINAT transcription after DNMTi treatment coincided with DNA hypomethylation and gain of classical promoter histone marks, while HDACi specifically induced a subset of TINATs in association with H2AK9ac, H3K14ac, and H3K23ac. Despite this mechanistic difference, both inhibitors convergently induced transcription from identical sites, as we found TINATs to be encoded in solitary long terminal repeats of the ERV9/LTR12 family, which are epigenetically repressed in virtually all normal cells.
Full text
Available for:
IJS, NUK, SBMB, UL, UM, UPUK
DNA methylation patterns are altered in numerous diseases and often correlate with clinically relevant information such as disease subtypes, prognosis and drug response. With suitable assays and ...after validation in large cohorts, such associations can be exploited for clinical diagnostics and personalized treatment decisions. Here we describe the results of a community-wide benchmarking study comparing the performance of all widely used methods for DNA methylation analysis that are compatible with routine clinical use. We shipped 32 reference samples to 18 laboratories in seven different countries. Researchers in those laboratories collectively contributed 21 locus-specific assays for an average of 27 predefined genomic regions, as well as six global assays. We evaluated assay sensitivity on low-input samples and assessed the assays' ability to discriminate between cell types. Good agreement was observed across all tested methods, with amplicon bisulfite sequencing and bisulfite pyrosequencing showing the best all-round performance. Our technology comparison can inform the selection, optimization and use of DNA methylation assays in large-scale validation studies, biomarker development and clinical diagnostics.
Full text
Available for:
IJS, NUK, SBMB, UL, UM, UPUK
Intrahepatic cholangiocarcinoma (iCCA) is the second most common primary liver cancer. It is defined by cholangiocytic differentiation and has poor prognosis. Recently, epigenetic processes have been ...shown to play an important role in cholangiocarcinogenesis. We performed an integrative analysis on 52 iCCAs using both genetic and epigenetic data with a specific focus on DNA methylation components. We found recurrent isocitrate dehydrogenase 1 (IDH1) and IDH2 (28%) gene mutations, recurrent arm‐length copy number alterations (CNAs), and focal alterations such as deletion of 3p21 or amplification of 12q15, which affect BRCA1 Associated Protein 1, polybromo 1, and mouse double minute 2 homolog. DNA methylome analysis revealed excessive hypermethylation of iCCA, affecting primarily the bivalent genomic regions marked with both active and repressive histone modifications. Integrative clustering of genetic and epigenetic data identified four iCCA subgroups with prognostic relevance further designated as IDH, high (H), medium (M), and low (L) alteration groups. The IDH group consisted of all samples with IDH1 or IDH2 mutations and showed, together with the H group, a highly disrupted genome, characterized by frequent deletions of chromosome arms 3p and 6q. Both groups showed excessive hypermethylation with distinct patterns. The M group showed intermediate characteristics regarding both genetic and epigenetic marks, whereas the L group exhibited few methylation changes and mutations and a lack of CNAs. Methylation‐based latent component analysis of cell‐type composition identified differences among these four groups. Prognosis of the H and M groups was significantly worse than that of the L group. Conclusion: Using an integrative genomic and epigenomic analysis approach, we identified four major iCCA subgroups with widespread genomic and epigenomic differences and prognostic implications. Furthermore, our data suggest differences in the cell‐of‐origin of the iCCA subtypes.
Full text
Available for:
BFBNIB, FZAB, GIS, IJS, KILJ, NLZOH, NUK, OILJ, SBCE, SBMB, UL, UM, UPUK
Neuroblastoma is a malignancy of the developing sympathetic nervous system that is often lethal when relapse occurs. We here used whole-exome sequencing, mRNA expression profiling, array CGH and DNA ...methylation analysis to characterize 16 paired samples at diagnosis and relapse from individuals with neuroblastoma. The mutational burden significantly increased in relapsing tumors, accompanied by altered mutational signatures and reduced subclonal heterogeneity. Global allele frequencies at relapse indicated clonal mutation selection during disease progression. Promoter methylation patterns were consistent over disease course and were patient specific. Recurrent alterations at relapse included mutations in the putative CHD5 neuroblastoma tumor suppressor, chromosome 9p losses, DOCK8 mutations, inactivating mutations in PTPN14 and a relapse-specific activity pattern for the PTPN14 target YAP. Recurrent new mutations in HRAS, KRAS and genes mediating cell-cell interaction in 13 of 16 relapse tumors indicate disturbances in signaling pathways mediating mesenchymal transition. Our data shed light on genetic alteration frequency, identity and evolution in neuroblastoma.
Full text
Available for:
IJS, NUK, SBMB, UL, UM, UPUK