Understanding the molecular events controlling melanoma progression is of paramount importance for the development of alternative treatment options for this devastating disease. Here we report a ...mechanism regulated by the oncogenic SOX2-GLI1 transcriptional complex driving melanoma invasion through the induction of the sialyltransferase ST3GAL1. Using in vitro and in vivo studies, we demonstrate that ST3GAL1 drives melanoma metastasis. Silencing of this enzyme suppresses melanoma invasion and significantly reduces the ability of aggressive melanoma cells to enter the blood stream, colonize distal organs, seed and survive in the metastatic environment. Analysis of glycosylated proteins reveals that the receptor tyrosine kinase AXL is a major effector of ST3GAL1 pro-invasive function. ST3GAL1 induces AXL dimerization and activation that, in turn, promotes melanoma invasion. Our data support a key role of the ST3GAL1-AXL axis as driver of melanoma metastasis, and highlight the therapeutic potential of targeting this axis to treat metastatic melanoma.
The mitochondrial genome of higher plants is unusually dynamic, with recombination and nonhomologous end-joining (NHEJ) activities producing variability in size and organization. Plant mitochondrial ...DNA also generally displays much lower nucleotide substitution rates than mammalian or yeast systems. Arabidopsis displays these features and expedites characterization of the mitochondrial recombination surveillance gene MSH1 (MutS 1 homolog), lending itself to detailed study of de novo mitochondrial genome activity. In the present study, we investigated the underlying basis for unusual plant features as they contribute to rapid mitochondrial genome evolution.
We obtained evidence of double-strand break (DSB) repair, including NHEJ, sequence deletions and mitochondrial asymmetric recombination activity in Arabidopsis wild-type and msh1 mutants on the basis of data generated by Illumina deep sequencing and confirmed by DNA gel blot analysis. On a larger scale, with mitochondrial comparisons across 72 Arabidopsis ecotypes, similar evidence of DSB repair activity differentiated ecotypes. Forty-seven repeat pairs were active in DNA exchange in the msh1 mutant. Recombination sites showed asymmetrical DNA exchange within lengths of 50- to 556-bp sharing sequence identity as low as 85%. De novo asymmetrical recombination involved heteroduplex formation, gene conversion and mismatch repair activities. Substoichiometric shifting by asymmetrical exchange created the appearance of rapid sequence gain and loss in association with particular repeat classes.
Extensive mitochondrial genomic variation within a single plant species derives largely from DSB activity and its repair. Observed gene conversion and mismatch repair activity contribute to the low nucleotide substitution rates seen in these genomes. On a phenotypic level, these patterns of rearrangement likely contribute to the reproductive versatility of higher plants.
Although the costs of next generation sequencing technology have decreased over the past years, there is still a lack of simple-to-use applications, for a comprehensive analysis of RNA sequencing ...data. There is no one-stop shop for transcriptomic genomics. We have developed MAP-RSeq, a comprehensive computational workflow that can be used for obtaining genomic features from transcriptomic sequencing data, for any genome.
For optimization of tools and parameters, MAP-RSeq was validated using both simulated and real datasets. MAP-RSeq workflow consists of six major modules such as alignment of reads, quality assessment of reads, gene expression assessment and exon read counting, identification of expressed single nucleotide variants (SNVs), detection of fusion transcripts, summarization of transcriptomics data and final report. This workflow is available for Human transcriptome analysis and can be easily adapted and used for other genomes. Several clinical and research projects at the Mayo Clinic have applied the MAP-RSeq workflow for RNA-Seq studies. The results from MAP-RSeq have thus far enabled clinicians and researchers to understand the transcriptomic landscape of diseases for better diagnosis and treatment of patients.
Our software provides gene counts, exon counts, fusion candidates, expressed single nucleotide variants, mapping statistics, visualizations, and a detailed research data report for RNA-Seq. The workflow can be executed on a standalone virtual machine or on a parallel Sun Grid Engine cluster. The software can be downloaded from http://bioinformaticstools.mayo.edu/research/maprseq/.
Uterine inflammatory myofibroblastic tumors (IMTs) have been reported in association with pregnancy and, in some instances, secondarily involve the placenta. The clinicopathological spectrum of these ...tumors in the setting of pregnancy is not well defined. We investigated the clinical, morphologic, immunohistochemical, molecular cytogenetic, and genetic features of 6 uterine IMTs occurring in pregnant women. Each tumor was discovered at parturition, and none was identified by prenatal ultrasound. Patient age ranged from 25 to 41 years (mean 31.5). Tumor size ranged from 1.5 to 9 cm (mean 4.7). Four of 6 had usual IMT features, with at least focal deciduoid change in 3. Necrosis was identified in 3 tumors; and multinucleated cells, in 3 tumors. Sex hormone receptor expression was consistent with estrogen receptor negative or focally weakly positive and progesterone receptor diffusely moderately or moderately to strongly positive in all 6 tumors. ALK immunohistochemistry was strongly positive in 5 tumors, and all of these had an ALK rearrangement detected by break-apart fluorescence in situ hybridization. Subsequent RNA sequencing of these 5 tumors identified a TIMP3-ALK fusion in 4 and a THBS1-ALK in 1. In the ALK-negative tumor, RNA sequencing detected a novel TIMP3-RET fusion that was confirmed by RET break-apart fluorescence in situ hybridization. Follow-up was available for 2 of 6 patients 5 and 19 months after diagnosis. Neither patient developed recurrence. ALK immunohistochemistry will distinguish most uterine IMTs, but if ALK expression and gene studies are negative, in the appropriate morphologic context, evaluation of other tyrosine kinase genes known to be more commonly altered in extrauterine IMTs such as ROS1, NTRK3, PDGFRβ, and RET may be necessary for diagnostic confirmation.
•Uterine inflammatory myofibroblastic tumors (IMTs) can secondarily involve the placenta•Most uterine IMTs have an ALK rearrangement and concordantly positive immunohistochemistry•Absent expression of ALK by immunohistochemistry and/or lack of ALK gene rearrangement do not exclude a diagnosis of IMT•In this series, uterine IMTs associated with pregnancy frequently partner with TIMP3 in their gene rearrangement
Archived formalin fixed paraffin embedded (FFPE) samples are valuable clinical resources to examine clinically relevant morphology features and also to study genetic changes. However, DNA quality and ...quantity of FFPE samples are often sub-optimal, and resulting NGS-based genetics variant detections are prone to false positives. Evaluations of wet-lab and bioinformatics approaches are needed to optimize variant detection from FFPE samples.
As a pilot study, we designed within-subject triplicate samples of DNA derived from paired FFPE and fresh frozen breast tissues to highlight FFPE-specific artifacts. For FFPE samples, we tested two FFPE DNA extraction methods to determine impact of wet-lab procedures on variant calling: QIAGEN QIAamp DNA Mini Kit ("QA"), and QIAGEN GeneRead DNA FFPE Kit ("QGR"). We also used negative-control (NA12891) and positive control samples (Horizon Discovery Reference Standard FFPE). All DNA sample libraries were prepared for NGS according to the QIAseq Human Breast Cancer Targeted DNA Panel protocol and sequenced on the HiSeq 4000. Variant calling and filtering were performed using QIAGEN Gene Globe Data Portal. Detailed variant concordance comparisons and mutational signature analysis were performed to investigate effects of FFPE samples compared to paired fresh frozen samples, along with different DNA extraction methods. In this study, we found that five times or more variants were called with FFPE samples, compared to their paired fresh-frozen tissue samples even after applying molecular barcoding error-correction and default bioinformatics filtering recommended by the vendor. We also found that QGR as an optimized FFPE-DNA extraction approach leads to much fewer discordant variants between paired fresh frozen and FFPE samples. Approximately 92% of the uniquely called FFPE variants were of low allelic frequency range (< 5%), and collectively shared a "C > T|G > A" mutational signature known to be representative of FFPE artifacts resulting from cytosine deamination. Based on control samples and FFPE-frozen replicates, we derived an effective filtering strategy with associated empirical false-discovery estimates.
Through this study, we demonstrated feasibility of calling and filtering genetic variants from FFPE tissue samples using a combined strategy with molecular barcodes, optimized DNA extraction, and bioinformatics methods incorporating genomics context such as mutational signature and variant allelic frequency.
Cholangiocarcinoma (CCA) is a lethal hepatobiliary neoplasm originating from the biliary apparatus. In humans, CCA risk factors include hepatobiliary inflammation and fibrosis. The recently ...identified interleukin (IL)−1 family member, IL‐33, has been shown to be a biliary mitogen which also promotes liver inflammation and fibrosis. Our aim was to generate a mouse model of CCA mimicking the human disease. Ectopic oncogene expression in the biliary tract was accomplished by the Sleeping Beauty transposon transfection system with transduction of constitutively active AKT (myr‐AKT) and Yes‐associated protein. Intrabiliary instillation of the transposon–transposase complex was coupled with lobar bile duct ligation in C57BL/6 mice, followed by administration of IL‐33 for 3 consecutive days. Tumors developed in 72% of the male mice receiving both oncogenes plus IL‐33 by 10 weeks but in only 20% of the male mice transduced with the oncogenes alone. Tumors expressed SOX9 and pancytokeratin (features of CCA) but were negative for HepPar1 (a marker of hepatocellular carcinoma). Substantive overlap with human CCA specimens was revealed by RNA profiling. Not only did IL‐33 induce IL‐6 expression by human cholangiocytes but it likely facilitated tumor development in vivo by an IL‐6–sensitive process as tumor development was significantly attenuated in Il‐6–/– male animals. Furthermore, tumor formation occurred at a similar rate when IL‐6 was substituted for IL‐33 in this model. Conclusion: The transposase‐mediated transduction of constitutively active AKT and Yes‐associated protein in the biliary epithelium coupled with lobar obstruction and IL‐33 administration results in the development of CCA with morphological and biochemical features of the human disease; this model highlights the role of inflammatory cytokines in CCA oncogenesis. (Hepatology 2015;61:1627–1642)
The accurate detection of point mutations from pathology slides using sequencing data is of great importance in cancer genomics and precision oncology. Formalin-fixation paraffin-embedding (FFPE) is ...a widely used technique to preserve pathology tissues. The FFPE process introduces artificial C > T mutations in next-generation sequencing, so we set out to develop
, a method to score and filter such spurious variants. FFPE mutational artifacts follow a mutational signature. By using the FFPE signature and Bayes' formula, we can calculate the probability of a mutation resulting from the FFPE process and use this probability to filter FFPE variants. We implement this method as the
R package. We tested
by simulating mutations across all 60-baseline mutational signatures from the Catalog of Somatic Mutations in Cancer (COSMIC) and combining them with mutations following the FFPE mutational signature. The sensitivity and specificity of
are adversely affected by the cosine similarity between the baseline and FFPE signatures (
). Higher percentages of FFPE mutations (
) result in increased sensitivity and reduced specificity. The specificity and sensitivity of
can be predicted as linear model with an interaction term using
and
, with an
and 0.79, respectively. Finally, we tested
using six RNA sequencing cancer samples and observed concordant trends of specificity and sensitivity with respect to our simulated data. The
R package can be used to annotate and filter FFPE-induced mutations in cancer genomics. Our method is adversely affected by
and
.
Next-generation sequencing provides comprehensive information about individuals' genetic makeup and is commonplace in precision oncology practice. Due to the heterogeneity of individual patient's ...disease conditions and treatment journeys, not all targeted therapies were initiated despite actionable mutations. To better understand and support the clinical decision-making process in precision oncology, there is a need to examine real-world associations between patients' genetic information and treatment choices. To fill the gap of insufficient use of real-world data (RWD) in electronic health records (EHRs), we generated a single Resource Description Framework (RDF) resource, called PO2RDF (precision oncology to RDF), by integrating information regarding genes, variants, diseases, and drugs from genetic reports and EHRs. There are a total 2,309,014 triples contained in the PO2RDF. Among them, 32,815 triples are related to Gene, 34,695 triples are related to Variant, 8,787 triples are related to Disease, 26,154 triples are related to Drug. We performed two use case analyses to demonstrate the usability of the PO2RDF: (1) we examined real-world associations between EGFR mutations and targeted therapies to confirm existing knowledge and detect off-label use. (2) We examined differences in prognosis for lung cancer patients with/without TP53 mutations. In conclusion, our work proposed to use RDF to organize and distribute clinical RWD that is otherwise inaccessible externally. Our work serves as a pilot study that will lead to new clinical applications and could ultimately stimulate progress in the field of precision oncology.
Topological data analysis (TDA) is a powerful method for reducing data dimensionality, mining underlying data relationships, and intuitively representing the data structure. The Mapper algorithm is ...one such tool that projects high-dimensional data to 1-dimensional space by using a filter function that is subsequently used to reconstruct the data topology relationships. However, domain context information and prior knowledge have not been considered in current TDA modeling frameworks. Here, we report the development and evaluation of a semi-supervised topological analysis (STA) framework that incorporates discrete or continuously labeled data points and selects the most relevant filter functions accordingly. We validate the proposed STA framework with simulation data and then apply it to samples from Genotype-Tissue Expression data and ovarian cancer transcriptome datasets. The graphs generated by STA for these 2 datasets, based on gene expression profiles, are consistent with prior knowledge, thereby supporting the effectiveness of the proposed framework.
Gastroblastoma is a rare distinctive biphasic tumor of the stomach. The molecular biology of gastroblastoma has not been studied, and no affirmative diagnostic markers have been developed. We ...retrieved two gastroblastomas from the consultation practices of the authors and performed transcriptome sequencing on formalin-fixed paraffin-embedded tissue. Recurrent predicted fusion genes were validated at genomic and RNA levels. The presence of the fusion gene was confirmed on two additional paraffin-embedded cases of gastroblastoma. Control cases of histologic mimics (biphasic synovial sarcoma, leiomyoma, leiomyosarcoma, desmoid-type fibromatosis, EWSR1-FLI1-positive Ewing sarcoma, Wilms' tumor, gastrointestinal stromal tumor, plexiform fibromyxoma, Sonic hedgehog-type medulloblastomas, and normal gastric mucosa and muscularis propria were also analyzed. The gastroblastomas affected two males and two females aged 9-56 years. Transcriptome sequencing identified recurrent somatic MALAT1-GLI1 fusion genes, which were predicted to retain the key domains of GLI1. The MALAT1-GLI1 fusion gene was validated by break-apart and dual-fusion FISH and RT-PCR. The additional two gastroblastomas were also positive for the MALAT1-GLI1 fusion gene. None of the other control cases harbored MALAT1-GLI1. Overexpression of GLI1 in the cases of gastroblastomas was confirmed at RNA and protein levels. Pathway analysis revealed activation of the Sonic hedgehog pathway in gastroblastoma and gene expression profiling showed that gastroblastomas grouped together and were most similar to Sonic hedgehog-type medulloblastomas. In summary, we have identified an oncogenic MALAT1-GLI1 fusion gene in all cases of gastroblastoma that may serve as a diagnostic biomarker. The fusion gene is predicted to encode a protein that includes the zinc finger domains of GLI1 and results in overexpression of GLI1 protein and activation of the Sonic hedgehog pathway.