Advanced cholangiocarcinoma continues to harbor a difficult prognosis and therapeutic options have been limited. During the course of a clinical trial of whole genomic sequencing seeking druggable ...targets, we examined six patients with advanced cholangiocarcinoma. Integrated genome-wide and whole transcriptome sequence analyses were performed on tumors from six patients with advanced, sporadic intrahepatic cholangiocarcinoma (SIC) to identify potential therapeutically actionable events. Among the somatic events captured in our analysis, we uncovered two novel therapeutically relevant genomic contexts that when acted upon, resulted in preliminary evidence of anti-tumor activity. Genome-wide structural analysis of sequence data revealed recurrent translocation events involving the FGFR2 locus in three of six assessed patients. These observations and supporting evidence triggered the use of FGFR inhibitors in these patients. In one example, preliminary anti-tumor activity of pazopanib (in vitro FGFR2 IC50≈350 nM) was noted in a patient with an FGFR2-TACC3 fusion. After progression on pazopanib, the same patient also had stable disease on ponatinib, a pan-FGFR inhibitor (in vitro, FGFR2 IC50≈8 nM). In an independent non-FGFR2 translocation patient, exome and transcriptome analysis revealed an allele specific somatic nonsense mutation (E384X) in ERRFI1, a direct negative regulator of EGFR activation. Rapid and robust disease regression was noted in this ERRFI1 inactivated tumor when treated with erlotinib, an EGFR kinase inhibitor. FGFR2 fusions and ERRFI mutations may represent novel targets in sporadic intrahepatic cholangiocarcinoma and trials should be characterized in larger cohorts of patients with these aberrations.
The aim of this study was to investigate the promoter hypermethylation as diagnostic markers to detect malignant prostate cells and as prognostic markers to predict the clinical recurrence of ...prostate cancer.
DNA was isolated from prostate cancer and normal adjacent tissues. After bisulfite conversion, methylation of 14,495 genes was evaluated using the Methylation27 microarrays in 238 prostate tissues. We analyzed methylation profiles in four different groups: (i) tumor (n = 198) versus matched normal tissues (n = 40), (ii) recurrence (n = 123) versus nonrecurrence (n = 75), (iii) clinical recurrence (n = 80) versus biochemical recurrence (n = 43), and (iv) systemic recurrence (n = 36) versus local recurrence (n = 44). Group 1, 2, 3, and 4 genes signifying biomarkers for diagnosis, prediction of recurrence, clinical recurrence, and systemic progression were determined. Univariate and multivariate analyses were conducted to predict risk of recurrence. We validated the methylation of genes in 20 independent tissues representing each group by pyrosequencing.
Microarray analysis revealed significant methylation of genes in four different groups of prostate cancer tissues. The sensitivity and specificity of methylation for 25 genes from 1, 2, and 4 groups and 7 from group 3 were shown. Validation of genes by pyrosequencing from group 1 (GSTP1, HIF3A, HAAO, and RARβ), group 2 (CRIP1, FLNC, RASGRF2, RUNX3, and HS3ST2), group 3 (PHLDA3, RASGRF2, and TNFRSF10D), and group 4 (BCL11B, POU3F3, and RASGRF2) confirmed the microarray results.
Our study provides a global assessment of DNA methylation in prostate cancer and identifies the significance of genes as diagnostic and progression biomarkers of prostate cancer.
Massive parallel sequencing has the potential to replace microarrays as the method for transcriptome profiling. Currently there are two protocols: full-length RNA sequencing (RNA-SEQ) and 3'-tag ...digital gene expression (DGE). In this preliminary effort, we evaluated the 3' DGE approach using two reference RNA samples from the MicroArray Quality Control Consortium (MAQC).
Using Brain RNA sample from multiple runs, we demonstrated that the transcript profiles from 3' DGE were highly reproducible between technical and biological replicates from libraries constructed by the same lab and even by different labs, and between two generations of Illumina's Genome Analyzers. Approximately 65% of all sequence reads mapped to mitochondrial genes, ribosomal RNAs, and canonical transcripts. The expression profiles of brain RNA and universal human reference RNA were compared which demonstrated that DGE was also highly quantitative with excellent correlation of differential expression with quantitative real-time PCR. Furthermore, one lane of 3' DGE sequencing, using the current sequencing chemistry and image processing software, had wider dynamic range for transcriptome profiling and was able to detect lower expressed genes which are normally below the detection threshold of microarrays.
3' tag DGE profiling with massive parallel sequencing achieved high sensitivity and reproducibility for transcriptome profiling. Although it lacks the ability of detecting alternative splicing events compared to RNA-SEQ, it is much more affordable and clearly out-performed microarrays (Affymetrix) in detecting lower abundant transcripts.
AEBP1 encodes the aortic carboxypeptidase-like protein (ACLP) that associates with collagens in the extracellular matrix (ECM) and has several roles in development, tissue repair, and fibrosis. ACLP ...is expressed in bone, the vasculature, and dermal tissues and is involved in fibroblast proliferation and mesenchymal stem cell differentiation into collagen-producing cells. Aebp1−/− mice have abnormal, delayed wound repair correlating with defects in fibroblast proliferation. In this study, we describe four individuals from three unrelated families that presented with a unique constellation of clinical findings including joint laxity, redundant and hyperextensible skin, poor wound healing with abnormal scarring, osteoporosis, and other features reminiscent of Ehlers-Danlos syndrome (EDS). Analysis of skin biopsies revealed decreased dermal collagen with abnormal collagen fibrils that were ragged in appearance. Exome sequencing revealed compound heterozygous variants in AEBP1 (c.1470delC p.Asn490_Met495delins(40) and c.1743C>A p.Cys581∗) in the first individual, a homozygous variant (c.1320_1326del p.Arg440Serfs∗3) in the second individual, and a homozygous splice site variant (c.1630+1G>A) in two siblings from the third family. We show that ACLP enhances collagen polymerization and binds to several fibrillar collagens via its discoidin domain. These studies support the conclusion that bi-allelic pathogenic variants in AEBP1 are the cause of this autosomal-recessive EDS subtype.
Modern Next Generation- and Third Generation- Sequencing methods such as Illumina and PacBio Circular Consensus Sequencing platforms provide accurate sequencing data. Parallel developments in Deep ...Learning have enabled the application of Deep Neural Networks to variant calling, surpassing the accuracy of classical approaches in many settings. DeepVariant, arguably the most popular among such methods, transforms the problem of variant calling into one of image recognition where a Deep Neural Network analyzes sequencing data that is formatted as images, achieving high accuracy. In this paper, we explore an alternative approach to designing Deep Neural Networks for variant calling, where we use meticulously designed Deep Neural Network architectures and customized variant inference functions that account for the underlying nature of sequencing data instead of converting the problem to one of image recognition.
Results from 27 whole-genome variant calling experiments spanning Illumina, PacBio and hybrid Illumina-PacBio settings suggest that our method allows vastly smaller Deep Neural Networks to outperform the Inception-v3 architecture used in DeepVariant for indel and substitution-type variant calls. For example, our method reduces the number of indel call errors by up to 18%, 55% and 65% for Illumina, PacBio and hybrid Illumina-PacBio variant calling respectively, compared to a similarly trained DeepVariant pipeline. In these cases, our models are between 7 and 14 times smaller.
We believe that the improved accuracy and problem-specific customization of our models will enable more accurate pipelines and further method development in the field. HELLO is available at https://github.com/anands-repo/hello.
As reliable, efficient genome sequencing becomes ubiquitous, the need for similarly reliable and efficient variant calling becomes increasingly important. The Genome Analysis Toolkit (GATK), ...maintained by the Broad Institute, is currently the widely accepted standard for variant calling software. However, alternative solutions may provide faster variant calling without sacrificing accuracy. One such alternative is Sentieon DNASeq, a toolkit analogous to GATK but built on a highly optimized backend. We conducted an independent evaluation of the DNASeq single-sample variant calling pipeline in comparison to that of GATK. Our results support the near-identical accuracy of the two software packages, showcase optimal scalability and great speed from Sentieon, and describe computational performance considerations for the deployment of DNASeq.
De novo germline mutations in the RNA helicase DDX3X account for 1%–3% of unexplained intellectual disability (ID) cases in females and are associated with autism, brain malformations, and epilepsy. ...Yet, the developmental and molecular mechanisms by which DDX3X mutations impair brain function are unknown. Here, we use human and mouse genetics and cell biological and biochemical approaches to elucidate mechanisms by which pathogenic DDX3X variants disrupt brain development. We report the largest clinical cohort to date with DDX3X mutations (n = 107), demonstrating a striking correlation between recurrent dominant missense mutations, polymicrogyria, and the most severe clinical outcomes. We show that Ddx3x controls cortical development by regulating neuron generation. Severe DDX3X missense mutations profoundly disrupt RNA helicase activity, induce ectopic RNA-protein granules in neural progenitors and neurons, and impair translation. Together, these results uncover key mechanisms underlying DDX3X syndrome and highlight aberrant RNA metabolism in the pathogenesis of neurodevelopmental disease.
Display omitted
•Discovery of 107 mutations in the RNA helicase DDX3X causing cortical malformations•Clinical severity is linked to reduced helicase activity and RNA-protein granules•Ddx3x is required in neural progenitors to produce cortical neurons during development•Severe missense mutations cause polymicrogyria and impair translation of targets
Using human and mouse genetics, Lennox et al. identify 107 mutations in DDX3X, demonstrating DDX3X is essential for cortical development. A striking correlation between the severity of clinical mutations and abnormal RNA metabolism highlights unappreciated mechanisms of DDX3X syndrome.
Bioinformatics pipelines are an integral component of next-generation sequencing (NGS). Processing raw sequence data to detect genomic alterations has significant impact on disease management and ...patient care. Because of the lack of published guidance, there is currently a high degree of variability in how members of the global molecular genetics and pathology community establish and validate bioinformatics pipelines. Improperly developed, validated, and/or monitored pipelines may generate inaccurate results that may have negative consequences for patient care. To address this unmet need, the Association of Molecular Pathology, with organizational representation from the College of American Pathologists and the American Medical Informatics Association, has developed a set of 17 best practice consensus recommendations for the validation of clinical NGS bioinformatics pipelines. Recommendations include practical guidance for laboratories regarding NGS bioinformatics pipeline design, development, and operation, with additional emphasis on the role of a properly trained and qualified molecular professional to achieve optimal NGS testing quality.
Abstract
Motivation
Next-generation sequencing is rapidly improving diagnostic rates in rare Mendelian diseases, but even with whole genome or whole exome sequencing, the majority of cases remain ...unsolved. Increasingly, RNA sequencing is being used to solve many cases that evade diagnosis through sequencing alone. Specifically, the detection of aberrant splicing in many rare disease patients suggests that identifying RNA splicing outliers is particularly useful for determining causal Mendelian disease genes. However, there is as yet a paucity of statistical methodologies to detect splicing outliers.
Results
We developed LeafCutterMD, a new statistical framework that significantly improves the previously published LeafCutter in the context of detecting outlier splicing events. Through simulations and analysis of real patient data, we demonstrate that LeafCutterMD has better power than the state-of-the-art methodology while controlling false-positive rates. When applied to a cohort of disease-affected probands from the Mayo Clinic Center for Individualized Medicine, LeafCutterMD recovered all aberrantly spliced genes that had previously been identified by manual curation efforts.
Availability and implementation
The source code for this method is available under the opensource Apache 2.0 license in the latest release of the LeafCutter software package available online at http://davidaknowles.github.io/leafcutter.
Supplementary information
Supplementary data are available at Bioinformatics online.