Oxidative stress (OS) is a key pathophysiological mechanism in Crohn's disease (CD). OS-related genes can be affected by environmental factors, intestinal inflammation, gut microbiota, and epigenetic ...changes. However, the role of OS as a potential CD etiological factor or triggering factor is unknown, as differentially expressed OS genes in CD can be either a cause or a subsequent change of intestinal inflammation. Herein, we used a multi-omics summary data-based Mendelian randomization (SMR) approach to identify putative causal effects and underlying mechanisms of OS genes in CD.
OS-related genes were extracted from the GeneCards database. Intestinal transcriptome datasets were collected from the Gene Expression Omnibus (GEO) database and meta-analyzed to identify differentially expressed genes (DEGs) related to OS in CD. Integration analyses of the largest CD genome-wide association study (GWAS) summaries with expression quantitative trait loci (eQTLs) and DNA methylation QTLs (mQTLs) from the blood were performed using SMR methods to prioritize putative blood OS genes and their regulatory elements associated with CD risk. Up-to-date intestinal eQTLs and fecal microbial QTLs (mbQTLs) were integrated to uncover potential interactions between host OS gene expression and gut microbiota through SMR and colocalization analysis. Two additional Mendelian randomization (MR) methods were used as sensitivity analyses. Putative results were validated in an independent multi-omics cohort from the First Affiliated Hospital of Sun Yat-sen University (FAH-SYS).
A meta-analysis from six datasets identified 438 OS-related DEGs enriched in intestinal enterocytes in CD from 817 OS-related genes. Five genes from blood tissue were prioritized as candidate CD-causal genes using three-step SMR methods: BAD, SHC1, STAT3, MUC1, and GPX3. Furthermore, SMR analysis also identified five putative intestinal genes, three of which were involved in gene-microbiota interactions through colocalization analysis: MUC1, CD40, and PRKAB1. Validation results showed that 88.79% of DEGs were replicated in the FAH-SYS cohort. Associations between pairs of MUC1-Bacillus aciditolerans and PRKAB1-Escherichia coli in the FAH-SYS cohort were consistent with eQTL-mbQTL colocalization.
This multi-omics integration study highlighted that OS genes causal to CD are regulated by DNA methylation and host-microbiota interactions. This provides evidence for future targeted functional research aimed at developing suitable therapeutic interventions and disease prevention.
Whole -genome sequencing projects of millions of subjects contain enormous genotypes, entailing a huge memory burden and time for computation. Here, we present GBC, a toolkit for rapidly compressing ...large-scale genotypes into highly addressable byte-encoding blocks under an optimized parallel framework. We demonstrate that GBC is up to 1000 times faster than state-of-the-art methods to access and manage compressed large-scale genotypes while maintaining a competitive compression ratio. We also showed that conventional analysis would be substantially sped up if built on GBC to access genotypes of a large population. GBC's data structure and algorithms are valuable for accelerating large-scale genomic research.
The usage of expressed somatic mutations may have a unique advantage in identifying active cancer driver mutations. However, accurately calling mutations from RNA-seq data is difficult due to ...confounding factors such as RNA-editing, reverse transcription, and gap alignment. In the present study, we proposed a framework (named RNA-SSNV,
https://github.com/pmglab/RNA-SSNV
) to call somatic single nucleotide variants (SSNV) from tumor bulk RNA-seq data. Based on a comprehensive multi-filtering strategy and a machine-learning classification model trained with comprehensively curated features, RNA-SSNV achieved the best precision–recall rate (0.880–0.884) in a testing dataset and robustly retained 0.94 AUC for the precision–recall curve in three validation adult-based TCGA (The Cancer Genome Atlas) datasets. We further showed that the somatic mutations called by RNA-SSNV tended to have a higher functional impact and therapeutic power in known driver genes. Furthermore, VAF (variant allele fraction) analysis revealed that subclonal harboring expressed mutations had evolutional selection advantage and RNA had higher detection power to rescue DNA-omitted mutations. In sum, RNA-SSNV will be a useful approach to accurately call expressed somatic mutations for a more insightful analysis of cancer drive genes and carcinogenic mechanisms.
Linkage disequilibrium and disease-associated variants in the non-coding regions make it difficult to distinguish the truly associated genes from the redundantly associated genes for complex ...diseases. In this study, we proposed a new conditional gene-based framework called eDESE that leveraged an improved effective chi-squared statistic to control the type I error rates and remove the redundant associations. eDESE initially performed the association analysis by mapping variants to genes according to their physical distance. We further demonstrated that the isoform-level eQTLs could be more powerful than the gene-level eQTLs in the association analysis using a simulation study. Then the eQTL-guided strategies, that is, mapping variants to genes according to their gene/isoform-level variant-gene
-eQTLs associations, were also integrated with eDESE. We then applied eDESE to predict the potential susceptibility genes of schizophrenia and found that the potential susceptibility genes were enriched with many neuronal or synaptic signaling-related terms in the Gene Ontology knowledgebase and antipsychotics-gene interaction terms in the drug-gene interaction database (DGIdb). More importantly, seven potential susceptibility genes identified by eDESE were the target genes of multiple antipsychotics in DrugBank. Comparing the potential susceptibility genes identified by eDESE and other benchmark approaches (i.e., MAGMA and S-PrediXcan) implied that strategy based on the isoform-level eQTLs could be an important supplement for the other two strategies (physical distance and gene-level eQTLs). We have implemented eDESE in our integrative platform KGGSEE (http://pmglab.top/kggsee/#/) and hope that eDESE can facilitate the prediction of candidate susceptibility genes and isoforms for complex diseases in a multi-tissue context.
De novo mutations as causes of schizophrenia Yang, Zhenxing; Li, Longkun; Li, Miaoxin
Psychiatry research,
December 2018, 2018-Dec, 2018-12-00, 20181201, Letnik:
270
Journal Article
Mendelian randomization using GWAS summary statistics has become a popular method to infer causal relationships across complex diseases. However, the widespread pleiotropy observed in GWAS has made ...the selection of valid instrumental variables problematic, leading to possible violations of Mendelian randomization assumptions and thus potentially invalid inferences concerning causation. Furthermore, current MR methods can examine causation in only one direction, so that two separate analyses are required for bi-directional analysis. In this study, we propose a ststistical framework, MRCI (Mixture model Reciprocal Causation Inference), to estimate reciprocal causation between two phenotypes simultaneously using the genome-scale summary statistics of the two phenotypes and reference linkage disequilibrium information. Simulation studies, including strong correlated pleiotropy, showed that MRCI obtained nearly unbiased estimates of causation in both directions, and correct Type I error rates under the null hypothesis. In applications to real GWAS data, MRCI detected significant bi-directional and uni-directional causal influences between common diseases and putative risk factors.
Due to the dyeing process, learning samples used for color prediction of pre-colored fiber blends should be re-prepared once the batches of the fiber change. The preparation of the sample is ...time-consuming and leads to manpower and material waste. The two-constant Kubelka-Munk theory is selected in this article to investigate the feasibility to minimize and optimize the learning samples for the theory since it has the highest prediction accuracy and moderate learning sample size requirement among all the color prediction models. Results show that two samples, namely, a masstone obtained by 100% pre-colored fiber and a tint mixed by 40% pre-colored fiber and 60% white fiber, are enough to determine the absorption and scattering coefficients of a pre-colored fiber. In addition, the optimal sample for the single-constant Kubelka-Munk theory is also explored.
The driver tissues or cell types in which susceptibility genes initiate diseases remain elusive. We develop a unified framework to detect the causal tissues of complex diseases or traits according to ...selective expression of disease-associated genes in genome-wide association studies (GWASs). This framework consists of three components which run iteratively to produce a converged prioritization list of driver tissues. Additionally, this framework also outputs a list of prioritized genes as a byproduct. We apply the framework to six representative complex diseases or traits with GWAS summary statistics, which leads to the estimation of the lung as an associated tissue of rheumatoid arthritis.
Genome-wide association studies (GWASs) have identified multiple susceptibility loci for Alzheimer's disease (AD), which is characterized by early and progressive damage to the hippocampus. However, ...the association of hippocampal gene expression with AD and the underlying neurobiological pathways remain largely unknown. Based on the genomic and transcriptomic data of 111 hippocampal samples and the summary data of two large-scale meta-analyses of GWASs, a transcriptome-wide association study (TWAS) was performed to identify genes with significant associations between hippocampal expression and AD. We identified 54 significantly associated genes using an AD-GWAS meta-analysis of 455,258 individuals; 36 of the genes were confirmed in another AD-GWAS meta-analysis of 63,926 individuals. Fine-mapping models further prioritized 24 AD-related genes whose effects on AD were mediated by hippocampal expression, including APOE and two novel genes (PTPN9 and PCDHA4). These genes are functionally related to amyloid-beta formation, phosphorylation/dephosphorylation, neuronal apoptosis, neurogenesis and telomerase-related processes. By integrating the predicted hippocampal expression and neuroimaging data, we found that the hippocampal expression of QPCTL and ERCC2 showed significant difference between AD patients and cognitively normal elderly individuals as well as correlated with hippocampal volume. Mediation analysis further demonstrated that hippocampal volume mediated the effect of hippocampal gene expression (QPCTL and ERCC2) on AD. This study identifies two novel genes associated with AD by integrating hippocampal gene expression and genome-wide association data and reveals candidate hippocampus-mediated neurobiological pathways from gene expression to AD.
Abstract
Background
The high mutation rate throughout the entire melanoma genome presents a major challenge in stratifying true driver events from the background mutations. Numerous recurrent ...non-coding alterations, such as those in enhancers, can shape tumor evolution, thereby emphasizing the importance in systematically deciphering enhancer disruptions in melanoma.
Results
Here, we leveraged 297 melanoma whole-genome sequencing samples to prioritize highly recurrent regions. By performing a genome-scale CRISPR interference (CRISPRi) screen on highly recurrent region-associated enhancers in melanoma cells, we identified 66 significant hits which could have tumor-suppressive roles. These functional enhancers show unique mutational patterns independent of classical significantly mutated genes in melanoma. Target gene analysis for the essential enhancers reveal many known and hidden mechanisms underlying melanoma growth. Utilizing extensive functional validation experiments, we demonstrate that a super enhancer element could modulate melanoma cell proliferation by targeting
MEF2A
, and another distal enhancer is able to sustain
PTEN
tumor-suppressive potential via long-range interactions.
Conclusions
Our study establishes a catalogue of crucial enhancers and their target genes in melanoma growth and progression, and illuminates the identification of novel mechanisms of dysregulation for melanoma driver genes and new therapeutic targeting strategies.