RNA editing in the form of substituting adenine with inosine (A-to-I editing) is the most frequent type of RNA editing in many metazoan species. In most species, A-to-I editing sites tend to form ...clusters and editing at clustered sites depends on editing of the adjacent sites. Although functionally important in some specific cases, A-to-I editing usually is rare. The exception occurs in soft-bodied coleoid cephalopods, where tens of thousands of potentially important A-to-I editing sites have been identified, making coleoids an ideal model for studying of properties and evolution of A-to-I editing sites. Here, we apply several diverse techniques to demonstrate a strong tendency of coleoid RNA editing sites to cluster along the transcript. We show that clustering of editing sites and correlated editing substantially contribute to the transcriptome diversity that arises due to extensive RNA editing. Moreover, we identify three distinct types of editing site clusters, varying in size, and describe RNA structural features and mechanisms likely underlying formation of these clusters. In particular, these observations may explain sequence conservation at large distances around editing sites and the observed dependency of editing on mutations in the vicinity of editing sites.
Enhancers play a vital role in gene regulation and are critical in mediating the impact of noncoding genetic variants associated with complex traits. Enhancer activity is a cell‐type‐specific process ...regulated by transcription factors (TFs), epigenetic mechanisms and genetic variants. Despite the strong mechanistic link between TFs and enhancers, we currently lack a framework for jointly analysing them in cell‐type‐specific gene regulatory networks (GRN). Equally important, we lack an unbiased way of assessing the biological significance of inferred GRNs since no complete ground truth exists. To address these gaps, we present GRaNIE (Gene Regulatory Network Inference including Enhancers) and GRaNPA (Gene Regulatory Network Performance Analysis). GRaNIE (https://git.embl.de/grp‐zaugg/GRaNIE) builds enhancer‐mediated GRNs based on covariation of chromatin accessibility and RNA‐seq across samples (e.g. individuals), while GRaNPA (https://git.embl.de/grp‐zaugg/GRaNPA) assesses the performance of GRNs for predicting cell‐type‐specific differential expression. We demonstrate their power by investigating gene regulatory mechanisms underlying the response of macrophages to infection, cancer and common genetic traits including autoimmune diseases. Finally, our methods identify the TF PURA as a putative regulator of pro‐inflammatory macrophage polarisation.
Synopsis
GRaNIE builds enhancer‐based gene regulatory networks (eGRNs) using chromatin accessibility and RNA‐seq data. GRaNPA assesses the biological significance of GRNs and transcription factors. Together, they provide insights into cell‐type‐specific gene regulation.
GRaNIE builds gene regulatory networks that encompass transcription factors, regulatory regions and genes, enabling a comprehensive view of gene regulation.
GRaNPA provides an unbiased evaluation method for cell‐type‐specific GRNs by testing their ability to predict cell‐type‐specific differential expression.
GRaNPA can identify important transcription factors that drive differential expression, leading to insights into biological mechanisms.
GRaNIE and GRaNPA analyses identified PURA as a promising candidate for regulating pro‐inflammatory macrophage polarisation.
GRaNIE builds enhancer‐based gene regulatory networks (GRNs) using chromatin accessibility and RNA‐seq data. GRaNPA assesses the biological significance of GRNs and transcription factors. Together, they provide insights into cell‐type‐specific gene regulation.
Abstract
eIF4G2 (DAP5 or Nat1) is a homologue of the canonical translation initiation factor eIF4G1 in higher eukaryotes but its function remains poorly understood. Unlike eIF4G1, eIF4G2 does not ...interact with the cap-binding protein eIF4E and is believed to drive translation under stress when eIF4E activity is impaired. Here, we show that eIF4G2 operates under normal conditions as well and promotes scanning downstream of the eIF4G1-mediated 40S recruitment and cap-proximal scanning. Specifically, eIF4G2 facilitates leaky scanning for a subset of mRNAs. Apparently, eIF4G2 replaces eIF4G1 during scanning of 5′ UTR and the necessity for eIF4G2 only arises when eIF4G1 dissociates from the scanning complex. In particular, this event can occur when the leaky scanning complexes interfere with initiating or elongating 80S ribosomes within a translated uORF. This mechanism is therefore crucial for higher eukaryotes which are known to have long 5′ UTRs with highly frequent uORFs. We suggest that uORFs are not the only obstacle on the way of scanning complexes towards the main start codon, because certain eIF4G2 mRNA targets lack uORF(s). Thus, higher eukaryotes possess two distinct scanning complexes: the principal one that binds mRNA and initiates scanning, and the accessory one that rescues scanning when the former fails.
Abstract
Motivation
The increasing volume of data from high-throughput experiments including parallel reporter assays facilitates the development of complex deep-learning approaches for modeling DNA ...regulatory grammar.
Results
Here, we introduce LegNet, an EfficientNetV2-inspired convolutional network for modeling short gene regulatory regions. By approaching the sequence-to-expression regression problem as a soft classification task, LegNet secured first place for the autosome.org team in the DREAM 2022 challenge of predicting gene expression from gigantic parallel reporter assays. Using published data, here, we demonstrate that LegNet outperforms existing models and accurately predicts gene expression per se as well as the effects of single-nucleotide variants. Furthermore, we show how LegNet can be used in a diffusion network manner for the rational design of promoter sequences yielding the desired expression level.
Availability and implementation
https://github.com/autosome-ru/LegNet. The GitHub repository includes Jupyter Notebook tutorials and Python scripts under the MIT license to reproduce the results presented in the study.