Long non-coding RNAs were once considered as “junk” RNA produced by aberrant DNA transcription. They are now understood to play central roles in diverse cellular processes from proliferation and ...migration to differentiation, senescence and DNA damage control. LncRNAs are classed as transcripts longer than 200 nucleotides that do not encode a peptide. They are relevant to many physiological and pathophysiological processes through their control of fundamental molecular functions. This review summarises the recent progress in lncRNA research and highlights the far-reaching physiological relevance of lncRNAs. The main areas of lncRNA research encompassing their characterisation, classification and mechanisms of action will be discussed. In particular, the regulation of gene expression and chromatin landscape through lncRNA control of proteins, DNA and other RNAs will be introduced. This will be exemplified with a selected number of lncRNAs that have been described in numerous physiological contexts and that should be largely representative of the tens-of-thousands of mammalian lncRNAs. To some extent, these lncRNAs have inspired the current thinking on the central dogmas of epigenetics, RNA and DNA mechanisms.
Abstract
MicroRNAs (miRNAs) are post-transcriptional regulators that finetune gene expression via translational repression or degradation of their target mRNAs. Despite their functional relevance, ...frameworks for the scalable and accurate detection of miRNA orthologs are missing. Consequently, there is still no comprehensive picture of how miRNAs and their associated regulatory networks have evolved. Here we present ncOrtho, a synteny informed pipeline for the targeted search of miRNA orthologs in unannotated genome sequences. ncOrtho matches miRNA annotations from multi-tissue transcriptomes in precision, while scaling to the analysis of hundreds of custom-selected species. The presence-absence pattern of orthologs to 266 human miRNA families across 402 vertebrate species reveals four bursts of miRNA acquisition, of which the most recent event occurred in the last common ancestor of higher primates. miRNA families are rarely modified or lost, but notable exceptions for both events exist. miRNA co-ortholog numbers faithfully indicate lineage-specific whole genome duplications, and miRNAs are powerful markers for phylogenomic analyses. Their exceptionally low genetic diversity makes them suitable to resolve clades where the phylogenetic signal is blurred by incomplete lineage sorting of ancestral alleles. In summary, ncOrtho allows to routinely consider miRNAs in evolutionary analyses that were thus far reserved to protein-coding genes.
Graphical Abstract
Graphical Abstract
Compared to their protein-coding counterparts, almost nothing is known about the role of long noncoding RNAs (lncRNAs) in cardiac fibrosis. In the current report, Liang and Pan et al. characterized ...the pro-fibrotic lncRNA PFL in respect to cardiac fibrosis in mice. PFL was upregulated in the hearts of mice after myocardial infarction and in fibrotic cardiac fibroblasts. Moreover, PFL competitively sponged the cardio-protective miRNA let-7d in cardiac fibroblasts. Knockdown of platelet activating factor receptor (PTAFR) was shown to affect the pro-fibrotic collagen production mediated by PFL. PTAFR overexpression also led to collagen production and RNA abundance of PTAFR was also regulated by miRNA let-7d. Therefore, the PFL/PTAFR/let-7d-dependent gene regulatory mechanism proposed by the authors manifests the hypothesis of competing endogenous RNAs to cardiac fibrosis.
Hoogsteen base pairing enables the association of an RNA strand with a DNA double-helix, allowing the formation of RNA-DNA triplexes, which now have been identified to occur in vivo.Such ...identifications require specific bioinformatic, biochemical, and biophysical methods, offering experimental opportunities, but also render the field complex.Triplex formation has been best studied for long non-coding RNAs (lncRNAs).Numerous lncRNAs fine-tune gene expression through triplex formation.Many aspects of triplex biology still await clarification, making this research field highly dynamic.
Interactions of RNA with DNA are principles of gene expression control that have recently gained considerable attention. Among RNA–DNA interactions are R-loops and RNA-DNA hybrid G-quadruplexes, as well as RNA-DNA triplexes. It is proposed that RNA-DNA triplexes guide RNA-associated regulatory proteins to specific genomic locations, influencing transcription and epigenetic decision making. Although triplex formation initially was considered solely an in vitro event, recent progress in computational, biochemical, and biophysical methods support in vivo functionality with relevance for gene expression control. Here, we review the central methodology and biology of triplexes, outline paradigms required for triplex function, and provide examples of physiologically important triplex-forming long non-coding RNAs.
Interactions of RNA with DNA are principles of gene expression control that have recently gained considerable attention. Among RNA–DNA interactions are R-loops and RNA-DNA hybrid G-quadruplexes, as well as RNA-DNA triplexes. It is proposed that RNA-DNA triplexes guide RNA-associated regulatory proteins to specific genomic locations, influencing transcription and epigenetic decision making. Although triplex formation initially was considered solely an in vitro event, recent progress in computational, biochemical, and biophysical methods support in vivo functionality with relevance for gene expression control. Here, we review the central methodology and biology of triplexes, outline paradigms required for triplex function, and provide examples of physiologically important triplex-forming long non-coding RNAs.
Understanding how epigenetic variation in non-coding regions is involved in distal gene-expression regulation is an important problem. Regulatory regions can be associated to genes using large-scale ...datasets of epigenetic and expression data. However, for regions of complex epigenomic signals and enhancers that regulate many genes, it is difficult to understand these associations. We present StitchIt, an approach to dissect epigenetic variation in a gene-specific manner for the detection of regulatory elements (REMs) without relying on peak calls in individual samples. StitchIt segments epigenetic signal tracks over many samples to generate the location and the target genes of a REM simultaneously. We show that this approach leads to a more accurate and refined REM detection compared to standard methods even on heterogeneous datasets, which are challenging to model. Also, StitchIt REMs are highly enriched in experimentally determined chromatin interactions and expression quantitative trait loci. We validated several newly predicted REMs using CRISPR-Cas9 experiments, thereby demonstrating the reliability of StitchIt. StitchIt is able to dissect regulation in superenhancers and predicts thousands of putative REMs that go unnoticed using peak-based approaches suggesting that a large part of the regulome might be uncharted water.
Within the family of NADPH oxidases, NOX4 is unique as it is predominantly localized in the endoplasmic reticulum, has constitutive activity, and generates hydrogen peroxide (H2O2). We hypothesize ...that these features are consequences of a so far unidentified NOX4-interacting protein. Two-dimensional blue native (BN) electrophorese combined with SDS-PAGE yielded NOX4 to reside in macromolecular complexes. Interacting proteins were screened by quantitative SILAC (stable isotope labeling of amino acids in cell culture) co-immunoprecipitation (Co-IP) in HEK293 cells stably overexpressing NOX4. By this technique, several interacting proteins were identified with calnexin showing the most robust interaction. Calnexin also resided in NOX4-containing complexes as demonstrated by complexome profiling from BN-PAGE. The calnexin NOX4 interaction could be confirmed by reverse Co-IP and proximity ligation assay, whereas NOX1, NOX2, or NOX5 did not interact with calnexin. Calnexin deficiency as studied in mouse embryonic fibroblasts from calnexin−/− mice or in response to calnexin shRNA reduced cellular NOX4 protein expression and reactive oxygen species formation. Our results suggest that endogenous NOX4 forms macromolecular complexes with calnexin, which are needed for the proper maturation, processing, and function of NOX4 in the endoplasmic reticulum.
In vascular endothelial cells, cysteine metabolism by the cystathionine γ lyase (CSE), generates hydrogen sulfide-related sulfane sulfur compounds (H
S
), that exert their biological actions via ...cysteine
-sulfhydration of target proteins. This study set out to map the "
-sulfhydrome" (ie, the spectrum of proteins targeted by H
S
) in human endothelial cells.
Liquid chromatography with tandem mass spectrometry was used to identify
-sulfhydrated cysteines in endothelial cell proteins and β3 integrin intraprotein disulfide bond rearrangement. Functional studies included endothelial cell adhesion, shear stress-induced cell alignment, blood pressure measurements, and flow-induced vasodilatation in endothelial cell-specific CSE knockout mice and in a small collective of patients with endothelial dysfunction.
Three paired sample sets were compared: (1) native human endothelial cells isolated from plaque-free mesenteric arteries (CSE activity high) and plaque-containing carotid arteries (CSE activity low); (2) cultured human endothelial cells kept under static conditions or exposed to fluid shear stress to decrease CSE expression; and (3) cultured endothelial cells exposed to shear stress to decrease CSE expression and treated with solvent or the slow-releasing H
S
donor, SG1002. The endothelial cell "
-sulfhydrome" consisted of 3446 individual cysteine residues in 1591 proteins. The most altered family of proteins were the integrins and focusing on β3 integrin in detail we found that
-sulfhydration affected intraprotein disulfide bond formation and was required for the maintenance of an extended-open conformation of the β leg. β3 integrin
-sulfhydration was required for endothelial cell mechanotransduction in vitro as well as flow-induced dilatation in murine mesenteric arteries. In cultured cells, the loss of
-sulfhydration impaired interactions between β3 integrin and Gα13 (guanine nucleotide-binding protein subunit α 13), resulting in the constitutive activation of RhoA (ras homolog family member A) and impaired flow-induced endothelial cell realignment. In humans with atherosclerosis, endothelial function correlated with low H
S
generation, impaired flow-induced dilatation, and failure to detect β3 integrin
-sulfhydration, all of which were rescued after the administration of an H
S
supplement.
Vascular disease is associated with marked changes in the
-sulfhydration of endothelial cell proteins involved in mediating responses to flow. Short-term H
S
supplementation improved vascular reactivity in humans highlighting the potential of interfering with this pathway to treat vascular disease.
Long non-coding RNAs (lncRNAs) impact cell function via numerous mechanisms. In the nucleus, interactions between lncRNAs and DNA and the consequent formation of non-canonical nucleic acid structures ...seems to be particularly relevant. Along with interactions between single-stranded RNA (ssRNA) and single-stranded DNA (ssDNA), such as R-loops, ssRNA can also interact with double-stranded DNA (dsDNA) to form DNA:DNA:RNA triplexes. A major challenge in the study of DNA:DNA:RNA triplexes is the identification of the precise RNA component interacting with specific regions of the dsDNA. As this is a crucial step towards understanding lncRNA function, there exist several computational methods designed to predict these sequences. This review summarises the recent progress in the prediction of triplex formation and highlights important DNA:DNA:RNA triplexes. In particular, different prediction tools (
,
,
,
,
,
and
) will be discussed and their use exemplified by selected lncRNAs, whose DNA:DNA:RNA triplex forming potential was validated experimentally. Collectively, these tools revealed that DNA:DNA:RNA triplexes are likely to be numerous and make important contributions to gene expression regulation.
Ribosome biogenesis is fundamental for cellular life, but surprisingly little is known about the underlying pathway. In eukaryotes a comprehensive collection of experimentally verified ribosome ...biogenesis factors (RBFs) exists only for Saccharomyces cerevisiae. Far less is known for other fungi, animals or plants, and insights are even more limited for archaea. Starting from 255 yeast RBFs, we integrated ortholog searches, domain architecture comparisons and, in part, manual curation to investigate the inventories of RBF candidates in 261 eukaryotes, 26 archaea and 57 bacteria. The resulting phylogenetic profiles reveal the evolutionary ancestry of the yeast pathway. The oldest core comprising 20 RBF lineages dates back to the last universal common ancestor, while the youngest 20 factors are confined to the Saccharomycotina. On this basis, we outline similarities and differences of ribosome biogenesis across contemporary species. Archaea, so far a rather uncharted domain, possess 38 well-supported RBF candidates of which some are known to form functional sub-complexes in yeast. This provides initial evidence that ribosome biogenesis in eukaryotes and archaea follows similar principles. Within eukaryotes, RBF repertoires vary considerably. A comparison of yeast and human reveals that lineage-specific adaptation via RBF exclusion and addition characterizes the evolution of this ancient pathway.
Monoamine oxidases (MAOs) generate H2O2 as a by-product of their catalytic cycle. Whether MAOs are mediators of endothelial dysfunction is unknown and was determined here in the angiotensin II and ...lipopolysaccharide-models of vascular dysfunction in mice. Quantitative real-time polymerase chain reaction revealed that mouse aortas contain enzymes involved in catecholamine generation and MAO-A and MAO-B mRNA. MAO-A and -B proteins could be detected by Western blot not only in mouse aortas but also in human umbilical vein endothelial cells. Ex vivo incubation of mouse aorta with recombinant MAO-A increased H2O2 formation and induced endothelial dysfunction that was attenuated by polyethylene glycol-catalase and MAO inhibitors. In vivo lipopolysaccharide (8 mg/kg IP overnight) or angiotensin II (1 mg/kg per day, 2 weeks, minipump) treatment induced vascular MAO-A and -B expressions and resulted in attenuated endothelium-dependent relaxation of the aorta in response to acetylcholine. MAO inhibitors reduced the lipopolysaccharide- and angiotensin II–induced aortic reactive oxygen species formation by 50% (ferrous oxidation xylenol orange assay) and partially normalized endothelium-dependent relaxation. MAO-A and MAO-B inhibitors had an additive effect; combined application completely restored endothelium-dependent relaxation. To determine how MAO-dependent H2O2 formation induces endothelial dysfunction, cyclic GMP was measured. Histamine stimulation of human umbilical vein endothelial cells to activate endothelial NO synthase resulted in an increase in cyclic GMP, which was almost abrogated by MAO-A exposure. MAO inhibition prevented this effect, suggesting that MAO-induced H2O2 formation is sufficient to attenuate endothelial NO release. Thus, MAO-A and MAO-B are both expressed in the mouse aorta, induced by in vivo lipopolysaccharide and angiotensin II treatment and contribute via the generation of H2O2 to endothelial dysfunction in vascular disease models.