The nucleocapsid protein (N) of SARS-CoV-2 is essential for virus replication, genome packaging, evading host immunity, and virus maturation. N is a multidomain protein composed of an independently ...folded monomeric N-terminal domain that is the primary site for RNA binding and a dimeric C-terminal domain that is essential for efficient phase separation and condensate formation with RNA. The domains are separated by a disordered Ser/Arg-rich region preceding a self-associating Leu-rich helix. Phosphorylation in the Ser/Arg region in infected cells decreases the viscosity of N:RNA condensates promoting viral replication and host immune evasion. The molecular level effect of phosphorylation, however, is missing from our current understanding. Using NMR spectroscopy and analytical ultracentrifugation, we show that phosphorylation destabilizes the self-associating Leu-rich helix 30 amino-acids distant from the phosphorylation site. NMR and gel shift assays demonstrate that RNA binding by the linker is dampened by phosphorylation, whereas RNA binding to the full-length protein is not significantly affected presumably due to retained strong interactions with the primary RNA-binding domain. Introducing a switchable self-associating domain to replace the Leu-rich helix confirms the importance of linker self-association to droplet formation and suggests that phosphorylation not only increases solubility of the positively charged elongated Ser/Arg region as observed in other RNA-binding proteins but can also inhibit self-association of the Leu-rich helix. These data highlight the effect of phosphorylation both at local sites and at a distant self-associating hydrophobic helix in regulating liquid–liquid phase separation of the entire protein.
Positive-strand RNA viruses use long open reading frames to express large polyproteins that are processed into individual proteins by viral proteases. Polyprotein processing is highly regulated and ...yields intermediate species with different functions than the fully processed proteins, increasing the biochemical diversity of the compact viral genome while also presenting challenges in that proteins must remain stably folded in multiple contexts. We have used circular dichroism spectroscopy and single molecule microscopy to examine the solution structure and self-association of the poliovirus P3 region protein composed of membrane binding 3A, RNA priming 3B (VPg), 3Cpro protease, and 3Dpol RNA-dependent RNA polymerase proteins. Our data indicate that co-folding interactions within the 3ABC segment stabilize the conformational state of the 3C protease region, and this stabilization requires the full-length 3A and 3B proteins. Enzymatic activity assays show that 3ABC is also an active protease, and it cleaves peptide substrates at rates comparable to 3Cpro. The cleavage of a larger polyprotein substrate is stimulated by the addition of RNA, and 3ABCpro becomes 20-fold more active than 3Cpro in the presence of stoichiometric amounts of viral cre RNA. The data suggest that co-folding within the 3ABC region results in a protease that can be highly activated toward certain cleavage sites by localization to specific RNA elements within the viral replication center, providing a mechanism for regulating viral polyprotein processing.
The recognition of cis-regulatory RNA motifs in human transcripts by RNA binding proteins (RBPs) is essential for gene regulation. The molecular features that determine RBP specificity are often ...poorly understood. Here, we combined NMR structural biology with high-throughput iCLIP approaches to identify a regulatory mechanism for U2AF2 RNA recognition. We found that the intrinsically disordered linker region connecting the two RNA recognition motif (RRM) domains of U2AF2 mediates autoinhibitory intramolecular interactions to reduce nonproductive binding to weak Py-tract RNAs. This proofreading favors binding of U2AF2 at stronger Py-tracts, as required to define 3′ splice sites at early stages of spliceosome assembly. Mutations that impair the linker autoinhibition enhance the affinity for weak Py-tracts result in promiscuous binding of U2AF2 along mRNAs and impact on splicing fidelity. Our findings highlight an important role of intrinsically disordered linkers to modulate RNA interactions of multidomain RBPs.
Protein‐RNA interactions play vital roles in plethora of biological processes such as regulation of gene expression, protein synthesis, mRNA processing and biogenesis. Identification of RNA‐binding ...residues (RBRs) in proteins is essential to understand RNA‐mediated protein functioning, to perform site‐directed mutagenesis and to develop novel targeted drug therapies. Moreover, the extensive gap between sequence and structural data restricts the identification of binding sites in unsolved structures. However, efficient use of computational methods demanding only sequence to identify binding residues can bridge this huge sequence‐structure gap. In this study, we have extensively studied protein‐RNA interface in known RNA‐binding proteins (RBPs). We find that the interface is highly enriched in basic and polar residues with Gly being the most common interface neighbor. We investigated several amino acid features and developed a method to predict putative RBRs from amino acid sequence. We have implemented balanced random forest (BRF) classifier with local residue features of protein sequences for prediction. With 5‐fold cross‐validations, the sequence pattern derived dipeptide composition based BRF model (DCP‐BRF) resulted in an accuracy of 87.9%, specificity of 88.8%, sensitivity of 82.2%, Mathew's correlation coefficient of 0.60 and AUC of 0.93, performing better than few existing methods. We further validated our prediction model on known human RBPs through RBR prediction and could map ~54% of them. Further, knowledge of binding site preferences obtained from computational predictions combined with experimental validations of potential RNA binding sites can enhance our understanding of protein‐RNA interactions. This may serve to accelerate investigations on functional roles of many novel RBPs.
•Evolution of the CHIKV 3′ UTR is shaped by fitness concerns in different hosts.•The 5′ UTR can antagonize host innate immune defenses.•3′ UTR interactions with miRNAs determine cellular tropism and ...disease pathogenesis.•Viral RNA stability is mediated by cellular HuR protein interaction with the 3′ UTR.
The non-coding regions found at the 5′ and 3′ ends of alphavirus genomes regulate viral gene expression, replication, translation and virus–host interactions, which have significant implications for viral evolution, host range, and pathogenesis. The functions of these non-coding regions are mediated by a combination of linear sequence and structural elements. The capped 5′ untranslated region (UTR) contains promoter elements, translational regulatory sequences that modulate dependence on cellular translation factors, and structures that help to avoid innate immune defenses. The polyadenylated 3′ UTR contains highly conserved sequence elements for viral replication, binding sites for cellular miRNAs that determine cell tropism, host range, and pathogenesis, and conserved binding regions for a cellular protein that influences viral RNA stability. Nonetheless, there are additional conserved elements in non-coding regions of the virus (e.g., the repeated sequence elements in the 3′ UTR) whose function remains obscure. Thus, key questions remain as to the function of these short yet influential untranslated segments of alphavirus RNAs.
Proteins and RNA functionally and physically intersect in multiple biological processes, however, currently no universal method is available to purify protein-RNA complexes. Here, we introduce XRNAX, ...a method for the generic purification of protein-crosslinked RNA, and demonstrate its versatility to study the composition and dynamics of protein-RNA interactions by various transcriptomic and proteomic approaches. We show that XRNAX captures all RNA biotypes and use this to characterize the sub-proteomes that interact with coding and non-coding RNAs (ncRNAs) and to identify hundreds of protein-RNA interfaces. Exploiting the quantitative nature of XRNAX, we observe drastic remodeling of the RNA-bound proteome during arsenite-induced stress, distinct from autophagy-related changes in the total proteome. In addition, we combine XRNAX with crosslinking immunoprecipitation sequencing (CLIP-seq) to validate the interaction of ncRNA with lamin B1 and EXOSC2. Thus, XRNAX is a resourceful approach to study structural and compositional aspects of protein-RNA interactions to address fundamental questions in RNA-biology.
Display omitted
•XRNAX purifies protein-crosslinked RNA of all biotypes from UV-crosslinked cells•Discovery of the WKF RNA-binding domain•Discovery of more than 700 proteins interacting with non-polyadenylated RNA•Profiling of stress-induced changes in RNA-binding proteomes
A general approach for characterizing cellular RNA-protein interactions allows examination of dynamic changes to the RNA-bound proteome.
Display omitted
•RNA-binding proteins are generally enriched in intrinsic disorder.•mRNA-, rRNA- and snRNA-binding proteins are enriched in disorder.•tRNA-binding proteins are depleted in ...disorder.•Disorder is rarely utilized in the RNA-binding regions.•Disorder in RNA-binding proteins facilitates DNA interactions and harbors PTM sites.
Although RNA-binding proteins (RBPs) are known to be enriched in intrinsic disorder, no previous analysis focused on RBPs interacting with specific RNA types. We fill this gap with a comprehensive analysis of the putative disorder in RBPs binding to six common RNA types: messenger RNA (mRNA), transfer RNA (tRNA), small nuclear RNA (snRNA), non-coding RNA (ncRNA), ribosomal RNA (rRNA), and internal ribosome RNA (irRNA). We also analyze the amount of putative intrinsic disorder in the RNA-binding domains (RBDs) and non-RNA-binding-domain regions (non-RBD regions). Consistent with previous studies, we show that in comparison with human proteome, RBPs are significantly enriched in disorder. However, closer examination finds significant enrichment in predicted disorder for the mRNA-, rRNA- and snRNA-binding proteins, while the proteins that interact with ncRNA and irRNA are not enriched in disorder, and the tRNA-binding proteins are significantly depleted in disorder. We show a consistent pattern of significant disorder enrichment in the non-RBD regions coupled with low levels of disorder in RBDs, which suggests that disorder is relatively rarely utilized in the RNA-binding regions. Our analysis of the non-RBD regions suggests that disorder harbors posttranslational modification sites and is involved in the putative interactions with DNA. Importantly, we utilize experimental data from DisProt and independent data from Pfam to validate the above observations that rely on the disorder predictions. This study provides new insights into the distribution of disorder across proteins that bind different RNA types and the functional role of disorder in the regions where it is enriched.
It is a well‐known fact that RNA is the target of a plethora of modifications which currently amount to over a hundred. The vast majority of these modifications was observed in the two most abundant ...classes of RNA, rRNA and tRNA. With the recent advance in mapping technologies, modifications have been discovered also in mRNA and in less abundant non‐coding RNA species. These developments have sparked renewed interest in elucidating the nature and functions of those “epitransciptomic” modifications in RNA. N6‐methyladenosine (m6A) is the best understood and most frequent mark of mRNA with demonstrated functions ranging from pre‐mRNA processing, translation, miRNA biogenesis to mRNA decay. By contrast, much less research has been conducted on 5‐methylcytosine (m5C), which was detected in tRNAs and rRNAs and more recently in poly(A)RNAs. In this review, we discuss recent developments in the discovery of m5C RNA methylomes, the functions of m5C as well as the proteins installing, translating and manipulating this modification. Although our knowledge about m5C in RNA transcripts is just beginning to consolidate, it has become clear that cytosine methylation represents a powerful mechanistic strategy to regulate cellular processes on an epitranscriptomic level.
This article is categorized under:
RNA Processing > RNA Editing and Modification
RNA Interactions with Proteins and Other Molecules > Protein–RNA Interactions: Functional Implications
RNA Processing > tRNA Processing
RNA Turnover and Surveillance > Regulation of RNA Stability
5‐methylcytosine is emerging as an important epitranscriptomic mark of RNA. Various RNA methyltransferases that reside at different locations in the cell install this mark on a variety of RNA types. These range from cytoplasmic and mitochondrial rRNA and tRNA to mRNA and to various other non‐coding RNAs. Similar to the mulitude of targets, many if not most aspects of RNA metabolism may be affected by the m5C mark.
Head and neck squamous cell carcinoma (HNSCC) constitute approximately 4% of all cancers worldwide. In this study, we analyzed the expression profile of the long noncoding RNA (lncRNA) of 502 HNSCC ...patients from The Cancer Genome Atlas database. Among the differentially expressed lncRNAs between HNSCC and normal samples, LNCAROD is overexpressed in HNSCC and associated with advanced T stage and shortened overall survival. The N6‐methyladenosine (m6A) modification mediated by METTL3 and METTL14 enhanced the stability of LNCAROD in HNSCC cells. Depletion of LNCAROD attenuated cell proliferation, mobility in vitro, and tumorigenicity in vivo, whereas overexpression of LNCAROD exerted opposite effects. LNCAROD is mainly distributed in nucleus and binds with YBX1 and HSPA1A proteins. Silencing either YBX1 or HSPA1A did not affect the level of LNCAROD. However, loss of LNCAROD led to shortened half‐life of YBX1 protein. Mechanistically, LNCAROD protected YBX1 from proteasomal degradation by facilitating YBX1‐HSPA1A protein–protein interaction. Depletion of HSPA1A in LNCAROD‐overexpressing cells resulted in accelerated proteasomal degradation of YBX1 protein. Moreover, re‐expression of Flag‐YBX1 in LNCAROD‐silenced cells rescued malignant behavior of HNSCC cells. Our study indicates that LNCAROD is an oncogenic lncRNA and dysregulation of m6A modification might account for aberrant expression of LNCAROD in HNSCC. LNCAROD acts as a scaffold for the interaction between YBX1 and HSPA1A, preventing proteasomal degradation of YBX1 in HNSCC cells.
Dysregulation of long noncoding RNAs (lncRNAs) is implicated in cancer development and progression. In this study, Ban et al. demonstrated that an oncogenic lncRNA, LNCAROD, is stabilized by m6A methylation and overexpressed in head and neck squamous cell carcinoma (HNSCC). LNCAROD forms a ternary complex with HSPA1A and YBX1, preventing proteasomal degradation of YBX1. This study provides a new paradigm of the mechanisms of lncRNAs in HNSCC development.