RNA has a dual role as an informational molecule and a direct effector of biological tasks. The latter function is enabled by RNA's ability to adopt complex secondary and tertiary folds and thus has ...motivated extensive computational and experimental efforts for determining RNA structures. Existing approaches for evaluating RNA structure have been largely limited to in vitro systems, yet the thermodynamic forces which drive RNA folding in vitro may not be sufficient to predict stable RNA structures in vivo. Indeed, the presence of RNA-binding proteins and ATP-dependent helicases can influence which structures are present inside cells. Here we present an approach for globally monitoring RNA structure in native conditions in vivo with single-nucleotide precision. This method is based on in vivo modification with dimethyl sulphate (DMS), which reacts with unpaired adenine and cytosine residues, followed by deep sequencing to monitor modifications. Our data from yeast and mammalian cells are in excellent agreement with known messenger RNA structures and with the high-resolution crystal structure of the Saccharomyces cerevisiae ribosome. Comparison between in vivo and in vitro data reveals that in rapidly dividing cells there are vastly fewer structured mRNA regions in vivo than in vitro. Even thermostable RNA structures are often denatured in cells, highlighting the importance of cellular processes in regulating RNA structure. Indeed, analysis of mRNA structure under ATP-depleted conditions in yeast shows that energy-dependent processes strongly contribute to the predominantly unfolded state of mRNAs inside cells. Our studies broadly enable the functional analysis of physiological RNA structures and reveal that, in contrast to the Anfinsen view of protein folding whereby the structure formed is the most thermodynamically favourable, thermodynamics have an incomplete role in determining mRNA structure in vivo.
SARS-CoV-2 is a betacoronavirus with a single-stranded, positive-sense, 30-kilobase RNA genome responsible for the ongoing COVID-19 pandemic. Although population average structure models of the ...genome were recently reported, there is little experimental data on native structural ensembles, and most structures lack functional characterization. Here we report secondary structure heterogeneity of the entire SARS-CoV-2 genome in two lines of infected cells at single nucleotide resolution. Our results reveal alternative RNA conformations across the genome and at the critical frameshifting stimulation element (FSE) that are drastically different from prevailing population average models. Importantly, we find that this structural ensemble promotes frameshifting rates much higher than the canonical minimal FSE and similar to ribosome profiling studies. Our results highlight the value of studying RNA in its full length and cellular context. The genomic structures detailed here lay groundwork for coronavirus RNA biology and will guide the design of SARS-CoV-2 RNA-based therapeutics.
Bacterial mRNAs are organized into operons consisting of discrete open reading frames (ORFs) in a single polycistronic mRNA. Individual ORFs on the mRNA are differentially translated, with rates ...varying as much as 100-fold. The signals controlling differential translation are poorly understood. Our genome-wide mRNA secondary structure analysis indicated that operonic mRNAs are comprised of ORF-wide units of secondary structure that vary across ORF boundaries such that adjacent ORFs on the same mRNA molecule are structurally distinct. ORF translation rate is strongly correlated with its mRNA structure in vivo, and correlation persists, albeit in a reduced form, with its structure when translation is inhibited and with that of in vitro refolded mRNA. These data suggest that intrinsic ORF mRNA structure encodes a rough blueprint for translation efficiency. This structure is then amplified by translation, in a self-reinforcing loop, to provide the structure that ultimately specifies the translation of each ORF.
Human immunodeficiency virus 1 (HIV-1) is a retrovirus with a ten-kilobase single-stranded RNA genome. HIV-1 must express all of its gene products from a single primary transcript, which undergoes ...alternative splicing to produce diverse protein products that include structural proteins and regulatory factors
. Despite the critical role of alternative splicing, the mechanisms that drive the choice of splice site are poorly understood. Synonymous RNA mutations that lead to severe defects in splicing and viral replication indicate the presence of unknown cis-regulatory elements
. Here we use dimethyl sulfate mutational profiling with sequencing (DMS-MaPseq) to investigate the structure of HIV-1 RNA in cells, and develop an algorithm that we name 'detection of RNA folding ensembles using expectation-maximization' (DREEM), which reveals the alternative conformations that are assumed by the same RNA sequence. Contrary to previous models that have analysed population averages
, our results reveal heterogeneous regions of RNA structure across the entire HIV-1 genome. In addition to confirming that in vitro characterized
alternative structures for the HIV-1 Rev responsive element also exist in cells, we discover alternative conformations at critical splice sites that influence the ratio of transcript isoforms. Our simultaneous measurement of splicing and intracellular RNA structure provides evidence for the long-standing hypothesis
that heterogeneity in RNA conformation regulates splice-site use and viral gene expression.
Ribosome profiling data report on the distribution of translating ribosomes, at steady‐state, with codon‐level resolution. We present a robust method to extract codon translation rates and protein ...synthesis rates from these data, and identify causal features associated with elongation and translation efficiency in physiological conditions in yeast. We show that neither elongation rate nor translational efficiency is improved by experimental manipulation of the abundance or body sequence of the rare AGG tRNA. Deletion of three of the four copies of the heavily used ACA tRNA shows a modest efficiency decrease that could be explained by other rate‐reducing signals at gene start. This suggests that correlation between codon bias and efficiency arises as selection for codons to utilize translation machinery efficiently in highly translated genes. We also show a correlation between efficiency and RNA structure calculated both computationally and from recent structure probing data, as well as the Kozak initiation motif, which may comprise a mechanism to regulate initiation.
Synopsis
Ribosome profiling experiments in wild‐type yeast and in mutants with altered tRNA levels illustrate that neither elongation rate nor translational efficiency is affected by tRNA abundance under physiological conditions.
A novel statistical model provides robust inference of codon translation rates and protein synthesis rates and hence better measures translation efficiency.
Codon translation rates have insignificant correlation with measures of codon bias.
Direct experimental manipulation of tRNA abundance does not affect elongation rates on affected codons or translation efficiency of overall genes.
Other sequence signals, such as mRNA structure and an initiation sequence motif, correlate to translation efficiency and may be causal determinants.
Ribosome profiling experiments in wild‐type yeast and in mutants with altered tRNA levels illustrate that neither elongation rate nor translational efficiency is affected by tRNA abundance under physiological conditions.
The conserved transcriptional regulator heat shock factor 1 (Hsf1) is a key sensor of proteotoxic and other stress in the eukaryotic cytosol. We surveyed Hsf1 activity in a genome-wide ...loss-of-function library in Saccaromyces cerevisiae as well as ∼78,000 double mutants and found Hsf1 activity to be modulated by highly diverse stresses. These included disruption of a ribosome-bound complex we named the Ribosome Quality Control Complex (RQC) comprising the Ltn1 E3 ubiquitin ligase, two highly conserved but poorly characterized proteins (Tae2 and Rqc1), and Cdc48 and its cofactors. Electron microscopy and biochemical analyses revealed that the RQC forms a stable complex with 60S ribosomal subunits containing stalled polypeptides and triggers their degradation. A negative feedback loop regulates the RQC, and Hsf1 senses an RQC-mediated translation-stress signal distinctly from other stresses. Our work reveals the range of stresses Hsf1 monitors and elucidates a conserved cotranslational protein quality control mechanism.
Display omitted
► Comprehensive characterization of the stresses sensed by Hsf1 ► Characterization of a complex that targets ribosomes stalled at translation ► An autoregulatory loop regulates activity of the complex ► Discovery of a translation-stress signaling pathway from the ribosome to Hsf1
A ribosome-bound complex designated RQC associates with 60S ribosomal subunits containing stalled polypeptides to trigger their degradation.
Hybrid RNA:DNA origami, in which a long RNA scaffold strand folds into a target nanostructure via thermal annealing with complementary DNA oligos, has only been explored to a limited extent despite ...its unique potential for biomedical delivery of mRNA, tertiary structure characterization of long RNAs, and fabrication of artificial ribozymes. Here, we investigate design principles of three-dimensional wireframe RNA-scaffolded origami rendered as polyhedra composed of dual-duplex edges. We computationally design, fabricate, and characterize tetrahedra folded from an EGFP-encoding messenger RNA and de Bruijn sequences, an octahedron folded with M13 transcript RNA, and an octahedron and pentagonal bipyramids folded with 23S ribosomal RNA, demonstrating the ability to make diverse polyhedral shapes with distinct structural and functional RNA scaffolds. We characterize secondary and tertiary structures using dimethyl sulfate mutational profiling and cryo-electron microscopy, revealing insight into both global and local, base-level structures of origami. Our top-down sequence design strategy enables the use of long RNAs as functional scaffolds for complex wireframe origami.
Coupling of structure-specific in vivo chemical modification to next-generation sequencing is transforming RNA secondary structure studies in living cells. The dominant strategy for detecting in vivo ...chemical modifications uses reverse transcriptase truncation products, which introduce biases and necessitate population-average assessments of RNA structure. Here we present dimethyl sulfate (DMS) mutational profiling with sequencing (DMS-MaPseq), which encodes DMS modifications as mismatches using a thermostable group II intron reverse transcriptase. DMS-MaPseq yields a high signal-to-noise ratio, can report multiple structural features per molecule, and allows both genome-wide studies and focused in vivo investigations of even low-abundance RNAs. We apply DMS-MaPseq for the first analysis of RNA structure within an animal tissue and to identify a functional structure involved in noncanonical translation initiation. Additionally, we use DMS-MaPseq to compare the in vivo structure of pre-mRNAs with their mature isoforms. These applications illustrate DMS-MaPseq's capacity to dramatically expand in vivo analysis of RNA structure.
A pan-viral DNA microarray, the Virochip (University of California, San Francisco), was used to detect human parainfluenzavirus 4 (HPIV-4) infection in an immunocompetent adult presenting with a ...life-threatening acute respiratory illness. The virus was identified in an endotracheal aspirate specimen, and the microarray results were confirmed by specific polymerase chain reaction and serological analysis for HPIV-4. Conventional clinical laboratory testing using an extensive panel of microbiological tests failed to yield a diagnosis. This case suggests that the potential severity of disease caused by HPIV-4 in adults may be greater than previously appreciated and illustrates the clinical utility of a microarray for broad-based viral pathogen screening.
•DMS-MaPseq can be used to probe the secondary structure of viral RNA.•DMS-MaPseq protocol is presented for virally infected or transfected cells as well as virions.•DMS-modified RNA can be used for ...RT-PCR or whole-genome library generation.•Library generation quality control and DMS-MaPseq data for HIV-1 TAR is presented.
RNA structure is critically important to RNA viruses in every part of the replication cycle. RNA structure is also utilized by DNA viruses in order to regulate gene expression and interact with host factors. Advances in next-generation sequencing have greatly enhanced the utility of chemical probing in order to analyze RNA structure. This review will cover some recent viral RNA structural studies using chemical probing and next-generation sequencing as well as the advantages of dimethyl sulfate (DMS)-mutational profiling and sequencing (MaPseq). DMS-MaPseq is a robust assay that can easily modify RNA in vitro, in cell and in virion. A detailed protocol for whole-genome DMS-MaPseq from cells transfected with HIV-1 and the structure of TAR as determined by DMS-MaPseq is presented. DMS-MaPseq has the ability to answer a variety of integral questions about viral RNA, including how they change in different environments and when interacting with different host factors.