Ribosome profiling has revealed pervasive but largely uncharacterized translation outside of canonical coding sequences (CDSs). In this work, we exploit a systematic CRISPR-based screening strategy ...to identify hundreds of noncanonical CDSs that are essential for cellular growth and whose disruption elicits specific, robust transcriptomic and phenotypic changes in human cells. Functional characterization of the encoded microproteins reveals distinct cellular localizations, specific protein binding partners, and hundreds of microproteins that are presented by the human leukocyte antigen system. We find multiple microproteins encoded in upstream open reading frames, which form stable complexes with the main, canonical protein encoded on the same messenger RNA, thereby revealing the use of functional bicistronic operons in mammals. Together, our results point to a family of functional human microproteins that play critical and diverse cellular roles.
A fundamental goal of genomics is to identify the complete set of expressed proteins. Automated annotation strategies rely on assumptions about protein-coding sequences (CDSs), e.g., they are ...conserved, do not overlap, and exceed a minimum length. However, an increasing number of newly discovered proteins violate these rules. Here we present an experimental and analytical framework, based on ribosome profiling and linear regression, for systematic identification and quantification of translation. Application of this approach to lipopolysaccharide-stimulated mouse dendritic cells and HCMV-infected human fibroblasts identifies thousands of novel CDSs, including micropeptides and variants of known proteins, that bear the hallmarks of canonical translation and exhibit translation levels and dynamics comparable to that of annotated CDSs. Remarkably, many translation events are identified in both mouse and human cells even when the peptide sequence is not conserved. Our work thus reveals an unexpected complexity to mammalian translation suited to provide both conserved regulatory or protein-based functions.
Display omitted
•ORF-RATER robustly identifies and quantifies translation from ribosome profiling data•ORF-RATER reveals thousands of novel micropeptides and variants of mammalian proteins•Hundreds of novel CDSs show evidence of protein-coding conservation among mammals•Many ORFs are translated in both mice and humans but lack protein-coding conservation
Fields et al. describe a ribosome profiling-based approach for empirical annotation of protein-coding regions of the genome. Of the thousands of previously unknown translated ORFs they identify in mouse and human, many are conserved or dynamically regulated. Surprisingly, a considerable subset is translated in both species despite weak sequence conservation.
We recently found that nucleosomes directly block access of CRISPR/Cas9 to DNA (Horlbeck et al., 2016). Here, we build on this observation with a comprehensive algorithm that incorporates chromatin, ...position, and sequence features to accurately predict highly effective single guide RNAs (sgRNAs) for targeting nuclease-dead Cas9-mediated transcriptional repression (CRISPRi) and activation (CRISPRa). We use this algorithm to design next-generation genome-scale CRISPRi and CRISPRa libraries targeting human and mouse genomes. A CRISPRi screen for essential genes in K562 cells demonstrates that the large majority of sgRNAs are highly active. We also find CRISPRi does not exhibit any detectable non-specific toxicity recently observed with CRISPR nuclease approaches. Precision-recall analysis shows that we detect over 90% of essential genes with minimal false positives using a compact 5 sgRNA/gene library. Our results establish CRISPRi and CRISPRa as premier tools for loss- or gain-of-function studies and provide a general strategy for identifying Cas9 target sites.
How the immune system readies for battleAlthough gene expression is tightly controlled at both the RNA and protein levels, the quantitative contribution of each step, especially during dynamic ...responses, remains largely unknown. Indeed, there has been much debate whether changes in RNA level contribute substantially to protein-level regulation. Jovanovic et al. built a genome-scale model of the temporal dynamics of differential protein expression during the stimulation of immunological dendritic cells (see the Perspective by Li and Biggin). Newly stimulated functions involved the up-regulation of specific RNAs and concomitant increases in the levels of the proteins they encode, whereas housekeeping functions were regulated posttranscriptionally at the protein level.Science, this issue 10.1126/science.1259038; see also p. 1066 Protein expression is regulated by the production and degradation of messenger RNAs (mRNAs) and proteins, but their specific relationships remain unknown. We combine measurements of protein production and degradation and mRNA dynamics so as to build a quantitative genomic model of the differential regulation of gene expression in lipopolysaccharide-stimulated mouse dendritic cells. Changes in mRNA abundance play a dominant role in determining most dynamic fold changes in protein levels. Conversely, the preexisting proteome of proteins performing basic cellular functions is remodeled primarily through changes in protein production or degradation, accounting for more than half of the absolute change in protein molecules in the cell. Thus, the proteome is regulated by transcriptional induction for newly activated cellular functions and by protein life-cycle changes for remodeling of preexisting functions.
Anti-Brownian electrokinetic traps have been used to trap and study the free-solution dynamics of large protein complexes and long chains of DNA. Small molecules in solution have thus far proved too ...mobile to trap by any means. Here we explore the ultimate limits on trapping single molecules. We developed a feedback-based anti-Brownian electrokinetic trap in which classical thermal noise is compensated to the maximal extent allowed by quantum measurement noise. We trapped single fluorophores with a molecular weight of < 1 kDa and a hydrodynamic radius of 6.7 Ã for longer than one second, in aqueous buffer at room temperature. This achievement represents an 800-fold decrease in the mass of objects trapped in solution, and opens the possibility to trap and manipulate any soluble molecule that can be fluorescently labeled. To illustrate the use of this trap, we studied the binding of unlabeled RecA to fluorescently labeled single-stranded DNA. Binding of RecA induced changes in the DNA diffusion coefficient, electrophoretic mobility, and brightness, all of which were measured simultaneously and on a molecule-by-molecule basis. This device greatly extends the size range of molecules that can be studied by room temperature feedback trapping, and opens the door to further studies of the binding of unmodified proteins to DNA in free solution.
In the Circulating Cell-free Genome Atlas (NCT02889978) substudy 1, we evaluate several approaches for a circulating cell-free DNA (cfDNA)-based multi-cancer early detection (MCED) test by defining ...clinical limit of detection (LOD) based on circulating tumor allele fraction (cTAF), enabling performance comparisons. Among 10 machine-learning classifiers trained on the same samples and independently validated, when evaluated at 98% specificity, those using whole-genome (WG) methylation, single nucleotide variants with paired white blood cell background removal, and combined scores from classifiers evaluated in this study show the highest cancer signal detection sensitivities. Compared with clinical stage and tumor type, cTAF is a more significant predictor of classifier performance and may more closely reflect tumor biology. Clinical LODs mirror relative sensitivities for all approaches. The WG methylation feature best predicts cancer signal origin. WG methylation is the most promising technology for MCED and informs development of a targeted methylation MCED test.
Display omitted
•Clinical LOD is a useful benchmark to assess cfDNA-based test performance•cTAF accounts for cfDNA cancer signal variation across cancer types and stages•cfDNA methylation was the most promising genomic feature for cancer signal detection•The results informed the development of a cfDNA-based multi-cancer early detection test
Jamshidi et al. compare several approaches for circulating cell-free DNA (cfDNA)-based multi-cancer early detection (MCED) tests. A whole-genome methylation-based approach has the best performance among those evaluated. In addition, they define a metric—clinical limit of detection (LOD)—based on tumor fraction to enable future comparison of cfDNA-based tests.
Genomic analyses in budding yeast have helped define the foundational principles of eukaryotic gene expression. However, in the absence of empirical methods for defining coding regions, these ...analyses have historically excluded specific classes of possible coding regions, such as those initiating at non-AUG start codons. Here, we applied an experimental approach to globally annotate translation initiation sites in yeast and identified 149 genes with alternative N-terminally extended protein isoforms initiating from near-cognate codons upstream of annotated AUG start codons. These isoforms are produced in concert with canonical isoforms and translated with high specificity, resulting from initiation at only a small subset of possible start codons. The non-AUG initiation driving their production is enriched during meiosis and induced by low eIF5A, which is seen in this context. These findings reveal widespread production of non-canonical protein isoforms and unexpected complexity to the rules by which even a simple eukaryotic genome is decoded.
Display omitted
•TIS-profiling reveals widespread translation of non-canonical ORFs in budding yeast•Production of non-AUG-initiated extended isoforms is prevalent and inefficient•A small subset of possible near-cognate sites is used for translation initiation•eIF5A-based regulation allows conditional unmasking of non-AUG initiation in meiosis
Eisenberg et al. identify translation initiation sites genome-wide in budding yeast. They define a class of 149 genes with alternate extended isoforms that initiate at non-AUG start codons. These isoforms are produced in concert with their corresponding canonical isoforms but typically at lower abundance, and are selectively induced during meiosis.
Fluorescence imaging is used to study the dynamics of a wide variety of single molecules in solution or attached to a surface. Two key challenges in this pursuit are (1) to image immobilized single ...molecules in the presence of a high level of fluorescent background and (2) to image freely diffusing single molecules for long times. Strategies that perform well by one measure often perform poorly by the other. Here, we present a simple modification to a wide-field fluorescence microscope that addresses both challenges and dramatically improves single-molecule imaging. The technique of convex lens-induced confinement (CLIC) restricts molecules to a wedge-shaped gap of nanoscale depth, formed between a plano-convex lens and a planar coverslip. The shallow depth of the imaging volume leads to 20-fold greater rejection of background fluorescence than is achieved with total internal reflection fluorescence (TIRF) imaging. Elimination of out-of-plane diffusion leads to an approximately 10,000-fold longer diffusion-limited observation time per molecule than is achieved with confocal fluorescence correlation spectroscopy. The CLIC system also provides a new means to determine molecular size. The CLIC system does not require any nanofabrication, nor any custom optics, electronics, or computer control.
The bending stiffness of double-stranded DNA (dsDNA) at high curvatures is fundamental to its biological activity, yet this regime has been difficult to probe experimentally, and literature results ...have not been consistent. We created a 'molecular vise' in which base-pairing interactions generated a compressive force on sub-persistence length segments of dsDNA. Short dsDNA strands (<41 base pairs) resisted this force and remained straight; longer strands became bent, a phenomenon called 'Euler buckling'. We monitored the buckling transition via Förster Resonance Energy Transfer (FRET) between appended fluorophores. For low-to-moderate concentrations of monovalent salt (up to ∼150 mM), our results are in quantitative agreement with the worm-like chain (WLC) model of DNA elasticity, without the need to invoke any 'kinked' states. Greater concentrations of monovalent salts or 1 mM Mg(2+) induced an apparent softening of the dsDNA, which was best accounted for by a kink in the region of highest curvature. We tested the effects of all single-nucleotide mismatches on the DNA bending. Remarkably, the propensity to kink correlated with the thermodynamic destabilization of the mismatched DNA relative the perfectly complementary strand, suggesting that the kinked state is locally melted. The molecular vise is exquisitely sensitive to the sequence-dependent linear and nonlinear elastic properties of dsDNA.
Optimal tracking of a Brownian particle Fields, Alexander P; Cohen, Adam E
Optics express,
2012-Sep-24, 2012-09-24, 20120924, Volume:
20, Issue:
20
Journal Article
Peer reviewed
Open access
Optical tracking of a fluorescent particle in solution faces fundamental constraints due to Brownian motion, diffraction, and photon shot noise. Background photons and imperfect tracking apparatus ...further degrade tracking precision. Here we use a model of particle motion to combine information from multiple time-points to improve the localization precision. We derive successive approximations that enable real-time particle tracking with well controlled tradeoffs between precision and computational cost. We present the theory in the context of feedback electrokinetic trapping, though the results apply to optical tracking of any particle subject to diffusion and drift. We use numerical simulations and experimental data to validate the algorithms' performance.