Ribosome profiling has revealed pervasive but largely uncharacterized translation outside of canonical coding sequences (CDSs). In this work, we exploit a systematic CRISPR-based screening strategy ...to identify hundreds of noncanonical CDSs that are essential for cellular growth and whose disruption elicits specific, robust transcriptomic and phenotypic changes in human cells. Functional characterization of the encoded microproteins reveals distinct cellular localizations, specific protein binding partners, and hundreds of microproteins that are presented by the human leukocyte antigen system. We find multiple microproteins encoded in upstream open reading frames, which form stable complexes with the main, canonical protein encoded on the same messenger RNA, thereby revealing the use of functional bicistronic operons in mammals. Together, our results point to a family of functional human microproteins that play critical and diverse cellular roles.
Genome editing is a method for making targeted sequence changes to the genomes of living cells. Prime editing, recently reported by Anzalone et al. (2019), is a new technology that uses reverse ...transcription to “write” programmed sequence changes into genomic DNA and thus promises significant technical advances.
Genome editing is a method for making targeted sequence changes to the genomes of living cells. Prime editing, recently reported by Anzalone et al. (2019), is a new technology that uses reverse transcription to “write” programmed sequence changes into genomic DNA and thus promises significant technical advances.
While the catalog of mammalian transcripts and their expression levels in different cell types and disease states is rapidly expanding, our understanding of transcript function lags behind. We ...present a robust technology enabling systematic investigation of the cellular consequences of repressing or inducing individual transcripts. We identify rules for specific targeting of transcriptional repressors (CRISPRi), typically achieving 90%–99% knockdown with minimal off-target effects, and activators (CRISPRa) to endogenous genes via endonuclease-deficient Cas9. Together they enable modulation of gene expression over a ∼1,000-fold range. Using these rules, we construct genome-scale CRISPRi and CRISPRa libraries, each of which we validate with two pooled screens. Growth-based screens identify essential genes, tumor suppressors, and regulators of differentiation. Screens for sensitivity to a cholera-diphtheria toxin provide broad insights into the mechanisms of pathogen entry, retrotranslocation and toxicity. Our results establish CRISPRi and CRISPRa as powerful tools that provide rich and complementary information for mapping complex pathways.
Display omitted
•CRISPRi and CRISPRa provide complementary information for mapping complex pathways•CRISPRi/a expression series (up to ∼1,000-fold) reveal how gene dose controls function•CRISPRi provides strong (typically 90%–99%) knockdown with minimal off-target effects•Genome-scale screens elucidate pathways controlling cholera/diphtheria toxicity
Genome-scale-specific targeting of transcriptional repressors (CRISPRi) and activators (CRISPRa) to endogenous genes via endonuclease-deficient Cas9 have been applied to growth and toxin-resistance screens, establishing CRISPRi and CRISPRa as powerful tools that provide rich and complementary information.
Cells repair DNA double-strand breaks (DSBs) through a complex set of pathways critical for maintaining genomic integrity. To systematically map these pathways, we developed a high-throughput ...screening approach called Repair-seq that measures the effects of thousands of genetic perturbations on mutations introduced at targeted DNA lesions. Using Repair-seq, we profiled DSB repair products induced by two programmable nucleases (Cas9 and Cas12a) in the presence or absence of oligonucleotides for homology-directed repair (HDR) after knockdown of 476 genes involved in DSB repair or associated processes. The resulting data enabled principled, data-driven inference of DSB end joining and HDR pathways. Systematic interrogation of this data uncovered unexpected relationships among DSB repair genes and demonstrated that repair outcomes with superficially similar sequence architectures can have markedly different genetic dependencies. This work provides a foundation for mapping DNA repair pathways and for optimizing genome editing across diverse modalities.
Display omitted
•Repair-seq maps the genetic dependencies of DNA repair outcomes•High-resolution signatures of gene function identify unexpected gene relationships•DSB-induced mutations with similar sequences can result from distinct mechanisms•Repair-seq can be adapted to study a broad range of genome editing tools
Measuring the effects of many genetic perturbations on the spectrum of mutations produced at targeted DNA breaks allows systematic mapping of DNA repair pathways.
Repair of DNA double-strand breaks is critical to genomic stability and the prevention of developmental disorders and cancer. A central pathway for this repair is homologous recombination (HR). Most ...knowledge of HR is derived from work in prokaryotic and eukaryotic model organisms. We carried out a genome-wide siRNA-based screen in human cells. Among positive regulators of HR we identified networks of DNA-damage-response and pre-mRNA-processing proteins, and among negative regulators we identified a phosphatase network. Three candidate proteins localized to DNA lesions, including RBMX, a heterogeneous nuclear ribonucleoprotein that has a role in alternative splicing. RBMX accumulated at DNA lesions through multiple domains in a poly(ADP-ribose) polymerase 1-dependent manner and promoted HR by facilitating proper BRCA2 expression. Our screen also revealed that off-target depletion of RAD51 is a common source of RNAi false positives, raising a cautionary note for siRNA screens and RNAi-based studies of HR.
Programmable C•G-to-G•C base editors (CGBEs) have broad scientific and therapeutic potential, but their editing outcomes have proved difficult to predict and their editing efficiency and product ...purity are often low. We describe a suite of engineered CGBEs paired with machine learning models to enable efficient, high-purity C•G-to-G•C base editing. We performed a CRISPR interference (CRISPRi) screen targeting DNA repair genes to identify factors that affect C•G-to-G•C editing outcomes and used these insights to develop CGBEs with diverse editing profiles. We characterized ten promising CGBEs on a library of 10,638 genomically integrated target sites in mammalian cells and trained machine learning models that accurately predict the purity and yield of editing outcomes (R = 0.90) using these data. These CGBEs enable correction to the wild-type coding sequence of 546 disease-related transversion single-nucleotide variants (SNVs) with >90% precision (mean 96%) and up to 70% efficiency (mean 14%). Computational prediction of optimal CGBE-single-guide RNA pairs enables high-purity transversion base editing at over fourfold more target sites than achieved using any single CGBE variant.
Functional genomics efforts face tradeoffs between number of perturbations examined and complexity of phenotypes measured. We bridge this gap with Perturb-seq, which combines droplet-based ...single-cell RNA-seq with a strategy for barcoding CRISPR-mediated perturbations, allowing many perturbations to be profiled in pooled format. We applied Perturb-seq to dissect the mammalian unfolded protein response (UPR) using single and combinatorial CRISPR perturbations. Two genome-scale CRISPR interference (CRISPRi) screens identified genes whose repression perturbs ER homeostasis. Subjecting ∼100 hits to Perturb-seq enabled high-precision functional clustering of genes. Single-cell analyses decoupled the three UPR branches, revealed bifurcated UPR branch activation among cells subject to the same perturbation, and uncovered differential activation of the branches across hits, including an isolated feedback loop between the translocon and IRE1α. These studies provide insight into how the three sensors of ER homeostasis monitor distinct types of stress and highlight the ability of Perturb-seq to dissect complex cellular responses.
Display omitted
•Perturb-seq allows parallel screening with rich phenotypic output from single cells•Simultaneous delivery and identification of up to three CRISPR perturbations•Genome-scale screens dissect the mammalian unfolded protein response•Analytical methods separate perturbation responses from confounding effects
A strategy for barcoding CRISPR-mediated perturbations allows pooled expression profiling via single-cell RNA sequencing. Application to the mammalian unfolded protein response then enabled systematic delineation of the transcriptional arms of the response and functional clustering of genes affecting ER homeostasis.
We recently found that nucleosomes directly block access of CRISPR/Cas9 to DNA (Horlbeck et al., 2016). Here, we build on this observation with a comprehensive algorithm that incorporates chromatin, ...position, and sequence features to accurately predict highly effective single guide RNAs (sgRNAs) for targeting nuclease-dead Cas9-mediated transcriptional repression (CRISPRi) and activation (CRISPRa). We use this algorithm to design next-generation genome-scale CRISPRi and CRISPRa libraries targeting human and mouse genomes. A CRISPRi screen for essential genes in K562 cells demonstrates that the large majority of sgRNAs are highly active. We also find CRISPRi does not exhibit any detectable non-specific toxicity recently observed with CRISPR nuclease approaches. Precision-recall analysis shows that we detect over 90% of essential genes with minimal false positives using a compact 5 sgRNA/gene library. Our results establish CRISPRi and CRISPRa as premier tools for loss- or gain-of-function studies and provide a general strategy for identifying Cas9 target sites.
Seminal yeast studies have established the value of comprehensively mapping genetic interactions (GIs) for inferring gene function. Efforts in human cells using focused gene sets underscore the ...utility of this approach, but the feasibility of generating large-scale, diverse human GI maps remains unresolved. We developed a CRISPR interference platform for large-scale quantitative mapping of human GIs. We systematically perturbed 222,784 gene pairs in two cancer cell lines. The resultant maps cluster functionally related genes, assigning function to poorly characterized genes, including TMEM261, a new electron transport chain component. Individual GIs pinpoint unexpected relationships between pathways, exemplified by a specific cholesterol biosynthesis intermediate whose accumulation induces deoxynucleotide depletion, causing replicative DNA damage and a synthetic-lethal interaction with the ATR/9-1-1 DNA repair pathway. Our map provides a broad resource, establishes GI maps as a high-resolution tool for dissecting gene function, and serves as a blueprint for mapping the genetic landscape of human cells.
Display omitted
•Genetic interaction (GI) mapping enables elucidation of human gene function•Large-scale, diverse maps of 222,784 gene pairs reveal buffering and synthetic GIs•Clustering of GIs identifies novel members of functional complexes•Specific GIs can define the physiological impact of biosynthetic metabolites
A large-scale genetic interaction map in human cells reveals unexpected interdependencies between core pathways and exposes potential combination therapies for cancer.
Single-cell CRISPR screens enable the exploration of mammalian gene function and genetic regulatory networks. However, use of this technology has been limited by reliance on indirect indexing of ...single-guide RNAs (sgRNAs). Here we present direct-capture Perturb-seq, a versatile screening approach in which expressed sgRNAs are sequenced alongside single-cell transcriptomes. Direct-capture Perturb-seq enables detection of multiple distinct sgRNA sequences from individual cells and thus allows pooled single-cell CRISPR screens to be easily paired with combinatorial perturbation libraries that contain dual-guide expression vectors. We demonstrate the utility of this approach for high-throughput investigations of genetic interactions and, leveraging this ability, dissect epistatic interactions between cholesterol biogenesis and DNA repair. Using direct capture Perturb-seq, we also show that targeting individual genes with multiple sgRNAs per cell improves efficacy of CRISPR interference and activation, facilitating the use of compact, highly active CRISPR libraries for single-cell screens. Last, we show that hybridization-based target enrichment permits sensitive, specific sequencing of informative transcripts from single-cell RNA-seq experiments.