While the catalog of mammalian transcripts and their expression levels in different cell types and disease states is rapidly expanding, our understanding of transcript function lags behind. We ...present a robust technology enabling systematic investigation of the cellular consequences of repressing or inducing individual transcripts. We identify rules for specific targeting of transcriptional repressors (CRISPRi), typically achieving 90%–99% knockdown with minimal off-target effects, and activators (CRISPRa) to endogenous genes via endonuclease-deficient Cas9. Together they enable modulation of gene expression over a ∼1,000-fold range. Using these rules, we construct genome-scale CRISPRi and CRISPRa libraries, each of which we validate with two pooled screens. Growth-based screens identify essential genes, tumor suppressors, and regulators of differentiation. Screens for sensitivity to a cholera-diphtheria toxin provide broad insights into the mechanisms of pathogen entry, retrotranslocation and toxicity. Our results establish CRISPRi and CRISPRa as powerful tools that provide rich and complementary information for mapping complex pathways.
Display omitted
•CRISPRi and CRISPRa provide complementary information for mapping complex pathways•CRISPRi/a expression series (up to ∼1,000-fold) reveal how gene dose controls function•CRISPRi provides strong (typically 90%–99%) knockdown with minimal off-target effects•Genome-scale screens elucidate pathways controlling cholera/diphtheria toxicity
Genome-scale-specific targeting of transcriptional repressors (CRISPRi) and activators (CRISPRa) to endogenous genes via endonuclease-deficient Cas9 have been applied to growth and toxin-resistance screens, establishing CRISPRi and CRISPRa as powerful tools that provide rich and complementary information.
Genetic interaction (GI) maps, comprising pairwise measures of how strongly the function of one gene depends on the presence of a second, have enabled the systematic exploration of gene function in ...microorganisms. Here, we present a two-stage strategy to construct high-density GI maps in mammalian cells. First, we use ultracomplex pooled shRNA libraries (25 shRNAs/gene) to identify high-confidence hit genes for a given phenotype and effective shRNAs. We then construct double-shRNA libraries from these to systematically measure GIs between hits. A GI map focused on ricin susceptibility broadly recapitulates known pathways and provides many unexpected insights. These include a noncanonical role for COPI, a previously uncharacterized protein complex affecting toxin clearance, a specialized role for the ribosomal protein RPS25, and functionally distinct mammalian TRAPP complexes. The ability to rapidly generate mammalian GI maps provides a potentially transformative tool for defining gene function and designing combination therapies based on synergistic pairs.
Display omitted
► Ultracomplex shRNA library minimizes false positives/negatives in genome-wide screens ► Pooled double-shRNA strategy systematically maps genetic interactions between hits ► Application of two-step strategy identifies pathways controlling ricin susceptibility ► The resulting map uncovers functionally distinct mammalian TRAPP complexes
A high-throughput method that relies on the use of ultracomplex shRNA libraries makes it possible to create genetic interaction maps in mammalian cells. This approach will be applicable to many cellular processes and conditions, as illustrated by the discovery of distinct TRAPP complexes involved in endocytosis.
How cellular and organismal complexity emerges from combinatorial expression of genes is a central question in biology. High-content phenotyping approaches such as Perturb-seq (single-cell ...RNA-sequencing pooled CRISPR screens) present an opportunity for exploring such genetic interactions (GIs) at scale. Here, we present an analytical framework for interpreting high-dimensional landscapes of cell states (manifolds) constructed from transcriptional phenotypes. We applied this approach to Perturb-seq profiling of strong GIs mined from a growth-based, gain-of-function GI map. Exploration of this manifold enabled ordering of regulatory pathways, principled classification of GIs (e.g., identifying suppressors), and mechanistic elucidation of synergistic interactions, including an unexpected synergy between
and
driving erythroid differentiation. Finally, we applied recommender system machine learning to predict interactions, facilitating exploration of vastly larger GI manifolds.
Functional genomics efforts face tradeoffs between number of perturbations examined and complexity of phenotypes measured. We bridge this gap with Perturb-seq, which combines droplet-based ...single-cell RNA-seq with a strategy for barcoding CRISPR-mediated perturbations, allowing many perturbations to be profiled in pooled format. We applied Perturb-seq to dissect the mammalian unfolded protein response (UPR) using single and combinatorial CRISPR perturbations. Two genome-scale CRISPR interference (CRISPRi) screens identified genes whose repression perturbs ER homeostasis. Subjecting ∼100 hits to Perturb-seq enabled high-precision functional clustering of genes. Single-cell analyses decoupled the three UPR branches, revealed bifurcated UPR branch activation among cells subject to the same perturbation, and uncovered differential activation of the branches across hits, including an isolated feedback loop between the translocon and IRE1α. These studies provide insight into how the three sensors of ER homeostasis monitor distinct types of stress and highlight the ability of Perturb-seq to dissect complex cellular responses.
Display omitted
•Perturb-seq allows parallel screening with rich phenotypic output from single cells•Simultaneous delivery and identification of up to three CRISPR perturbations•Genome-scale screens dissect the mammalian unfolded protein response•Analytical methods separate perturbation responses from confounding effects
A strategy for barcoding CRISPR-mediated perturbations allows pooled expression profiling via single-cell RNA sequencing. Application to the mammalian unfolded protein response then enabled systematic delineation of the transcriptional arms of the response and functional clustering of genes affecting ER homeostasis.
We recently found that nucleosomes directly block access of CRISPR/Cas9 to DNA (Horlbeck et al., 2016). Here, we build on this observation with a comprehensive algorithm that incorporates chromatin, ...position, and sequence features to accurately predict highly effective single guide RNAs (sgRNAs) for targeting nuclease-dead Cas9-mediated transcriptional repression (CRISPRi) and activation (CRISPRa). We use this algorithm to design next-generation genome-scale CRISPRi and CRISPRa libraries targeting human and mouse genomes. A CRISPRi screen for essential genes in K562 cells demonstrates that the large majority of sgRNAs are highly active. We also find CRISPRi does not exhibit any detectable non-specific toxicity recently observed with CRISPR nuclease approaches. Precision-recall analysis shows that we detect over 90% of essential genes with minimal false positives using a compact 5 sgRNA/gene library. Our results establish CRISPRi and CRISPRa as premier tools for loss- or gain-of-function studies and provide a general strategy for identifying Cas9 target sites.
The human genome produces thousands of long noncoding RNAs (lncRNAs)-transcripts >200 nucleotides long that do not encode proteins. Although critical roles in normal biology and disease have been ...revealed for a subset of lncRNAs, the function of the vast majority remains untested. We developed a CRISPR interference (CRISPRi) platform targeting 16,401 lncRNA loci in seven diverse cell lines, including six transformed cell lines and human induced pluripotent stem cells (iPSCs). Large-scale screening identified 499 lncRNA loci required for robust cellular growth, of which 89% showed growth-modifying function exclusively in one cell type. We further found that lncRNA knockdown can perturb complex transcriptional networks in a cell type-specific manner. These data underscore the functional importance and cell type specificity of many lncRNAs.
Noncoding mutations in cancer genomes are frequent but challenging to interpret. PVT1 encodes an oncogenic lncRNA, but recurrent translocations and deletions in human cancers suggest alternative ...mechanisms. Here, we show that the PVT1 promoter has a tumor-suppressor function that is independent of PVT1 lncRNA. CRISPR interference of PVT1 promoter enhances breast cancer cell competition and growth in vivo. The promoters of the PVT1 and the MYC oncogenes, located 55 kb apart on chromosome 8q24, compete for engagement with four intragenic enhancers in the PVT1 locus, thereby allowing the PVT1 promoter to regulate pause release of MYC transcription. PVT1 undergoes developmentally regulated monoallelic expression, and the PVT1 promoter inhibits MYC expression only from the same chromosome via promoter competition. Cancer genome sequencing identifies recurrent mutations encompassing the human PVT1 promoter, and genome editing verified that PVT1 promoter mutation promotes cancer cell growth. These results highlight regulatory sequences of lncRNA genes as potential disease-associated DNA elements.
Display omitted
•Silencing PVT1 promoter enhances breast cancer cell competition•PVT1 promoter inhibits MYC transcription independent of PVT1 lncRNA•PVT1 and MYC promoters compete for enhancer contact in cis•Mutations encompassing PVT1 promoter are recurrent in human cancers
Recurrent mutations in human cancer are found encompassing the promotor for the lncRNA gene PVT1, which regulates MYC transcription via promoter competition for a shared set of enhancers.
Seminal yeast studies have established the value of comprehensively mapping genetic interactions (GIs) for inferring gene function. Efforts in human cells using focused gene sets underscore the ...utility of this approach, but the feasibility of generating large-scale, diverse human GI maps remains unresolved. We developed a CRISPR interference platform for large-scale quantitative mapping of human GIs. We systematically perturbed 222,784 gene pairs in two cancer cell lines. The resultant maps cluster functionally related genes, assigning function to poorly characterized genes, including TMEM261, a new electron transport chain component. Individual GIs pinpoint unexpected relationships between pathways, exemplified by a specific cholesterol biosynthesis intermediate whose accumulation induces deoxynucleotide depletion, causing replicative DNA damage and a synthetic-lethal interaction with the ATR/9-1-1 DNA repair pathway. Our map provides a broad resource, establishes GI maps as a high-resolution tool for dissecting gene function, and serves as a blueprint for mapping the genetic landscape of human cells.
Display omitted
•Genetic interaction (GI) mapping enables elucidation of human gene function•Large-scale, diverse maps of 222,784 gene pairs reveal buffering and synthetic GIs•Clustering of GIs identifies novel members of functional complexes•Specific GIs can define the physiological impact of biosynthetic metabolites
A large-scale genetic interaction map in human cells reveals unexpected interdependencies between core pathways and exposes potential combination therapies for cancer.
Long non-coding RNAs (lncRNAs) comprise a diverse class of transcripts that can regulate molecular and cellular processes in brain development and disease. LncRNAs exhibit cell type- and ...tissue-specific expression, but little is known about the expression and function of lncRNAs in the developing human brain. Furthermore, it has been unclear whether lncRNAs are highly expressed in subsets of cells within tissues, despite appearing lowly expressed in bulk populations.
We use strand-specific RNA-seq to deeply profile lncRNAs from polyadenylated and total RNA obtained from human neocortex at different stages of development, and we apply this reference to analyze the transcriptomes of single cells. While lncRNAs are generally detected at low levels in bulk tissues, single-cell transcriptomics of hundreds of neocortex cells reveal that many lncRNAs are abundantly expressed in individual cells and are cell type-specific. Notably, LOC646329 is a lncRNA enriched in single radial glia cells but is detected at low abundance in tissues. CRISPRi knockdown of LOC646329 indicates that this lncRNA regulates cell proliferation.
The discrete and abundant expression of lncRNAs among individual cells has important implications for both their biological function and utility for distinguishing neural cell types.
The prokaryotic CRISPR (clustered regularly interspaced palindromic repeats)-associated protein, Cas9, has been widely adopted as a tool for editing, imaging, and regulating eukaryotic genomes. ...However, our understanding of how to select single-guide RNAs (sgRNAs) that mediate efficient Cas9 activity is incomplete, as we lack insight into how chromatin impacts Cas9 targeting. To address this gap, we analyzed large-scale genetic screens performed in human cell lines using either nuclease-active or nuclease-dead Cas9 (dCas9). We observed that highly active sgRNAs for Cas9 and dCas9 were found almost exclusively in regions of low nucleosome occupancy. In vitro experiments demonstrated that nucleosomes in fact directly impede Cas9 binding and cleavage, while chromatin remodeling can restore Cas9 access. Our results reveal a critical role of eukaryotic chromatin in dictating the targeting specificity of this transplanted bacterial enzyme, and provide rules for selecting Cas9 target sites distinct from and complementary to those based on sequence properties.