Computational single-cell RNA-seq (scRNA-seq) methods have been successfully applied to experiments representing a single condition, technology, or species to discover and define cellular phenotypes. ...However, identifying subpopulations of cells that are present across multiple data sets remains challenging. Here, we introduce an analytical strategy for integrating scRNA-seq data sets based on common sources of variation, enabling the identification of shared populations across data sets and downstream comparative analysis. We apply this approach, implemented in our R toolkit Seurat (http://satijalab.org/seurat/), to align scRNA-seq data sets of peripheral blood mononuclear cells under resting and stimulated conditions, hematopoietic progenitors sequenced using two profiling technologies, and pancreatic cell 'atlases' generated from human and mouse islets. In each case, we learn distinct or transitional cell states jointly across data sets, while boosting statistical power through integrated analysis. Our approach facilitates general comparisons of scRNA-seq data sets, potentially deepening our understanding of how distinct cell states respond to perturbation, disease, and evolution.
Despite rapid developments in single cell sequencing, sample-specific batch effects, detection of cell multiplets, and experimental costs remain outstanding challenges. Here, we introduce Cell ...Hashing, where oligo-tagged antibodies against ubiquitously expressed surface proteins uniquely label cells from distinct samples, which can be subsequently pooled. By sequencing these tags alongside the cellular transcriptome, we can assign each cell to its original sample, robustly identify cross-sample multiplets, and "super-load" commercial droplet-based systems for significant cost reduction. We validate our approach using a complementary genetic approach and demonstrate how hashing can generalize the benefits of single cell multiplexing to diverse samples and experimental designs.
The simultaneous measurement of multiple modalities represents an exciting frontier for single-cell genomics and necessitates computational methods that can define cellular states based on multimodal ...data. Here, we introduce “weighted-nearest neighbor” analysis, an unsupervised framework to learn the relative utility of each data type in each cell, enabling an integrative analysis of multiple modalities. We apply our procedure to a CITE-seq dataset of 211,000 human peripheral blood mononuclear cells (PBMCs) with panels extending to 228 antibodies to construct a multimodal reference atlas of the circulating immune system. Multimodal analysis substantially improves our ability to resolve cell states, allowing us to identify and validate previously unreported lymphoid subpopulations. Moreover, we demonstrate how to leverage this reference to rapidly map new datasets and to interpret immune responses to vaccination and coronavirus disease 2019 (COVID-19). Our approach represents a broadly applicable strategy to analyze single-cell multimodal datasets and to look beyond the transcriptome toward a unified and multimodal definition of cellular identity.
Display omitted
•“Weighted nearest neighbor” analysis integrates multimodal single-cell data•A multimodal reference “atlas” of the circulating human immune system•Identification and validation of novel sources of lymphoid heterogeneity•“Reference-based” mapping of query datasets onto a multimodal atlas
A framework that allows for the integration of multiple data types using single cells is applied to understand distinct immune cell states, previously unidentified immune populations, and to interpret immune responses to vaccinations.
To better understand host-virus genetic dependencies and find potential therapeutic targets for COVID-19, we performed a genome-scale CRISPR loss-of-function screen to identify host factors required ...for SARS-CoV-2 viral infection of human alveolar epithelial cells. Top-ranked genes cluster into distinct pathways, including the vacuolar ATPase proton pump, Retromer, and Commander complexes. We validate these gene targets using several orthogonal methods such as CRISPR knockout, RNA interference knockdown, and small-molecule inhibitors. Using single-cell RNA-sequencing, we identify shared transcriptional changes in cholesterol biosynthesis upon loss of top-ranked genes. In addition, given the key role of the ACE2 receptor in the early stages of viral entry, we show that loss of RAB7A reduces viral entry by sequestering the ACE2 receptor inside cells. Overall, this work provides a genome-scale, quantitative resource of the impact of the loss of each host gene on fitness/response to viral infection.
Display omitted
•Genome-wide CRISPR knockout screen identifies host factors for SARS-CoV-2 infection•Top-ranked genes include vacuolar ATPases, Retromer, Commander, and Arp2/3 complex•Validation using CRISPR knockout, RNA interference, and small molecule inhibitors•Reduced infection via increased cholesterol biosynthesis and sequestration of ACE2
To identify potential therapeutic targets for SARS-CoV-2, Daniloski et al. conduct a genome-wide CRISPR screen in human lung epithelial cells. They identify genes and pathways required for SARS-CoV-2 infection, including the vacuolar ATPase proton pump, Retromer, and Commander complexes. Using single-cell transcriptomics, they identify upregulation of cholesterol biosynthesis as a common mechanism underlying viral resistance, in addition to ACE2 sequestration.
Multimodal single-cell assays provide high-resolution snapshots of complex cell populations, but are mostly limited to transcriptome plus an additional modality. Here, we describe expanded ...CRISPR-compatible cellular indexing of transcriptomes and epitopes by sequencing (ECCITE-seq) for the high-throughput characterization of at least five modalities of information from each single cell. We demonstrate application of ECCITE-seq to multimodal CRISPR screens with robust direct single-guide RNA capture and to clonotype-aware multimodal phenotyping of cancer samples.
Recent technological advances have enabled massively parallel chromatin profiling with scATAC-seq (single-cell assay for transposase accessible chromatin by sequencing). Here we present ATAC with ...select antigen profiling by sequencing (ASAP-seq), a tool to simultaneously profile accessible chromatin and protein levels. Our approach pairs sparse scATAC-seq data with robust detection of hundreds of cell surface and intracellular protein markers and optional capture of mitochondrial DNA for clonal tracking, capturing three distinct modalities in single cells. ASAP-seq uses a bridging approach that repurposes antibody:oligonucleotide conjugates designed for existing technologies that pair protein measurements with single-cell RNA sequencing. Together with DOGMA-seq, an adaptation of CITE-seq (cellular indexing of transcriptomes and epitopes by sequencing) for measuring gene activity across the central dogma of gene regulation, we demonstrate the utility of systematic multi-omic profiling by revealing coordinated and distinct changes in chromatin, RNA and surface proteins during native hematopoietic differentiation and peripheral blood mononuclear cell stimulation and as a combinatorial decoder and reporter of multiplexed perturbations in primary T cells.
Homeostatic mechanisms regulate the abundance of several components in small-RNA pathways. We used Drosophila and mammalian systems to demonstrate a conserved homeostatic system in which the status ...of miRNA biogenesis controls Argonaute protein stability. Clonal analyses of multiple mutants of core Drosophila miRNA factors revealed that stability of the miRNA effector AGO1 is dependent on miRNA biogenesis. Reciprocally, ectopic transcription of miRNAs within in vivo clones induced accumulation of AGO1, as did genetic interference with the ubiquitin-proteasome system. In mouse cells, we found that the stability of Ago2 declined in Dicer-knockout cells and was rescued by proteasome blockade or introduction of either Dicer plasmid or Dicer-independent miRNA constructs. Notably, Dicer-dependent miRNA constructs generated pre-miRNAs that bound Ago2 but did not rescue Ago2 stability. We conclude that Argonaute levels are finely tuned by cellular availability of mature miRNAs and the ubiquitin-proteasome system.
Intragenomic conflicts are fueled by rapidly evolving selfish genetic elements, which induce selective pressures to innovate opposing repressive mechanisms. This is patently manifest in sex-ratio ...(SR) meiotic drive systems, in which distorter and suppressor factors bias and restore equal transmission of X and Y sperm. Here, we reveal that multiple SR suppressors in Drosophila simulans (Nmy and Tmy) encode related hairpin RNAs (hpRNAs), which generate endo-siRNAs that repress the paralogous distorters Dox and MDox. All components in this drive network are recently evolved and largely testis restricted. To connect SR hpRNA function to the RNAi pathway, we generated D. simulans null mutants of Dcr-2 and AGO2. Strikingly, these core RNAi knockouts massively derepress Dox and MDox and are in fact completely male sterile and exhibit highly defective spermatogenesis. Altogether, our data reveal how the adaptive capacity of hpRNAs is critically deployed to restrict selfish gonadal genetic systems that can exterminate a species.
Display omitted
•Selfish sex ratio (SR) distorters impede son transmission and must be silenced•Multiple SR suppressors in Drosophila simulans correspond to hairpin RNA-siRNA loci•Knockout of RNAi pathway derepresses SR drivers and disrupts spermatogenesis•Rapidly evolving intragenomic conflicts and suppression by RNAi may underlie speciation
Lin and Hu et al. reveal a critical biological usage of RNAi in Drosophila simulans to tame meiotic drive systems. Multiple autosomal hairpin-siRNA loci are deployed to suppress X-encoded distorter loci that bias progeny sex ratio. These loci are rapidly evolving and testis restricted, and such intragenomic conflicts may fuel speciation.
Cellular imbalances of cholesterol and fatty acid metabolism result in pathological processes, including atherosclerosis and metabolic syndrome. Recent work from our group and others has shown that ...the intronic microRNAs hsa-miR-33a and hsa-miR-33b are located within the sterol regulatory element-binding protein-2 and -1 genes, respectively, and regulate cholesterol homeostasis in concert with their host genes. Here, we show that miR-33a and -b also regulate genes involved in fatty acid metabolism and insulin signaling. miR-33a and -b target key enzymes involved in the regulation of fatty acid oxidation, including carnitine O-octaniltransferase, carnitine palmitoyltransferase 1A, hydroxyacyl-CoA-dehydrogenase, Sirtuin 6 (SIRT6), and AMP kinase subunit-α. Moreover, miR-33a and -b also target the insulin receptor substrate 2, an essential component of the insulin-signaling pathway in the liver. Overexpression of miR-33a and -b reduces both fatty acid oxidation and insulin signaling in hepatic cell lines, whereas inhibition of endogenous miR-33a and -b increases these two metabolic pathways. Together, these data establish that miR-33a and -b regulate pathways controlling three of the risk factors of metabolic syndrome, namely levels of HDL, triglycerides, and insulin signaling, and suggest that inhibitors of miR-33a and -b may be useful in the treatment of this growing health concern.
Modifications are present on many classes of RNA, including tRNA, rRNA, and mRNA. These modifications modulate diverse biological processes such as genetic recoding and mRNA export and folding. In ...addition, modifications can be introduced to RNA molecules using chemical probing strategies that reveal RNA structure and dynamics. Many methods exist to detect RNA modifications by short-read sequencing; however, limitations on read length inherent to short-read-based methods dissociate modifications from their native context, preventing single-molecule modification analysis. Here, we demonstrate direct RNA nanopore sequencing to detect endogenous and exogenous RNA modifications on long RNAs at the single-molecule level. We detect endogenous 2′-O-methyl and base modifications across E. coli and S. cerevisiae ribosomal RNAs as shifts in current signal and dwell times distally through interactions with the helicase motor protein. We further use the 2′-hydroxyl reactive SHAPE reagent acetylimidazole to probe RNA structure at the single-molecule level with readout by direct nanopore sequencing.
Display omitted
Nanopore sequencing detects endogenous RNA modifications at a single-molecule level2′-O-methyl (Nm) and pseudouridine (Ψ) alter nanopore translocation kineticsAcIm is a small-adduct-generating SHAPE-MaP reagentAcIm enables full-length single-molecule structural profiling on RNA
Stephenson et al. employ direct RNA nanopore sequencing to detect endogenous and exogenous modifications on single RNA molecules. The authors demonstrate detection of endogenous 2′-O-methylation (Nm) on native ribosomal RNAs, confirming known modification patterns. They describe the development of nanoSHAPE, a method that involves exogenously labeling RNA with a small-adduct-generating chemical probe that can reveal RNA structure using long-read sequencing.