Spatial localization is a key determinant of cellular fate and behavior, but methods for spatially resolved, transcriptome-wide gene expression profiling across complex tissues are lacking. RNA ...staining methods assay only a small number of transcripts, whereas single-cell RNA-seq, which measures global gene expression, separates cells from their native spatial context. Here we present Seurat, a computational strategy to infer cellular localization by integrating single-cell RNA-seq data with in situ RNA patterns. We applied Seurat to spatially map 851 single cells from dissociated zebrafish (Danio rerio) embryos and generated a transcriptome-wide map of spatial patterning. We confirmed Seurat's accuracy using several experimental approaches, then used the strategy to identify a set of archetypal expression patterns and spatial markers. Seurat correctly localizes rare subpopulations, accurately mapping both spatially restricted and scattered groups. Seurat will be applicable to mapping cellular localization within complex patterned tissues in diverse systems.
The scale and capabilities of single-cell RNA-sequencing methods have expanded rapidly in recent years, enabling major discoveries and large-scale cell mapping efforts. However, these methods have ...not been systematically and comprehensively benchmarked. Here, we directly compare seven methods for single-cell and/or single-nucleus profiling-selecting representative methods based on their usage and our expertise and resources to prepare libraries-including two low-throughput and five high-throughput methods. We tested the methods on three types of samples: cell lines, peripheral blood mononuclear cells and brain tissue, generating 36 libraries in six separate experiments in a single center. To directly compare the methods and avoid processing differences introduced by the existing pipelines, we developed scumi, a flexible computational pipeline that can be used with any single-cell RNA-sequencing method. We evaluated the methods for both basic performance, such as the structure and alignment of reads, sensitivity and extent of multiplets, and for their ability to recover known biological information in the samples.
Regulation of gene expression is central to many biological processes. Although reconstruction of regulatory circuits from genomic data alone is therefore desirable, this remains a major ...computational challenge. Comparative approaches that examine the conservation and divergence of circuits and their components across strains and species can help reconstruct circuits as well as provide insights into the evolution of gene regulatory processes and their adaptive contribution. In recent years, advances in genomic and computational tools have led to a wealth of methods for such analysis at the sequence, expression, pathway, module, and entire network level. Here, we review computational methods developed to study transcriptional regulatory networks using comparative genomics, from sequence to functional data. We highlight how these methods use evolutionary conservation and divergence to reliably detect regulatory components as well as estimate the extent and rate of divergence. Finally, we discuss the promise and open challenges in linking regulatory divergence to phenotypic divergence and adaptation.
The recent advent of methods for high-throughput single-cell and spatial profiling have opened the way to complete the 150-year-old endeavor of identifying all cell types in the human body, by their ...distinctive molecular profiles, and to relate this information to other cellular descriptions, physiological phenotypes, molecular mechanisms and functions. Our effort to build a comprehensive reference map of the molecular state of cells in healthy human tissues is propelling the systematic study of physiological states, developmental trajectories, regulatory circuitry and interactions of cells, provides a framework for understanding cellular dysregulation in human disease, and suggests the possibility of predicting cell types and behaviors, towards a “periodic table of our cells”. In this talk, I describe our foundational work underlying single cell genomics and the conceptual framework and impact of our understanding of cell and tissue biology in health, as well as how we use it to shed light on rare disease, cancer, and COVID-19.
The immune system varies in cell types, states, and locations. The complex networks, interactions, and responses of immune cells produce diverse cellular ecosystems composed of multiple cell types, ...accompanied by genetic diversity in antigen receptors. Within this ecosystem, innate and adaptive immune cells maintain and protect tissue function, integrity, and homeostasis upon changes in functional demands and diverse insults. Characterizing this inherent complexity requires studies at single-cell resolution. Recent advances such as massively parallel single-cell RNA sequencing and sophisticated computational methods are catalyzing a revolution in our understanding of immunology. Here we provide an overview of the state of single-cell genomics methods and an outlook on the use of single-cell techniques to decipher the adaptive and innate components of immunity.
Malignant abdominal fluid (ascites) frequently develops in women with advanced high-grade serous ovarian cancer (HGSOC) and is associated with drug resistance and a poor prognosis
. To ...comprehensively characterize the HGSOC ascites ecosystem, we used single-cell RNA sequencing to profile ~11,000 cells from 22 ascites specimens from 11 patients with HGSOC. We found significant inter-patient variability in the composition and functional programs of ascites cells, including immunomodulatory fibroblast sub-populations and dichotomous macrophage populations. We found that the previously described immunoreactive and mesenchymal subtypes of HGSOC, which have prognostic implications, reflect the abundance of immune infiltrates and fibroblasts rather than distinct subsets of malignant cells
. Malignant cell variability was partly explained by heterogeneous copy number alteration patterns or expression of a stemness program. Malignant cells shared expression of inflammatory programs that were largely recapitulated in single-cell RNA sequencing of ~35,000 cells from additionally collected samples, including three ascites, two primary HGSOC tumors and three patient ascites-derived xenograft models. Inhibition of the JAK/STAT pathway, which was expressed in both malignant cells and cancer-associated fibroblasts, had potent anti-tumor activity in primary short-term cultures and patient-derived xenograft models. Our work contributes to resolving the HSGOC landscape
and provides a resource for the development of novel therapeutic approaches.
De novo assembly of RNA-seq data enables researchers to study transcriptomes without the need for a genome sequence; this approach can be usefully applied, for instance, in research on 'non-model ...organisms' of ecological and evolutionary importance, cancer samples or the microbiome. In this protocol we describe the use of the Trinity platform for de novo transcriptome assembly from RNA-seq data in non-model organisms. We also present Trinity-supported companion utilities for downstream applications, including RSEM for transcript abundance estimation, R/Bioconductor packages for identifying differentially expressed transcripts across samples and approaches to identify protein-coding genes. In the procedure, we provide a workflow for genome-independent transcriptome analysis leveraging the Trinity platform. The software, documentation and demonstrations are freely available from http://trinityrnaseq.sourceforge.net. The run time of this protocol is highly dependent on the size and complexity of data to be analyzed. The example data set analyzed in the procedure detailed herein can be processed in less than 5 h.
Analyzing the spatial organization of molecules in cells and tissues is a cornerstone of biological research and clinical practice. However, despite enormous progress in molecular profiling of ...cellular constituents, spatially mapping them remains a disjointed and specialized machinery-intensive process, relying on either light microscopy or direct physical registration. Here, we demonstrate DNA microscopy, a distinct imaging modality for scalable, optics-free mapping of relative biomolecule positions. In DNA microscopy of transcripts, transcript molecules are tagged in situ with randomized nucleotides, labeling each molecule uniquely. A second in situ reaction then amplifies the tagged molecules, concatenates the resulting copies, and adds new randomized nucleotides to uniquely label each concatenation event. An algorithm decodes molecular proximities from these concatenated sequences and infers physical images of the original transcripts at cellular resolution with precise sequence information. Because its imaging power derives entirely from diffusive molecular dynamics, DNA microscopy constitutes a chemically encoded microscopy system.
Display omitted
•DNA microscopy is a distinct imaging modality that encodes physical images into DNA•A stand-alone reaction first writes inter-molecular proximities into DNA•An algorithm then reconstructs a supra-molecular image from this DNA recording•DNA microscopy thereby constitutes a chemically encoded microscopy system
DNA microscopy is an optics-free imaging method based on chemical reactions and a computational algorithm to infer spatial organization of transcripts while simultaneously preserving full sequence information.
How transcription factors (TFs) interpret cis-regulatory DNA sequence to control gene expression remains unclear, largely because past studies using native and engineered sequences had insufficient ...scale. Here, we measure the expression output of >100 million synthetic yeast promoter sequences that are fully random. These sequences yield diverse, reproducible expression levels that can be explained by their chance inclusion of functional TF binding sites. We use machine learning to build interpretable models of transcriptional regulation that predict ~94% of the expression driven from independent test promoters and ~89% of the expression driven from native yeast promoter fragments. These models allow us to characterize each TF's specificity, activity and interactions with chromatin. TF activity depends on binding-site strand, position, DNA helical face and chromatin context. Notably, expression level is influenced by weak regulatory interactions, which confound designed-sequence studies. Our analyses show that massive-throughput assays of fully random DNA can provide the big data necessary to develop complex, predictive models of gene regulation.
Cells of the immune system routinely respond to cues from their local environment and feed back to their surroundings through transient responses, choice of differentiation trajectories, plastic ...changes in cell state, and malleable adaptation to their tissue of residence. Genomic approaches have opened the way for comprehensive interrogation of such orchestrated responses. Focusing on genomic profiling of transcriptional and epigenetic cell states, we discuss how they are applied to investigate immune cells faced with various environmental cues. We highlight some of the emerging principles on the role of dense regulatory circuitry, epigenetic memory, cell type fluidity, and reuse of regulatory modules in achieving and maintaining appropriate responses to a changing environment.These provide a first step toward a systematic understanding of molecular circuits in complex tissues.