Methods for profiling RNA and protein expression in a spatially resolved manner are rapidly evolving, making it possible to comprehensively characterize cells and tissues in health and disease. To ...maximize the biological insights obtained using these techniques, it is critical to both clearly articulate the key biological questions in spatial analysis of tissues and develop the requisite computational tools to address them. Developers of analytical tools need to decide on the intrinsic molecular features of each cell that need to be considered, and how cell shape and morphological features are incorporated into the analysis. Also, optimal ways to compare different tissue samples at various length scales are still being sought. Grouping these biological problems and related computational algorithms into classes across length scales, thus characterizing common issues that need to be addressed, will facilitate further progress in spatial transcriptomics and proteomics.
Spatial omics data are advancing the study of tissue organization and cellular communication at an unprecedented scale. Flexible tools are required to store, integrate and visualize the large ...diversity of spatial omics data. Here, we present Squidpy, a Python framework that brings together tools from omics and image analysis to enable scalable description of spatial molecular data, such as transcriptome or multivariate proteins. Squidpy provides efficient infrastructure and numerous analysis methods that allow to efficiently store, manipulate and interactively visualize spatial omics data. Squidpy is extensible and can be interfaced with a variety of already existing libraries for the scalable analysis of spatial omics data.
Transcription factors (TFs) regulate many cellular processes and can therefore serve as readouts of the signaling and regulatory state. Yet for many TFs, the mode of action—repressing or activating ...transcription of target genes—is unclear. Here, we present diffTF (https://git.embl.de/grp-zaugg/diffTF) to calculate differential TF activity (basic mode) and classify TFs into putative transcriptional activators or repressors (classification mode). In basic mode, it combines genome-wide chromatin accessibility/activity with putative TF binding sites that, in classification mode, are integrated with RNA-seq. We apply diffTF to compare (1) mutated and unmutated chronic lymphocytic leukemia patients and (2) two hematopoietic progenitor cell types. In both datasets, diffTF recovers most known biology and finds many previously unreported TFs. It classifies almost 40% of TFs based on their mode of action, which we validate experimentally. Overall, we demonstrate that diffTF recovers known biology, identifies less well-characterized TFs, and classifies TFs into transcriptional activators or repressors.
Display omitted
•diffTF estimates differential TF activity based on chromatin data (basic mode)•Integration with RNA-seq identifies activator/repressor TFs (classification mode)•We applied diffTF to subtypes of CLL and hematopoietic differentiation•The TF classification was experimentally validated in both case studies
Berest et al. present a computational tool (diffTF) to estimate differential TF activity and classify TFs into activators or repressors. It requires active chromatin data (accessibility/ChIP-seq) and integrates with RNA-seq for classification. The authors apply it to two case studies (CLL and hematopoietic differentiation) and validate their predictions experimentally.
Enhancers play a vital role in gene regulation and are critical in mediating the impact of noncoding genetic variants associated with complex traits. Enhancer activity is a cell‐type‐specific process ...regulated by transcription factors (TFs), epigenetic mechanisms and genetic variants. Despite the strong mechanistic link between TFs and enhancers, we currently lack a framework for jointly analysing them in cell‐type‐specific gene regulatory networks (GRN). Equally important, we lack an unbiased way of assessing the biological significance of inferred GRNs since no complete ground truth exists. To address these gaps, we present GRaNIE (Gene Regulatory Network Inference including Enhancers) and GRaNPA (Gene Regulatory Network Performance Analysis). GRaNIE (https://git.embl.de/grp‐zaugg/GRaNIE) builds enhancer‐mediated GRNs based on covariation of chromatin accessibility and RNA‐seq across samples (e.g. individuals), while GRaNPA (https://git.embl.de/grp‐zaugg/GRaNPA) assesses the performance of GRNs for predicting cell‐type‐specific differential expression. We demonstrate their power by investigating gene regulatory mechanisms underlying the response of macrophages to infection, cancer and common genetic traits including autoimmune diseases. Finally, our methods identify the TF PURA as a putative regulator of pro‐inflammatory macrophage polarisation.
Synopsis
GRaNIE builds enhancer‐based gene regulatory networks (eGRNs) using chromatin accessibility and RNA‐seq data. GRaNPA assesses the biological significance of GRNs and transcription factors. Together, they provide insights into cell‐type‐specific gene regulation.
GRaNIE builds gene regulatory networks that encompass transcription factors, regulatory regions and genes, enabling a comprehensive view of gene regulation.
GRaNPA provides an unbiased evaluation method for cell‐type‐specific GRNs by testing their ability to predict cell‐type‐specific differential expression.
GRaNPA can identify important transcription factors that drive differential expression, leading to insights into biological mechanisms.
GRaNIE and GRaNPA analyses identified PURA as a promising candidate for regulating pro‐inflammatory macrophage polarisation.
GRaNIE builds enhancer‐based gene regulatory networks (GRNs) using chromatin accessibility and RNA‐seq data. GRaNPA assesses the biological significance of GRNs and transcription factors. Together, they provide insights into cell‐type‐specific gene regulation.
Latent factor modeling applied to single-cell RNA sequencing (scRNA-seq) data is a useful approach to discover gene signatures. However, it is often unclear what methods are best suited for specific ...tasks and how latent factors should be interpreted.
Here, we compare four state-of-the-art methods and propose an approach to assign derived latent factors to pathway activities and specific cell subsets. By applying this framework to scRNA-seq datasets from biopsies of patients with rheumatoid arthritis and systemic lupus erythematosus, we discover disease-relevant gene signatures in specific cellular subsets. In rheumatoid arthritis, we identify an inflammatory OSMR signaling signature active in a subset of synovial fibroblasts and an efferocytic signature in a subset of synovial monocytes.
Overall, we provide insights into latent factors models for the analysis of scRNA-seq data, develop a framework to identify cell subtypes in a phenotype-driven way, and use it to identify novel pathways dysregulated in rheumatoid arthritis.
Display omitted
•Benchmarking four latent factor models for analysis of scRNA-seq data•Map biological pathways associated with latent factors to specific cell subsets•OSMR pathway dysregulated in a subset of RA synovial fibroblasts•MERTK-expressing monocytes with efferocytic function depleted in RA
Immunology; Bioinformatics; Transcriptomics
Recent advances in single-cell technologies have enabled high-throughput molecular profiling of cells across modalities and locations. Single-cell transcriptomics data can now be complemented by ...chromatin accessibility, surface protein expression, adaptive immune receptor repertoire profiling and spatial information. The increasing availability of single-cell data across modalities has motivated the development of novel computational methods to help analysts derive biological insights. As the field grows, it becomes increasingly difficult to navigate the vast landscape of tools and analysis steps. Here, we summarize independent benchmarking studies of unimodal and multimodal single-cell analysis across modalities to suggest comprehensive best-practice workflows for the most common analysis steps. Where independent benchmarks are not available, we review and contrast popular methods. Our article serves as an entry point for novices in the field of single-cell (multi-)omic analysis and guides advanced users to the most recent best practices.
Abstract Summary Spatial omics technologies are increasingly leveraged to characterize how disease disrupts tissue organization and cellular niches. While multiple methods to analyze spatial ...variation within a sample have been published, statistical and computational approaches to compare cell spatial organization across samples or conditions are mostly lacking. We present GraphCompass, a comprehensive set of omics-adapted graph analysis methods to quantitatively evaluate and compare the spatial arrangement of cells in samples representing diverse biological conditions. GraphCompass builds upon the Squidpy spatial omics toolbox and encompasses various statistical approaches to perform cross-condition analyses at the level of individual cell types, niches, and samples. Additionally, GraphCompass provides custom visualization functions that enable effective communication of results. We demonstrate how GraphCompass can be used to address key biological questions, such as how cellular organization and tissue architecture differ across various disease states and which spatial patterns correlate with a given pathological condition. GraphCompass can be applied to various popular omics techniques, including, but not limited to, spatial proteomics (e.g. MIBI-TOF), spot-based transcriptomics (e.g. 10× Genomics Visium), and single-cell resolved transcriptomics (e.g. Stereo-seq). In this work, we showcase the capabilities of GraphCompass through its application to three different studies that may also serve as benchmark datasets for further method development. With its easy-to-use implementation, extensive documentation, and comprehensive tutorials, GraphCompass is accessible to biologists with varying levels of computational expertise. By facilitating comparative analyses of cell spatial organization, GraphCompass promises to be a valuable asset in advancing our understanding of tissue function in health and disease.
Gastruloids are 3D structures generated from pluripotent stem cells recapitulating fundamental principles of embryonic pattern formation. Using single-cell genomic analysis, we provide a resource ...mapping cell states and types during gastruloid development and compare them with the in vivo embryo. We developed a high-throughput handling and imaging pipeline to spatially monitor symmetry breaking during gastruloid development and report an early spatial variability in pluripotency determining a binary response to Wnt activation. Although cells in the gastruloid-core revert to pluripotency, peripheral cells become primitive streak-like. These two populations subsequently break radial symmetry and initiate axial elongation. By performing a compound screen, perturbing thousands of gastruloids, we derive a phenotypic landscape and infer networks of genetic interactions. Finally, using a dual Wnt modulation, we improve the formation of anterior structures in the existing gastruloid model. This work provides a resource to understand how gastruloids develop and generate complex patterns in vitro.
Display omitted
•Spatial map of cell-type emergence in gastruloids at single-cell resolution•Phenotypic compound screen reveals functional modules underlying gastruloid formation•Spatial and temporal variabilities in the pluripotency state determines binary Wnt responses•Dual Wnt modulation improves the representation of anterior structures in gastruloids
Suppinger et al. employ scRNA-seq, an image-based trajectory and a phenotypic compound screen, to provide a resource characterizing murine gastruloid formation. Focusing on the early stages of gastruloid development, they show that variability in pluripotency states determines Wnt response and symmetry breaking. They, then, use dual Wnt modulation enriching for anterior structures.