Abstract
Motivation
Increasing numbers of large scale single cell RNA-Seq projects are leading to a data explosion, which can only be fully exploited through data integration. A number of methods ...have been developed to combine diverse datasets by removing technical batch effects, but most are computationally intensive. To overcome the challenge of enormous datasets, we have developed BBKNN, an extremely fast graph-based data integration algorithm. We illustrate the power of BBKNN on large scale mouse atlasing data, and favourably benchmark its run time against a number of competing methods.
Availability and implementation
BBKNN is available at https://github.com/Teichlab/bbknn, along with documentation and multiple example notebooks, and can be installed from pip.
Supplementary information
Supplementary data are available at Bioinformatics online.
Advancements in technology and architecture enable mixed-use development while normative settings like the European Commission’s New Leipzig Charter support the concept of a productive city. ...Nonetheless, small urban manufacturers (SUMs) including crafts still face displacement due to property prices, conflicts with housing, planning laws, and building regulations. Urban planning and economic development emphasise the importance of identifying and redeveloping suitable sites for urban manufacturing companies. Largely unanswered, however, is whether the next generation of manufacturers (apprentices) want mixed-use locations within the city or space sharing, and if so, under which conditions. Based on two written surveys, this article examines the location requirements of SUMs in Germany and the willingness of apprentices in the Ruhr area to embrace mixed-use buildings and shared spaces. The study focuses on three craft groups: store crafts, workshop crafts, and construction site crafts. The results show that SUMs in Germany and manufacturing apprentices in the Ruhr prioritise car- and security-related infrastructure, as well as low real-estate costs. Store crafts specifically seek affordable and well-connected ground-floor locations. Construction site crafts prioritise (un)loading facilities for trucks on industrial land over sustainable transport infrastructure, and they differ significantly from the other craft groups in terms of mixed-use preferences. However, all craft groups express openness to mixed-use locations with offices and additional workshops and shared spaces like garages, canteens, and showrooms. The article suggests that commercial courtyards could effectively meet the requirements and desires of apprentices and urban planners alike.
The origin of human metaplastic states and their propensity for cancer is poorly understood. Barrett's esophagus is a common metaplastic condition that increases the risk for esophageal ...adenocarcinoma, and its cellular origin is enigmatic. To address this, we harvested tissues spanning the gastroesophageal junction from healthy and diseased donors, including isolation of esophageal submucosal glands. A combination of single-cell transcriptomic profiling, in silico lineage tracing from methylation, open chromatin and somatic mutation analyses, and functional studies in organoid models showed that Barrett's esophagus originates from gastric cardia through c-MYC and HNF4A-driven transcriptional programs. Furthermore, our data indicate that esophageal adenocarcinoma likely arises from undifferentiated Barrett's esophagus cell types even in the absence of a pathologically identifiable metaplastic precursor, illuminating early detection strategies.
Lung disease accounts for every sixth death globally. Profiling the molecular state of all lung cell types in health and disease is currently revolutionizing the identification of disease mechanisms ...and will aid the design of novel diagnostic and personalized therapeutic regimens. Recent progress in high-throughput techniques for single-cell genomic and transcriptomic analyses has opened up new possibilities to study individual cells within a tissue, classify these into cell types, and characterize variations in their molecular profiles as a function of genetics, environment, cell-cell interactions, developmental processes, aging, or disease. Integration of these cell state definitions with spatial information allows the in-depth molecular description of cellular neighborhoods and tissue microenvironments, including the tissue resident structural and immune cells, the tissue matrix, and the microbiome. The Human Cell Atlas consortium aims to characterize all cells in the healthy human body and has prioritized lung tissue as one of the flagship projects. Here, we present the rationale, the approach, and the expected impact of a Human Lung Cell Atlas.
T helper type 2 (Th2) cells are important regulators of mammalian adaptive immunity and have relevance for infection, autoimmunity, and tumor immunology. Using a newly developed, genome-wide ...retroviral CRISPR knockout (KO) library, combined with RNA-seq, ATAC-seq, and ChIP-seq, we have dissected the regulatory circuitry governing activation and differentiation of these cells. Our experiments distinguish cell activation versus differentiation in a quantitative framework. We demonstrate that these two processes are tightly coupled and are jointly controlled by many transcription factors, metabolic genes, and cytokine/receptor pairs. There are only a small number of genes regulating differentiation without any role in activation. By combining biochemical and genetic data, we provide an atlas for Th2 differentiation, validating known regulators and identifying factors, such as Pparg and Bhlhe40, as part of the core regulatory network governing Th2 helper cell fates.
Display omitted
•A retroviral CRISPR sgRNA library enables screening in primary mouse T helper cells•Genes frequently affect both T helper cell activation and differentiation•Genes of all functional categories impact activation and differentiation•The transcription factor PPARG appears particularly important for Th2 gene regulation
A CRISPR-based screen reveals gene networks central to modulating multiple aspects of Th2 cell function.
Abstract
Expression Atlas is EMBL-EBI’s resource for gene and protein expression. It sources and compiles data on the abundance and localisation of RNA and proteins in various biological systems and ...contexts and provides open access to this data for the research community. With the increased availability of single cell RNA-Seq datasets in the public archives, we have now extended Expression Atlas with a new added-value service to display gene expression in single cells. Single Cell Expression Atlas was launched in 2018 and currently includes 123 single cell RNA-Seq studies from 12 species. The website can be searched by genes within or across species to reveal experiments, tissues and cell types where this gene is expressed or under which conditions it is a marker gene. Within each study, cells can be visualized using a pre-calculated t-SNE plot and can be coloured by different features or by cell clusters based on gene expression. Within each experiment, there are links to downloadable files, such as RNA quantification matrices, clustering results, reports on protocols and associated metadata, such as assigned cell types.
The thymus provides a nurturing environment for the differentiation and selection of T cells, a process orchestrated by their interaction with multiple thymic cell types. We used single-cell RNA ...sequencing to create a cell census of the human thymus across the life span and to reconstruct T cell differentiation trajectories and T cell receptor (TCR) recombination kinetics. Using this approach, we identified and located in situ CD8αα
T cell populations, thymic fibroblast subtypes, and activated dendritic cell states. In addition, we reveal a bias in TCR recombination and selection, which is attributed to genomic position and the kinetics of lineage commitment. Taken together, our data provide a comprehensive atlas of the human thymus across the life span with new insights into human T cell development.
Genetic risk for breast cancer is conferred by a combination of multiple variants of small effect. To better understand how risk loci might combine, we examined whether risk-associated genes share ...regulatory mechanisms. We created a breast cancer gene regulatory network comprising transcription factors and groups of putative target genes (regulons) and asked whether specific regulons are enriched for genes associated with risk loci via expression quantitative trait loci (eQTLs). We identified 36 overlapping regulons that were enriched for risk loci and formed a distinct cluster within the network, suggesting shared biology. The risk transcription factors driving these regulons are frequently mutated in cancer and lie in two opposing subgroups, which relate to estrogen receptor (ER)(+) luminal A or luminal B and ER(-) basal-like cancers and to different luminal epithelial cell populations in the adult mammary gland. Our network approach provides a foundation for determining the regulatory circuits governing breast cancer, to identify targets for intervention, and is transferable to other disease settings.
The cellular landscape of the human intestinal tract is dynamic throughout life, developing in utero and changing in response to functional requirements and environmental exposures. Here, to ...comprehensively map cell lineages, we use single-cell RNA sequencing and antigen receptor analysis of almost half a million cells from up to 5 anatomical regions in the developing and up to 11 distinct anatomical regions in the healthy paediatric and adult human gut. This reveals the existence of transcriptionally distinct BEST4 epithelial cells throughout the human intestinal tract. Furthermore, we implicate IgG sensing as a function of intestinal tuft cells. We describe neural cell populations in the developing enteric nervous system, and predict cell-type-specific expression of genes associated with Hirschsprung's disease. Finally, using a systems approach, we identify key cell players that drive the formation of secondary lymphoid tissue in early human development. We show that these programs are adopted in inflammatory bowel disease to recruit and retain immune cells at the site of inflammation. This catalogue of intestinal cells will provide new insights into cellular programs in development, homeostasis and disease.
Genetic mapping studies have identified multiple cancer susceptibility regions at chromosome 8q24, upstream of the MYC oncogene. MYC has been widely presumed as the regulated target gene, but ...definitive evidence functionally linking these cancer regions with MYC has been difficult to obtain. Here we examined candidate functional variants of a haplotype block at 8q24 encompassing the two independent risk alleles for prostate and breast cancer, rs620861 and rs13281615. We used the mapping of DNase I hypersensitive sites as a tool to prioritise regions for further functional analysis. This approach identified rs378854, which is in complete linkage disequilibrium (LD) with rs620861, as a novel functional prostate cancer-specific genetic variant. We demonstrate that the risk allele (G) of rs378854 reduces binding of the transcription factor YY1 in vitro. This factor is known to repress global transcription in prostate cancer and is a candidate tumour suppressor. Additional experiments showed that the YY1 binding site is occupied in vivo in prostate cancer, but not breast cancer cells, consistent with the observed cancer-specific effects of this single nucleotide polymorphism (SNP). Using chromatin conformation capture (3C) experiments, we found that the region surrounding rs378854 interacts with the MYC and PVT1 promoters. Moreover, expression of the PVT1 oncogene in normal prostate tissue increased with the presence of the risk allele of rs378854, while expression of MYC was not affected. In conclusion, we identified a new functional prostate cancer risk variant at the 8q24 locus, rs378854 allele G, that reduces binding of the YY1 protein and is associated with increased expression of PVT1 located 0.5 Mb downstream.