Normalization is a critical step in the analysis of single-cell RNA-sequencing (scRNA-seq) datasets. Its main goal is to make gene counts comparable within and between cells. To do so, normalization ...methods must account for technical and biological variability. Numerous normalization methods have been developed addressing different sources of dispersion and making specific assumptions about the count data.
The selection of a normalization method has a direct impact on downstream analysis, for example differential gene expression and cluster identification. Thus, the objective of this review is to guide the reader in making an informed decision on the most appropriate normalization method to use. To this aim, we first give an overview of the different single cell sequencing platforms and methods commonly used including isolation and library preparation protocols. Next, we discuss the inherent sources of variability of scRNA-seq datasets. We describe the categories of normalization methods and include examples of each. We also delineate imputation and batch-effect correction methods. Furthermore, we describe data-driven metrics commonly used to evaluate the performance of normalization methods. We also discuss common scRNA-seq methods and toolkits used for integrated data analysis.
According to the correction performed, normalization methods can be broadly classified as within and between-sample algorithms. Moreover, with respect to the mathematical model used, normalization methods can further be classified into: global scaling methods, generalized linear models, mixed methods, and machine learning-based methods. Each of these methods depict pros and cons and make different statistical assumptions. However, there is no better performing normalization method. Instead, metrics such as silhouette width, K-nearest neighbor batch-effect test, or Highly Variable Genes are recommended to assess the performance of normalization methods.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Long non-coding RNAs (lncRNAs) (> 200 bp) play crucial roles in transcriptional regulation during numerous biological processes. However, it is challenging to comprehensively identify lncRNAs, ...because they are often expressed at low levels and with more cell-type specificity than are protein-coding genes. In the present study, we performed ab initio transcriptome reconstruction using eight purified cell populations from mouse cortex and detected more than 5000 lncRNAs. Predicting the functions of lncRNAs using cell-type specific data revealed their potential functional roles in Central Nervous System (CNS) development. We performed motif searches in ENCODE DNase I digital footprint data and Mouse ENCODE promoters to infer transcription factor (TF) occupancy. By integrating TF binding and cell-type specific transcriptomic data, we constructed a novel framework that is useful for systematically identifying lncRNAs that are potentially essential for brain cell fate determination. Based on this integrative analysis, we identified lncRNAs that are regulated during Oligodendrocyte Precursor Cell (OPC) differentiation from Neural Stem Cells (NSCs) and that are likely to be involved in oligodendrogenesis. The top candidate, lnc-OPC, shows highly specific expression in OPCs and remarkable sequence conservation among placental mammals. Interestingly, lnc-OPC is significantly up-regulated in glial progenitors from experimental autoimmune encephalomyelitis (EAE) mouse models compared to wild-type mice. OLIG2-binding sites in the upstream regulatory region of lnc-OPC were identified by ChIP (chromatin immunoprecipitation)-Sequencing and validated by luciferase assays. Loss-of-function experiments confirmed that lnc-OPC plays a functional role in OPC genesis. Overall, our results substantiated the role of lncRNA in OPC fate determination and provided an unprecedented data source for future functional investigations in CNS cell types. We present our datasets and analysis results via the interactive genome browser at our laboratory website that is freely accessible to the research community. This is the first lncRNA expression database of collective populations of glia, vascular cells, and neurons. We anticipate that these studies will advance the knowledge of this major class of non-coding genes and their potential roles in neurological development and diseases.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Abstract
Chromatin architecture influences transcription by modulating the physical access of regulatory factors to DNA, playing fundamental roles in cell identity. Studies on dopaminergic ...differentiation have identified coding genes, but the relationship with non-coding genes or chromatin accessibility remains elusive. Using RNA-Seq and ATAC-Seq we profiled differentially expressed transcripts and open chromatin regions during early dopaminergic neuron differentiation. Hierarchical clustering of differentially expressed genes, resulted in 6 groups with unique characteristics. Surprisingly, the abundance of long non-coding RNAs (lncRNAs) was high in the most downregulated transcripts, and depicted positive correlations with target mRNAs. We observed that open chromatin regions decrease upon differentiation. Enrichment analyses of accessibility depict an association between open chromatin regions and specific functional pathways and gene-sets. A bioinformatic search for motifs allowed us to identify transcription factors and structural nuclear proteins that potentially regulate dopaminergic differentiation. Interestingly, we also found changes in protein and mRNA abundance of the CCCTC-binding factor, CTCF, which participates in genome organization and gene expression. Furthermore, assays demonstrated co-localization of CTCF with Polycomb-repressed chromatin marked by H3K27me3 in pluripotent cells, progressively decreasing in neural precursor cells and differentiated neurons. Our work provides a unique resource of transcription factors and regulatory elements, potentially involved in the acquisition of human dopaminergic neuron cell identity.
Single-cell RNA-sequencing of the brain Cuevas-Diaz Duran, Raquel; Wei, Haichao; Wu, Jia Qian
Clinical and translational medicine,
06/2017, Letnik:
6, Številka:
1
Journal Article
Recenzirano
Odprti dostop
Single-cell RNA-sequencing (scRNA-seq) is revolutionizing our understanding of the genomic, transcriptomic and epigenomic landscapes of cells within organs. The mammalian brain is composed of a ...complex network of millions to billions of diverse cells with either highly specialized functions or support functions. With scRNA-seq it is possible to comprehensively dissect the cellular heterogeneity of brain cells, and elucidate their specific functions and state. In this review, we describe the current experimental methods used for scRNA-seq. We also review bioinformatic tools and algorithms for data analyses and discuss critical challenges. Additionally, we summarized recent mouse brain scRNA-seq studies and systematically compared their main experimental approaches, computational tools implemented, and important findings. scRNA-seq has allowed researchers to identify diverse cell subpopulations within many brain regions, pinpointing gene signatures and novel cell markers, as well as addressing functional differences. Due to the complexity of the brain, a great deal of work remains to be accomplished. Defining specific brain cell types and functions is critical for understanding brain function as a whole in development, health, and diseases.
Spinal cord injury (SCI) is one of the most devastating neural injuries without effective therapeutic solutions. Astrocytes are the predominant component of the scar. Understanding the complex ...contributions of reactive astrocytes to SCI pathophysiologies is fundamentally important for developing therapeutic strategies. We have studied the molecular changes in the injury environment and the astrocyte-specific responses by astrocyte purification from injured spinal cords from acute to chronic stages. In addition to protein-coding genes, we have systematically analyzed the expression profiles of long non-coding RNAs (lncRNAs) (>200 bp), which are regulatory RNAs that play important roles in the CNS. We have identified a highly conserved lncRNA, Zeb2os, and demonstrated using functional assays that it plays an important role in reactive astrogliosis through the Zeb2os/Zeb2/Stat3 axis. These studies provide valuable insights into the molecular basis of reactive astrogliosis and fill the knowledge gap regarding the function(s) of lncRNAs in astrogliosis and SCI.
Display omitted
•Transcriptomes reveal expression dynamics of both coding and long non-coding RNAs•Zeb2os and Zeb2 are upregulated in astrocytes in acute and chronic stages and co-localize•Knockdown of Zeb2os leads to reduced astrogliosis, lesion size, and Zeb2 and Stat3 expression
Wei et al. comprehensively investigate the coding and long non-coding gene expression changes and astrocyte-specific responses in injured spinal cord tissue and purified astrocytes from acute to chronic stages. Bioinformatic and functional analysis identify a conserved lncRNA Zeb2os that plays an essential role in reactive astrogliosis through the Zeb2os/Zeb2/Stat3 axis.
Spinal cord injury (SCI) remains one of the most debilitating neurological disorders and the majority of SCI patients are in the chronic phase. Previous studies of SCI have usually focused on few ...genes and pathways at a time. In particular, the biological roles of long non-coding RNAs (lncRNAs) have never been characterized in SCI. Our study is the first to comprehensively investigate alterations in the expression of both coding and long non-coding genes in the sub-chronic and chronic stages of SCI using RNA-Sequencing. Through pathway analysis and network construction, the functions of differentially expressed genes were analyzed systematically. Furthermore, we predicted the potential regulatory function of non-coding transcripts, revealed enriched motifs of transcription factors in the upstream regulatory regions of differentially expressed lncRNAs, and identified differentially expressed lncRNAs homologous to human genomic regions which contain single-nucleotide polymorphisms associated with diseases. Overall, these results revealed critical pathways and networks that exhibit sustained alterations at the sub-chronic and chronic stages of SCI, highlighting the temporal regulation of pathological processes including astrogliosis. This study also provided an unprecedented resource and a new catalogue of lncRNAs potentially involved in the regulation and progression of SCI.
Breast cancer (BC) is a leading cause of cancer-related deaths among women worldwide. Neoadjuvant therapy (NAT) is increasingly being used to reduce tumor burden prior to surgical resection. However, ...current techniques for assessing tumor response have significant limitations. Additionally, drug resistance is commonly observed, raising a need to identify biomarkers that can predict treatment sensitivity and survival outcomes. Circulating microRNAs (miRNAs) are small non-coding RNAs that regulate gene expression and have been shown to play a significant role in cancer progression as tumor inducers or suppressors. The expression of circulating miRNAs has been found to be significantly altered in breast cancer patients. Moreover, recent studies have suggested that circulating miRNAs can serve as non-invasive biomarkers for predicting response to NAT. Therefore, this review provides a brief overview of recent studies that have demonstrated the potential of circulating miRNAs as biomarkers for predicting the clinical response to NAT in BC patients. The findings of this review will strengthen future research on developing miRNA-based biomarkers and their translation into medical practice, which could significantly improve the clinical management of BC patients undergoing NAT.
Androgenetic alopecia is a highly prevalent condition mainly affecting men. This complex trait is related to aging and genetics; however, multiple other factors, for example, lifestyle, are also ...involved. Despite its prevalence, the underlying biology of androgenetic alopecia remains elusive, and thus advances in its treatment have been hindered. Herein, we review the functional anatomy of hair follicles and the cell signaling events that play a role in follicle cycling. We also discuss the pathology of androgenetic alopecia and the known molecular mechanisms underlying this condition. Additionally, we describe studies comparing the transcriptional differences in hair follicles between balding and non-balding scalp regions. Given the genetic contribution, we also discuss the most significant risk variants found to be associated with androgenetic alopecia. A more comprehensive understanding of this pathology may be generated through using multi-omics approaches.
Oligodendrocytes, responsible for axon ensheathment, are critical for central nervous system (CNS) development, function, and diseases. OLIG2 is an important transcription factor (TF) that acts ...during oligodendrocyte development and performs distinct functions at different stages. Previous studies have shown that lncRNAs (long non-coding RNAs; > 200 bp) have important functions during oligodendrocyte development, but their roles have not been systematically characterized and their regulation is not yet clear.
We performed an integrated study of genome-wide OLIG2 binding and the epigenetic modification status of both coding and non-coding genes during three stages of oligodendrocyte differentiation in vivo: neural stem cells (NSCs), oligodendrocyte progenitor cells (OPCs), and newly formed oligodendrocytes (NFOs). We found that 613 lncRNAs have OLIG2 binding sites and are expressed in at least one cell type, which can potentially be activated or repressed by OLIG2. Forty-eight of them have increased expression in oligodendrocyte lineage cells. Predicting lncRNA functions by using a "guilt-by-association" approach revealed that the functions of these 48 lncRNAs were enriched in "oligodendrocyte development and differentiation." Additionally, bivalent genes are known to play essential roles during embryonic stem cell differentiation. We identified bivalent genes in NSCs, OPCs, and NFOs and found that some bivalent genes bound by OLIG2 are dynamically regulated during oligodendrocyte development. Importantly, we unveiled a previously unknown mechanism that, in addition to transcriptional regulation via DNA binding, OLIG2 could self-regulate through the 3' UTR of its own mRNA.
Our studies have revealed the missing links in the mechanisms regulating oligodendrocyte development at the transcriptional level and after transcription. The results of our research have improved the understanding of fundamental cell fate decisions during oligodendrocyte lineage formation, which can enable insights into demyelination diseases and regenerative medicine.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Recent studies have revealed the heterogeneous nature of astrocytes; however, how diverse constituents of astrocyte-lineage cells are regulated in adult spinal cord after injury and contribute to ...regeneration remains elusive. We perform single-cell RNA sequencing of GFAP-expressing cells from sub-chronic spinal cord injury models and identify and compare with the subpopulations in acute-stage data. We find subpopulations with distinct functional enrichment and their identities defined by subpopulation-specific transcription factors and regulons. Immunohistochemistry, RNAscope experiments, and quantification by stereology verify the molecular signature, location, and morphology of potential resident neural progenitors or neural stem cells in the adult spinal cord before and after injury and uncover the populations of the intermediate cells enriched in neuronal genes that could potentially transition into other subpopulations. This study has expanded the knowledge of the heterogeneity and cell state transition of glial progenitors in adult spinal cord before and after injury.
Display omitted
•scRNA-seq revealed GFAP+ populations with distinct functional enrichment and dynamics•Regulatory network analysis predicted regulons defining specific subpopulations•Laminectomy sham surgery was sufficient to induce the activation of astrocyte-lineage cells•This study unveiled additional sources/pools of resident progenitors in adult spinal cord
Wei et al. report the identification of the subpopulations of GFAP-expressing cells in healthy and injured adult spinal cords using single-cell RNA-seq. Bioinformatic and functional investigations of these subpopulations revealed the signature, location, morphology, and proliferation and differentiation abilities of the resident progenitors in the spinal cord.