Transcriptome profiling followed by differential gene expression analysis often leads to lists of genes that are hard to analyze and interpret. Functional genomics tools are powerful approaches for ...downstream analysis, as they summarize the large and noisy gene expression space into a smaller number of biological meaningful features. In particular, methods that estimate the activity of processes by mapping transcripts level to process members are popular. However, footprints of either a pathway or transcription factor (TF) on gene expression show superior performance over mapping-based gene sets. These footprints are largely developed for humans and their usability in the broadly-used model organism Mus musculus is uncertain. Evolutionary conservation of the gene regulatory system suggests that footprints of human pathways and TFs can functionally characterize mice data. In this paper we analyze this hypothesis. We perform a comprehensive benchmark study exploiting two state-of-the-art footprint methods, DoRothEA and an extended version of PROGENy. These methods infer TF and pathway activity, respectively. Our results show that both can recover mouse perturbations, confirming our hypothesis that footprints are conserved between mice and humans. Subsequently, we illustrate the usability of PROGENy and DoRothEA by recovering pathway/TF-disease associations from newly generated disease sets. Additionally, we provide pathway and TF activity scores for a large collection of human and mouse perturbation and disease experiments (2374). We believe that this resource, available for interactive exploration and download (https://saezlab.shinyapps.io/footprint_scores/), can have broad applications including the study of diseases and therapeutics.
•Functional analysis with human gene sets recovers pathway and TF perturbation in mice.•Mice data can be functionally characterized with human gene sets.•Transcriptomic perturbation signatures are largely conserved between mouse and human.•Pathway/TF-disease associations are recovered across both species.
Many functional analysis tools have been developed to extract functional and mechanistic insight from bulk transcriptome data. With the advent of single-cell RNA sequencing (scRNA-seq), it is in ...principle possible to do such an analysis for single cells. However, scRNA-seq data has characteristics such as drop-out events and low library sizes. It is thus not clear if functional TF and pathway analysis tools established for bulk sequencing can be applied to scRNA-seq in a meaningful way.
To address this question, we perform benchmark studies on simulated and real scRNA-seq data. We include the bulk-RNA tools PROGENy, GO enrichment, and DoRothEA that estimate pathway and transcription factor (TF) activities, respectively, and compare them against the tools SCENIC/AUCell and metaVIPER, designed for scRNA-seq. For the in silico study, we simulate single cells from TF/pathway perturbation bulk RNA-seq experiments. We complement the simulated data with real scRNA-seq data upon CRISPR-mediated knock-out. Our benchmarks on simulated and real data reveal comparable performance to the original bulk data. Additionally, we show that the TF and pathway activities preserve cell type-specific variability by analyzing a mixture sample sequenced with 13 scRNA-seq protocols. We also provide the benchmark data for further use by the community.
Our analyses suggest that bulk-based functional analysis tools that use manually curated footprint gene sets can be applied to scRNA-seq data, partially outperforming dedicated single-cell tools. Furthermore, we find that the performance of functional analysis tools is more sensitive to the gene sets than to the statistic used.
The growing availability of single-cell data, especially transcriptomics, has sparked an increased interest in the inference of cell-cell communication. Many computational tools were developed for ...this purpose. Each of them consists of a resource of intercellular interactions prior knowledge and a method to predict potential cell-cell communication events. Yet the impact of the choice of resource and method on the resulting predictions is largely unknown. To shed light on this, we systematically compare 16 cell-cell communication inference resources and 7 methods, plus the consensus between the methods' predictions. Among the resources, we find few unique interactions, a varying degree of overlap, and an uneven coverage of specific pathways and tissue-enriched proteins. We then examine all possible combinations of methods and resources and show that both strongly influence the predicted intercellular interactions. Finally, we assess the agreement of cell-cell communication methods with spatial colocalisation, cytokine activities, and receptor protein abundance and find that predictions are generally coherent with those data modalities. To facilitate the use of the methods and resources described in this work, we provide LIANA, a LIgand-receptor ANalysis frAmework as an open-source interface to all the resources and methods.
Comparing SARS-CoV-2 infection-induced gene expression signatures to drug treatment-induced gene expression signatures is a promising bioinformatic tool to repurpose existing drugs against ...SARS-CoV-2. The general hypothesis of signature-based drug repurposing is that drugs with inverse similarity to a disease signature can reverse disease phenotype and thus be effective against it. However, in the case of viral infection diseases, like SARS-CoV-2, infected cells also activate adaptive, antiviral pathways, so that the relationship between effective drug and disease signature can be more ambiguous. To address this question, we analysed gene expression data from in vitro SARS-CoV-2 infected cell lines, and gene expression signatures of drugs showing anti-SARS-CoV-2 activity. Our extensive functional genomic analysis showed that both infection and treatment with in vitro effective drugs leads to activation of antiviral pathways like NFkB and JAK-STAT. Based on the similarity-and not inverse similarity-between drug and infection-induced gene expression signatures, we were able to predict the in vitro antiviral activity of drugs. We also identified SREBF1/2, key regulators of lipid metabolising enzymes, as the most activated transcription factors by several in vitro effective antiviral drugs. Using a fluorescently labeled cholesterol sensor, we showed that these drugs decrease the cholesterol levels of plasma-membrane. Supplementing drug-treated cells with cholesterol reversed the in vitro antiviral effect, suggesting the depleting plasma-membrane cholesterol plays a key role in virus inhibitory mechanism. Our results can help to more effectively repurpose approved drugs against SARS-CoV-2, and also highlights key mechanisms behind their antiviral effect.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
It is still not possible to predict whether a given molecule will have a perceived odor or what olfactory percept it will produce. We therefore organized the crowd-sourced DREAM Olfaction Prediction ...Challenge. Using a large olfactory psychophysical data set, teams developed machine-learning algorithms to predict sensory attributes of molecules based on their chemoinformatic features. The resulting models accurately predicted odor intensity and pleasantness and also successfully predicted 8 among 19 rated semantic descriptors (“garlic,” “fish,” “sweet,” “fruit,” “burnt,” “spices,” “flower,” and “sour”). Regularized linear models performed nearly as well as random forest–based ones, with a predictive accuracy that closely approaches a key theoretical limit. These models help to predict the perceptual qualities of virtually any molecule with high accuracy and also reverse-engineer the smell of a molecule.
High dimensional characterization of drug targets, compound effects and disease phenotypes are crucial for increased efficiency of drug discovery. High-throughput gene expression measurements are one ...of the most frequently used data acquisition methods for such a systems level analysis of biological phenotypes. RNA sequencing allows genome wide quantification of transcript abundances, recently even on the level of single cells. However, the correct, mechanistic interpretation of transcriptomic measurements is complicated by the fact that gene expression changes can be both the cause and the consequence of altered phenotype. Perturbation gene expression profiles, where gene expression is measured after a genetic or chemical perturbation, can help to overcome these problems by directly connecting the causal perturbations to their gene expression consequences. In this Review, we discuss the main large scale perturbation gene expression profile datasets, and their application in the drug discovery process, covering mechanisms of action identification, drug repurposing, pathway activity analysis and quantitative modelling.
Prostate cancer is the second most occurring cancer in men worldwide. To better understand the mechanisms of tumorigenesis and possible treatment responses, we developed a mathematical model of ...prostate cancer which considers the major signalling pathways known to be deregulated. We personalised this Boolean model to molecular data to reflect the heterogeneity and specific response to perturbations of cancer patients. A total of 488 prostate samples were used to build patient-specific models and compared to available clinical data. Additionally, eight prostate cell line-specific models were built to validate our approach with dose-response data of several drugs. The effects of single and combined drugs were tested in these models under different growth conditions. We identified 15 actionable points of interventions in one cell line-specific model whose inactivation hinders tumorigenesis. To validate these results, we tested nine small molecule inhibitors of five of those putative targets and found a dose-dependent effect on four of them, notably those targeting HSP90 and PI3K. These results highlight the predictive power of our personalised Boolean models and illustrate how they can be used for precision oncology.
Babur et al. (2021) developed the CausalPath tool to infer causal signaling interactions in high-throughput proteomics data that may foster mechanical understanding from large-scale biological ...datasets.
Babur et al. (2021) developed the CausalPath tool to infer causal signaling interactions in high-throughput proteomics data that may foster mechanical understanding from large-scale biological datasets.
Angiotensin II (AngII) is a vasoactive peptide hormone, which, under pathological conditions, contributes to the development of cardiovascular diseases. Oxysterols, including 25-hydroxycholesterol ...(25-HC), the product of cholesterol-25-hydroxylase (CH25H), also have detrimental effects on vascular health by affecting vascular smooth muscle cells (VSMCs). We investigated AngII-induced gene expression changes in VSMCs to explore whether AngII stimulus and 25-HC production have a connection in the vasculature. RNA-sequencing revealed that
is significantly upregulated in response to AngII stimulus. The
mRNA levels were elevated robustly (~50-fold) 1 h after AngII (100 nM) stimulation compared to baseline levels. Using inhibitors, we specified that the AngII-induced
upregulation is type 1 angiotensin II receptor- and G
activity-dependent. Furthermore, p38 MAPK has a crucial role in the upregulation of
. We performed LC-MS/MS to identify 25-HC in the supernatant of AngII-stimulated VSMCs. In the supernatants, 25-HC concentration peaked 4 h after AngII stimulation. Our findings provide insight into the pathways mediating AngII-induced
upregulation. Our study elucidates a connection between AngII stimulus and 25-HC production in primary rat VSMCs. These results potentially lead to the identification and understanding of new mechanisms in the pathogenesis of vascular impairments.
G Protein Coupled Receptors (GPCR) can form dimers or higher ordered oligomers, the process of which can remarkably influence the physiological and pharmacological function of these receptors. ...Quantitative Bioluminescence Resonance Energy Transfer (qBRET) measurements are the gold standards to prove the direct physical interaction between the protomers of presumed GPCR dimers. For the correct interpretation of these experiments, the expression of the energy donor Renilla luciferase labeled receptor has to be maintained constant, which is hard to achieve in expression systems. To analyze the effects of non-constant donor expression on qBRET curves, we performed Monte Carlo simulations. Our results show that the decrease of donor expression can lead to saturation qBRET curves even if the interaction between donor and acceptor labeled receptors is non-specific leading to false interpretation of the dimerization state. We suggest here a new approach to the analysis of qBRET data, when the BRET ratio is plotted as a function of the acceptor labeled receptor expression at various donor receptor expression levels. With this method, we were able to distinguish between dimerization and non-specific interaction when the results of classical qBRET experiments were ambiguous. The simulation results were confirmed experimentally using rapamycin inducible heterodimerization system. We used this new method to investigate the dimerization of various GPCRs, and our data have confirmed the homodimerization of V2 vasopressin and CaSR calcium sensing receptors, whereas our data argue against the heterodimerization of these receptors with other studied GPCRs, including type I and II angiotensin, β2 adrenergic and CB1 cannabinoid receptors.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK