Genome-wide association studies have become a popular strategy to find associations of genes to traits of interest. Despite the high-resolution available today to carry out genotyping studies, the ...success of its application in real studies has been limited by the testing strategy used. As an alternative to brute force solutions involving the use of very large cohorts, we propose the use of the Gene Set Analysis (GSA), a different analysis strategy based on testing the association of modules of functionally related genes. We show here how the Gene Set-based Analysis of Polymorphisms (GeSBAP), which is a simple implementation of the GSA strategy for the analysis of genome-wide association studies, provides a significant increase in the power testing for this type of studies. GeSBAP is freely available at http://bioinfo.cipf.es/gesbap/
We present a new version of Babelomics, a complete suite of web tools for functional analysis of genome-scale experiments, with new and improved tools. New functionally relevant terms have been ...included such as CisRed motifs or bioentities obtained by text-mining procedures. An improved indexing has considerably speeded up several of the modules. An improved version of the FatiScan method for studying the coordinate behaviour of groups of functionally related genes is presented, along with a similar tool, the Gene Set Enrichment Analysis. Babelomics is now more oriented to test systems biology inspired hypotheses. Babelomics can be found at http://www.babelomics.org.
Abstract
Summary
bollito is an automated, flexible and parallelizable computational pipeline for the comprehensive analysis of single-cell RNA-seq data. Starting from FASTQ files or preprocessed ...expression matrices, bollito performs both basic and advanced tasks in single-cell analysis integrating >30 state-of-the-art tools. This includes quality control, read alignment, dimensionality reduction, clustering, cell-marker detection, differential expression, functional analysis, trajectory inference and RNA velocity. bollito is built using the Snakemake workflow management system, which easily connects each execution step and facilitates the reproducibility of results. bollito’s modular design makes it easy to incorporate other packages into the pipeline enabling its expansion with new functionalities.
Availability and implementation
Source code is freely available at https://gitlab.com/bu_cnio/bollito under the MIT license.
Supplementary information
Supplementary data are available at Bioinformatics online.
The ultimate goal of any genome-scale experiment is to provide a functional interpretation of the data, relating the available information with the hypotheses that originated the experiment. Thus, ...functional profiling methods have become essential in diverse scenarios such as microarray experiments, proteomics, etc. We present the FatiGO+, a web-based tool for the functional profiling of genome-scale experiments, specially oriented to the interpretation of microarray experiments. In addition to different functional annotations (gene ontology, KEGG pathways, Interpro motifs, Swissprot keywords and text-mining based bioentities related to diseases and chemical compounds) FatiGO+ includes, as a novelty, regulatory and structural information. The regulatory information used includes predictions of targets for distinct regulatory elements (obtained from the Transfac and CisRed databases). Additionally FatiGO+ uses predictions of target motifs of miRNA to infer which of these can be activated or deactivated in the sample of genes studied. Finally, properties of gene products related to their relative location and connections in the interactome have also been used. Also, enrichment of any of these functional terms can be directly analysed on chromosomal coordinates. FatiGO+ can be found at: http://www.fatigoplus.org and within the Babelomics environment http://www.babelomics.org
Calreticulin (CALR) mutations are frequent, disease-initiating events in myeloproliferative neoplasms (MPNs). Although the biological mechanism by which CALR mutations cause MPNs has been elucidated, ...there currently are no clonally selective therapies for CALR-mutant MPNs. To identify unique genetic dependencies in CALR-mutant MPNs, we performed a whole-genome clustered regularly interspaced short palindromic repeats (CRISPR) knockout depletion screen in mutant CALR-transformed hematopoietic cells. We found that genes in the N-glycosylation pathway (among others) were differentially depleted in mutant CALR-transformed cells as compared with control cells. Using a focused pharmacological in vitro screen targeting unique vulnerabilities uncovered in the CRISPR screen, we found that chemical inhibition of N-glycosylation impaired the growth of mutant CALR-transformed cells, through a reduction in MPL cell surface expression. We treated Calr-mutant knockin mice with the N-glycosylation inhibitor 2-deoxy-glucose (2-DG) and found a preferential sensitivity of Calr-mutant cells to 2-DG as compared with wild-type cells and normalization of key MPNs disease features. To validate our findings in primary human cells, we performed megakaryocyte colony-forming unit (CFU-MK) assays. We found that N-glycosylation inhibition significantly reduced CFU-MK formation in patient-derived CALR-mutant bone marrow as compared with bone marrow derived from healthy donors. In aggregate, our findings advance the development of clonally selective treatments for CALR-mutant MPNs.
Large-sequencing cancer genome projects have shown that tumors have thousands of molecular alterations and their frequency is highly heterogeneous. In such scenarios, physicians and oncologists ...routinely face lists of cancer genomic alterations where only a minority of them are relevant biomarkers to drive clinical decision-making. For this reason, the medical community agrees on the urgent need of methodologies to establish the relevance of tumor alterations, assisting in genomic profile interpretation, and, more importantly, to prioritize those that could be clinically actionable for cancer therapy.
We present PanDrugs, a new computational methodology to guide the selection of personalized treatments in cancer patients using the variant lists provided by genome-wide sequencing analyses. PanDrugs offers the largest database of drug-target associations available from well-known targeted therapies to preclinical drugs. Scoring data-driven gene cancer relevance and drug feasibility PanDrugs interprets genomic alterations and provides a prioritized evidence-based list of anticancer therapies. Our tool represents the first drug prescription strategy applying a rational based on pathway context, multi-gene markers impact and information provided by functional experiments. Our approach has been systematically applied to TCGA patients and successfully validated in a cancer case study with a xenograft mouse model demonstrating its utility.
PanDrugs is a feasible method to identify potentially druggable molecular alterations and prioritize drugs to facilitate the interpretation of genomic landscape and clinical decision-making in cancer patients. Our approach expands the search of druggable genomic alterations from the concept of cancer driver genes to the druggable pathway context extending anticancer therapeutic options beyond already known cancer genes. The methodology is public and easily integratable with custom pipelines through its programmatic API or its docker image. The PanDrugs webtool is freely accessible at http://www.pandrugs.org .
In mammalian cells, chromosomal replication starts at thousands of origins at which replisomes are assembled. Replicative stress triggers additional initiation events from 'dormant' origins whose ...genomic distribution and regulation are not well understood. In this study, we have analyzed origin activity in mouse embryonic stem cells in the absence or presence of mild replicative stress induced by aphidicolin, a DNA polymerase inhibitor, or by deregulation of origin licensing factor CDC6. In both cases, we observe that the majority of stress-responsive origins are also active in a small fraction of the cell population in a normal S phase, and stress increases their frequency of activation. In a search for the molecular determinants of origin efficiency, we compared the genetic and epigenetic features of origins displaying different levels of activation, and integrated their genomic positions in three-dimensional chromatin interaction networks derived from high-depth Hi-C and promoter-capture Hi-C data. We report that origin efficiency is directly proportional to the proximity to transcriptional start sites and to the number of contacts established between origin-containing chromatin fragments, supporting the organization of origins in higher-level DNA replication factories.
The causes for malignant progression of disseminated tumors and the reasons recurrence rates differ in women with different breast cancer subtypes are unknown. Here, we report novel mechanisms of ...tumor plasticity that are mandated by microenvironmental factors and show that recurrence rates are not strictly due to cell-intrinsic properties. Specifically, outgrowth of the same population of incipient tumors is accelerated in mice with triple-negative breast cancer (TNBC) relative to those with luminal breast cancer. Systemic signals provided by overt TNBCs cause the formation of a tumor-supportive microenvironment enriched for EGF and insulin-like growth factor-I (IGF-I) at distant indolent tumor sites. Bioavailability of EGF and IGF-I enhances the expression of transcription factors associated with pluripotency, proliferation, and epithelial-mesenchymal transition. Combinatorial therapy with EGF receptor and IGF-I receptor inhibitors prevents malignant progression. These results suggest that plasticity and recurrence rates can be dictated by host systemic factors and offer novel therapeutic potential for patients with TNBC.
Abstract
Genomics studies routinely confront researchers with long lists of tumor alterations detected in patients. Such lists are difficult to interpret since only a minority of the alterations are ...relevant biomarkers for diagnosis and for designing therapeutic strategies. PanDrugs is a methodology that facilitates the interpretation of tumor molecular alterations and guides the selection of personalized treatments. To do so, PanDrugs scores gene actionability and drug feasibility to provide a prioritized evidence-based list of drugs. Here, we introduce PanDrugs2, a major upgrade of PanDrugs that, in addition to somatic variant analysis, supports a new integrated multi-omics analysis which simultaneously combines somatic and germline variants, copy number variation and gene expression data. Moreover, PanDrugs2 now considers cancer genetic dependencies to extend tumor vulnerabilities providing therapeutic options for untargetable genes. Importantly, a novel intuitive report to support clinical decision-making is generated. PanDrugs database has been updated, integrating 23 primary sources that support >74K drug–gene associations obtained from 4642 genes and 14 659 unique compounds. The database has also been reimplemented to allow semi-automatic updates to facilitate maintenance and release of future versions. PanDrugs2 does not require login and is freely available at https://www.pandrugs.org/.
Graphical Abstract
Graphical Abstract