High-throughput screening (HTS) is one of the main strategies to identify novel entry points for the development of small molecule chemical probes and drugs and is now commonly accessible to public ...sector research. Large amounts of data generated in HTS campaigns are submitted to public repositories such as PubChem, which is growing at an exponential rate. The diversity and quantity of available HTS assays and screening results pose enormous challenges to organizing, standardizing, integrating, and analyzing the datasets and thus to maximize the scientific and ultimately the public health impact of the huge investments made to implement public sector HTS capabilities. Novel approaches to organize, standardize and access HTS data are required to address these challenges.
We developed the first ontology to describe HTS experiments and screening results using expressive description logic. The BioAssay Ontology (BAO) serves as a foundation for the standardization of HTS assays and data and as a semantic knowledge model. In this paper we show important examples of formalizing HTS domain knowledge and we point out the advantages of this approach. The ontology is available online at the NCBO bioportal http://bioportal.bioontology.org/ontologies/44531.
After a large manual curation effort, we loaded BAO-mapped data triples into a RDF database store and used a reasoner in several case studies to demonstrate the benefits of formalized domain knowledge representation in BAO. The examples illustrate semantic querying capabilities where BAO enables the retrieval of inferred search results that are relevant to a given query, but are not explicitly defined. BAO thus opens new functionality for annotating, querying, and analyzing HTS datasets and the potential for discovering new knowledge by means of inference.
Abstract
The Library of Integrated Network-Based Cellular Signatures (LINCS) is an NIH Common Fund program with the goal of generating a large-scale and comprehensive catalogue of ...perturbation-response signatures by utilizing a diverse collection of perturbations across many model systems and assay types. The LINCS Data Portal (LDP) has been the primary access point for the compendium of LINCS data and has been widely utilized. Here, we report the first major update of LDP (http://lincsportal.ccs.miami.edu/signatures) with substantial changes in the data architecture and APIs, a completely redesigned user interface, and enhanced curated metadata annotations to support more advanced, intuitive and deeper querying, exploration and analysis capabilities. The cornerstone of this update has been the decision to reprocess all high-level LINCS datasets and make them accessible at the data point level enabling users to directly access and download any subset of signatures across the entire library independent from the originating source, project or assay. Access to the individual signatures also enables the newly implemented signature search functionality, which utilizes the iLINCS platform to identify conditions that mimic or reverse gene set queries. A newly designed query interface enables global metadata search with autosuggest across all annotations associated with perturbations, model systems, and signatures.
Depending on endoplasmic reticulum (ER) stress levels, the ER transmembrane multidomain protein IRE1α promotes either adaptation or apoptosis. Unfolded ER proteins cause IRE1α lumenal domain ...homo-oligomerization, inducing trans autophosphorylation that further drives homo-oligomerization of its cytosolic kinase/endoribonuclease (RNase) domains to activate mRNA splicing of adaptive XBP1 transcription factor. However, under high/chronic ER stress, IRE1α surpasses an oligomerization threshold that expands RNase substrate repertoire to many ER-localized mRNAs, leading to apoptosis. To modulate these effects, we developed ATP-competitive IRE1α Kinase-Inhibiting RNase Attenuators—KIRAs—that allosterically inhibit IRE1α’s RNase by breaking oligomers. One optimized KIRA, KIRA6, inhibits IRE1α in vivo and promotes cell survival under ER stress. Intravitreally, KIRA6 preserves photoreceptor functional viability in rat models of ER stress-induced retinal degeneration. Systemically, KIRA6 preserves pancreatic β cells, increases insulin, and reduces hyperglycemia in Akita diabetic mice. Thus, IRE1α powerfully controls cell fate but can itself be controlled with small molecules to reduce cell degeneration.
Display omitted
•ER stress triggers a “terminal UPR” via hyperactivation of IRE1α, an ER kinase/RNase•Hyperactive IRE1α’s RNase degrades ER-localized mRNAs and miRNAs to cause apoptosis•Kinase-Inhibiting RNase-Attenuators (KIRAs) reduce IRE1α’s RNase activity•KIRA6—an advanced KIRA—preserves cell survival and function in many ER stress models
Allosteric inhibitors of the unfolded protein response sensor IRE1α promote cell survival and function under ER stress and in rodent models of diabetes and retinitis pigmentosa.
The Illuminating the Druggable Genome (IDG) project aims to improve our understanding of understudied proteins and our ability to study them in the context of disease biology by perturbing them with ...small molecules, biologics, or other therapeutic modalities. Two main products from the IDG effort are the Target Central Resource Database (TCRD) (http://juniper.health.unm.edu/tcrd/), which curates and aggregates information, and Pharos (https://pharos.nih.gov/), a web interface for fusers to extract and visualize data from TCRD. Since the 2021 release, TCRD/Pharos has focused on developing visualization and analysis tools that help reveal higher-level patterns in the underlying data. The current iterations of TCRD and Pharos enable users to perform enrichment calculations based on subsets of targets, diseases, or ligands and to create interactive heat maps and UpSet charts of many types of annotations. Using several examples, we show how to address disease biology and drug discovery questions through enrichment calculations and UpSet charts.
PNCK, or CAMK1b, is an understudied kinase of the calcium-calmodulin dependent kinase family which recently has been identified as a marker of cancer progression and survival in several large-scale ...multi-omics studies. The biology of PNCK and its relation to oncogenesis has also begun to be elucidated, with data suggesting various roles in DNA damage response, cell cycle control, apoptosis and HIF-1-alpha related pathways. To further explore PNCK as a clinical target, potent small-molecule molecular probes must be developed. Currently, there are no targeted small molecule inhibitors in pre-clinical or clinical studies for the CAMK family. Additionally, there exists no experimentally derived crystal structure for PNCK. We herein report a three-pronged chemical probe discovery campaign which utilized homology modeling, machine learning, virtual screening and molecular dynamics to identify small molecules with low-micromolar potency against PNCK activity from commercially available compound libraries. We report the discovery of a hit-series for the first targeted effort towards discovering PNCK inhibitors that will serve as the starting point for future medicinal chemistry efforts for hit-to-lead optimization of potent chemical probes.
Abstract
Millions of transcriptome samples were generated by the Library of Integrated Network-based Cellular Signatures (LINCS) program. When these data are processed into searchable signatures ...along with signatures extracted from Genotype-Tissue Expression (GTEx) and Gene Expression Omnibus (GEO), connections between drugs, genes, pathways and diseases can be illuminated. SigCom LINCS is a webserver that serves over a million gene expression signatures processed, analyzed, and visualized from LINCS, GTEx, and GEO. SigCom LINCS is built with Signature Commons, a cloud-agnostic skeleton Data Commons with a focus on serving searchable signatures. SigCom LINCS provides a rapid signature similarity search for mimickers and reversers given sets of up and down genes, a gene set, a single gene, or any search term. Additionally, users of SigCom LINCS can perform a metadata search to find and analyze subsets of signatures and find information about genes and drugs. SigCom LINCS is findable, accessible, interoperable, and reusable (FAIR) with metadata linked to standard ontologies and vocabularies. In addition, all the data and signatures within SigCom LINCS are available via a well-documented API. In summary, SigCom LINCS, available at https://maayanlab.cloud/sigcom-lincs, is a rich webserver resource for accelerating drug and target discovery in systems pharmacology.
Graphical Abstract
Graphical Abstract
SigCom LINCS user interface workflow map. SigCom LINCS has several entry points to query the database for mimicking and reversing signatures. Users can submit any search term. The search term can be converted to a gene set using the Geneshot or the Enrichr API or used to retrieve SigCom LINCS signatures. Users of SigCom LINCS can also start with a single human gene. The single human gene can be converted into up and down gene sets using co-expression data from ARCHS4 or submitted for reverse search using two Appyters. Users of SigCom LINCS can also submit a gene set or up and down gene sets for signature search.
Glioblastoma multiforme (GBM) is the most malignant primary adult brain tumor. The current standard of care is surgical resection, radiation, and chemotherapy treatment, which extends life in most ...cases. Unfortunately, tumor recurrence is nearly universal and patients with recurrent glioblastoma typically survive <1 year. Therefore, new therapies and therapeutic combinations need to be developed that can be quickly approved for use in patients. However, in order to gain approval, therapies need to be safe as well as effective. One possible means of attaining rapid approval is repurposing FDA approved compounds for GBM therapy. However, candidate compounds must be able to penetrate the blood-brain barrier (BBB) and therefore a selection process has to be implemented to identify such compounds that can eliminate GBM tumor expansion. We review here psychiatric and non-psychiatric compounds that may be effective in GBM, as well as potential drugs targeting cell death pathways. We also discuss the potential of data-driven computational approaches to identify compounds that induce cell death in GBM cells, enabled by large reference databases such as the Library of Integrated Network Cell Signatures (LINCS). Finally, we argue that identifying pathways dysregulated in GBM in a patient specific manner is essential for effective repurposing in GBM and other gliomas.
Glioblastoma (GBM) is the most common primary adult brain tumor. Despite extensive efforts, the median survival for GBM patients is approximately 14 months. GBM therapy could benefit greatly from ...patient-specific targeted therapies that maximize treatment efficacy. Here we report a platform termed SynergySeq to identify drug combinations for the treatment of GBM by integrating information from The Cancer Genome Atlas (TCGA) and the Library of Integrated Network-Based Cellular Signatures (LINCS). We identify differentially expressed genes in GBM samples and devise a consensus gene expression signature for each compound using LINCS L1000 transcriptional profiling data. The SynergySeq platform computes disease discordance and drug concordance to identify combinations of FDA-approved drugs that induce a synergistic response in GBM. Collectively, our studies demonstrate that combining disease-specific gene expression signatures with LINCS small molecule perturbagen-response signatures can identify preclinical combinations for GBM, which can potentially be tested in humans.
Deep learning is a machine learning technique to model high-level abstractions in data by utilizing a graph composed of multiple processing layers that experience various linear and non-linear ...transformations. This technique has been shown to perform well for applications in drug discovery, utilizing structural features of small molecules to predict activity. Here, we report a large-scale study to predict the activity of small molecules across the human kinome-a major family of drug targets, particularly in anti-cancer agents. While small-molecule kinase inhibitors exhibit impressive clinical efficacy in several different diseases, resistance often arises through adaptive kinome reprogramming or subpopulation diversity. Polypharmacology and combination therapies offer potential therapeutic strategies for patients with resistant diseases. Their development would benefit from a more comprehensive and dense knowledge of small-molecule inhibition across the human kinome. Leveraging over 650,000 bioactivity annotations for more than 300,000 small molecules, we evaluated multiple machine learning methods to predict the small-molecule inhibition of 342 kinases across the human kinome. Our results demonstrated that multi-task deep neural networks outperformed classical single-task methods, offering the potential for conducting large-scale virtual screening, predicting activity profiles, and bridging the gaps in the available data.