Bridging chemical and biological space is the key to drug discovery and development. Typically, cheminformatics methods operate under the assumption that similar chemicals have similar biological ...activity. Ideally then, one could predict a drug's biological function(s) given only its chemical structure by similarity searching in libraries of compounds with known activities. In practice, effectively choosing a similarity metric is case dependent. This work compares both 2D and 3D chemical descriptors as tools for predicting the biological targets of ligand probes, on the basis of their similarity to reference molecules in a 46 000 compound, biologically annotated chemical database. Overall, we found that the 2D methods employed here outperform the 3D (88% vs 67% success) in correct target prediction. However, the 3D descriptors proved superior in cases of probes with low structural similarity to other compounds in the database (singletons). Additionally, the 3D method (FEPOPS) shows promise for providing pharmacophoric alignment of the small molecules' chemical features consistent with those seen in experimental ligand/ receptor complexes. These results suggest that querying annotated chemical databases with a systematic combination of both 2D and 3D descriptors will prove more effective than employing single methods.
Preclinical Safety Pharmacology (PSP) attempts to anticipate adverse drug reactions (ADRs) during early phases of drug discovery by testing compounds in simple, in vitro binding assays (that is, ...preclinical profiling). The selection of PSP targets is based largely on circumstantial evidence of their contribution to known clinical ADRs, inferred from findings in clinical trials, animal experiments, and molecular studies going back more than forty years. In this work we explore PSP chemical space and its relevance for the prediction of adverse drug reactions. Firstly, in silico (computational) Bayesian models for 70 PSP‐related targets were built, which are able to detect 93 % of the ligands binding at IC50≤10 μM at an overall correct classification rate of about 94 %. Secondly, employing the World Drug Index (WDI), a model for adverse drug reactions was built directly based on normalized side‐effect annotations in the WDI, which does not require any underlying functional knowledge. This is, to our knowledge, the first attempt to predict adverse drug reactions across hundreds of categories from chemical structure alone. On average 90 % of the adverse drug reactions observed with known, clinically used compounds were detected, an overall correct classification rate of 92 %. Drugs withdrawn from the market (Rapacuronium, Suprofen) were tested in the model and their predicted ADRs align well with known ADRs. The analysis was repeated for acetylsalicylic acid and Benperidol which are still on the market. Importantly, features of the models are interpretable and back‐projectable to chemical structure, raising the possibility of rationally engineering out adverse effects. By combining PSP and ADR models new hypotheses linking targets and adverse effects can be proposed and examples for the opioid μ and the muscarinic M2 receptors, as well as for cyclooxygenase‐1 are presented. It is hoped that the generation of predictive models for adverse drug reactions is able to help support early SAR to accelerate drug discovery and decrease late stage attrition in drug discovery projects. In addition, models such as the ones presented here can be used for compound profiling in all development stages.
Preclinical Safety Pharmacology attempts to anticipate adverse drug reactions (ADRs) during early phases of drug discovery by testing compounds in simple, in vitro binding assays. In this paper we describe the successful application of cheminformatics methods to predict adverse side effects of drugs to accelerate drug discovery and decrease late stage attrition in drug discovery projects.
We present a workflow that leverages data from chemogenomics based target predictions with Systems Biology databases to better understand off-target related toxicities. By analyzing a set of ...compounds that share a common toxic phenotype and by comparing the pathways they affect with pathways modulated by nontoxic compounds we are able to establish links between pathways and particular adverse effects. We further link these predictive results with literature data in order to explain why a certain pathway is predicted. Specifically, relevant pathways are elucidated for the side effects rhabdomyolysis and hypotension. Prospectively, our approach is valuable not only to better understand toxicities of novel compounds early on but also for drug repurposing exercises to find novel uses for known drugs.
Computational target prediction methods using chemical descriptors have been applied exhaustively in drug discovery to elucidate the mechanisms-of-action (MOAs) of small molecules. To predict truly ...novel and unexpected small molecule–target interactions, compounds must be compared by means other than their chemical structure alone. Here we investigated predictions made by a method, HTS fingerprints (HTSFPs), that matches patterns of activities in experimental screens. Over 1,400 drugs and 1,300 natural products (NPs) were screened in more than 200 diverse assays, creating encodable activity patterns. The comparison of these activity patterns to an MOA-annotated reference panel led to the prediction of 5,281 and 2,798 previously unknown targets for the NP and drug sets, respectively. Intriguingly, there was limited overlap among the targets predicted; the drugs were more biased toward membrane receptors and the NPs toward soluble enzymes, consistent with the idea that they represent unexplored pharmacologies. Importantly, HTSFPs inferred targets that were beyond the prediction capabilities of standard chemical descriptors, especially for NPs but also for the more explored drug set. Of 65 drug–target predictions that we tested in vitro, 48 (73.8%) were confirmed with AC50 values ranging from 38 nM to 29 μM. Among these interactions was the inhibition of cyclooxygenases 1 and 2 by the HIV protease inhibitor Tipranavir. These newly discovered targets that are phylogenetically and phylochemically distant to the primary target provide an explanation for spontaneous bleeding events observed for patients treated with this drug, a physiological effect that was previously difficult to reconcile with the drug’s known MOA.
The increasing amount of chemogenomics data, that is, activity measurements of many compounds across a variety of biological targets, allows for better understanding of pharmacology in a broad ...biological context. Rather than assessing activity at individual biological targets, today understanding of compound interaction with complex biological systems and molecular pathways is often sought in phenotypic screens. This perspective poses novel challenges to structure–activity relationship (SAR) assessment. Today, the bottleneck of drug discovery lies in the understanding of SAR of rich datasets that go beyond single targets in the context of biological pathways, potential off-targets, and complex selectivity profiles. To aid in the understanding and interpretation of such complex SAR, we introduce Chemotography (chemotype chromatography), which encodes chemical space using a color spectrum by combining clustering and multidimensional scaling. Rich biological data in our approach were visualized using spatial dimensions traditionally reserved for chemical space. This allowed us to analyze SAR in the context of target hierarchies and phylogenetic trees, two-target activity scatter plots, and biological pathways. Chemotography, in combination with the Kyoto Encyclopedia of Genes and Genomes (KEGG), also allowed us to extract pathway-relevant SAR from the ChEMBL database. We identified chemotypes showing polypharmacology and selectivity-conferring scaffolds, even in cases where individual compounds have not been tested against all relevant targets. In addition, we analyzed SAR in ChEMBL across the entire Kinome, going beyond individual compounds. Our method combines the strengths of chemical space visualization for SAR analysis and graphical representation of complex biological data. Chemotography is a new paradigm for chemogenomic data visualization and its versatile applications presented here may allow for improved assessment of SAR in biological context, such as phenotypic assay hit lists.
•Ongoing efforts at our company to integrate and structure chemogenomics data are described.•The usefulness of a chemogenomics database for common applications in drug discovery is ...highlighted.•Applications such as focused library design, model building and MoA deconvolution are shown.•Common caveats and pitfalls in capturing and interpreting chemogenomics data are discussed.
Increasing amounts of biological data are accumulating in the pharmaceutical industry and academic institutions. However, data does not equal actionable information, and guidelines for appropriate data capture, harmonization, integration, mining, and visualization need to be established to fully harness its potential. Here, we describe ongoing efforts at Merck & Co. to structure data in the area of chemogenomics. We are integrating complementary data from both internal and external data sources into one chemogenomics database (Chemical Genetic Interaction Enterprise; CHEMGENIE). Here, we demonstrate how this well-curated database facilitates compound set design, tool compound selection, target deconvolution in phenotypic screening, and predictive model building.
Conventional similarity searching of molecules compares single (or multiple) active query structures to each other in a relative framework, by means of a structural descriptor and a similarity ...measure. While this often works well, depending on the target, we show here that retrieval rates can be improved considerably by incorporating an external framework describing ligand bioactivity space for comparisons (“Bayes affinity fingerprints”). Structures are described by Bayes scores for a ligand panel comprising about 1000 activity classes extracted from the WOMBAT database. The comparison of structures is performed via the Pearson correlation coefficient of activity classes, that is, the order in which two structures are similar to the panel activity classes. Compound retrieval on a recently published data set could be improved by as much as 24% relative (9% absolute). Knowledge about the shape of the “bioactive chemical universe” is thus beneficial to identifying similar bioactivities. Principal component analysis was employed to further analyze activity space with the objective to define orthogonal ligand bioactive chemical space, leading to nine major (roughly orthogonal) activity axes. Employing only those nine activity classes, retrieval rates are still comparable to original Bayes affinity fingerprints; thus, the concept of orthogonal bioactive ligand chemical space was validated as being an information-rich but low-dimensional representation of bioactivity space. Correlations between activity classes are a major determinant to gauge whether the desired multitarget activity of drugs is (on the basis of current knowledge) a feasible concept because it measures the extent to which activities can be optimized independently, or only by strongly influencing one another.
Autophagy is a dynamic process that regulates lysosomal-dependent degradation of cellular components. Until recently the study of autophagy has been hampered by the lack of reliable pharmacological ...tools, but selective inhibitors are now available to modulate the PI 3-kinase VPS34, which is required for autophagy. Here we describe the discovery of potent and selective VPS34 inhibitors, their pharmacokinetic (PK) properties, and ability to inhibit autophagy in cellular and mouse models.