•DILIrank contains the largest number of drugs (N=1036) ranked by their risk for causing DILI.•The existing drug labeling-based DILI annotation was enhanced by weighing evidence of causality.•Drugs ...were classified as verified vMost-, vLess-, vNo-DILI-concern, leaving out terminology ‘Ambiguous DILI-concern’ drugs.•DILIrank is invaluable for the development of predictive models using emerging technologies.
High-throughput methods are powerful tools to develop predictive models for assessing drug-induced liver injury (DILI). However, the development of predictive models requires a drug reference list with an accurate annotation of DILI risk in humans. We previously developed a DILI annotation schema based on information curated from the US Food and Drug Administration (FDA)-approved drug labeling for 287 drugs. In this article, we refine the schema by weighing the evidence of causality (i.e., a verification process to evaluate a drug as the cause of DILI) and generate a data set that ranks the DILI risk (DILIrank) in humans for 1036 FDA-approved drugs, providing the largest annotated data set of such drugs in the public domain.
Persistent organic pollutants (POPs) present in foods have been a major concern for food safety due to their persistence and toxic effects. To ensure food safety and protect human health from POPs, ...it is critical to achieve a better understanding of POP pathways into food and develop strategies to reduce human exposure. POPs could present in food in the raw stages, transferred from the environment or artificially introduced during food preparation steps. Exposure to these pollutants may cause various health problems such as endocrine disruption, cardiovascular diseases, cancers, diabetes, birth defects, and dysfunctional immune and reproductive systems. This review describes potential sources of POP food contamination, analytical approaches to measure POP levels in food and efforts to control food contamination with POPs.
Drug-induced liver injury (DILI) presents a significant challenge to drug development and regulatory science. The FDA's Liver Toxicity Knowledge Base (LTKB) evaluated >1000 drugs for their likelihood ...of causing DILI in humans, of which >700 drugs were classified into three categories (most-DILI, less-DILI, and no-DILI). Based on this dataset, we developed and compared 2-class and 3-class DILI prediction models using the machine learning algorithm of Decision Forest (DF) with Mold2 structural descriptors. The models were evaluated through 1000 iterations of 5-fold cross-validations, 1000 bootstrapping validations and 1000 permutation tests (that assessed the chance correlation). Furthermore, prediction confidence analysis was conducted, which provides an additional parameter for proper interpretation of prediction results. We revealed that the 3-class model not only had a higher resolution to estimate DILI risk but also showed an improved capability to differentiate most-DILI drugs from no-DILI drugs in comparison with the 2-class DILI model. We demonstrated the utility of the models for drug ingredients with warnings very recently issued by the FDA. Moreover, we identified informative molecular features important for assessing DILI risk. Our results suggested that the 3-class model presents a better option than the binary model (which most publications are focused on) for drug safety evaluation.
Drug‐induced liver injury (DILI) is a major public health concern, and improving its prediction remains an unmet challenge. Recently, we reported the Rule‐of‐2 (RO2) and found lipophilicity (logP ≥3) ...and daily dose ≥100 mg of oral medications to be associated with significant risk for DILI; however, the RO2 failed to estimate grades of DILI severity. In an effort to develop a quantitative metrics, we analyzed the association of daily dose, logP, and formation of reactive metabolites (RM) in a large set of Food and Drug Administration‐approved oral medications and found factoring RM into the RO2 to highly improve DILI prediction. Based on these parameters and by considering n = 354 drugs, an algorithm to assign a DILI score was developed. In univariate and multivariate logistic regression analyses the algorithm (i.e., DILI score model) defined the relative contribution of daily dose, logP, and RM and permitted a quantitative assessment of risk of clinical DILI. Furthermore, a clear relationship between calculated DILI scores and DILI risk was obtained when applied to three independent studies. The DILI score model was also functional with drug pairs defined by similar chemical structure and mode of action but divergent toxicities. Specifically, for drug pairs where the RO2 failed, the DILI score correctly identified toxic drugs. Finally, the model was applied to n = 159 clinical cases collected from the National Institutes of Health's LiverTox database to demonstrate that the DILI score correlated with the severity of clinical outcome. Conclusions: Based on daily dose, lipophilicity, and RM, a DILI score algorithm was developed that provides a scale of assessing the severity of DILI risk in humans associated with oral medications. (Hepatology 2016;64:931‐940)
Abstract
Animal studies are unavoidable in evaluating chemical and drug safety. Generative Adversarial Networks (GANs) can generate synthetic animal data by learning from the legacy animal study ...results, thus may serve as an alternative approach to assess untested chemicals. AnimalGAN, a GAN method to simulate 38 rat clinical pathology measures, was developed with significant robustness even for the drugs that vary significantly from these used during training, both in terms of chemical structure, drug class, and the year of FDA approval. AnimalGAN showed comparable results in hepatotoxicity assessment as using the real animal data and outperformed 12 conventional quantitative structure-activity relationship approaches. Using AnimalGAN, a virtual experiment of 100,000 rats ranked hepatotoxicity of three structurally similar drugs in a similar trend that has been observed in human population. AnimalGAN represented a significant step with artificial intelligence towards the global effort in replacement, reduction, and refinement (3Rs) of animal use.
Drug-induced liver injury (DILI) is a leading cause of drugs failing during clinical trials and being withdrawn from the market. Comparative analysis of drugs based on their DILI potential is an ...effective approach to discover key DILI mechanisms and risk factors. However, assessing the DILI potential of a drug is a challenge with no existing consensus methods. We proposed a systematic classification scheme using FDA-approved drug labeling to assess the DILI potential of drugs, which yielded a benchmark dataset with 287 drugs representing a wide range of therapeutic categories and daily dosage amounts. The method is transparent and reproducible with a potential to serve as a common practice to study the DILI of marketed drugs for supporting drug discovery and biomarker development.
Toxicogenomics enjoyed considerable attention as a ground-breaking addition to conventional toxicology assays at its inception. However, the pace at which toxicogenomics was expected to perform has ...been tempered in recent years. Next to cost, the lack of advanced knowledge discovery and data mining tools significantly hampered progress in this new field of toxicological sciences. Recently, two of the largest toxicogenomics databases were made freely available to the public. These comprehensive studies are expected to stimulate knowledge discovery and development of novel data mining tools, which are essential to advance this field. In this review, we provide a concise summary of each of these two databases with a brief discussion on the commonalities and differences between them. We place our emphasis on some key questions in toxicogenomics and how these questions can be appropriately addressed with the two databases. Finally, we provide a perspective on the future direction of toxicogenomics and how new technologies such as RNA-Seq may impact this field.
Combinatorial drug therapy can improve the therapeutic effect and reduce the corresponding adverse events.
strategies to classify synergistic vs. antagonistic drug pairs is more efficient than ...experimental strategies. However, most of the developed methods have been applied only to cancer therapies. In this study, we introduce a novel method, XGBoost, based on five features of drugs and biomolecular networks of their targets, to classify synergistic vs. antagonistic drug combinations from different drug categories. We found that XGBoost outperformed other classifiers in both stratified fivefold cross-validation (CV) and independent validation. For example, XGBoost achieved higher predictive accuracy than other models (0.86, 0.78, 0.78, and 0.83 for XGBoost, logistic regression, naïve Bayesian, and random forest, respectively) for an independent validation set. We also found that the five-feature XGBoost model is much more effective at predicting combinatorial therapies that have synergistic effects than those with antagonistic effects. The five-feature XGBoost model was also validated on TCGA data with accuracy of 0.79 among the 61 tested drug pairs, which is comparable to that of DeepSynergy. Among the 14 main anatomical/pharmacological groups classified according to WHO Anatomic Therapeutic Class, for drugs belonging to five groups, their prediction accuracy was significantly increased (odds ratio < 1) or reduced (odds ratio > 1) (Fisher's exact test,
< 0.05). This study concludes that our five-feature XGBoost model has significant benefits for classifying synergistic vs. antagonistic drug combinations.
All biological processes are inherently dynamic. Biological systems evolve transiently or sustainably according to sequential time points after perturbation by environment insults, drugs and ...chemicals. Investigating the temporal behavior of molecular events has been an important subject to understand the underlying mechanisms governing the biological system in response to, such as, drug treatment. The intrinsic complexity of time series data requires appropriate computational algorithms for data interpretation. In this study, we propose, for the first time, the application of dynamic topic models (DTM) for analyzing time-series gene expression data.
A large time-series toxicogenomics dataset was studied. It contains over 3144 microarrays of gene expression data corresponding to rat livers treated with 131 compounds (most are drugs) at two doses (control and high dose) in a repeated schedule containing four separate time points (4-, 8-, 15- and 29-day). We analyzed, with DTM, the topics (consisting of a set of genes) and their biological interpretations over these four time points. We identified hidden patterns embedded in this time-series gene expression profiles. From the topic distribution for compound-time condition, a number of drugs were successfully clustered by their shared mode-of-action such as PPARɑ agonists and COX inhibitors. The biological meaning underlying each topic was interpreted using diverse sources of information such as functional analysis of the pathways and therapeutic uses of the drugs. Additionally, we found that sample clusters produced by DTM are much more coherent in terms of functional categories when compared to traditional clustering algorithms.
We demonstrated that DTM, a text mining technique, can be a powerful computational approach for clustering time-series gene expression profiles with the probabilistic representation of their dynamic features along sequential time frames. The method offers an alternative way for uncovering hidden patterns embedded in time series gene expression profiles to gain enhanced understanding of dynamic behavior of gene regulation in the biological system.
•Next-generation sequencing technology accelerates rare disease research.•Accumulated genomics information aids orphan drug development.•Novel drug repositioning approaches for rare diseases ...facilitate precision medicine.•Efforts from different stakeholders boost rare disease treatment development.
There are tremendous unmet needs in drug development for rare diseases. Computational drug repositioning is a promising approach and has been successfully applied to the development of treatments for diseases. However, how to utilize this knowledge and effectively conduct and implement computational drug repositioning approaches for rare disease therapies is still an open issue. Here, we focus on the means of utilizing accumulated genomic data for accelerating and facilitating drug repositioning for rare diseases. First, we summarize the current genome landscape of rare diseases. Second, we propose several promising bioinformatics approaches and pipelines for computational drug repositioning for rare diseases. Finally, we discuss recent regulatory incentives and other enablers in rare disease drug development and outline the remaining challenges.