Abstract
In drug development, preclinical safety and pharmacokinetics assessments of candidate drugs to ensure the safety profile are a must. While in vivo and in vitro tests are traditionally used, ...experimental determinations have disadvantages, as they are usually time-consuming and costly. In silico predictions of these preclinical endpoints have each been developed in the past decades. However, only a few web-based tools have integrated different models to provide a simple one-step platform to help researchers thoroughly evaluate potential drug candidates. To efficiently achieve this approach, a platform for preclinical evaluation must not only predict key ADMET (absorption, distribution, metabolism, excretion and toxicity) properties but also provide some guidance on structural modifications to improve the undesired properties. In this review, we organized and compared several existing integrated web servers that can be adopted in preclinical drug development projects to evaluate the subject of interest. We also introduced our new web server, Virtual Rat, as an alternative choice to profile the properties of drug candidates. In Virtual Rat, we provide not only predictions of important ADMET properties but also possible reasons as to why the model made those structural predictions. Multiple models were implemented into Virtual Rat, including models for predicting human ether-a-go-go-related gene (hERG) inhibition, cytochrome P450 (CYP) inhibition, mutagenicity (Ames test), blood–brain barrier penetration, cytotoxicity and Caco-2 permeability. Virtual Rat is free and has been made publicly available at https://virtualrat.cmdm.tw/.
Abstract
Summary
P-glycoprotein (P-gp) is a member of ABC transporter family that actively pumps xenobiotics out of cells to protect organisms from toxic compounds. P-gp substrates can be easily ...pumped out of the cells to reduce their absorption; conversely P-gp inhibitors can reduce such pumping activity. Hence, it is crucial to know if a drug is a P-gp substrate or inhibitor in view of pharmacokinetics. Here we present PgpRules, an online P-gp substrate and P-gp inhibitor prediction server with ruled-sets. The two models were built using classification and regression tree algorithm. For each compound uploaded, PgpRules not only predicts whether the compound is a P-gp substrate or a P-gp inhibitor, but also provides the rules containing chemical structural features for further structural optimization.
Availability and implementation
PgpRules is freely accessible at https://pgprules.cmdm.tw/.
Supplementary information
Supplementary data are available at Bioinformatics online.
Metabolomics is a powerful tool for understanding phenotypes and discovering biomarkers. Combinations of multiple batches or data sets in large cross-sectional epidemiology studies are frequently ...utilized in metabolomics, but various systematic biases can introduce both batch and injection order effects and often require proper calibrations prior to chemometric analyses. We present a novel algorithm, Batch Normalizer, to calibrate large scale metabolomic data. Batch Normalizer utilizes a regression model with consideration of the total abundance of each sample to improve its calibration performance, and it is able to remove both batch effect and injection order effects. This calibration method was tested using liquid chromatography/time-of-flight mass spectrometry (LC/TOF-MS) chromatograms of 228 plasma samples and 23 pooled quality control (QC) samples. We evaluated the performance of Batch Normalizer by examining the distribution of relative standard deviation (RSD) for all peaks detected in the pooled QC samples, the average Pearson correlation coefficients for all peaks between any two of QC samples, and the distribution of QC samples in the scores plot of a principal component analysis (PCA). After calibration by Batch Normalizer, the number of peaks in QC samples with RSD less than 15% increased from 11 to 914, all of the QC samples were closely clustered in PCA scores plot, and the average Pearson correlation coefficients for all peaks of QC samples increased from 0.938 to 0.976. This method was compared to 7 commonly used calibration methods. We discovered that using Batch Normalizer to calibrate LC/TOF-MS data produces the best calibration results.
Hepatotoxicity, drug-induced liver injury, and competitive Cytochrome P-450 (CYP) isozyme binding are serious problems associated with drug use. It would be favorable to avoid or to understand ...potential CYP inhibition at the developmental stages. However, current in silico CYP prediction models or available public prediction servers can provide only yes/no classification results for just one or a few CYP enzymes. In this study, we utilized a rule-based C5.0 algorithm with different descriptors, including PaDEL, Mold2, and PubChem fingerprints, to construct rule-based inhibition prediction models for five major CYP enzymesCYP1A2, CYP2C9, CYP2C19, CYP2D6 and CYP3A4that account for 90% of drug oxidation or hydrolysis. We also developed a rational sampling algorithm for the selection of compounds in the training data set, to enhance the performance of these CYP prediction models. The optimized models include several improved features. First, the final models significantly outperformed all of the currently available models. Second, the final models can also be used for rapid virtual screening of a large set of compounds due to their ruleset-based nature. Moreover, such rule-based prediction models can provide rulesets for structural features related to the five major CYP enzymes. The five most significant rules for CYP inhibition were identified for each CYP enzymes and discussed. An example was chosen for each of the five CYP enzymes to demonstrate how rule-based models can be used to gain insights into structural features that correspond with CYP inhibitions. A newer version of the freely accessible CYP prediction server, CypRules, is presented here as a result of the aforementioned improvements.
Obesity, dyslipidemia, insulin resistance, oxidative stress, and inflammation are key clinical risk factors for the progression of non-alcoholic fatty liver disease (NAFLD). Currently, there is no ...comprehensive metabolic profile of a well-established animal model that effectively mimics the etiology and pathogenesis of NAFLD in humans. Here, we report the pathophysiological and metabolomic changes associated with NAFLD development in a C57BL/6J mouse model in which NAFLD was induced by feeding a high-fat diet (HFD) for 4, 8, 12, and 16 weeks. Serum metabolomic analysis was conducted using ultrahigh-performance liquid chromatography coupled with quadrupole time-of-flight mass spectrometry (UHPLC–QTOF–MS) and gas chromatography–mass spectrometry (GC–MS) to establish a metabolomic profile. Analysis of the metabolomic profile in combination with principal component analysis revealed marked differences in metabolites between the control and HFD group depending upon NAFLD severity. A total of 30 potential biomarkers were strongly associated with the development of NAFLD. Among these, 11 metabolites were mainly related to carbohydrate metabolism, hepatic biotransformation, collagen synthesis, and gut microbial metabolism, which are characteristics of obesity, as well as significantly increased serum glucose, total cholesterol, and hepatic triglyceride levels during the onset of NAFLD (4 weeks). At 8 weeks, 5 additional metabolites that are chiefly involved in perturbation of lipid metabolism and insulin secretion were found to be associated with hyperinsulinemia, hyperlipidemia, and hepatic steatosis in the mid-term of NAFLD progression. At the end of 12 and 16 weeks, 14 additional metabolites were predominantly correlated to abnormal bile acid synthesis, oxidative stress, and inflammation, representing hepatic inflammatory infiltration during NAFLD development. These results provide potential biomarkers for early risk assessment of NAFLD and further insights into NAFLD development.
With advances in the development and application of Ames mutagenicity in silico prediction tools, the International Conference on Harmonisation (ICH) has amended its M7 guideline to reflect the use ...of such prediction models for the detection of mutagenic activity in early drug safety evaluation processes. Since current Ames mutagenicity prediction tools only focus on functional group alerts or side chain modifications of an analog series, these tools are unable to identify mutagenicity derived from core structures or specific scaffolds of a compound. In this study, a large collection of 6512 compounds are used to perform scaffold tree analysis. By relating different scaffolds on constructed scaffold trees with Ames mutagenicity, four major and one minor novel mutagenic groups of scaffold are identified. The recognized mutagenic groups of scaffold can serve as a guide for medicinal chemists to prevent the development of potentially mutagenic therapeutic agents in early drug design or development phases, by modifying the core structures of mutagenic compounds to form non-mutagenic compounds. In addition, five series of substructures are provided as recommendations, for direct modification of potentially mutagenic scaffolds to decrease associated mutagenic activities.
Abstract
Summary
Drug discovery targeting G protein-coupled receptors (GPCRs), the largest known class of therapeutic targets, is challenging. To facilitate the rapid discovery and development of ...GPCR drugs, we built a system, PanGPCR, to predict multiple potential GPCR targets and their expression locations in the tissues, side effects and possible repurposing of GPCR drugs. With PanGPCR, the compound of interest is docked to a library of 36 experimentally determined crystal structures comprising of 46 docking sites for human GPCRs, and a ranked list is generated from the docking studies to assess all GPCRs and their binding affinities. Users can determine a given compound’s GPCR targets and its repurposing potential accordingly. Moreover, potential side effects collected from the SIDER (Side-Effect Resource) database and mapped to 45 tissues and organs are provided by linking predicted off-targets and their expressed sequence tag profiles. With PanGPCR, multiple targets, repurposing potential and side effects can be determined by simply uploading a small ligand.
Availability and implementation
PanGPCR is freely accessible at https://gpcrpanel.cmdm.tw/index.html.
Supplementary information
Supplementary data are available at Bioinformatics online.
Studies on metabolomes of carcinogenic pollutants among children and adolescents are limited. We aim to identify metabolic perturbations in 107 children and adolescents (aged 9–15) exposed to ...multiple carcinogens in a polluted area surrounding the largest petrochemical complex in Taiwan. We measured urinary concentrations of eight carcinogen exposure biomarkers (heavy metals and polycyclic aromatic hydrocarbons (PAHs) represented by 1-hydroxypyrene), and urinary oxidative stress biomarkers and serum acylcarnitines as biomarkers of early health effects. Serum metabolomics was analyzed using a liquid chromatography mass spectrometry-based method. Pathway analysis and “meet-in-the-middle” approach were applied to identify potential metabolites and biological mechanisms linking carcinogens exposure with early health effects. We found 10 potential metabolites possibly linking increased exposure to IARC group 1 carcinogens (As, Cd, Cr, Ni) and group 2 carcinogens (V, Hg, PAHs) with elevated oxidative stress and deregulated serum acylcarnitines, including inosine monophosphate and adenosine monophosphate (purine metabolism), malic acid and oxoglutaric acid (citrate cycle), carnitine (fatty acid metabolism), and pyroglutamic acid (glutathione metabolism). Purine metabolism was identified as the possible mechanism affected by children and adolescents’ exposure to carcinogens. These findings contribute to understanding the health effects of childhood and adolescence exposure to multiple industrial carcinogens during critical periods of development.
Able to detect known and unknown metabolites, untargeted metabolomics has shown great potential in identifying novel biomarkers. However, elucidating all possible liquid chromatography/time-of-flight ...mass spectrometry (LC/TOF-MS) ion signals in a complex biological sample remains challenging since many ions are not the products of metabolites. Methods of reducing ions not related to metabolites or simply directly detecting metabolite related (pure) ions are important. In this work, we describe PITracer, a novel algorithm that accurately detects the pure ions of a LC/TOF-MS profile to extract pure ion chromatograms and detect chromatographic peaks. PITracer estimates the relative mass difference tolerance of ions and calibrates the mass over charge (m/z) values for peak detection algorithms with an additional option to further mass correction with respect to a user-specified metabolite. PITracer was evaluated using two data sets containing 373 human metabolite standards, including 5 saturated standards considered to be split peaks resultant from huge m/z fluctuation, and 12 urine samples spiked with 50 forensic drugs of varying concentrations. Analysis of these data sets show that PITracer correctly outperformed existing state-of-art algorithm and extracted the pure ion chromatograms of the 5 saturated standards without generating split peaks and detected the forensic drugs with high recall, precision, and F-score and small mass error.