Reductions in genotyping costs and improvements in computational power have made conducting genome-wide association studies (GWAS) standard practice for many complex diseases. GWAS is the assessment ...of genetic variants across the genome of many individuals to determine which, if any, genetic variants are associated with a specific trait. As with any analysis, there are evolving best practices that should be followed to ensure scientific rigor and reliability in the conclusions. This article presents a brief summary for many of the key bioinformatics considerations when either planning or evaluating GWAS. This review is meant to serve as a guide to those without deep expertise in bioinformatics and GWAS and give them tools to critically evaluate this popular approach to investigating complex diseases. In addition, a checklist is provided that can be used by investigators to evaluate whether a GWAS has appropriately accounted for the many potential sources of bias and generally followed current best practices.
Metabolic disease in children is increasing worldwide and predisposes a wide array of chronic comorbid conditions with severe impacts on quality of life. Tools for early detection are needed to ...promptly intervene to prevent or slow the development of these long-term complications.
No clinically available tools are currently in widespread use that can predict the onset of metabolic diseases in pediatric patients. Here, we use interpretable deep learning, leveraging longitudinal clinical measurements, demographical data, and diagnosis codes from electronic health record data from a large integrated health system to predict the onset of prediabetes, type 2 diabetes (T2D), and metabolic syndrome in pediatric cohorts.
The cohort included 49 517 children with overweight or obesity aged 2-18 (54.9% male, 73% Caucasian), with a median follow-up time of 7.5 years and mean body mass index (BMI) percentile of 88.6%. Our model demonstrated area under receiver operating characteristic curve (AUC) accuracies up to 0.87, 0.79, and 0.79 for predicting T2D, metabolic syndrome, and prediabetes, respectively. Whereas most risk calculators use only recently available data, incorporating longitudinal data improved AUCs by 13.04%, 11.48%, and 11.67% for T2D, syndrome, and prediabetes, respectively, versus models using the most recent BMI (P < 2.2 × 10-16).
Despite most risk calculators using only the most recent data, incorporating longitudinal data improved the model accuracies because utilizing trajectories provides a more comprehensive characterization of the patient's health history. Our interpretable model indicated that BMI trajectories were consistently identified as one of the most influential features for prediction, highlighting the advantages of incorporating longitudinal data when available.
Reports indicate that coronavirus disease 2019 (COVID-19) may impact pancreatic function and increase type 2 diabetes (T2D) risk, although real-world COVID-19 impacts on HbA1c and T2D are unknown. We ...tested whether COVID-19 increased HbA1c, risk of T2D, or diabetic ketoacidosis (DKA). We compared pre- and post-COVID-19 HbA1c and T2D risk in a large real-world clinical cohort of 8,755 COVID-19(+) patients and 11,998 COVID-19(-) matched control subjects. We investigated whether DKA risk was modified in COVID-19(+) patients with type 1 diabetes (T1D) (N = 701) or T2D (N = 21,830), or by race and sex. We observed a statistically significant, albeit clinically insignificant, HbA1c increase post-COVID-19 (all patients ΔHbA1c = 0.06%; with T2D ΔHbA1c = 0.1%) and no increase among COVID-19(-) patients. COVID-19(+) patients were 40% more likely to be diagnosed with T2D compared with COVID-19(-) patients and 28% more likely for the same HbA1c change as COVID-19(-) patients, indicating that COVID-19-attributed T2D risk may be due to increased recognition during COVID-19 management. DKA in COVID-19(+) patients with T1D was not increased. COVID-19(+) Black patients with T2D displayed disproportionately increased DKA risk (hazard ratio 2.46 95% CI 1.48-6.09, P = 0.004) compared with White patients, suggesting a need for further clinical awareness and investigation.
High-throughput in vitro toxicity screening can provide an efficient way to identify potential biological targets for chemicals. However, relying on nominal assay concentrations may misrepresent ...potential in vivo effects of these chemicals due to differences in bioavailability, clearance, and exposure. Hepatic metabolic clearance and plasma protein binding were experimentally measured for 239 ToxCast Phase I chemicals. The experimental data were used in a population-based in vitro-to-in vivo extrapolation model to estimate the daily human oral dose, called the oral equivalent dose, necessary to produce steady-state in vivo blood concentrations equivalent to in vitro AC50 (concentration at 50% of maximum activity) or lowest effective concentration values across more than 500 in vitro assays. The estimated steady-state oral equivalent doses associated with the in vitro assays were compared with chronic aggregate human oral exposure estimates to assess whether in vitro bioactivity would be expected at the dose-equivalent level of human exposure. A total of 18 (9.9%) chemicals for which human oral exposure estimates were available had oral equivalent doses at levels equal to or less than the highest estimated U.S. population exposures. Ranking the chemicals by nominal assay concentrations would have resulted in different chemicals being prioritized. The in vitro assay endpoints with oral equivalent doses lower than the human exposure estimates included cell growth kinetics, cytokine and cytochrome P450 expression, and cytochrome P450 inhibition. The incorporation of dosimetry and exposure provide necessary context for interpretation of in vitro toxicity screening data and are important considerations in determining chemical testing priorities.
We demonstrate a computational network model that integrates 18 in vitro, high-throughput screening assays measuring estrogen receptor (ER) binding, dimerization, chromatin binding, transcriptional ...activation, and ER-dependent cell proliferation. The network model uses activity patterns across the in vitro assays to predict whether a chemical is an ER agonist or antagonist, or is otherwise influencing the assays through a manner dependent on the physics and chemistry of the technology platform ("assay interference"). The method is applied to a library of 1812 commercial and environmental chemicals, including 45 ER positive and negative reference chemicals. Among the reference chemicals, the network model correctly identified the agonists and antagonists with the exception of very weak compounds whose activity was outside the concentration range tested. The model agonist score also correlated with the expected potency class of the active reference chemicals. Of the 1812 chemicals evaluated, 111 (6.1%) were predicted to be strongly ER active in agonist or antagonist mode. This dataset and model were also used to begin a systematic investigation of assay interference. The most prominent cause of false-positive activity (activity in an assay that is likely not due to interaction of the chemical with ER) is cytotoxicity. The model provides the ability to prioritize a large set of important environmental chemicals with human exposure potential for additional in vivo endocrine testing. Finally, this model is generalizable to any molecular pathway for which there are multiple upstream and downstream assays available.
The U.S. Tox21 program has screened a library of approximately 10,000 (10K) environmental chemicals and drugs in three independent runs for estrogen receptor alpha (ERα) agonist and antagonist ...activity using two types of ER reporter gene cell lines, one with an endogenous full length ERα (ER-luc; BG1 cell line) and the other with a transfected partial receptor consisting of the ligand binding domain (ER-bla; ERα β-lactamase cell line), in a quantitative high-throughput screening (qHTS) format. The ability of the two assays to correctly identify ERα agonists and antagonists was evaluated using a set of 39 reference compounds with known ERα activity. Although both assays demonstrated adequate (i.e. >80%) predictivity, the ER-luc assay was more sensitive and the ER-bla assay more specific. The qHTS assay results were compared with results from previously published ERα binding assay data and showed >80% consistency. Actives identified from both the ER-bla and ER-luc assays were analyzed for structure-activity relationships (SARs) revealing known and potentially novel ERα active structure classes. The results demonstrate the feasibility of qHTS to identify environmental chemicals with the potential to interact with the ERα signaling pathway and the two different assay formats improve the confidence in correctly identifying these chemicals.
Abstract Introduction The Alzheimer's Disease Research Summits of 2012 and 2015 incorporated experts from academia, industry, and nonprofit organizations to develop new research directions to ...transform our understanding of Alzheimer's disease (AD) and propel the development of critically needed therapies. In response to their recommendations, big data at multiple levels are being generated and integrated to study network failures in disease. We used metabolomics as a global biochemical approach to identify peripheral metabolic changes in AD patients and correlate them to cerebrospinal fluid pathology markers, imaging features, and cognitive performance. Methods Fasting serum samples from the Alzheimer's Disease Neuroimaging Initiative (199 control, 356 mild cognitive impairment, and 175 AD participants) were analyzed using the AbsoluteIDQ-p180 kit. Performance was validated in blinded replicates, and values were medication adjusted. Results Multivariable-adjusted analyses showed that sphingomyelins and ether-containing phosphatidylcholines were altered in preclinical biomarker-defined AD stages, whereas acylcarnitines and several amines, including the branched-chain amino acid valine and α-aminoadipic acid, changed in symptomatic stages. Several of the analytes showed consistent associations in the Rotterdam, Erasmus Rucphen Family, and Indiana Memory and Aging Studies. Partial correlation networks constructed for Aβ1–42 , tau, imaging, and cognitive changes provided initial biochemical insights for disease-related processes. Coexpression networks interconnected key metabolic effectors of disease. Discussion Metabolomics identified key disease-related metabolic changes and disease-progression-related changes. Defining metabolic changes during AD disease trajectory and its relationship to clinical phenotypes provides a powerful roadmap for drug and biomarker discovery.
Background: Chemical toxicity testing is being transformed by advances in biology and computer modeling, concerns over animal use, and the thousands of environmental chemicals lacking toxicity data. ...The U.S. Environmental Protection Agency's ToxCast program aims to address these concerns by screening and prioritizing chemicals for potential human toxicity using in vitro assays and in silico approaches. Objectives: This project aims to evaluate the use of in vitro assays for understanding the types of molecular and pathway perturbations caused by environmental chemicals and to build initial prioritization models of in vivo toxicity. Methods: We tested 309 mostly pesticide active chemicals in 467 assays across nine technologies, including high-throughput cell-free assays and cell-based assays, in multiple human primary cells and cell lines plus rat primary hepatocytes. Both individual and composite scores for effects on genes and pathways were analyzed. Results: Chemicals displayed a broad spectrum of activity at the molecular and pathway levels. We saw many expected interactions, including endocrine and xenobiotic metabolism enzyme activity. Chemicals ranged in promiscuity across pathways, from no activity to affecting dozens of pathways. We found a statistically significant inverse association between the number of pathways perturbed by a chemical at low in vitro concentrations and the lowest in vivo dose at which a chemical causes toxicity. We also found associations between a small set of in vitro assays and rodent liver lesion formation. Conclusions: This approach promises to provide meaningful data on the thousands of untested environmental chemicals and to guide targeted testing of environmental contaminants.
Exposure to environmental chemicals adds to the burden of disease in humans and wildlife to a degree that is difficult to estimate and, thus, mitigate. The ability to assess the impact of existing ...chemicals for which little to no toxicity data are available or to foresee such effects during early stages of chemical development and use, and before potential exposure occurs, is a pressing need. However, the capacity of the current toxicity evaluation approaches to meet this demand is limited by low throughput and high costs. In the context of EPA’s ToxCast project, we have evaluated a novel cellular biosensor system (Factorial) that enables rapid, high-content assessment of a compound’s impact on gene regulatory networks. The Factorial biosensors combined libraries of cis- and trans-regulated transcription factor reporter constructs with a highly homogeneous method of detection enabling simultaneous evaluation of multiplexed transcription factor activities. Here, we demonstrate the application of the technology toward determining bioactivity profiles by quantitatively evaluating the effects of 309 environmental chemicals on 25 nuclear receptors and 48 transcription factor response elements. We demonstrate coherent transcription factor activity across nuclear receptors and their response elements and that Nrf2 activity, a marker of oxidative stress, is highly correlated to the overall promiscuity of a chemical. Additionally, as part of the ToxCast program, we identify molecular targets that associate with in vivo end points and represent modes of action that can serve as potential toxicity pathway biomarkers and inputs for predictive modeling of in vivo toxicity.