The application of multiple omics technologies in biomedical cohorts has the potential to reveal patient-level disease characteristics and individualized response to treatment. However, the scale and ...heterogeneous nature of multi-modal data makes integration and inference a non-trivial task. We developed a deep-learning-based framework, multi-omics variational autoencoders (MOVE), to integrate such data and applied it to a cohort of 789 people with newly diagnosed type 2 diabetes with deep multi-omics phenotyping from the DIRECT consortium. Using in silico perturbations, we identified drug-omics associations across the multi-modal datasets for the 20 most prevalent drugs given to people with type 2 diabetes with substantially higher sensitivity than univariate statistical tests. From these, we among others, identified novel associations between metformin and the gut microbiota as well as opposite molecular responses for the two statins, simvastatin and atorvastatin. We used the associations to quantify drug-drug similarities, assess the degree of polypharmacy and conclude that drug effects are distributed across the multi-omics modalities.
Differences in glucose metabolism among categories of prediabetes have not been systematically investigated. In this longitudinal study, participants (N = 2,111) underwent a 2-h 75-g oral glucose ...tolerance test (OGTT) at baseline and 48 months. HbA1c was also measured. We classified participants as having isolated prediabetes defect (impaired fasting glucose IFG, impaired glucose tolerance IGT, or HbA1c indicative of prediabetes IA1c), two defects (IFG+IGT, IFG+IA1c, or IGT+IA1c), or all defects (IFG+IGT+IA1c). β-Cell function (BCF) and insulin sensitivity were assessed from OGTT. At baseline, in pooling of participants with isolated defects, they showed impairment in both BCF and insulin sensitivity compared with healthy control subjects. Pooled groups with two or three defects showed progressive further deterioration. Among groups with isolated defect, those with IGT showed lower insulin sensitivity, insulin secretion at reference glucose (ISRr), and insulin secretion potentiation (P < 0.002). Conversely, those with IA1c showed higher insulin sensitivity and ISRr (P < 0.0001). Among groups with two defects, we similarly found differences in both BCF and insulin sensitivity. At 48 months, we found higher type 2 diabetes incidence for progressively increasing number of prediabetes defects (odds ratio >2, P < 0.008). In conclusion, the prediabetes groups showed differences in type/degree of glucometabolic impairment. Compared with the pooled group with isolated defects, those with double or triple defect showed progressive differences in diabetes incidence.
Abstract
We evaluate the shared genetic regulation of mRNA molecules, proteins and metabolites derived from whole blood from 3029 human donors. We find abundant allelic heterogeneity, where multiple ...variants regulate a particular molecular phenotype, and pleiotropy, where a single variant associates with multiple molecular phenotypes over multiple genomic regions. The highest proportion of share genetic regulation is detected between gene expression and proteins (66.6%), with a further median shared genetic associations across 49 different tissues of 78.3% and 62.4% between plasma proteins and gene expression. We represent the genetic and molecular associations in networks including 2828 known GWAS variants, showing that GWAS variants are more often connected to gene expression in trans than other molecular phenotypes in the network. Our work provides a roadmap to understanding molecular networks and deriving the underlying mechanism of action of GWAS variants using different molecular phenotypes in an accessible tissue.
Display omitted
•Variants in HFE and TMPRSS6 are associated with higher liver iron.•There is genetic evidence that higher central obesity causes higher liver iron.•Liver iron variants are not organ ...specific and associate with multiple diseases.
Excess liver iron content is common and is linked to the risk of hepatic and extrahepatic diseases. We aimed to identify genetic variants influencing liver iron content and use genetics to understand its link to other traits and diseases.
First, we performed a genome-wide association study (GWAS) in 8,289 individuals from UK Biobank, whose liver iron level had been quantified by magnetic resonance imaging, before validating our findings in an independent cohort (n = 1,513 from IMI DIRECT). Second, we used Mendelian randomisation to test the causal effects of 25 predominantly metabolic traits on liver iron content. Third, we tested phenome-wide associations between liver iron variants and 770 traits and disease outcomes.
We identified 3 independent genetic variants (rs1800562 C282Y and rs1799945 H63D in HFE and rs855791 V736A in TMPRSS6) associated with liver iron content that reached the GWAS significance threshold (p <5 × 10−8). The 2 HFE variants account for ∼85% of all cases of hereditary haemochromatosis. Mendelian randomisation analysis provided evidence that higher central obesity plays a causal role in increased liver iron content. Phenome-wide association analysis demonstrated shared aetiopathogenic mechanisms for elevated liver iron, high blood pressure, cirrhosis, malignancies, neuropsychiatric and rheumatological conditions, while also highlighting inverse associations with anaemias, lipidaemias and ischaemic heart disease.
Our study provides genetic evidence that mechanisms underlying higher liver iron content are likely systemic rather than organ specific, that higher central obesity is causally associated with higher liver iron, and that liver iron shares common aetiology with multiple metabolic and non-metabolic diseases.
Excess liver iron content is common and is associated with liver diseases and metabolic diseases including diabetes, high blood pressure, and heart disease. We identified 3 genetic variants that are linked to an increased risk of developing higher liver iron content. We show that the same genetic variants are linked to higher risk of many diseases, but they may also be associated with some health advantages. Finally, we use genetic variants associated with waist-to-hip ratio as a tool to show that central obesity is causally associated with increased liver iron content.
In regression settings, a su?cient dimension reduction (SDR) method seeks the core information in a p-vector predictor that completely captures its relationship with a response. The reduced predictor ...may reside in a lower dimension d < p, improving ability to visualize data and predict future observations, and mitigating dimensionality issues when carrying out further analysis. We introduce ldr, a new R software package that implements three recently proposed likelihood-based methods for SDR: covariance reduction, likelihood acquired directions, and principal fitted components. All three methods reduce the dimensionality of the data by pro jection into lower dimensional subspaces. The package also implements a variable screening method built upon principal ?tted components which makes use of ?exible basis functions to capture the dependencies between the predictors and the response. Examples are given to demonstrate likelihood-based SDR analyses using ldr, including estimation of the dimension of reduction subspaces and selection of basis functions. The ldr package provides a framework that we hope to grow into a comprehensive library of likelihood-based SDR methodologies.
A high dimensional regression setting is considered with p predictors X = (X1,...,X p)T and a response Y. The interest is with large p, possibly much larger than n the number of observations. Three ...novel methodologies based on Principal Fitted Components models (PFC; Cook, 2007) are presented: (1) Screening by PFC (SPFC) for variable screening when p is excessively large, (2) Prediction by PFC (PPFC), and (3) Sparse PFC (SpPFC) for variable selection. SPFC uses a test statistic to detect all predictors marginally related to the outcome. We show that SPFC subsumes the Sure Independence Screening of Fan and Lv (2008). PPFC is a novel methodology for prediction in regression where p can be large or larger than n. PPFC assumes that X∣Y has a normal distribution and applies to continuous response variables regardless of their distribution. It yields accuracy in prediction better than current leading methods. We adapt the Sparse Principal Components Analysis (Zou et al., 2006) to the PFC model to develop SpPFC. SpPFC performs variable selection as good as forward linear model methods like the lasso (Tibshirani, 1996), but moreover, it encompasses cases where the distribution of Y∣X is non-normal or the predictors and the response are not linearly related.
Manifold optimization appears in a wide variety of computational problems in the applied sciences. In recent statistical methodologies such as sufficient dimension reduction and regression envelopes, ...estimation relies on the optimization of likelihood functions over spaces of matrices such as the Stiefel or Grassmann manifolds. Recently, Huang, Absil, Gallivan, and Hand (2016) have introduced the library ROPTLIB, which provides a framework and state of the art algorithms to optimize real-valued objective functions over commonly used matrix-valued Riemannian manifolds. This article presents ManifoldOptim, an R package that wraps the C++ library ROPTLIB. ManifoldOptim enables users to access functionality in ROPTLIB through R so that optimization problems can easily be constructed, solved, and integrated into larger R codes. Computationally intensive problems can be programmed with Rcpp and RcppArmadillo, and otherwise accessed through R. We illustrate the practical use of ManifoldOptim through several motivating examples involving dimension reduction and envelope methods in regression.