Aims/hypothesis
Five clusters based on clinical characteristics have been suggested as diabetes subtypes: one autoimmune and four subtypes of type 2 diabetes. In the current study we replicate and ...cross-validate these type 2 diabetes clusters in three large cohorts using variables readily measured in the clinic.
Methods
In three independent cohorts, in total 15,940 individuals were clustered based on age, BMI, HbA
1c
, random or fasting C-peptide, and HDL-cholesterol. Clusters were cross-validated against the original clusters based on HOMA measures. In addition, between cohorts, clusters were cross-validated by re-assigning people based on each cohort’s cluster centres. Finally, we compared the time to insulin requirement for each cluster.
Results
Five distinct type 2 diabetes clusters were identified and mapped back to the original four All New Diabetics in Scania (ANDIS) clusters. Using C-peptide and HDL-cholesterol instead of HOMA2-B and HOMA2-IR, three of the clusters mapped with high sensitivity (80.6–90.7%) to the previously identified severe insulin-deficient diabetes (SIDD), severe insulin-resistant diabetes (SIRD) and mild obesity-related diabetes (MOD) clusters. The previously described ANDIS mild age-related diabetes (MARD) cluster could be mapped to the two milder groups in our study: one characterised by high HDL-cholesterol (mild diabetes with high HDL-cholesterol MDH cluster), and the other not having any extreme characteristic (mild diabetes MD). When these two milder groups were combined, they mapped well to the previously labelled MARD cluster (sensitivity 79.1%). In the cross-validation between cohorts, particularly the SIDD and MDH clusters cross-validated well, with sensitivities ranging from 73.3% to 97.1%. SIRD and MD showed a lower sensitivity, ranging from 36.1% to 92.3%, where individuals shifted from SIRD to MD and vice versa. People belonging to the SIDD cluster showed the fastest progression towards insulin requirement, while the MDH cluster showed the slowest progression.
Conclusions/interpretation
Clusters based on C-peptide instead of HOMA2 measures resemble those based on HOMA2 measures, especially for SIDD, SIRD and MOD. By adding HDL-cholesterol, the MARD cluster based upon HOMA2 measures resulted in the current clustering into two clusters, with one cluster having high HDL levels. Cross-validation between cohorts showed generally a good resemblance between cohorts. Together, our results show that the clustering based on clinical variables readily measured in the clinic (age, HbA
1c
, HDL-cholesterol, BMI and C-peptide) results in informative clusters that are representative of the original ANDIS clusters and stable across cohorts. Adding HDL-cholesterol to the clustering resulted in the identification of a cluster with very slow glycaemic deterioration.
Graphical abstract
Full text
Available for:
EMUNI, FIS, FZAB, GEOZS, GIS, IJS, IMTLJ, KILJ, KISLJ, MFDPS, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, SBMB, SBNM, UKNU, UL, UM, UPUK, VKSCE, ZAGLJ
Considerable heterogeneity exists in Type 2 diabetes disease onset and subsequent outcomes. Multi-omics analysis may provide a means to better assess inter-patient variations and design personalised ...approaches. Previous T2D biomarker discovery mainly focused on associating biomarkers with clinical pre-assigned patients. Here, we attempt to group T2D patients with similar multi-omics profiles using a multi-omics integrating clustering approach, Similarity Network Fusion (SNF). Each T2D subgroup’s unique omics profiles were then associated with T2D progression. In both cohorts, 180 and 1195 circulating lipids and proteins from a total of 1,134 subjects in DCS and GoDARTS cohorts group patients into two subgroups. Two subgroups are likely to represent the different stages of T2D insulin insensitivity with differences at HOMA2 (p=0.0008;4.2e-11;1.1e-09, -B, -IR and -S, DCS), C-peptide (p=3.7e-11;2.5e-06) and overall disease progression (HR=0.6;0.7). This is the first study solely relying on integrated lipidomics and proteomics to capture the stages of T2D insulin insensitivity, allowing potential novel biomarker discovery which may be masked by traditional approaches. In both cohorts, a number of discriminative omics features can be observed. Immune proteins, such as IL-18R, CLP, IL-1R and COD antigen showed strong but differing associations with insulin insensitivity, which may be related to the inflammation associated with T2D. Moreover, growth factors such as GHR and IGFs also exhibit strong associations with the present study. For lipids, patients with less severe insulin insensitivity exhibit elevated levels of sphingomyelins. In conclusion, we systematically analysed the multi-omics profiles associated with T2D insulin insensitivity. The molecular signatures may be used to investigate molecular mechanisms underpinning progression and provide insights for precision medicine.
Disclosure
S.Li: None. I.J.Dragan: None. K.Simons: Other Relationship; Lipotype, I have no there compoanz. L.A.Donnelly: None. V.T.Tran: None. F.Mehl: None. L.M.'t hart: Research Support; European Union. M.Ibberson: Consultant; Novo Nordisk Foundation. G.A.Rutter: Consultant; Sun Pharmaceutical Industries Ltd. E.Pearson: Speaker's Bureau; Novo Nordisk, Lilly, Illumina. R.Slieker: None. J.Beulens: None. M.K.Hansen: None. D.Kuznetsov: None. M.J.Gerl: Employee; Lipotype GmbH.
We identify biomarkers for disease progression in three type 2 diabetes cohorts encompassing 2,973 individuals across three molecular classes, metabolites, lipids and proteins. Homocitrulline, ...isoleucine and 2-aminoadipic acid, eight triacylglycerol species, and lowered sphingomyelin 42:2;2 levels are predictive of faster progression towards insulin requirement. Of ~1,300 proteins examined in two cohorts, levels of GDF15/MIC-1, IL-18Ra, CRELD1, NogoR, FAS, and ENPP7 are associated with faster progression, whilst SMAC/DIABLO, SPOCK1 and HEMK2 predict lower progression rates. In an external replication, proteins and lipids are associated with diabetes incidence and prevalence. NogoR/RTN4R injection improved glucose tolerance in high fat-fed male mice but impaired it in male db/db mice. High NogoR levels led to islet cell apoptosis, and IL-18R antagonised inflammatory IL-18 signalling towards nuclear factor kappa-B in vitro. This comprehensive, multi-disciplinary approach thus identifies biomarkers with potential prognostic utility, provides evidence for possible disease mechanisms, and identifies potential therapeutic avenues to slow diabetes progression.
Type 2 diabetes (T2D) onset, progression and outcomes differ substantially between individuals. Multi-omics analyses may allow a deeper understanding of these differences and ultimately facilitate ...personalised treatments. Here, in an unsupervised "bottom-up" approach, we attempt to group T2D patients based solely on -omics data generated from plasma.
Circulating plasma lipidomic and proteomic data from two independent clinical cohorts, Hoorn Diabetes Care System (DCS) and Genetics of Diabetes Audit and Research in Tayside Scotland (GoDARTS), were analysed using Similarity Network Fusion. The resulting patient network was analysed with Logistic and Cox regression modelling to explore relationships between plasma -omic profiles and clinical characteristics.
From a total of 1,134 subjects in the two cohorts, levels of 180 circulating plasma lipids and 1195 proteins were used to separate patients into two subgroups. These differed in terms of glycaemic deterioration (Hazard Ratio=0.56;0.73), insulin sensitivity and secretion (C-peptide,
=3.7e-11;2.5e-06, DCS and GoDARTS, respectively; Homeostatic model assessment 2 (HOMA2)-B; -IR; -S, p=0.0008;4.2e-11;1.1e-09, only in DCS). The main molecular signatures separating the two groups included triacylglycerols, sphingomyelin, testican-1 and interleukin 18 receptor.
Using an unsupervised network-based fusion method on plasma lipidomics and proteomics data from two independent cohorts, we were able to identify two subgroups of T2D patients differing in terms of disease severity. The molecular signatures identified within these subgroups provide insights into disease mechanisms and possibly new prognostic markers for T2D.
Obesity is considered by many as a lifestyle choice rather than a chronic progressive disease. The Innovative Medicines Initiative (IMI) SOPHIA (Stratification of Obesity Phenotypes to Optimize ...Future Obesity Therapy) project is part of a momentum shift aiming to provide better tools for the stratification of people with obesity according to disease risk and treatment response. One of the challenges to achieving these goals is that many clinical cohorts are siloed, limiting the potential of combined data for biomarker discovery. In SOPHIA, we have addressed this challenge by setting up a federated database building on open-source DataSHIELD technology. The database currently federates 16 cohorts that are accessible via a central gateway. The database is multi-modal, including research studies, clinical trials, and routine health data, and is accessed using the R statistical programming environment where statistical and machine learning analyses can be performed at a distance without any disclosure of patient-level data. We demonstrate the use of the database by providing a proof-of-concept analysis, performing a federated linear model of BMI and systolic blood pressure, pooling all data from 16 studies virtually without any analyst seeing individual patient-level data. This analysis provided similar point estimates compared to a meta-analysis of the 16 individual studies. Our approach provides a benchmark for reproducible, safe federated analyses across multiple study types provided by multiple stakeholders.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
Since ancient times, to survive this society it has required affectivity, empathy and compassion. Thus in a world of speed, a world that seems to have lost nature, we can see that these feelings are ...actually necessary. Without them, humanity would be represented by robots, which would have a stable schedule, work without communication and cooperation. Daily we take part in the sufferings and joys of others, our vocation as persons is to help them, to develop them, to get them out of the deadlock, but also to be present at their joys. Human behavior must be full of delicacy, purity, transparency, showing deep compassion through these feelings of ours.
Type 2 diabetes is a multifactorial disease with multiple underlying aetiologies. To address this heterogeneity, investigators of a previous study clustered people with diabetes according to five ...diabetes subtypes. The aim of the current study is to investigate the etiology of these clusters by comparing their molecular signatures. In three independent cohorts, in total 15,940 individuals were clustered based on five clinical characteristics. In a subset, genetic (
= 12,828), metabolomic (
= 2,945), lipidomic (
= 2,593), and proteomic (
= 1,170) data were obtained in plasma. For each data type, each cluster was compared with the other four clusters as the reference. The insulin-resistant cluster showed the most distinct molecular signature, with higher branched-chain amino acid, diacylglycerol, and triacylglycerol levels and aberrant protein levels in plasma were enriched for proteins in the intracellular PI3K/Akt pathway. The obese cluster showed higher levels of cytokines. The mild diabetes cluster with high HDL showed the most beneficial molecular profile with effects opposite of those seen in the insulin-resistant cluster. This study shows that clustering people with type 2 diabetes can identify underlying molecular mechanisms related to pancreatic islets, liver, and adipose tissue metabolism. This provides novel biological insights into the diverse aetiological processes that would not be evident when type 2 diabetes is viewed as a homogeneous disease.
Aims/hypothesis
People with type 2 diabetes are heterogeneous in their disease trajectory, with some progressing more quickly to insulin initiation than others. Although classical biomarkers such as ...age, HbA
1c
and diabetes duration are associated with glycaemic progression, it is unclear how well such variables predict insulin initiation or requirement and whether newly identified markers have added predictive value.
Methods
In two prospective cohort studies as part of IMI-RHAPSODY, we investigated whether clinical variables and three types of molecular markers (metabolites, lipids, proteins) can predict time to insulin requirement using different machine learning approaches (lasso, ridge, GRridge, random forest). Clinical variables included age, sex, HbA
1c
, HDL-cholesterol and C-peptide. Models were run with unpenalised clinical variables (i.e. always included in the model without weights) or penalised clinical variables, or without clinical variables. Model development was performed in one cohort and the model was applied in a second cohort. Model performance was evaluated using Harrel’s C statistic.
Results
Of the 585 individuals from the Hoorn Diabetes Care System (DCS) cohort, 69 required insulin during follow-up (1.0–11.4 years); of the 571 individuals in the Genetics of Diabetes Audit and Research in Tayside Scotland (GoDARTS) cohort, 175 required insulin during follow-up (0.3–11.8 years). Overall, the clinical variables and proteins were selected in the different models most often, followed by the metabolites. The most frequently selected clinical variables were HbA
1c
(18 of the 36 models, 50%), age (15 models, 41.2%) and C-peptide (15 models, 41.2%). Base models (age, sex, BMI, HbA
1c
) including only clinical variables performed moderately in both the DCS discovery cohort (C statistic 0.71 95% CI 0.64, 0.79) and the GoDARTS replication cohort (C 0.71 95% CI 0.69, 0.75). A more extensive model including HDL-cholesterol and C-peptide performed better in both cohorts (DCS, C 0.74 95% CI 0.67, 0.81; GoDARTS, C 0.73 95% CI 0.69, 0.77). Two proteins, lactadherin and proto-oncogene tyrosine-protein kinase receptor, were most consistently selected and slightly improved model performance.
Conclusions/interpretation
Using machine learning approaches, we show that insulin requirement risk can be modestly well predicted by predominantly clinical variables. Inclusion of molecular markers improves the prognostic performance beyond that of clinical variables by up to 5%. Such prognostic models could be useful for identifying people with diabetes at high risk of progressing quickly to treatment intensification.
Data availability
Summary statistics of lipidomic, proteomic and metabolomic data are available from a Shiny dashboard at
https://rhapdata-app.vital-it.ch
.
Graphical Abstract
Full text
Available for:
EMUNI, FIS, FZAB, GEOZS, GIS, IJS, IMTLJ, KILJ, KISLJ, MFDPS, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, SBMB, SBNM, UKNU, UL, UM, UPUK, VKSCE, ZAGLJ
Type 2 diabetes is a multifactorial disease with multiple underlying aetiologies. To address this heterogeneity a previous study clustered people with diabetes into five diabetes subtypes. The aim of ...the current study is to investigate the aetiology of these clusters by comparing their molecular signatures. In three independent cohorts, in total 15,940 individuals were clustered based on five clinical characteristics. In a subset, genetic- (N=12828), metabolomic- (N=2945), lipidomic- (N=2593) and proteomic (N=1170) data were obtained in plasma. In each datatype each cluster was compared with the other four clusters as the reference. The insulin resistant cluster showed the most distinct molecular signature, with higher BCAAs, DAG and TAG levels and aberrant protein levels in plasma enriched for proteins in the intracellular PI3K/Akt pathway. The obese cluster showed higher cytokines. A subset of the mild diabetes cluster with high HDL showed the most beneficial molecular profile with opposite effects to those seen in the insulin resistant cluster. This study showed that clustering people with type 2 diabetes can identify underlying molecular mechanisms related to pancreatic islets, liver, and adipose tissue metabolism. This provides novel biological insights into the diverse aetiological processes that would not be evident when type 2 diabetes is viewed as a homogeneous disease.