Mendelian randomisation (MR) analysis is an important tool to elucidate the causal relevance of environmental and biological risk factors for disease. However, causal inference is undermined if ...genetic variants used to instrument a risk factor also influence alternative disease-pathways (horizontal pleiotropy). Here we report how the 'no horizontal pleiotropy assumption' is strengthened when proteins are the risk factors of interest. Proteins are typically the proximal effectors of biological processes encoded in the genome. Moreover, proteins are the targets of most medicines, so MR studies of drug targets are becoming a fundamental tool in drug development. To enable such studies, we introduce a mathematical framework that contrasts MR analysis of proteins with that of risk factors located more distally in the causal chain from gene to disease. We illustrate key model decisions and introduce an analytical framework for maximising power and evaluating the robustness of analyses.
Because low-grade inflammation may play a role in the pathogenesis of coronary heart disease (CHD), and pro-inflammatory cytokines govern inflammatory cascades, this study aimed to assess the ...associations of several pro-inflammatory cytokines and CHD risk in a new prospective study, including meta-analysis of prospective studies.
Interleukin-6 (IL-6), IL-18, matrix metalloproteinase-9 (MMP-9), soluble CD40 ligand (sCD40L), and tumour necrosis factor-α (TNF-α) were measured at baseline in a case-cohort study of 1514 participants and 833 incident CHD events within population-based prospective cohorts at the Danish Research Centre for Prevention and Health. Age- and sex-adjusted hazard ratios (HRs) for CHD per 1-SD higher log-transformed baseline levels were: 1.37 (95% CI: 1.21-1.54) for IL-6, 1.26 (1.11-1.44) for IL-18, 1.30 (1.16-1.46) for MMP-9, 1.01 (0.89-1.15) for sCD40L, and 1.13 (1.01-1.27) for TNF-α. Multivariable adjustment for conventional vascular risk factors attenuated the HRs to: 1.26 (1.08-1.46) for IL-6, 1.12 (0.95-1.31) for IL-18, 1.21 (1.05-1.39) for MMP-9, 0.93 (0.78-1.11) for sCD40L, and 1.14 (1.00-1.31) for TNF-α. In meta-analysis of up to 29 population-based prospective studies, adjusted relative risks for non-fatal MI or CHD death per 1-SD higher levels were: 1.25 (1.19-1.32) for IL-6; 1.13 (1.05-1.20) for IL-18; 1.07 (0.97-1.19) for MMP-9; 1.07 (0.95-1.21) for sCD40L; and 1.17 (1.09-1.25) for TNF-α.
Several different pro-inflammatory cytokines are each associated with CHD risk independent of conventional risk factors and in an approximately log-linear manner. The findings lend support to the inflammation hypothesis in vascular disease, but further studies are needed to assess causality.
Inflammation, which is directly regulated by interleukin-6 (IL-6) signaling, is implicated in the etiology of several chronic diseases. Although a common, non-synonymous variant in the IL-6 receptor ...gene (IL6R Asp358Ala; rs2228145 A>C) is associated with the risk of several common diseases, with the 358Ala allele conferring protection from coronary heart disease (CHD), rheumatoid arthritis (RA), atrial fibrillation (AF), abdominal aortic aneurysm (AAA), and increased susceptibility to asthma, the variant's effect on IL-6 signaling is not known. Here we provide evidence for the association of this non-synonymous variant with the risk of type 1 diabetes (T1D) in two independent populations and confirm that rs2228145 is the major determinant of the concentration of circulating soluble IL-6R (sIL-6R) levels (34.6% increase in sIL-6R per copy of the minor allele 358Ala; rs2228145 C). To further investigate the molecular mechanism of this variant, we analyzed expression of IL-6R in peripheral blood mononuclear cells (PBMCs) in 128 volunteers from the Cambridge BioResource. We demonstrate that, although 358Ala increases transcription of the soluble IL6R isoform (P = 8.3×10⁻²²) and not the membrane-bound isoform, 358Ala reduces surface expression of IL-6R on CD4+ T cells and monocytes (up to 28% reduction per allele; P≤5.6×10⁻²²). Importantly, reduced expression of membrane-bound IL-6R resulted in impaired IL-6 responsiveness, as measured by decreased phosphorylation of the transcription factors STAT3 and STAT1 following stimulation with IL-6 (P≤5.2×10⁻⁷). Our findings elucidate the regulation of IL-6 signaling by IL-6R, which is causally relevant to several complex diseases, identify mechanisms for new approaches to target the IL-6/IL-6R axis, and anticipate differences in treatment response to IL-6 therapies based on this common IL6R variant.
Previous Mendelian randomization studies have suggested that, while low-density lipoprotein cholesterol (LDL-c) and triglycerides are causally implicated in coronary artery disease (CAD) risk, ...high-density lipoprotein cholesterol (HDL-c) may not be, with causal effect estimates compatible with the null.
The causal effects of these three lipid fractions can be better identified using the extended methods of 'multivariable Mendelian randomization'. We employ this approach using published data on 185 lipid-related genetic variants and their associations with lipid fractions in 188,578 participants, and with CAD risk in 22,233 cases and 64,762 controls. Our results suggest that HDL-c may be causally protective of CAD risk, independently of the effects of LDL-c and triglycerides. Estimated causal odds ratios per standard deviation increase, based on 162 variants not having pleiotropic associations with either blood pressure or body mass index, are 1.57 (95% credible interval 1.45 to 1.70) for LDL-c, 0.91 (0.83 to 0.99, p-value = 0.028) for HDL-c, and 1.29 (1.16 to 1.43) for triglycerides.
Some interventions on HDL-c concentrations may influence risk of CAD, but to a lesser extent than interventions on LDL-c. A causal interpretation of these estimates relies on the assumption that the genetic variants do not have pleiotropic associations with risk factors on other pathways to CAD. If they do, a weaker conclusion is that genetic predictors of LDL-c, HDL-c and triglycerides each have independent associations with CAD risk.
Interest in the application of machine learning (ML) to the design, conduct, and analysis of clinical trials has grown, but the evidence base for such applications has not been surveyed. This ...manuscript reviews the proceedings of a multi-stakeholder conference to discuss the current and future state of ML for clinical research. Key areas of clinical trial methodology in which ML holds particular promise and priority areas for further investigation are presented alongside a narrative review of evidence supporting the use of ML across the clinical trial spectrum.
Conference attendees included stakeholders, such as biomedical and ML researchers, representatives from the US Food and Drug Administration (FDA), artificial intelligence technology and data analytics companies, non-profit organizations, patient advocacy groups, and pharmaceutical companies. ML contributions to clinical research were highlighted in the pre-trial phase, cohort selection and participant management, and data collection and analysis. A particular focus was paid to the operational and philosophical barriers to ML in clinical research. Peer-reviewed evidence was noted to be lacking in several areas.
ML holds great promise for improving the efficiency and quality of clinical research, but substantial barriers remain, the surmounting of which will require addressing significant gaps in evidence.
Genetic variants regulating RNA splicing and transcript usage have been implicated in both common and rare diseases. Although transcript usage quantitative trait loci (tuQTLs) have been mapped across ...multiple cell types and contexts, it is challenging to distinguish between the main molecular mechanisms controlling transcript usage: promoter choice, splicing and 3' end choice. Here, we analysed RNA-seq data from human macrophages exposed to three inflammatory and one metabolic stimulus. In addition to conventional gene-level and transcript-level analyses, we also directly quantified promoter usage, splicing and 3' end usage. We found that promoters, splicing and 3' ends were predominantly controlled by independent genetic variants enriched in distinct genomic features. Promoter usage QTLs were also 50% more likely to be context-specific than other tuQTLs and constituted 25% of the transcript-level colocalisations with complex traits. Thus, promoter usage might be an underappreciated molecular mechanism mediating complex trait associations in a context-specific manner.
Machine learning (ML) is increasingly used in research for subtype definition and risk prediction, particularly in cardiovascular diseases. No existing ML models are routinely used for cardiovascular ...disease management, and their phase of clinical utility is unknown, partly due to a lack of clear criteria. We evaluated ML for subtype definition and risk prediction in heart failure (HF), acute coronary syndromes (ACS) and atrial fibrillation (AF).
For ML studies of subtype definition and risk prediction, we conducted a systematic review in HF, ACS and AF, using PubMed, MEDLINE and Web of Science from January 2000 until December 2019. By adapting published criteria for diagnostic and prognostic studies, we developed a seven-domain, ML-specific checklist.
Of 5918 studies identified, 97 were included. Across studies for subtype definition (n = 40) and risk prediction (n = 57), there was variation in data source, population size (median 606 and median 6769), clinical setting (outpatient, inpatient, different departments), number of covariates (median 19 and median 48) and ML methods. All studies were single disease, most were North American (n = 61/97) and only 14 studies combined definition and risk prediction. Subtype definition and risk prediction studies respectively had limitations in development (e.g. 15.0% and 78.9% of studies related to patient benefit; 15.0% and 15.8% had low patient selection bias), validation (12.5% and 5.3% externally validated) and impact (32.5% and 91.2% improved outcome prediction; no effectiveness or cost-effectiveness evaluations).
Studies of ML in HF, ACS and AF are limited by number and type of included covariates, ML methods, population size, country, clinical setting and focus on single diseases, not overlap or multimorbidity. Clinical utility and implementation rely on improvements in development, validation and impact, facilitated by simple checklists. We provide clear steps prior to safe implementation of machine learning in clinical practice for cardiovascular diseases and other disease areas.
Asymmetric dimethylarginine (ADMA) inhibits the production of nitric oxide, a key regulator of the vascular tone, and may be important in the development of cardiovascular disease (CVD). Our aim was ...to reliably quantify the association of ADMA and its isomer symmetric dimethylarginine (SDMA) with the risk of CVD outcomes in long-term cohort studies.
Data were collated from 22 prospective studies involving a total of 19 842 participants, which have recorded 2339 CVD, 997 coronary heart disease, and 467 stroke outcomes during a mean follow-up of 7.1 years. In a comparison of individuals in the top with those in the bottom third of baseline ADMA values, the combined risk ratios were 1.42 (95% confidence interval: 1.29 to 1.56) for CVD, 1.39 for coronary heart disease (1.19 to 1.62), and 1.60 for stroke (1.33 to 1.91). Broadly similar results were observed according to participants' baseline disease status (risk ratios for CVD: 1.35 1.18 to 1.54 in general populations; 1.47 1.16 to 1.87 in individuals with pre-existing CVD; and 1.52 1.26 to 1.84 in individuals with pre-existing kidney disease) and by different study characteristics, including geographical location, sample type, assay method, number of incident outcomes, and level of statistical adjustment (all P values>0.05). In contrast, in 8 prospective studies involving 9070 participants and 848 outcomes, the corresponding estimate for SDMA concentration was 1.32 (0.92 to 1.90) for CVD.
Available prospective studies suggest associations between circulating ADMA concentration and CVD outcomes under a broad range of circumstances. Further research is needed to better clarify these associations, particularly in large general population studies.
Metabolite levels in urine may provide insights into genetic mechanisms shaping their related pathways. We therefore investigate the cumulative contribution of rare, exonic genetic variants on urine ...levels of 1487 metabolites and 53,714 metabolite ratios among 4864 GCKD study participants. Here we report the detection of 128 significant associations involving 30 unique genes, 16 of which are known to underlie inborn errors of metabolism. The 30 genes are strongly enriched for shared expression in liver and kidney (odds ratio = 65, p-FDR = 3e-7), with hepatocytes and proximal tubule cells as driving cell types. Use of UK Biobank whole-exome sequencing data links genes to diseases connected to the identified metabolites. In silico constraint-based modeling of gene knockouts in a virtual whole-body, organ-resolved metabolic human correctly predicts the observed direction of metabolite changes, highlighting the potential of linking population genetics to modeling. Our study implicates candidate variants and genes for inborn errors of metabolism.
Cardiovascular ageing is a process that begins early in life and leads to a progressive change in structure and decline in function due to accumulated damage across diverse cell types, tissues and ...organs contributing to multi-morbidity. Damaging biophysical, metabolic and immunological factors exceed endogenous repair mechanisms resulting in a pro-fibrotic state, cellular senescence and end-organ damage, however the genetic architecture of cardiovascular ageing is not known. Here we use machine learning approaches to quantify cardiovascular age from image-derived traits of vascular function, cardiac motion and myocardial fibrosis, as well as conduction traits from electrocardiograms, in 39,559 participants of UK Biobank. Cardiovascular ageing is found to be significantly associated with common or rare variants in genes regulating sarcomere homeostasis, myocardial immunomodulation, and tissue responses to biophysical stress. Ageing is accelerated by cardiometabolic risk factors and we also identify prescribed medications that are potential modifiers of ageing. Through large-scale modelling of ageing across multiple traits our results reveal insights into the mechanisms driving premature cardiovascular ageing and reveal potential molecular targets to attenuate age-related processes.