ABSTRACT
Developments in genome‐wide association studies and the increasing availability of summary genetic association data have made application of Mendelian randomization relatively ...straightforward. However, obtaining reliable results from a Mendelian randomization investigation remains problematic, as the conventional inverse‐variance weighted method only gives consistent estimates if all of the genetic variants in the analysis are valid instrumental variables. We present a novel weighted median estimator for combining data on multiple genetic variants into a single causal estimate. This estimator is consistent even when up to 50% of the information comes from invalid instrumental variables. In a simulation analysis, it is shown to have better finite‐sample Type 1 error rates than the inverse‐variance weighted method, and is complementary to the recently proposed MR‐Egger (Mendelian randomization‐Egger) regression method. In analyses of the causal effects of low‐density lipoprotein cholesterol and high‐density lipoprotein cholesterol on coronary artery disease risk, the inverse‐variance weighted method suggests a causal effect of both lipid fractions, whereas the weighted median and MR‐Egger regression methods suggest a null effect of high‐density lipoprotein cholesterol that corresponds with the experimental evidence. Both median‐based and MR‐Egger regression methods should be considered as sensitivity analyses for Mendelian randomization investigations with multiple genetic variants.
Ethanol is a classic teratogen capable of inducing a wide range of developmental abnormalities. Studies in animal models suggest that differences in timing and dosage underlie this variability, with ...three particularly important developmental periods: preconception, preimplantation, and gastrulation. These periods of teratogenesis correlate with peak periods of epigenetic reprogramming which, together with the evidence that ethanol interferes with one-carbon metabolism, DNA methylation, histone modifications, and noncoding RNA, suggests an important role for epigenetic mechanisms in the etiology of fetal alcohol spectrum disorders (FASDs). In addition to a number of testable hypotheses, an epigenetic model suggests that the concept of a "fetal alcohol spectrum" should be expanded to include "preconceptional effects." This proposal has important public health implications, highlighting the urgency of research into the epigenetic basis of FASDs.
Risk factors for pancreatic cancer include a cluster of metabolic conditions such as obesity, hypertension, dyslipidemia, insulin resistance, and type 2 diabetes. Given that these risk factors are ...correlated, separating out causal from confounded effects is challenging. Mendelian randomization (MR), or the use of genetic instrumental variables, may facilitate the identification of the metabolic drivers of pancreatic cancer.
We identified genetic instruments for obesity, body shape, dyslipidemia, insulin resistance, and type 2 diabetes in order to evaluate their causal role in pancreatic cancer etiology. These instruments were analyzed in relation to risk using a likelihood-based MR approach within a series of 7110 pancreatic cancer patients and 7264 control subjects using genome-wide data from the Pancreatic Cancer Cohort Consortium (PanScan) and the Pancreatic Cancer Case-Control Consortium (PanC4). Potential unknown pleiotropic effects were assessed using a weighted median approach and MR-Egger sensitivity analyses.
Results indicated a robust causal association of increasing body mass index (BMI) with pancreatic cancer risk (odds ratio OR = 1.34, 95% confidence interval CI = 1.09 to 1.65, for each standard deviation increase in BMI 4.6 kg/m2). There was also evidence that genetically increased fasting insulin levels were causally associated with an increased risk of pancreatic cancer (OR = 1.66, 95% CI = 1.05 to 2.63, per SD 44.4 pmol/L). Notably, no evidence of a causal relationship was observed for type 2 diabetes, nor for dyslipidemia. Sensitivity analyses did not indicate that pleiotropy was an important source of bias.
Our results suggest a causal role of BMI and fasting insulin in pancreatic cancer etiology.
Results from genome-wide association studies (GWAS) can be used to infer causal relationships between phenotypes, using a strategy known as 2-sample Mendelian randomization (2SMR) and bypassing the ...need for individual-level data. However, 2SMR methods are evolving rapidly and GWAS results are often insufficiently curated, undermining efficient implementation of the approach. We therefore developed MR-Base (<ext-link ext-link-type="uri" xlink:href="http://www.mrbase.org">http://www.mrbase.org</ext-link>): a platform that integrates a curated database of complete GWAS results (no restrictions according to statistical significance) with an application programming interface, web app and R packages that automate 2SMR. The software includes several sensitivity analyses for assessing the impact of horizontal pleiotropy and other violations of assumptions. The database currently comprises 11 billion single nucleotide polymorphism-trait associations from 1673 GWAS and is updated on a regular basis. Integrating data with software ensures more rigorous application of hypothesis-driven analyses and allows millions of potential causal relationships to be efficiently evaluated in phenome-wide association studies.
The human proteome is a major source of therapeutic targets. Recent genetic association analyses of the plasma proteome enable systematic evaluation of the causal consequences of variation in plasma ...protein levels. Here we estimated the effects of 1,002 proteins on 225 phenotypes using two-sample Mendelian randomization (MR) and colocalization. Of 413 associations supported by evidence from MR, 130 (31.5%) were not supported by results of colocalization analyses, suggesting that genetic confounding due to linkage disequilibrium is widespread in naïve phenome-wide association studies of proteins. Combining MR and colocalization evidence in cis-only analyses, we identified 111 putatively causal effects between 65 proteins and 52 disease-related phenotypes ( https://www.epigraphdb.org/pqtl/ ). Evaluation of data from historic drug development programs showed that target-indication pairs with MR and colocalization support were more likely to be approved, evidencing the value of this approach in identifying and prioritizing potential therapeutic targets.
Mendelian randomization (MR) is an increasingly important tool for appraising causality in observational epidemiology. The technique exploits the principle that genotypes are not generally ...susceptible to reverse causation bias and confounding, reflecting their fixed nature and Mendel’s first and second laws of inheritance. The approach is, however, subject to important limitations and assumptions that, if unaddressed or compounded by poor study design, can lead to erroneous conclusions. Nevertheless, the advent of 2-sample approaches (in which exposure and outcome are measured in separate samples) and the increasing availability of open-access data from large consortia of genome-wide association studies and population biobanks mean that the approach is likely to become routine practice in evidence synthesis and causal inference research. In this article we provide an overview of the design, analysis, and interpretation of MR studies, with a special emphasis on assumptions and limitations. We also consider different analytic strategies for strengthening causal inference. Although impossible to prove causality with any single approach, MR is a highly cost-effective strategy for prioritizing intervention targets for disease prevention and for strengthening the evidence base for public health policy.
Both positive and negative associations between higher body mass index (BMI) and Parkinson disease (PD) have been reported in observational studies, but it has been difficult to establish causality ...because of the possibility of residual confounding or reverse causation. To our knowledge, Mendelian randomisation (MR)-the use of genetic instrumental variables (IVs) to explore causal effects-has not previously been used to test the effect of BMI on PD.
Two-sample MR was undertaken using genome-wide association (GWA) study data. The associations between the genetic instruments and BMI were obtained from the GIANT consortium and consisted of the per-allele difference in mean BMI for 77 independent variants that reached genome-wide significance. The per-allele difference in log-odds of PD for each of these variants was estimated from a recent meta-analysis, which included 13,708 cases of PD and 95,282 controls. The inverse-variance weighted method was used to estimate a pooled odds ratio (OR) for the effect of a 5-kg/m2 higher BMI on PD. Evidence of directional pleiotropy averaged across all variants was sought using MR-Egger regression. Frailty simulations were used to assess whether causal associations were affected by mortality selection. A combined genetic IV expected to confer a lifetime exposure of 5-kg/m2 higher BMI was associated with a lower risk of PD (OR 0.82, 95% CI 0.69-0.98). MR-Egger regression gave similar results, suggesting that directional pleiotropy was unlikely to be biasing the result (intercept 0.002; p = 0.654). However, the apparent protective influence of higher BMI could be at least partially induced by survival bias in the PD GWA study, as demonstrated by frailty simulations. Other important limitations of this application of MR include the inability to analyse non-linear associations, to undertake subgroup analyses, and to gain mechanistic insights.
In this large study using two-sample MR, we found that variants known to influence BMI had effects on PD in a manner consistent with higher BMI leading to lower risk of PD. The mechanism underlying this apparent protective effect warrants further study.
In the present study, it was hypothesized that disruption of imprinting control in the H19/Igf2 domain may be a mechanism of ethanol-induced growth retardation--a key clinical feature of the fetal ...alcohol spectrum disorders (FASD). To test this prediction, genomic bisulphite sequencing was carried out on 473 bp of the H19 imprinting control region in DNA obtained from midgestation F(1) hybrid mouse embryos (C57BL/6 x Mus musculus castaneus) exposed to ethanol during preimplantation development. Although ethanol-exposed placentae and embryos were severely growth retarded in comparison with saline-treated controls, DNA methylation at paternal and maternal alleles was unaffected in embryos. However, paternal alleles were significantly less methylated in ethanol-treated placentae in comparison with saline-treated controls. Partial correlations suggested that the relationship between ethanol and placental weight partly depended on DNA methylation at a CCCTC-binding factor site on the paternal allele in placentae, suggesting a novel mechanism of ethanol-induced growth retardation. In contrast, partial correlations suggested that embryo growth retardation was independent of placental growth retardation. Relaxation of allele-specific DNA methylation in control placentae in comparison with control embryos was also observed, consistent with a model of imprinting in which 1) regulation of allele-specific DNA methylation in the placenta depends on a stochastic interplay between silencer and enhancer chromatin assembly factors and 2) imprinting control mechanisms in the embryo are more robust to environmental perturbations.
Despite early interest in the health effects of polyunsaturated fatty acids (PUFA), there is still substantial controversy and uncertainty on the evidence linking PUFA to cardiovascular diseases ...(CVDs). We investigated the effect of plasma concentration of omega-3 PUFA (i.e. docosahexaenoic acid (DHA) and total omega-3 PUFA) and omega-6 PUFA (i.e. linoleic acid and total omega-6 PUFA) on the risk of CVDs using Mendelian randomization.
We conducted the largest genome-wide association study (GWAS) of circulating PUFA to date including a sample of 114,999 individuals and incorporated these data in a two-sample Mendelian randomization framework to investigate the involvement of circulating PUFA on a wide range of CVDs in up to 1,153,768 individuals of European ancestry (i.e. coronary artery disease, ischemic stroke, haemorrhagic stroke, heart failure, atrial fibrillation, peripheral arterial disease, aortic aneurysm, venous thromboembolism and aortic valve stenosis).
GWAS identified between 46 and 64 SNPs for the four PUFA traits, explaining 4.8-7.9% of circulating PUFA variance and with mean F statistics >100. Higher genetically predicted DHA (and total omega-3 fatty acids) concentration was related to higher risk of some cardiovascular endpoints; however, these findings did not pass our criteria for multiple testing correction and were attenuated when accounting for LDL-cholesterol through multivariable Mendelian randomization or excluding SNPs in the vicinity of the FADS locus. Estimates for the relation between higher genetically predicted linoleic acid (and total omega-6) concentration were inconsistent across different cardiovascular endpoints and Mendelian randomization methods. There was weak evidence of higher genetically predicted linoleic acid being related to lower risk of ischemic stroke and peripheral artery disease when accounting by LDL-cholesterol.
We have conducted the largest GWAS of circulating PUFA to date and the most comprehensive Mendelian randomization analyses. Overall, our Mendelian randomization findings do not support a protective role of circulating PUFA concentration on the risk of CVDs. However, horizontal pleiotropy via lipoprotein-related traits could be a key source of bias in our analyses.
In Mendelian randomization (MR) analysis, variants that exert horizontal pleiotropy are typically treated as a nuisance. However, they could be valuable in identifying alternative pathways to the ...traits under investigation. Here, we develop MR-TRYX, a framework that exploits horizontal pleiotropy to discover putative risk factors for disease. We begin by detecting outliers in a single exposure-outcome MR analysis, hypothesising they are due to horizontal pleiotropy. We search across hundreds of complete GWAS summary datasets to systematically identify other (candidate) traits that associate with the outliers. We develop a multi-trait pleiotropy model of the heterogeneity in the exposure-outcome analysis due to pathways through candidate traits. Through detailed investigation of several causal relationships, many pleiotropic pathways are uncovered with already established causal effects, validating the approach, but also alternative putative causal pathways. Adjustment for pleiotropic pathways reduces the heterogeneity across the analyses.