The number of Mendelian randomization (MR) analyses including large numbers of genetic variants is rapidly increasing. This is due to the proliferation of genome‐wide association studies, and the ...desire to obtain more precise estimates of causal effects. Since it is unlikely that all genetic variants will be valid instrumental variables, several robust methods have been proposed. We compare nine robust methods for MR based on summary data that can be implemented using standard statistical software. Methods were compared in three ways: by reviewing their theoretical properties, in an extensive simulation study, and in an empirical example. In the simulation study, the best method, judged by mean squared error was the contamination mixture method. This method had well‐controlled Type 1 error rates with up to 50% invalid instruments across a range of scenarios. Other methods performed well according to different metrics. Outlier‐robust methods had the narrowest confidence intervals in the empirical example. With isolated exceptions, all methods performed badly when over 50% of the variants were invalid instruments. Our recommendation for investigators is to perform a variety of robust methods that operate in different ways and rely on different assumptions for valid inferences to assess the reliability of MR analyses.
Obesity is a worldwide epidemic that has been associated with a plurality of diseases in observational studies. The aim of this study was to summarize the evidence from Mendelian randomization (MR) ...studies of the association between body mass index (BMI) and chronic diseases.
PubMed and Embase were searched for MR studies on adult BMI in relation to major chronic diseases, including diabetes mellitus; diseases of the circulatory, respiratory, digestive, musculoskeletal, and nervous systems; and neoplasms. A meta-analysis was performed for each disease by using results from published MR studies and corresponding de novo analyses based on summary-level genetic data from the FinnGen consortium (n = 218,792 individuals).
In a meta-analysis of results from published MR studies and de novo analyses of the FinnGen consortium, genetically predicted higher BMI was associated with increased risk of type 2 diabetes mellitus, 14 circulatory disease outcomes, asthma, chronic obstructive pulmonary disease, five digestive system diseases, three musculoskeletal system diseases, and multiple sclerosis as well as cancers of the digestive system (six cancer sites), uterus, kidney, and bladder. In contrast, genetically predicted higher adult BMI was associated with a decreased risk of Dupuytren's disease, osteoporosis, and breast, prostate, and non-melanoma cancer, and not associated with Alzheimer's disease, amyotrophic lateral sclerosis, or Parkinson's disease.
The totality of the evidence from MR studies supports a causal role of excess adiposity in a plurality of chronic diseases. Hence, continued efforts to reduce the prevalence of overweight and obesity are a major public health goal.
Mendelian randomization is a powerful tool for inferring the presence, or otherwise, of causal effects from observational data. However, the nature of genetic variants is such that pleiotropy remains ...a barrier to valid causal effect estimation. There are many options in the literature for pleiotropy robust methods when studying the effects of a single risk factor on an outcome. However, there are few pleiotropy robust methods in the multivariable setting, that is, when there are multiple risk factors of interest. In this article we introduce three methods which build on common approaches in the univariable setting: MVMR‐Robust; MVMR‐Median; and MVMR‐Lasso. We discuss the properties of each of these methods and examine their performance in comparison to existing approaches in a simulation study. MVMR‐Robust is shown to outperform existing outlier robust approaches when there are low levels of pleiotropy. MVMR‐Lasso provides the best estimation in terms of mean squared error for moderate to high levels of pleiotropy, and can provide valid inference in a three sample setting. MVMR‐Median performs well in terms of estimation across all scenarios considered, and provides valid inference up to a moderate level of pleiotropy. We demonstrate the methods in an applied example looking at the effects of intelligence, education and household income on the risk of Alzheimer's disease.
Undertake a systematic investigation into associations between genetic predictors of lipid fractions and age-related macular degeneration (AMD) risk.
Two-sample Mendelian randomization investigation ...using published data.
A total of 33 526 individuals (16 144 cases, 17 832 controls) predominantly of European ancestry from the International Age-related Macular Degeneration Genomics Consortium.
We consider 185 variants previously demonstrated to be associated with at least 1 of low-density lipoprotein (LDL) cholesterol, high-density lipoprotein (HDL) cholesterol, or triglycerides at a genome-wide level of significance, and test their associations with AMD. We particularly focus on variants in gene regions that are proxies for specific pharmacologic agents for lipid therapy. We then conduct a 2-sample Mendelian randomization investigation to assess the causal roles of LDL-cholesterol, HDL-cholesterol, and triglycerides on AMD risk. We also conduct parallel investigations for coronary artery disease (CAD) (viewed as a positive control) and Alzheimer's disease (a negative control) for comparison.
Diagnosis of AMD.
We find evidence that HDL-cholesterol is a causal risk factor for AMD, with an odds ratio (OR) estimate of 1.22 (95% confidence interval CI, 1.03–1.44) per 1 standard deviation increase in HDL-cholesterol. No causal effect of LDL-cholesterol or triglycerides was found. Variants in the CETP gene region associated with increased circulating HDL-cholesterol also associate with increased AMD risk, although variants in the LIPC gene region that increase circulating HDL-cholesterol have the opposite direction of association with AMD risk. Parallel analyses suggest that lipids have a greater role for AMD compared with Alzheimer's disease, but a lesser role than for CAD.
Some genetic evidence suggests that HDL-cholesterol is a causal risk factor for AMD risk and that increasing HDL-cholesterol (particularly via CETP inhibition) will increase AMD risk.
Modern high-throughput experiments provide a rich resource to investigate causal determinants of disease risk. Mendelian randomization (MR) is the use of genetic variants as instrumental variables to ...infer the causal effect of a specific risk factor on an outcome. Multivariable MR is an extension of the standard MR framework to consider multiple potential risk factors in a single model. However, current implementations of multivariable MR use standard linear regression and hence perform poorly with many risk factors. Here, we propose a two-sample multivariable MR approach based on Bayesian model averaging (MR-BMA) that scales to high-throughput experiments. In a realistic simulation study, we show that MR-BMA can detect true causal risk factors even when the candidate risk factors are highly correlated. We illustrate MR-BMA by analysing publicly-available summarized data on metabolites to prioritise likely causal biomarkers for age-related macular degeneration.
ABSTRACT
Mendelian randomization analyses are often performed using summarized data. The causal estimate from a one‐sample analysis (in which data are taken from a single data source) with weak ...instrumental variables is biased in the direction of the observational association between the risk factor and outcome, whereas the estimate from a two‐sample analysis (in which data on the risk factor and outcome are taken from non‐overlapping datasets) is less biased and any bias is in the direction of the null. When using genetic consortia that have partially overlapping sets of participants, the direction and extent of bias are uncertain. In this paper, we perform simulation studies to investigate the magnitude of bias and Type 1 error rate inflation arising from sample overlap. We consider both a continuous outcome and a case‐control setting with a binary outcome. For a continuous outcome, bias due to sample overlap is a linear function of the proportion of overlap between the samples. So, in the case of a null causal effect, if the relative bias of the one‐sample instrumental variable estimate is 10% (corresponding to an F parameter of 10), then the relative bias with 50% sample overlap is 5%, and with 30% sample overlap is 3%. In a case‐control setting, if risk factor measurements are only included for the control participants, unbiased estimates are obtained even in a one‐sample setting. However, if risk factor data on both control and case participants are used, then bias is similar with a binary outcome as with a continuous outcome. Consortia releasing publicly available data on the associations of genetic variants with continuous risk factors should provide estimates that exclude case participants from case‐control samples.
Instrumental variable analysis is an approach for obtaining causal inferences on the effect of an exposure (risk factor) on an outcome from observational data. It has gained in popularity over the ...past decade with the use of genetic variants as instrumental variables, known as Mendelian randomization. An instrumental variable is associated with the exposure, but not associated with any confounder of the exposure–outcome association, nor is there any causal pathway from the instrumental variable to the outcome other than via the exposure. Under the assumption that a single instrumental variable or a set of instrumental variables for the exposure is available, the causal effect of the exposure on the outcome can be estimated. There are several methods available for instrumental variable estimation; we consider the ratio method, two-stage methods, likelihood-based methods, and semi-parametric methods. Techniques for obtaining statistical inferences and confidence intervals are presented. The statistical properties of estimates from these methods are compared, and practical advice is given about choosing a suitable analysis method. In particular, bias and coverage properties of estimators are considered, especially with weak instruments. Settings particularly relevant to Mendelian randomization are prioritized in the paper, notably the scenario of a continuous exposure and a continuous or binary outcome.
About half of the Amazon rainforest is subject to seasonal droughts of 3 months or more. Despite this drought, several studies have shown that these forests, under a strongly seasonal climate, do not ...exhibit significant water stress during the dry season. In addition to deep soil water uptake, another contributing explanation for the absence of plant water stress during drought is the process of hydraulic redistribution; the nocturnal transfer of water by roots from moist to dry regions of the soil profile. Here, we present data on patterns of soil moisture and sap flow in roots of three dimorphic-rooted species in the Tapajós Forest, Amazônia, which demonstrate both upward (hydraulic lift) and downward hydraulic redistribution. We measured sap flow in lateral and tap roots of our three study species over a 2-year period using the heat ratio method, a sap-flow technique that allows bi-directional measurement of water flow. On certain nights during the dry season, reverse or acropetal flow (i.e.,in the direction of the soil) in the lateral roots and positive or basipetal sap flow (toward the plant) in the tap roots of Coussarea racemosa (caferana), Manilkara huberi (maçaranduba) and Protium robustum (breu) were observed, a pattern consistent with upward hydraulic redistribution (hydraulic lift). With the onset of heavy rains, this pattern reversed, with continuous night-time acropetal sap flow in the tap root and basipetal sap flow in lateral roots, indicating water movement from wet top soil to dry deeper soils (downward hydraulic redistribution). Both patterns were present in trees within a rainfall exclusion plot (Seca Floresta) and to a more limited extent in the control plot. Although hydraulic redistribution has traditionally been associated with arid or strongly seasonal environments, our findings now suggest that it is important in ameliorating water stress and improving rain infiltration in Amazonian rainforests. This has broad implications for understanding and modeling ecosystem process and forest function in this important biome.
An observational correlation between a suspected risk factor and an outcome does not necessarily imply that interventions on levels of the risk factor will have a causal impact on the outcome ...(correlation is not causation). If genetic variants associated with the risk factor are also associated with the outcome, then this increases the plausibility that the risk factor is a causal determinant of the outcome. However, if the genetic variants in the analysis do not have a specific biological link to the risk factor, then causal claims can be spurious. We review the Mendelian randomization paradigm for making causal inferences using genetic variants. We consider monogenic analysis, in which genetic variants are taken from a single gene region, and polygenic analysis, which includes variants from multiple regions. We focus on answering two questions: When can Mendelian randomization be used to make reliable causal inferences, and when can it be used to make relevant causal inferences?