This article discusses the augmented inverse propensity weighted (AIPW) estimator as an estimator for average treatment effects. The AIPW combines both the properties of the regression-based ...estimator and the inverse probability weighted (IPW) estimator and is therefore a “doubly robust” method in that it requires only either the propensity or outcome model to be correctly specified but not both. Even though this estimator has been known for years, it is rarely used in practice. After explaining the estimator and proving the double robustness property, I conduct a simulation study to compare the AIPW efficiency with IPW and regression under different scenarios of misspecification. In 2 real-world examples, I provide a step-by-step guide on implementing the AIPW estimator in practice. I show that it is an easily usable method that extends the IPW to reduce variability and improve estimation accuracy.
Highlights
• Average treatment effects are often estimated by regression or inverse probability weighting methods, but both are vulnerable to bias.
• The augmented inverse probability weighted estimator is an easy-to-use method for average treatment effects that can be less biased because of the double robustness property.
The statistical analysis of health care cost data is often problematic because these data are usually non-negative, right-skewed and have excess zeros for non-users. This prevents the use of linear ...models based on the Gaussian or Gamma distribution. A common way to counter this is the use of Two-part or Tobit models, which makes interpretation of the results more difficult. In this study, I explore a statistical distribution from the Tweedie family of distributions that can simultaneously model the probability of zero outcome, i.e. of being a non-user of health care utilization and continuous costs for users.
I assess the usefulness of the Tweedie model in a Monte Carlo simulation study that addresses two common situations of low and high correlation of the users and the non-users of health care utilization. Furthermore, I compare the Tweedie model with several other models using a real data set from the RAND health insurance experiment.
I show that the Tweedie distribution fits cost data very well and provides better fit, especially when the number of non-users is low and the correlation between users and non-users is high.
The Tweedie distribution provides an interesting solution to many statistical problems in health economic analyses.
Machine-learning classifiers mostly offer good predictive performance and are increasingly used to support shared decision-making in clinical practice. Focusing on performance and practicability, ...this study evaluates prediction of patient-reported outcomes (PROs) by eight supervised classifiers including a linear model, following hip and knee replacement surgery.
NHS PRO data (130,945 observations) from April 2015 to April 2017 were used to train and test eight classifiers to predict binary postoperative improvement based on minimal important differences. Area under the receiver operating characteristic, J-statistic and several other metrics were calculated. The dependent outcomes were generic and disease-specific improvement based on the EQ-5D-3L visual analogue scale (VAS) as well as the Oxford Hip and Knee Score (Q score).
The area under the receiver operating characteristic of the best training models was around 0.87 (VAS) and 0.78 (Q score) for hip replacement, while it was around 0.86 (VAS) and 0.70 (Q score) for knee replacement surgery. Extreme gradient boosting, random forests, multistep elastic net and linear model provided the highest overall J-statistics. Based on variable importance, the most important predictors for post-operative outcomes were preoperative VAS, Q score and single Q score dimensions. Sensitivity analysis for hip replacement VAS evaluated the influence of minimal important difference, patient selection criteria as well as additional data years. Together with a small benchmark of the NHS prediction model, robustness of our results was confirmed.
Supervised machine-learning implementations, like extreme gradient boosting, can provide better performance than linear models and should be considered, when high predictive performance is needed. Preoperative VAS, Q score and specific dimensions like limping are the most important predictors for postoperative hip and knee PROMs.
Laser cooling and trapping of atoms and atomic ions has led to advances including the observation of exotic phases of matter, the development of precision sensors and state-of-the-art atomic clocks. ...The same level of control in molecules could also lead to important developments such as controlled chemical reactions and sensitive probes of fundamental theories, but the vibrational and rotational degrees of freedom in molecules pose a challenge for controlling their quantum mechanical states. Here we use quantum-logic spectroscopy, which maps quantum information between two ion species, to prepare and non-destructively detect quantum mechanical states in molecular ions. We develop a general technique for optical pumping and preparation of the molecule into a pure initial state. This enables us to observe high-resolution spectra in a single ion (CaH
) and coherent phenomena such as Rabi flopping and Ramsey fringes. The protocol requires a single, far-off-resonant laser that is not specific to the molecule, so many other molecular ions, including polyatomic species, could be treated using the same methods in the same apparatus by changing the molecular source. Combined with the long interrogation times afforded by ion traps, a broad range of molecular ions could be studied with unprecedented control and precision. Our technique thus represents a critical step towards applications such as precision molecular spectroscopy, stringent tests of fundamental physics, quantum computing and precision control of molecular dynamics.
Causal effect estimates for the association of obesity with health care costs can be biased by reversed causation and omitted variables. In this study, we use genetic variants as instrumental ...variables to overcome these limitations, a method that is often called Mendelian randomization (MR). We describe the assumptions, available methods, and potential pitfalls of using genetic information and how to address them. We estimate the effect of body mass index (BMI) on total health care costs using data from a German observational study and from published large-scale data. In a meta-analysis of several MR approaches, we find that models using genetic instruments identify additional annual costs of €280 for a 1-unit increase in BMI. This is more than 3 times higher than estimates from linear regression without instrumental variables (€75). We found little evidence of a nonlinear relationship between BMI and health care costs. Our results suggest that the use of genetic instruments can be a powerful tool for estimating causal effects in health economic evaluation that might be superior to other types of instruments where there is a strong association with a modifiable risk factor.
To assess the independent causal effect of BMI and type 2 diabetes (T2D) on socioeconomic outcomes by applying two-sample Mendelian randomization (MR) analysis.
We performed univariable and ...multivariable two-sample MR to jointly assess the effect of BMI and T2D on socioeconomic outcomes. We used overlapping genome-wide significant single nucleotide polymorphisms for BMI and T2D as instrumental variables. Their causal impact on household income and regional deprivation was assessed using summary-level data from the UK Biobank.
In the univariable analysis, higher BMI was related to lower income (marginal effect of 1-SD increase in BMI β = -0.092; 95% CI -0.138; -0.047) and higher deprivation (β = 0.051; 95% CI 0.022; 0.079). In the multivariable MR, the effect of BMI controlling for diabetes was slightly lower for income and deprivation. Diabetes was not associated with these outcomes.
High BMI, but not diabetes, shows a causal link with socioeconomic outcomes.
Sugar-sweetened beverages (SSBs) are associated with increased body weight and obesity, which induce a wide array of health impairments such as diabetes or cardiovascular disorders. Excise taxes have ...been introduced to counteract SSB consumption. We investigated the effect of sugar taxes on SSB sales in Hungary and France using a synthetic control approach. For France, we found a slight decrease in SSB sales after tax implementation while overall soft drink sales increased. For Hungary, there was only a short-term decrease in SSB sales which disappeared after 2 years, leading to an overall increase in SSB sales. However, both effects are characterized by great uncertainty.
Mixture modeling is a popular approach to accommodate overdispersion, skewness, and multimodality features that are very common for health care utilization data. However, mixture modeling tends to ...rely on subjective judgment regarding the appropriate number of mixture components or some hypothesis about how to cluster the data. In this work, we adopt a nonparametric, variational Bayesian approach to allow the model to select the number of components while estimating their parameters. Our model allows for a probabilistic classification of observations into clusters and simultaneous estimation of a Gaussian regression model within each cluster. When we apply this approach to data on patients with interstitial lung disease, we find distinct subgroups of patients with differences in means and variances of health care costs, health and treatment covariates, and relationships between covariates and costs. The subgroups identified are readily interpretable, suggesting that this nonparametric variational approach to inference can discover valid insights into the factors driving treatment costs. Moreover, the learning algorithm we employed is very fast and scalable, which should make the technique accessible for a broad range of applications.
In 2002-2003 disease management programs (DMPs) for type 2 diabetes and coronary heart disease were introduced in Germany to improve the management of these conditions. Today around 6 million Germans ...aged 56 and older are enrolled in one of the DMPs; however, their effect on health remains unclear.
We estimated the impact of German DMPs on circulatory and all-cause mortality using a synthetic control study. Specifically, using routinely available data, we compared pre and post-intervention trends in mortality of individuals aged 56 and older for 1998-2014 in Germany to trends in other European countries.
Average circulatory and all-cause mortality in Germany and the synthetic control was 1.63 and 3.24 deaths per 100 persons. Independent of model choice, circulatory and all-cause mortality decreased non-significantly less in Germany than in the synthetic control; for the model with a 3 year time lag, for example, by 0.12 (95%-CI: - 0.20; 0.44) and 0.22 (95%-CI: - 0.40; 0.66) deaths per 100 persons, respectively. Further main analyses, as well as sensitivity and subgroup analyses supported these results.
We observed no effect on circulatory or all-cause mortality at the population-level. However, confidence intervals were wide, meaning we could not reject the possibility of a positive effect. Given the substantial costs for administration and operation of the programs, further comparative effectiveness research is needed to clarify the value of German DMPs for type 2 diabetes and CHD.