ObjectivesThere is growing recognition that simultaneously assessing multiple exposures may reduce false positive discoveries and improve epidemiological effect estimates. We evaluated the ...performance of statistical methods for identifying exposure–outcome associations across various data structures typical of environmental and occupational epidemiology analyses.MethodsWe simulated a case–control study, generating 100 data sets for each of 270 different simulation scenarios; varying the number of exposure variables, the correlation between exposures, sample size, the number of effective exposures and the magnitude of effect estimates. We compared conventional analytical approaches, that is, univariable (with and without multiplicity adjustment), multivariable and stepwise logistic regression, with variable selection methods: sparse partial least squares discriminant analysis, boosting, and frequentist and Bayesian penalised regression approaches.ResultsThe variable selection methods consistently yielded more precise effect estimates and generally improved selection accuracy compared with conventional logistic regression methods, especially for scenarios with higher correlation levels. Penalised lasso and elastic net regression both seemed to perform particularly well, specifically when statistical inference based on a balanced weighting of high sensitivity and a low proportion of false discoveries is sought.ConclusionsIn this extensive simulation study with multicollinear data, we found that most variable selection methods consistently outperformed conventional approaches, and demonstrated how performance is influenced by the structure of the data and underlying model.
Abstract
Metal exposure has been suggested as a possible environmental risk factor for Parkinson disease (PD). We searched the PubMed, EMBASE, and Cochrane databases to systematically review the ...literature on the relationship between metal exposure and PD risk and to examine the overall quality of each study and the exposure assessment method. A total of 83 case-control studies and 5 cohort studies published during the period 1963–July 2021 were included, of which 73 were graded as being of low or moderate overall quality. Investigators in 69 studies adopted self-reported exposure and biomonitoring after disease diagnosis for exposure assessment approaches. The meta-analyses showed that concentrations of copper and iron in serum and concentrations of zinc in either serum or plasma were lower, while concentrations of magnesium in CSF and zinc in hair were higher, among PD cases as compared with controls. Cumulative lead levels in bone were found to be associated with increased risk of PD. We did not find associations between other metals and PD. The current level of evidence for associations between metals and PD risk is limited, as biases from methodological limitations cannot be ruled out. High-quality studies assessing metal levels before disease onset are needed to improve our understanding of the role of metals in the etiology of PD.
Diesel engine exhaust (DEE) has recently been classified as a known human carcinogen.
We derived a meta-exposure-response curve (ERC) for DEE and lung cancer mortality and estimated lifetime excess ...risks (ELRs) of lung cancer mortality based on assumed occupational and environmental exposure scenarios.
We conducted a meta-regression of lung cancer mortality and cumulative exposure to elemental carbon (EC), a proxy measure of DEE, based on relative risk (RR) estimates reported by three large occupational cohort studies (including two studies of workers in the trucking industry and one study of miners). Based on the derived risk function, we calculated ELRs for several lifetime occupational and environmental exposure scenarios and also calculated the fractions of annual lung cancer deaths attributable to DEE.
We estimated a lnRR of 0.00098 (95% CI: 0.00055, 0.0014) for lung cancer mortality with each 1-μg/m3-year increase in cumulative EC based on a linear meta-regression model. Corresponding lnRRs for the individual studies ranged from 0.00061 to 0.0012. Estimated numbers of excess lung cancer deaths through 80 years of age for lifetime occupational exposures of 1, 10, and 25 μg/m3 EC were 17, 200, and 689 per 10,000, respectively. For lifetime environmental exposure to 0.8 μg/m3 EC, we estimated 21 excess lung cancer deaths per 10,000. Based on broad assumptions regarding past occupational and environmental exposures, we estimated that approximately 6% of annual lung cancer deaths may be due to DEE exposure.
Combined data from three U.S. occupational cohort studies suggest that DEE at levels common in the workplace and in outdoor air appear to pose substantial excess lifetime risks of lung cancer, above the usually acceptable limits in the United States and Europe, which are generally set at 1/1,000 and 1/100,000 based on lifetime exposure for the occupational and general population, respectively.
The exposome constitutes a promising framework to improve understanding of the effects of environmental exposures on health by explicitly considering multiple testing and avoiding selective ...reporting. However, exposome studies are challenged by the simultaneous consideration of many correlated exposures.
We compared the performances of linear regression-based statistical methods in assessing exposome-health associations.
In a simulation study, we generated 237 exposure covariates with a realistic correlation structure and with a health outcome linearly related to 0 to 25 of these covariates. Statistical methods were compared primarily in terms of false discovery proportion (FDP) and sensitivity.
On average over all simulation settings, the elastic net and sparse partial least-squares regression showed a sensitivity of 76% and an FDP of 44%; Graphical Unit Evolutionary Stochastic Search (GUESS) and the deletion/substitution/addition (DSA) algorithm revealed a sensitivity of 81% and an FDP of 34%. The environment-wide association study (EWAS) underperformed these methods in terms of FDP (average FDP, 86%) despite a higher sensitivity. Performances decreased considerably when assuming an exposome exposure matrix with high levels of correlation between covariates.
Correlation between exposures is a challenge for exposome research, and the statistical methods investigated in this study were limited in their ability to efficiently differentiate true predictors from correlated covariates in a realistic exposome context. Although GUESS and DSA provided a marginally better balance between sensitivity and FDP, they did not outperform the other multivariate methods across all scenarios and properties examined, and computational complexity and flexibility should also be considered when choosing between these methods. Citation: Agier L, Portengen L, Chadeau-Hyam M, Basagaña X, Giorgis-Allemand L, Siroux V, Robinson O, Vlaanderen J, González JR, Nieuwenhuijsen MJ, Vineis P, Vrijheid M, Slama R, Vermeulen R. 2016. A systematic comparison of linear regression-based statistical methods to assess exposome-health associations. Environ Health Perspect 124:1848-1856; http://dx.doi.org/10.1289/EHP172.
Some legacy and emerging environmental contaminants are suspected risk factors for intrauterine growth restriction. However, the evidence is equivocal, in part due to difficulties in disentangling ...the effects of mixtures.
We assessed associations between multiple correlated biomarkers of environmental exposure and birth weight.
We evaluated a cohort of 1,250 term (≥ 37 weeks gestation) singleton infants, born to 513 mothers from Greenland, 180 from Poland, and 557 from Ukraine, who were recruited during antenatal care visits in 2002-2004. Secondary metabolites of diethylhexyl and diisononyl phthalates (DEHP, DiNP), eight perfluoroalkyl acids, and organochlorines (PCB-153 and p,p´-DDE) were quantifiable in 72-100% of maternal serum samples. We assessed associations between exposures and term birth weight, adjusting for co-exposures and covariates, including prepregnancy body mass index. To identify independent associations, we applied the elastic net penalty to linear regression models.
Two phthalate metabolites (MEHHP, MOiNP), perfluorooctanoic acid (PFOA), and p,p´-DDE were most consistently predictive of term birth weight based on elastic net penalty regression. In an adjusted, unpenalized regression model of the four exposures, 2-SD increases in natural log-transformed MEHHP, PFOA, and p,p´-DDE were associated with lower birth weight: -87 g (95% CI: -137, -340 per 1.70 ng/mL), -43 g (95% CI: -108, 23 per 1.18 ng/mL), and -135 g (95% CI: -192, -78 per 1.82 ng/g lipid), respectively; and MOiNP was associated with higher birth weight (46 g; 95% CI: -5, 97 per 2.22 ng/mL).
This study suggests that several of the environmental contaminants, belonging to three chemical classes, may be independently associated with impaired fetal growth. These results warrant follow-up in other cohorts.
The genotoxicity of benzene has been investigated in dozens of biomonitoring studies, mainly by studying (classical) chromosomal aberrations (CAs) or micronuclei (MN) as markers of DNA damage. Both ...have been shown to be predictive of future cancer risk in cohort studies and could, therefore, potentially be used for risk assessment of genotoxicity-mediated cancers.
We sought to estimate an exposure-response curve (ERC) and quantify between-study heterogeneity using all available quantitative evidence on the cytogenetic effects of benzene exposure on CAs and MN respectively.
We carried out a systematic literature review and summarized all available data of sufficient quality using meta-analyses. We assessed the heterogeneity in slope estimates between studies and conducted additional sensitivity analyses to assess how various study characteristics impacted the estimated ERC.
Sixteen CA (1,356 individuals) and
studies (2,097 individuals) were found to be eligible for inclusion in a meta-analysis. Studies where benzene was the primary genotoxic exposure and that had adequate assessment of both exposure and outcomes were used for the primary analysis. Estimated slope estimates were an increase of 0.27% CA (95% CI: 0.08%, 0.47%); based on the results from 4 studies and 0.27% MN (95% CI:
, 0.76%); based on the results from 7 studies per parts-per-million benzene exposure. We observed considerable between-study heterogeneity for both end points (
).
Our study provides a systematic, transparent, and quantitative summary of the literature describing the strong association between benzene exposure and accepted markers of genotoxicity in humans. The derived consensus slope can be used as a best estimate of the quantitative relationship between real-life benzene exposure and genetic damage in future risk assessment. We also quantitate the large between-study heterogeneity that exists in this literature, a factor which is crucial for the interpretation of single-study or consensus slopes. https://doi.org/10.1289/EHP6404.
•Prospective exploration of environmental determinants of headache.•Headache appears to be a transient condition in the population.•Air pollution and urban temperature contribute to the reporting of ...headache.•The largest effect was observed for NO2, PM10, and heat island effect.
Headache is one of the most prevalent and disabling health conditions globally. We prospectively explored the urban exposome in relation to weekly occurrence of headache episodes using data from the Dutch population-based Occupational and Environmental Health Cohort Study (AMIGO).
Participants (N = 7,339) completed baseline and follow-up questionnaires in 2011 and 2015, reporting headache frequency. Information on the urban exposome covered 80 exposures across 10 domains, such as air pollution, electromagnetic fields, and lifestyle and socio-demographic characteristics. We first identified all relevant exposures using the Boruta algorithm and then, for each exposure separately, we estimated the average treatment effect (ATE) and related standard error (SE) by training causal forests adjusted for age, depression diagnosis, painkiller use, general health indicator, sleep disturbance index and weekly occurrence of headache episodes at baseline.
Occurrence of weekly headache was 12.5 % at baseline and 11.1 % at follow-up. Boruta selected five air pollutants (NO2, NOX, PM10, silicon in PM10, iron in PM2.5) and one urban temperature measure (heat island effect) as factors contributing to the occurrence of weekly headache episodes at follow-up. The estimated causal effect of each exposure on weekly headache indicated positive associations. NO2 showed the largest effect (ATE = 0.007 per interquartile range (IQR) increase; SE = 0.004), followed by PM10 (ATE = 0.006 per IQR increase; SE = 0.004), heat island effect (ATE = 0.006 per one-degree Celsius increase; SE = 0.007), NOx (ATE = 0.004 per IQR increase; SE = 0.004), iron in PM2.5 (ATE = 0.003 per IQR increase; SE = 0.004), and silicon in PM10 (ATE = 0.003 per IQR increase; SE = 0.004).
Our results suggested that exposure to air pollution and heat island effects contributed to the reporting of weekly headache episodes in the study population.