Population structure (PS), including population stratification and admixture, is a significant confounder in genome-wide association studies (GWAS), as it may produce spurious associations. Random ...forest (RF) has been increasingly applied in GWAS data analysis because of its advantage in analysing high dimensional genetic data. RF creates importance measures for single nucleotide polymorphisms (SNPs), which are helpful for feature selections. However, if PS is not appropriately corrected, RF tends to give high importance to disease-unrelated SNPs with different frequencies of allele or genotype among subpopulations, leading to inaccurate results.
In this study, the authors propose to correct for the confounding effect of PS by including the information of PS in RF analysis. The correction procedure starts by extracting the information of PS using EIGENSTRAT or multi-dimensional scaling clustering procedure from a large number of structure inference SNPs. Phenotype and genotypes adjusted by the information of PS are then used as the outcome and predictors in RF analysis.
Extensive simulations indicate that the importance measure of the causal SNP is increased following the PS correction. By analysing a real dataset, the proposed correction removes the spurious association between the lactase gene and height.
The authors propose a simple method to correct for PS in RF analysis on GWAS data. Further studies in real GWAS datasets are required to validate the robustness of the proposed approach.
Occupational exposure to endotoxin is associated with decrements in pulmonary function, but how much variation in this association is explained by genetic variants is not well understood.
We aimed to ...identify single nucleotide polymorphisms (SNPs) that are associated with the rate of forced expiratory volume in one second (FEV1) decline by a large scale genetic association study in newly-hired healthy young female cotton textile workers.
DNA samples were genotyped using the Illumina Human CVD BeadChip. Change rate in FEV1 was modeled as a function of each SNP genotype in linear regression model with covariate adjustment. We controlled the type 1 error in study-wide level by permutation method. The false discovery rate (FDR) and the family-wise error rate (FWER) were set to be 0.10 and 0.15 respectively.
Two SNPs were found to be significant (P<6.29×10(-5)), including rs1910047 (P = 3.07×10(-5), FDR = 0.0778) and rs9469089 (P = 6.19×10(-5), FDR = 0.0967), as well as other eight suggestive (P<5×10(-4)) associated SNPs. Gene-gene and gene-environment interactions were also observed, such as rs1910047 and rs1049970 (P = 0.0418, FDR = 0.0895); rs9469089 and age (P = 0.0161, FDR = 0.0264). Genetic risk score analysis showed that the more risk loci the subjects carried, the larger the rate of FEV1 decline occurred (P trend = 3.01×10(-18)). However, the association was different among age subgroups (P = 7.11×10(-6)) and endotoxin subgroups (P = 1.08×10(-2)). Functional network analysis illustrates potential biological connections of all interacted genes.
Genetic variants together with environmental factors interact to affect the rate of FEV1 decline in cotton textile workers.
The development of cancer is driven by the accumulation of many oncogenesis-related genetic alterations and tumorigenesis is triggered by complex networks of involved genes rather than independent ...actions. To explore the epistasis existing among oncogenesis-related genes in lung cancer development, we conducted pairwise genetic interaction analyses among 35,031 SNPs from 2027 oncogenesis-related genes. The genotypes from three independent genome-wide association studies including a total of 24,037 lung cancer patients and 20,401 healthy controls with Caucasian ancestry were analyzed in the study. Using a two-stage study design including discovery and replication studies, and stringent Bonferroni correction for multiple statistical analysis, we identified significant genetic interactions between SNPs in
(OR=0.44,
value=3.27x10
in overall lung cancer and OR=0.41,
value=9.71x10
in non-small cell lung cancer),
(OR=0.73,
value=1.01x10
in adenocarcinoma) and
(OR=1.82,
value=7.62x10
in squamous cell carcinoma) in our analysis. None of these genes have been identified from previous main effect association studies in lung cancer. Further eQTL gene expression analysis in lung tissues provided information supporting the functional role of the identified epistasis in lung tumorigenesis. Gene set enrichment analysis revealed potential pathways and gene networks underlying molecular mechanisms in overall lung cancer as well as histology subtypes development. Our results provide evidence that genetic interactions between oncogenesis-related genes play an important role in lung tumorigenesis and epistasis analysis, combined with functional annotation, provides a valuable tool for uncovering functional novel susceptibility genes that contribute to lung cancer development by interacting with other modifier genes.
Body mass index (BMI) is inversely associated with lung cancer risk in observational studies, even though it increases the risk of several other cancers, which could indicate confounding by tobacco ...smoking or reverse causality. We used the two-sample Mendelian randomization (MR) approach to circumvent these limitations of observational epidemiology by constructing a genetic instrument for BMI, based on results from the GIANT consortium, which was evaluated in relation to lung cancer risk using GWAS results on 16,572 lung cancer cases and 21,480 controls. Results were stratified by histological subtype, smoking status and sex. An increase of one standard deviation (SD) in BMI (4.65 Kg/m(2)) raised the risk for lung cancer overall (OR = 1.13; P = 0.10). This was driven by associations with squamous cell (SQ) carcinoma (OR = 1.45; P = 1.2 × 10(-3)) and small cell (SC) carcinoma (OR = 1.81; P = 0.01). An inverse trend was seen for adenocarcinoma (AD) (OR = 0.82; P = 0.06). In stratified analyses, a 1 SD increase in BMI was inversely associated with overall lung cancer in never smokers (OR = 0.50; P = 0.02). These results indicate that higher BMI may increase the risk of certain types of lung cancer, in particular SQ and SC carcinoma.
The matrix metalloproteinase (MMP) family degrade extracellular matrix and mediate pathways including apoptosis, angiogenesis and immunity. We studied the association between four MMP polymorphisms ...within three MMP genes and esophageal adenocarcinoma (EA) risk and prognosis. A total of 313 EA cases and 455 age and gender frequency-matched controls were genotyped for MMP1 1G/2G, MMP3 6A/5A, MMP12 −82A/G and MMP12 1082A/G. The association between individual MMP polymorphisms and EA risk was evaluated using regression models and adjusted for age, gender, adult body mass index and smoking status. Haplotype analysis was performed to investigate the combined effect of all four linked MMP polymorphisms and EA risk. The MMP1 and MMP3 polymorphisms were associated with increased EA risk: MMP1 1G/2G and 2G/2G had adjusted odds ratios of 1.46 95% confidence interval 1.0–2.1; P = 0.04 and adjusted odds ratio 1.83 (1.2–2.8; P = 0.005), respectively, whereas MMP3 6A/5A had adjusted odds ratio 1.40 (95% confidence interval 1.0–2.1; P = 0.09) and MMP3 5A/5A had 1.61 (95% confidence interval 1.0–2.5; P = 0.03). Two MMP haplotypes MMP1–MMP3–MMP12 (−82) 2G-5A-A (adjusted odds ratio 1.36, 95% confidence interval 1.0–1.8; P = 0.03) and 2G-5A-G (adjusted odds ratio 1.70, 95% confidence interval 1.1–2.6; P = 0.01) were also associated with increased EA risk. The relationship between BE cases with the same set of controls was similar. No association was identified between the MMP polymorphisms and overall survival or progression free survival of patients with EA. MMP1, MMP3 and possibly MMP12 −82A/G polymorphisms and their haplotypes are associated with increased EA risk.
Background: Traffic-related particles (TRPs) are associated with adverse cardiovascular events. The exact mechanisms are unclear, but systemic inflammatory responses likely play a role. Objectives: ...We conducted a repeated measures study among male participants of the Normative Aging Study in the greater Boston, Massachusetts, area to determine whether individual-level residential black carbon (BC), a marker of TRPs, is associated with systemic inflammation and whether coronary heart disease (CHD), diabetes, and obesity modify associations. Methods: We quantified markers of inflammation in 1,163 serum samples from 580 men. Exposure to BC up to 4 weeks prior was predicted from a validated spatiotemporal land-use regression model. Linear mixed effects models estimated the effects of BC on each marker while adjusting for potential confounders. Results: Associations between BC and blood markers were not observed in main effects models or when stratified by obesity status. However, BC was positively associated with markers of inflammation in men with CHD (particularly vascular endothelial growth factor) and in men with diabetes (particularly interleukin-lß and tumor necrosis factor-α). Significant exposure time windows varied by marker, although in general the strongest associations were observed with moving averages of 2-7 days after a lag of several days. Conclusions: In an elderly male population, estimated BC exposures were positively associated with markers of systemic inflammation but only in men with CHD or diabetes.
Arsenic exposure has been associated with low birth weight. However, the underlying mechanisms are not well understood. Alterations to metabolites may act as causal mediators of the effect of arsenic ...exposure on low birth weight. This pilot study aimed to explore the role of metabolites in mediating the association of arsenic exposure on infant birth weight. Study samples were selected from a well-established prospectively enrolled cohort in Bangladesh comprising 35 newborns and a subset of 20 matched mothers. Metabolomics profiling was performed on 35 cord blood samples and 20 maternal peripheral blood samples collected during the second trimester of pregnancy. Inorganic arsenic (iAs) exposure was evaluated via cord blood samples and maternal toenail samples collected during the first trimester. Multiple linear regression and mediation analyses were used to explore the relationship between iAs exposure, metabolite alterations, and low birth weight. Cord blood arsenic level was correlated with elevated levels of 17-methylstearate, laurate (12:0) and 4-vinylphenol sulfate along with lower birth weight. Prenatal maternal toenail iAs level was associated with two peripheral blood metabolites (butyrylqlycine and tartarate), which likely contributed to higher cord blood iAs levels both independently and interactively. Findings of this pilot study indicate that both intrauterine and maternal peripheral blood metabolites appear to influence the toxic effect of inorganic arsenic exposure on low birth weight.
Purpose: To investigate the hypothesis that non-steroidal anti-inflammatory drugs (NSAIDs) lower lung cancer risk. Methods: We analysed pooled individual-level data from seven case-control and one ...cohort study in the International Lung Cancer Consortium (ILCCO). Relative risks for lung cancer associated with self-reported history of aspirin and other NSAID use were estimated within individual studies using logistic regression or proportional hazards models, adjusted for packyears of smoking, age, calendar period, ethnicity and education and were combined using random effects meta-analysis. Results: A total of 4,309 lung cancer cases (mean age at diagnosis 65 years, 45% adenocarcinoma and 22% squamous-cell carcinoma) and 58,301 non-cases/controls were included. Amongst controls, 34% had used NSAIDs in the past (81% of them used aspirin). After adjustment for negative confounding by smoking, ever-NSAID use (affirmative answer to the study-specific question on NSAID use) was associated with a 26% reduction (95% confidence interval 8 to 41%) in lung cancer risk in men, but not in women (3% increase (— 11% to 30%)). In men, the association was stronger in current and former smokers, and for squamous-cell carcinoma than for adenocarcinomas, but there was no trend with duration of use. No differences were found in the effects on lung cancer risk of aspirin and non-aspirin NSAIDs. Conclusions: Evidence from ILCCO suggests that NSAID use in men confers a modest protection for lung cancer, especially amongst ever-smokers. Additional investigation is needed regarding the possible effects of age, duration, dose and type of NSAID and whether effect modification by smoking status or sex exists.