The risk of APOE for Alzheimer's disease (AD) is modified by age. Beyond APOE, the polygenic architecture may also be heterogeneous across age. We aim to investigate age-related genetic heterogeneity ...of AD and identify genomic loci with differential effects across age. Stratified gene-based genome-wide association studies and polygenic variation analyses were performed in the younger (60–79 years, N = 14,895) and older (≥80 years, N = 6559) age-at-onset groups using Alzheimer's Disease Genetics Consortium data. We showed a moderate genetic correlation (rg = 0.64) between the two age groups, supporting genetic heterogeneity. Heritability explained by variants on chromosome 19 (harboring APOE) was significantly larger in younger than in older onset group (p < 0.05). APOE region, BIN1, OR2S2, MS4A4E, and PICALM were identified at the gene-based genome-wide significance (p < 2.73 × 10−6) with larger effects at younger age (except MS4A4E). For the novel gene OR2S2, we further performed leave-one-out analyses, which showed consistent effects across subsamples. Our results suggest using genetically more homogeneous individuals may help detect additional susceptible loci.
•Genetic heterogeneity of Alzheimer's disease between the younger and older patients.•Heritability explained by chromosome 19 was markedly larger at younger than older age.•A novel gene, OR2S2, was found in the younger individuals using gene-based genome-wide association studies.•APOE, BIN1, OR2S2, and PICALM had stronger effects in the younger than the older age.•Our strategy may help identify divergent biological mechanisms and Alzheimer's disease subtypes.
Personality is influenced by genetic and environmental factors and associated with mental health. However, the underlying genetic determinants are largely unknown. We identified six genetic loci, ...including five novel loci, significantly associated with personality traits in a meta-analysis of genome-wide association studies (N = 123,132-260,861). Of these genome-wide significant loci, extraversion was associated with variants in WSCD2 and near PCDH15, and neuroticism with variants on chromosome 8p23.1 and in L3MBTL2. We performed a principal component analysis to extract major dimensions underlying genetic variations among five personality traits and six psychiatric disorders (N = 5,422-18,759). The first genetic dimension separated personality traits and psychiatric disorders, except that neuroticism and openness to experience were clustered with the disorders. High genetic correlations were found between extraversion and attention-deficit-hyperactivity disorder (ADHD) and between openness and schizophrenia and bipolar disorder. The second genetic dimension was closely aligned with extraversion-introversion and grouped neuroticism with internalizing psychopathology (e.g., depression or anxiety).
Lung cancer survivors are at high risk of developing a second primary lung cancer (SPLC). However, SPLC risk factors have not been established and the impact of tobacco smoking remains controversial. ...We examined the risk factors for SPLC across multiple epidemiologic cohorts and evaluated the impact of smoking cessation on reducing SPLC risk.
We analyzed data from 7059 participants in the Multiethnic Cohort (MEC) diagnosed with an initial primary lung cancer (IPLC) between 1993 and 2017. Cause-specific proportional hazards models estimated SPLC risk. We conducted validation studies using the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial (N = 3423 IPLC cases) and European Prospective Investigation into Cancer and Nutrition (N = 4731 IPLC cases) cohorts and pooled the SPLC risk estimates using random effects meta-analysis.
Overall, 163 MEC cases (2.3%) developed SPLC. Smoking pack-years (hazard ratio HR = 1.18 per 10 pack-years, p < 0.001) and smoking intensity (HR = 1.30 per 10 cigarettes per day, p < 0.001) were significantly associated with increased SPLC risk. Individuals who met the 2013 U.S. Preventive Services Task Force’s screening criteria at IPLC diagnosis also had an increased SPLC risk (HR = 1.92; p < 0.001). Validation studies with the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial and European Prospective Investigation into Cancer and Nutrition revealed consistent results. Meta-analysis yielded pooled HRs of 1.16 per 10 pack-years (pmeta < 0.001), 1.25 per 10 cigarettes per day (pmeta < 0.001), and 1.99 (pmeta < 0.001) for meeting the U.S. Preventive Services Task Force’s criteria. In MEC, smoking cessation after IPLC diagnosis was associated with an 83% reduction in SPLC risk (HR = 0.17; p < 0.001).
Tobacco smoking is a risk factor for SPLC. Smoking cessation may reduce the risk of SPLC. Additional strategies for SPLC surveillance and screening are warranted.
We propose an iterative variable selection method for the accelerated failure time model using high-dimensional survival data. Our method pioneers the use of the recently proposed structured ...screen-and-select framework for survival analysis. We use the marginal utility as the measure of association to inform the structured screening process. For the selection steps, we use Bayesian model selection based on non-local priors. We compare the proposed method with a few well-known methods. Assessment in terms of true positive rate and false discovery rate shows the usefulness of our method. We have implemented the method within the R package GWASinlps.
Antipsychotic drugs were incidentally discovered in the 1950s, but their mechanisms of action are still not understood. Better understanding of schizophrenia pathogenesis could shed light on actions ...of current drugs and reveal novel "druggable" pathways for unmet therapeutic needs. Recent genome-wide association studies offer unprecedented opportunities to characterize disease gene networks and uncover drug-disease relationships. Polygenic overlap between schizophrenia risk genes and antipsychotic drug targets has been demonstrated, but specific genes and pathways constituting this overlap are undetermined. Risk genes of polygenic disorders do not operate in isolation but in combination with other genes through protein-protein interactions among gene product.
The protein interactome was used to map antipsychotic drug targets (N=88) to networks of schizophrenia risk genes (N=328).
Schizophrenia risk genes were significantly localized in the interactome, forming a distinct disease module. Core genes of the module were enriched for genes involved in developmental biology and cognition, which may have a central role in schizophrenia etiology. Antipsychotic drug targets overlapped with the core disease module and comprised multiple pathways beyond dopamine. Some important risk genes like CHRN, PCDH, and HCN families were not connected to existing antipsychotics but may be suitable targets for novel drugs or drug repurposing opportunities to treat other aspects of schizophrenia, such as cognitive or negative symptoms.
The network medicine approach provides a platform to collate information of disease genetics and drug-gene interactions to shift focus from development of antipsychotics to multitarget antischizophrenia drugs. This approach is transferable to other diseases.
Abstract
Evaluating gene by environment (G × E) interaction under an additive risk model (i.e., additive interaction) has gained wider attention. Recently, statistical tests have been proposed for ...detecting additive interaction, utilizing an assumption on gene-environment (G-E) independence to boost power, that do not rely on restrictive genetic models such as dominant or recessive models. However, a major limitation of these methods is a sharp increase in type I error when this assumption is violated. Our goal was to develop a robust test for additive G × E interaction under the trend effect of genotype, applying an empirical Bayes-type shrinkage estimator of the relative excess risk due to interaction. The proposed method uses a set of constraints to impose the trend effect of genotype and builds an estimator that data-adaptively shrinks an estimator of relative excess risk due to interaction obtained under a general model for G-E dependence using a retrospective likelihood framework. Numerical study under varying levels of departures from G-E independence shows that the proposed method is robust against the violation of the independence assumption while providing an adequate balance between bias and efficiency compared with existing methods. We applied the proposed method to the genetic data of Alzheimer disease and lung cancer.
Abstract
Motivation
Multiple marker analysis of the genome-wide association study (GWAS) data has gained ample attention in recent years. However, because of the ultra high-dimensionality of GWAS ...data, such analysis is challenging. Frequently used penalized regression methods often lead to large number of false positives, whereas Bayesian methods are computationally very expensive. Motivated to ameliorate these issues simultaneously, we consider the novel approach of using non-local priors in an iterative variable selection framework.
Results
We develop a variable selection method, named, iterative non-local prior based selection for GWAS, or GWASinlps, that combines, in an iterative variable selection framework, the computational efficiency of the screen-and-select approach based on some association learning and the parsimonious uncertainty quantification provided by the use of non-local priors. The hallmark of our method is the introduction of ‘structured screen-and-select’ strategy, that considers hierarchical screening, which is not only based on response-predictor associations, but also based on response-response associations and concatenates variable selection within that hierarchy. Extensive simulation studies with single nucleotide polymorphisms having realistic linkage disequilibrium structures demonstrate the advantages of our computationally efficient method compared to several frequentist and Bayesian variable selection methods, in terms of true positive rate, false discovery rate, mean squared error and effect size estimation error. Further, we provide empirical power analysis useful for study design. Finally, a real GWAS data application was considered with human height as phenotype.
Availability and implementation
An R-package for implementing the GWASinlps method is available at https://cran.r-project.org/web/packages/GWASinlps/index.html.
Supplementary information
Supplementary data are available at Bioinformatics online.
We propose an iterative variable selection scheme for high-dimensional data with binary outcomes. The scheme adopts a structured screen-and-select framework and uses non-local prior-based Bayesian ...model selection within the same. The structured screening is based on the association of the independent variables with the outcome which is measured in terms of the maximum marginal likelihood estimator. Performance comparison with several well-known methods in terms of true positive rate and false discovery rate shows that our proposed method stands to be a competitive alternative for sparse high-dimensional variable selection with binary outcomes. The method has been implemented within the R package GWASinlps.
Abstract
Background
With advancing therapeutics, lung cancer (LC) survivors are rapidly increasing in number. Although mounting evidence suggests LC survivors have high risk of second primary lung ...cancer (SPLC), there is no validated prediction model available for clinical use to identify high-risk LC survivors for SPLC.
Methods
Using data from 6325 ever-smokers in the Multiethnic Cohort (MEC) study diagnosed with initial primary lung cancer (IPLC) in 1993-2017, we developed a prediction model for 10-year SPLC risk after IPLC diagnosis using cause-specific Cox regression. We evaluated the model’s clinical utility using decision curve analysis and externally validated it using 2 population-based data—Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial (PLCO) and National Lung Screening Trial (NLST)—that included 2963 and 2844 IPLC (101 and 93 SPLC cases), respectively.
Results
Over 14 063 person-years, 145 (2.3%) ever-smoking IPLC patients developed SPLC in MEC. Our prediction model demonstrated a high predictive accuracy (Brier score = 2.9, 95% confidence interval CI = 2.4 to 3.3) and discrimination (area under the receiver operating characteristics AUC = 81.9%, 95% CI = 78.2% to 85.5%) based on bootstrap validation in MEC. Stratification by the estimated risk quartiles showed that the observed SPLC incidence was statistically significantly higher in the 4th vs 1st quartile (9.5% vs 0.2%; P < .001). Decision curve analysis indicated that in a wide range of 10-year risk thresholds from 1% to 20%, the model yielded a larger net-benefit vs hypothetical all-screening or no-screening scenarios. External validation using PLCO and NLST showed an AUC of 78.8% (95% CI = 74.6% to 82.9%) and 72.7% (95% CI = 67.7% to 77.7%), respectively.
Conclusions
We developed and validated a SPLC prediction model based on large population-based cohorts. The proposed prediction model can help identify high-risk LC patients for SPLC and can be incorporated into clinical decision making for SPLC surveillance and screening.