It is hoped that advances in our knowledge in disease genomics will contribute to personalized medicine such as individualized preventive strategies or early diagnoses of diseases. With the growth of ...genome-wide association studies (GWAS) in the past decade, how far have we reached this goal? In this study we explored the predictive ability of polygenic risk scores (PRSs) derived from GWAS for a range of complex disease and traits.
We first proposed a new approach to evaluate predictive performances of PRS at arbitrary P -value thresholds. The method was based on corrected estimates of effect sizes, accounting for possible false positives and selection bias. This approach requires no distributional assumptions and only requires summary statistics as input. The validity of the approach was verified in simulations. We explored the predictive power of PRS for ten complex traits, including type 2 diabetes (DM), coronary artery disease (CAD), triglycerides, high- and low-density lipoprotein, total cholesterol, schizophrenia (SCZ), bipolar disorder (BD), major depressive disorder and anxiety disorders. We found that the predictive ability of PRS for CAD and DM were modest (best AUC = 0.608 and 0.607) while for lipid traits the prediction R-squared ranged from 16.1 to 29.8%. For psychiatric disorders, the predictive power for SCZ was estimated to be the highest (best AUC 0.820), followed by BD. Predictive performance of other psychiatric disorders ranged from 0.543 to 0.585. Psychiatric traits tend to have more gradual rise in AUC when significance thresholds increase and achieve the best predictive power at higher P -values than cardiometabolic traits.
hcso@cuhk.edu.hk ; pcsham@hku.hk.
Supplementary data are available at Bioinformatics online.
Historically, association tests were limited to single variants, so that the allele was considered the basic unit for association testing. As marker density increases and indirect approaches are used ...to assess association through linkage disequilibrium, association is now frequently considered at the haplotypic level. We suggest that there are difficulties in replicating association findings at the single-nucleotide–polymorphism (SNP) or the haplotype level, and we propose a shift toward a gene-based approach in which all common variation within a candidate gene is considered jointly. Inconsistencies arising from population differences are more readily resolved by use of a gene-based approach rather than either a SNP-based or a haplotype-based approach. A gene-based approach captures all of the potential risk-conferring variations; thus, negative findings are subject only to the issue of power. In addition, chance findings due to multiple testing can be readily accounted for by use of a genewide-significance level. Meta-analysis procedures can be formalized for gene-based methods through the combination of
P values. It is only a matter of time before all variation within genes is mapped, at which point the gene-based approach will become the natural end point for association analysis and will inform our search for functional variants relevant to disease etiology.
Human neuropsychiatric disorders are associated with genetic and environmental factors affecting the brain, which has been subjected to strong evolutionary pressures resulting in an enlarged cerebral ...cortex and improved cognitive performance. Thus, genes involved in human brain evolution may also play a role in neuropsychiatric disorders. We test whether genes associated with 7 neuropsychiatric phenotypes are enriched in genomic regions that have experienced rapid changes in human evolution (HARs) and importantly, whether HAR status interacts with developmental brain expression to predict associated genes. We used the most recent publicly available GWAS and gene expression data to test for enrichment of HARs, brain expression, and their interaction. These revealed significant interactions between HAR status and whole-brain expression across developmental stages, indicating that the relationship between brain expression and association with schizophrenia and intelligence is stronger among HAR than non-HAR genes. Follow-up regional analyses indicated that predicted HAR-expression interaction effects may vary substantially across regions and developmental stages. Although depression indicated significant enrichment of HAR genes, little support was found for HAR enrichment among bipolar, autism, ADHD, or Alzheimer's associated genes. Our results indicate that intelligence, schizophrenia, and depression-associated genes are enriched for those involved in the evolution of the human brain. These findings highlight promising candidates for follow-up study and considerations for novel drug development, but also caution careful assessment of the translational ability of animal models for studying neuropsychiatric traits in the context of HARs, and the importance of using humanized animal models or human-derived tissues when researching these traits.
Cardiovascular diseases represent a major health issue in patients with schizophrenia (SCZ) and bipolar disorder (BD), but the exact nature of cardiometabolic (CM) abnormalities involved and the ...underlying mechanisms remain unclear. Psychiatric medications are known risk factors, but it is unclear whether there is a connection between the disorders (SCZ/BD) themselves and CM abnormalities.
Using polygenic risk scores and linkage disequilibrium score regression, we investigated the shared genetic bases of SCZ and BD with 28 CM traits. We performed Mendelian randomization (MR) to elucidate causal relationships between the two groups of disorders. The analysis was based on large-scale meta-analyses of genome-wide association studies. We also identified the potential shared genetic variants and inferred the pathways involved.
We found tentative polygenic associations of SCZ with glucose metabolism abnormalities, adverse adipokine profiles, increased waist-to-hip ratio and visceral adiposity (false discovery rate or FDR<0.05). However, there was an inverse association with body mass index. For BD, we observed several polygenic associations with favorable CM profiles at FDR<0.05. MR analysis showed that SCZ may be causally linked to raised triglyceride and that lower fasting glucose may be linked to BD. We also identified numerous single nucleotide polymorphisms and pathways shared between SCZ/BD with CM traits, some of which are related to inflammation or the immune system.
Our findings suggest that SCZ patients may be genetically predisposed to several CM abnormalities independent of medication side effects. On the other hand, CM abnormalities in BD may be more likely to be secondary. However, the findings require further validation.
Individuals affected with different neuropsychiatric disorders such as autism (AUT), schizophrenia (SCZ) and bipolar disorder (BPD), may share similar clinical manifestations, suggesting shared ...genetic influences and common biological mechanisms underlying these disorders. Using brain transcriptome data gathered from postmortem donors affected with AUT, SCZ and BPD, it is now possible to identify shared dysregulated gene sets, i.e., those abnormally expressed in brains of neuropsychiatric patients, compared to non-psychiatric controls. Here, we apply a novel aberrant gene expression analysis method, coupled with consensus co-expression network analysis, to identify gene sets with shared dysregulated expression in cortical brains of individuals affected with AUT, SCZ and BPD. We identify eight gene sets with dysregulated expression shared by AUT, SCZ and BPD, 23 by AUT and SCZ, four by AUT and BPD, and two by SCZ and BPD. The identified genes are enriched with functions relevant to amino acid transport, synapse, neurotransmitter release, oxidative stress, nitric oxide synthase biosynthesis, immune response, protein folding, lysophosphatidic acid-mediated signaling and glycolysis. Our method has been proven to be effective in discovering and revealing multigene sets with dysregulated expression shared by different neuropsychiatric disorders. Our findings provide new insights into the common molecular mechanisms underlying the pathogenesis and progression of AUT, SCZ and BPD, contributing to the study of etiological overlap between these neuropsychiatric disorders.
Polygenic risk scores (PRS) from genome-wide association studies (GWAS) are increasingly used to predict disease risks. However some included variants could be false positives and the raw estimates ...of effect sizes from them may be subject to selection bias. In addition, the standard PRS approach requires testing over a range of p-value thresholds, which are often chosen arbitrarily. The prediction error estimated from the optimized threshold may also be subject to an optimistic bias. To improve genomic risk prediction, we proposed new empirical Bayes approaches to recover the underlying effect sizes and used them as weights to construct PRS. We applied the new PRS to twelve cardio-metabolic traits in the Northern Finland Birth Cohort and demonstrated improvements in predictive power (in R
) when compared to standard PRS at the best p-value threshold. Importantly, for eleven out of the twelve traits studied, the predictive performance from the entire set of genome-wide markers outperformed the best R
from standard PRS at optimal p-value thresholds. Our proposed methodology essentially enables an automatic PRS weighting scheme without the need of choosing tuning parameters. The new method also performed satisfactorily in simulations. It is computationally simple and does not require assumptions on the effect size distributions.
The extended Simes’ test (known as GATES) and scaled chi-square test were proposed to combine a set of dependent genome-wide association signals at multiple single-nucleotide polymorphisms (SNPs) for ...assessing the overall significance of association at the gene or pathway levels. The two tests use different strategies to combine association p values and can outperform each other when the number of and linkage disequilibrium between SNPs vary. In this paper, we introduce a hybrid set-based test (HYST) combining the two tests for genome-wide association studies (GWASs). We describe how HYST can be used to evaluate statistical significance for association at the protein-protein interaction (PPI) level in order to increase power for detecting disease-susceptibility genes of moderate effect size. Computer simulations demonstrated that HYST had a reasonable type 1 error rate and was generally more powerful than its parents and other alternative tests to detect a PPI pair where both genes are associated with the disease of interest. We applied the method to three complex disease GWAS data sets in the public domain; the method detected a number of highly connected significant PPI pairs involving multiple confirmed disease-susceptibility genes not found in the SNP- and gene-based association analyses. These results indicate that HYST can be effectively used to examine a collection of predefined SNP sets based on prior biological knowledge for revealing additional disease-predisposing genes of modest effects in GWASs.
The etiology of depression remains poorly understood. Changes in blood lipid levels were reported to be associated with depression and suicide, however study findings were mixed.
We performed a ...two-sample Mendelian randomisation (MR) analysis to investigate the causal relationship between blood lipids and depression phenotypes, based on large-scale GWAS summary statistics (N = 188 577/480 359 for lipid/depression traits respectively). Five depression-related phenotypes were included, namely major depression (MD; from PGC), depressive symptoms (DS; from SSGAC), longest duration and number of episodes of low mood, and history of deliberate self-harm (DSH)/suicide (from UK Biobank). MR was conducted with inverse-variance weighted (MR-IVW), Egger and Generalised Summary-data-based MR (GSMR) methods.
There was consistent evidence that triglyceride (TG) is causally associated with DS (MR-IVW β for one-s.d. increase in TG = 0.0346, 95% CI 0.0114-0.0578), supported by MR-IVW and GSMR and multiple r2 clumping thresholds. We also observed relatively consistent associations of TG with DSH/suicide (MR-Egger OR = 2.514, CI 1.579-4.003). There was moderate evidence for positive associations of TG with MD and the number of episodes of low mood. For HDL-c, we observed moderate evidence for causal associations with DS and MD. LDL-c and TC did not show robust causal relationships with depression phenotypes, except for weak evidence that LDL-c is inversely related to DSH/suicide. We did not detect significant associations when depression phenotypes were treated as exposures.
This study provides evidence to a causal relationship between TG, and to a lesser extent, altered cholesterol levels with depression phenotypes. Further studies on its mechanistic basis and the effects of lipid-lowering therapies are warranted.
Abstract
Aims
In long QT syndrome (LQTS) patients, modifier genes modulate the arrhythmic risk associated with a disease-causing mutation. Their recognition can improve risk stratification and ...clinical management, but their discovery represents a challenge. We tested whether a cellular-driven approach could help to identify new modifier genes and especially their mechanism of action.
Methods and results
We generated human-induced pluripotent stem cell-derived cardiomyocytes (iPSC-CM) from two patients carrying the same KCNQ1-Y111C mutation, but presenting opposite clinical phenotypes. We showed that the phenotype of the iPSC-CMs derived from the symptomatic patient is due to impaired trafficking and increased degradation of the mutant KCNQ1 and wild-type human ether-a-go-go-related gene. In the iPSC-CMs of the asymptomatic (AS) patient, the activity of an E3 ubiquitin-protein ligase (Nedd4L) involved in channel protein degradation was reduced and resulted in a decreased arrhythmogenic substrate. Two single-nucleotide variants (SNVs) on the Myotubularin-related protein 4 (MTMR4) gene, an interactor of Nedd4L, were identified by whole-exome sequencing as potential contributors to decreased Nedd4L activity. Correction of these SNVs by CRISPR/Cas9 unmasked the LQTS phenotype in AS cells. Importantly, the same MTMR4 variants were present in 77% of AS Y111C mutation carriers of a separate cohort. Thus, genetically mediated interference with Nedd4L activation seems associated with protective effects.
Conclusion
Our finding represents the first demonstration of the cellular mechanism of action of a protective modifier gene in LQTS. It provides new clues for advanced risk stratification and paves the way for the design of new therapies targeting this specific molecular pathway.