The genetic correlation describes the genetic relationship between two traits and can contribute to a better understanding of the shared biological pathways and/or the causality relationships between ...them. The rarity of large family cohorts with recorded instances of two traits, particularly disease traits, has made it difficult to estimate genetic correlations using traditional epidemiological approaches. However, advances in genomic methodologies, such as genome-wide association studies, and widespread sharing of data now allow genetic correlations to be estimated for virtually any trait pair. Here, we review the definition, estimation, interpretation and uses of genetic correlations, with a focus on applications to human disease.
Psychiatric disorders are highly genetically correlated, but little research has been conducted on the genetic differences between disorders. We developed a new method (case-case genome-wide ...association study; CC-GWAS) to test for differences in allele frequency between cases of two disorders using summary statistics from the respective case-control GWAS, transcending current methods that require individual-level data. Simulations and analytical computations confirm that CC-GWAS is well powered with effective control of type I error. We applied CC-GWAS to publicly available summary statistics for schizophrenia, bipolar disorder, major depressive disorder and five other psychiatric disorders. CC-GWAS identified 196 independent case-case loci, including 72 CC-GWAS-specific loci that were not significant at the genome-wide level in the input case-control summary statistics; two of the CC-GWAS-specific loci implicate the genes KLF6 and KLF16 (from the Krüppel-like family of transcription factors), which have been linked to neurite outgrowth and axon regeneration. CC-GWAS loci replicated convincingly in applications to datasets with independent replication data.
Polygenic risk scores suffer reduced accuracy in non-European populations, exacerbating health disparities. We propose PolyPred, a method that improves cross-population polygenic risk scores by ...combining two predictors: a new predictor that leverages functionally informed fine-mapping to estimate causal effects (instead of tagging effects), addressing linkage disequilibrium differences, and BOLT-LMM, a published predictor. When a large training sample is available in the non-European target population, we propose PolyPred
, which further incorporates the non-European training data. We applied PolyPred to 49 diseases/traits in four UK Biobank populations using UK Biobank British training data, and observed relative improvements versus BOLT-LMM ranging from +7% in south Asians to +32% in Africans, consistent with simulations. We applied PolyPred
to 23 diseases/traits in UK Biobank east Asians using both UK Biobank British and Biobank Japan training data, and observed improvements of +24% versus BOLT-LMM and +12% versus PolyPred. Summary statistics-based analogs of PolyPred and PolyPred
attained similar improvements.
The association between major depressive disorder (MDD) and obesity may stem from shared immunometabolic mechanisms particularly evident in MDD with atypical features, characterized by increased ...appetite and/or weight (A/W) during an active episode.
To determine whether subgroups of patients with MDD stratified according to the A/W criterion had a different degree of genetic overlap with obesity-related traits (body mass index BMI and levels of C-reactive protein CRP and leptin).
This multicenter study assembled genome-wide genotypic and phenotypic measures from 14 data sets of the Psychiatric Genomics Consortium. Data sets were drawn from case-control, cohort, and population-based studies, including 26 628 participants with established psychiatric diagnoses and genome-wide genotype data. Data on BMI were available for 15 237 participants. Data were retrieved and analyzed from September 28, 2015, through May 20, 2017.
Lifetime DSM-IV MDD was diagnosed using structured diagnostic instruments. Patients with MDD were stratified into subgroups according to change in the DSM-IV A/W symptoms as decreased or increased.
Data included 11 837 participants with MDD and 14 791 control individuals, for a total of 26 628 participants (59.1% female and 40.9% male). Among participants with MDD, 5347 (45.2%) were classified in the decreased A/W and 1871 (15.8%) in the increased A/W subgroups. Common genetic variants explained approximately 10% of the heritability in the 2 subgroups. The increased A/W subgroup showed a strong and positive genetic correlation (SE) with BMI (0.53 0.15; P = 6.3 × 10-4), whereas the decreased A/W subgroup showed an inverse correlation (-0.28 0.14; P = .06). Furthermore, the decreased A/W subgroup had a higher polygenic risk for increased BMI (odds ratio OR, 1.18; 95% CI, 1.12-1.25; P = 1.6 × 10-10) and levels of CRP (OR, 1.08; 95% CI, 1.02-1.13; P = 7.3 × 10-3) and leptin (OR, 1.09; 95% CI, 1.06-1.12; P = 1.7 × 10-3).
The phenotypic associations between atypical depressive symptoms and obesity-related traits may arise from shared pathophysiologic mechanisms in patients with MDD. Development of treatments effectively targeting immunometabolic dysregulations may benefit patients with depression and obesity, both syndromes with important disability.
Research on gene × environment interaction in major depressive disorder (MDD) has thus far primarily focused on candidate genes, although genetic effects are known to be polygenic.
To test whether ...the effect of polygenic risk scores on MDD is moderated by childhood trauma.
The study sample consisted of 1645 participants with a DSM-IV diagnosis of MDD and 340 screened controls from The Netherlands. Chronic or remitted episodes (severe MDD) were present in 956 participants. The occurrence of childhood trauma was assessed with the Childhood Trauma Interview and the polygenic risk scores were based on genome-wide meta-analysis results from the Psychiatric Genomics Consortium.
The polygenic risk scores and childhood trauma independently affected MDD risk, and evidence was found for interaction as departure from both multiplicativity and additivity, indicating that the effect of polygenic risk scores on depression is increased in the presence of childhood trauma. The interaction effects were similar in predicting all MDD risk and severe MDD risk, and explained a proportion of variation in MDD risk comparable to the polygenic risk scores themselves.
The interaction effect found between polygenic risk scores and childhood trauma implies that (1) studies on direct genetic effect on MDD gain power by focusing on individuals exposed to childhood trauma, and that (2) individuals with both high polygenic risk scores and exposure to childhood trauma are particularly at risk for developing MDD.
The ability to predict the disease course of individuals with major depressive disorder (MDD) is essential for optimal treatment planning. Here, we used a data-driven machine learning approach to ...assess the predictive value of different sets of biological data (whole-blood proteomics, lipid metabolomics, transcriptomics, genetics), both separately and added to clinical baseline variables, for the longitudinal prediction of 2-year remission status in MDD at the individual-subject level.
Prediction models were trained and cross-validated in a sample of 643 patients with current MDD (2-year remission n = 325) and subsequently tested for performance in 161 individuals with MDD (2-year remission n = 82).
Proteomics data showed the best unimodal data predictions (area under the receiver operating characteristic curve = 0.68). Adding proteomic to clinical data at baseline significantly improved 2-year MDD remission predictions (area under the receiver operating characteristic curve = 0.63 vs. 0.78, p = .013), while the addition of other omics data to clinical data did not yield significantly improved model performance. Feature importance and enrichment analysis revealed that proteomic analytes were involved in inflammatory response and lipid metabolism, with fibrinogen levels showing the highest variable importance, followed by symptom severity. Machine learning models outperformed psychiatrists' ability to predict 2-year remission status (balanced accuracy = 71% vs. 55%).
This study showed the added predictive value of combining proteomic data, but not other omics data, with clinical data for the prediction of 2-year remission status in MDD. Our results reveal a novel multimodal signature of 2-year MDD remission status that shows clinical potential for individual MDD disease course predictions from baseline measurements.
Predictors consistently associated with psychosis liability and course of illness in schizophrenia (SCZ) spectrum disorders (SSD), including the need for clozapine treatment, are lacking. ...Longitudinally ascertained medication use may empower studies examining associations between polygenic risk scores (PRSs) and pharmacotherapy choices.
To examine associations between PRS-SCZ loading and groups with different liabilities to SSD (individuals with SSD taking clozapine, individuals with SSD taking other antipsychotics, their parents and siblings, and unrelated healthy controls) and between PRS-SCZ and the likelihood of receiving a prescription of clozapine relative to other antipsychotics.
This genetic association study was a multicenter, observational cohort study with 6 years of follow-up. Included were individuals diagnosed with SSD who were taking clozapine or other antipsychotics, their parents and siblings, and unrelated healthy controls. Data were collected from 2004 until 2021 and analyzed between October 2021 and September 2022.
Polygenic risk scores for SCZ.
Multinomial logistic regression was used to examine possible differences between groups by computing risk ratios (RRs), ie, ratios of the probability of pertaining to a particular group divided by the probability of healthy control status. We also computed PRS-informed odd ratios (ORs) for clozapine use relative to other antipsychotics.
Polygenic risk scores for SCZ were generated for 2344 participants (mean SD age, 36.95 years 14.38; 994 female individuals 42.4%) who remained after quality control screening (557 individuals with SSD taking clozapine, 350 individuals with SSD taking other antipsychotics during the 6-year follow-up, 542 parents and 574 siblings of individuals with SSD, and 321 unrelated healthy controls). All RRs were significantly different from 1; RRs were highest for individuals with SSD taking clozapine (RR, 3.24; 95% CI, 2.76-3.81; P = 2.47 × 10-46), followed by individuals with SSD taking other antipsychotics (RR, 2.30; 95% CI, 1.95-2.72; P = 3.77 × 10-22), parents (RR, 1.44; 95% CI, 1.25-1.68; P = 1.76 × 10-6), and siblings (RR, 1.40; 95% CI, 1.21-1.63; P = 8.22 × 10-6). Polygenic risk scores for SCZ were positively associated with clozapine vs other antipsychotic use (OR, 1.41; 95% CI, 1.22-1.63; P = 2.98 × 10-6), suggesting a higher likelihood of clozapine prescriptions among individuals with higher PRS-SCZ.
In this study, PRS-SCZ loading differed between groups of individuals with SSD, their relatives, and unrelated healthy controls, with patients taking clozapine at the far end of PRS-SCZ loading. Additionally, PRS-SCZ was associated with a higher likelihood of clozapine prescribing. Our findings may inform early intervention and prognostic studies of the value of using PRS-SCZ to personalize antipsychotic treatment.
Major depressive disorder (MDD) is clinically heterogeneous with prevalence rates twice as high in women as in men. There are many possible sources of heterogeneity in MDD most of which are not ...measured in a sufficiently comparable way across study samples. Here, we assess genetic heterogeneity based on two fundamental measures, between‐cohort and between‐sex heterogeneity. First, we used genome‐wide association study (GWAS) summary statistics to investigate between‐cohort genetic heterogeneity using the 29 research cohorts of the Psychiatric Genomics Consortium (PGC; N cases = 16,823, N controls = 25,632) and found that some of the cohort heterogeneity can be attributed to ascertainment differences (such as recruitment of cases from hospital vs. community sources). Second, we evaluated between‐sex genetic heterogeneity using GWAS summary statistics from the PGC, Kaiser Permanente GERA, UK Biobank, and the Danish iPSYCH studies but did not find convincing evidence for genetic differences between the sexes. We conclude that there is no evidence that the heterogeneity between MDD data sets and between sexes reflects genetic heterogeneity. Larger sample sizes with detailed phenotypic records and genomic data remain the key to overcome heterogeneity inherent in assessment of MDD.
Genome-wide association studies (GWASs) are an optimal design for discovery of disease risk loci for diseases whose underlying genetic architecture includes many common causal loci of small effect (a ...polygenic architecture). We consider two designs that deserve careful consideration if the true underlying genetic architecture of the trait is polygenic: parent-offspring trios and unscreened control subjects. We assess these designs in terms of quantification of the total contribution of genome-wide genetic markers to disease risk (SNP heritability) and power to detect an associated risk allele. First, we show that trio designs should be avoided when: (1) the disease has a lifetime risk > 1%; (2) trio probands are ascertained from families with more than one affected sibling under which scenario the SNP heritability can drop by more than 50% and power can drop as much as from 0.9 to 0.15 for a sample of 20,000 subjects; or (3) assortative mating occurs (spouse correlation of the underlying liability to the disorder), which decreases the SNP heritability but not the power to detect a single locus in the trio design. Some studies use unscreened rather than screened control subjects because these can be easier to collect; we show that the estimated SNP heritability should then be scaled by dividing by (1 − K × u)2 for disorders with population prevalence K and proportion of unscreened control subjects u. When omitting to scale appropriately, the SNP heritability of, for example, major depressive disorder (K = 0.15) would be underestimated by 28% when none of the control subjects are screened.