The predictive utility of polygenic scores is increasing, and many polygenic scoring methods are available, but it is unclear which method performs best. This study evaluates the predictive utility ...of polygenic scoring methods within a reference-standardized framework, which uses a common set of variants and reference-based estimates of linkage disequilibrium and allele frequencies to construct scores. Eight polygenic score methods were tested: p-value thresholding and clumping (pT+clump), SBLUP, lassosum, LDpred1, LDpred2, PRScs, DBSLMM and SBayesR, evaluating their performance to predict outcomes in UK Biobank and the Twins Early Development Study (TEDS). Strategies to identify optimal p-value thresholds and shrinkage parameters were compared, including 10-fold cross validation, pseudovalidation and infinitesimal models (with no validation sample), and multi-polygenic score elastic net models. LDpred2, lassosum and PRScs performed strongly using 10-fold cross-validation to identify the most predictive p-value threshold or shrinkage parameter, giving a relative improvement of 16-18% over pT+clump in the correlation between observed and predicted outcome values. Using pseudovalidation, the best methods were PRScs, DBSLMM and SBayesR. PRScs pseudovalidation was only 3% worse than the best polygenic score identified by 10-fold cross validation. Elastic net models containing polygenic scores based on a range of parameters consistently improved prediction over any single polygenic score. Within a reference-standardized framework, the best polygenic prediction was achieved using LDpred2, lassosum and PRScs, modeling multiple polygenic scores derived using multiple parameters. This study will help researchers performing polygenic score studies to select the most powerful and predictive analysis methods.
Quantifying the microstructural properties of the human brain's connections is necessary for understanding normal ageing and disease. Here we examine brain white matter magnetic resonance imaging ...(MRI) data in 3,513 generally healthy people aged 44.64-77.12 years from the UK Biobank. Using conventional water diffusion measures and newer, rarely studied indices from neurite orientation dispersion and density imaging, we document large age associations with white matter microstructure. Mean diffusivity is the most age-sensitive measure, with negative age associations strongest in the thalamic radiation and association fibres. White matter microstructure across brain tracts becomes increasingly correlated in older age. This may reflect an age-related aggregation of systemic detrimental effects. We report several other novel results, including age associations with hemisphere and sex, and comparative volumetric MRI analyses. Results from this unusually large, single-scanner sample provide one of the most extensive characterizations of age associations with major white matter tracts in the human brain.
Neuroticism is a relatively stable personality trait characterized by negative emotionality (for example, worry and guilt)
; heritability estimated from twin studies ranges from 30 to 50%
, and ...SNP-based heritability ranges from 6 to 15%
. Increased neuroticism is associated with poorer mental and physical health
, translating to high economic burden
. Genome-wide association studies (GWAS) of neuroticism have identified up to 11 associated genetic loci
. Here we report 116 significant independent loci from a GWAS of neuroticism in 329,821 UK Biobank participants; 15 of these loci replicated at P < 0.00045 in an unrelated cohort (N = 122,867). Genetic signals were enriched in neuronal genesis and differentiation pathways, and substantial genetic correlations were found between neuroticism and depressive symptoms (r
= 0.82, standard error (s.e.) = 0.03), major depressive disorder (MDD; r
= 0.69, s.e. = 0.07) and subjective well-being (r
= -0.68, s.e. = 0.03) alongside other mental health traits. These discoveries significantly advance understanding of neuroticism and its association with MDD.
Male pattern baldness can have substantial psychosocial effects, and it has been phenotypically linked to adverse health outcomes such as prostate cancer and cardiovascular disease. We explored the ...genetic architecture of the trait using data from over 52,000 male participants of UK Biobank, aged 40-69 years. We identified over 250 independent genetic loci associated with severe hair loss (P<5x10-8). By splitting the cohort into a discovery sample of 40,000 and target sample of 12,000, we developed a prediction algorithm based entirely on common genetic variants that discriminated (AUC = 0.78, sensitivity = 0.74, specificity = 0.69, PPV = 59%, NPV = 82%) those with no hair loss from those with severe hair loss. The results of this study might help identify those at greatest risk of hair loss, and also potential genetic targets for intervention.
Alzheimer's disease (AD) is a public health priority for the 21st century. Risk reduction currently revolves around lifestyle changes with much research trying to elucidate the biological ...underpinnings. We show that self-report of parental history of Alzheimer's dementia for case ascertainment in a genome-wide association study of 314,278 participants from UK Biobank (27,696 maternal cases, 14,338 paternal cases) is a valid proxy for an AD genetic study. After meta-analysing with published consortium data (n = 74,046 with 25,580 cases across the discovery and replication analyses), three new AD-associated loci (P < 5 × 10
) are identified. These contain genes relevant for AD and neurodegeneration: ADAM10, BCKDK/KAT8 and ACE. Novel gene-based loci include drug targets such as VKORC1 (warfarin dose). We report evidence that the association of SNPs in the TOMM40 gene with AD is potentially mediated by both gene expression and DNA methylation in the prefrontal cortex. However, it is likely that multiple variants are affecting the trait and gene methylation/expression. Our discovered loci may help to elucidate the biological mechanisms underlying AD and, as they contain genes that are drug targets for other diseases and disorders, warrant further exploration for potential precision medicine applications.
Treatment-resistant depression (TRD) is a major contributor to the disability caused by major depressive disorder (MDD). Primary care electronic health records provide an easily accessible approach ...to investigate TRD clinical and genetic characteristics. MDD defined from primary care records in UK Biobank (UKB) and EXCEED studies was compared with other measures of depression and tested for association with MDD polygenic risk score (PRS). Using prescribing records, TRD was defined from at least two switches between antidepressant drugs, each prescribed for at least 6 weeks. Clinical-demographic characteristics, SNP-based heritability (h
) and genetic overlap with psychiatric and non-psychiatric traits were compared in TRD and non-TRD MDD cases. In 230,096 and 8926 UKB and EXCEED participants with primary care data, respectively, the prevalence of MDD was 8.7% and 14.2%, of which 13.2% and 13.5% was TRD, respectively. In both cohorts, MDD defined from primary care records was strongly associated with MDD PRS, and in UKB it showed overlap of 71-88% with other MDD definitions. In UKB, TRD vs healthy controls and non-TRD vs healthy controls h
was comparable (0.25 SE = 0.04 and 0.19 SE = 0.02, respectively). TRD vs non-TRD was positively associated with the PRS of attention deficit hyperactivity disorder, with lower socio-economic status, obesity, higher neuroticism and other unfavourable clinical characteristics. This study demonstrated that MDD and TRD can be reliably defined using primary care records and provides the first large scale population assessment of the genetic, clinical and demographic characteristics of TRD.
Identifying causal risk factors for self-harm is essential to inform preventive interventions. Epidemiological studies have identified risk factors associated with self-harm, but these associations ...can be subject to confounding. By implementing genetically informed methods to better account for confounding, this study aimed to better identify plausible causal risk factors for self-harm.
Using summary statistics from 24 genome-wide association studies (GWASs) comprising 16,067 to 322,154 individuals, polygenic scores (PSs) were generated to index 24 possible individual risk factors for self-harm (i.e., mental health vulnerabilities, substance use, cognitive traits, personality traits, and physical traits) among a subset of UK Biobank participants (N = 125,925, 56.2% female) who completed an online mental health questionnaire in the period from 13 July 2016 to 27 July 2017. In total, 5,520 (4.4%) of these participants reported having self-harmed in their lifetime. In binomial regression models, PSs indexing 6 risk factors (major depressive disorder MDD, attention deficit/hyperactivity disorder ADHD, bipolar disorder, schizophrenia, alcohol dependence disorder, and lifetime cannabis use) predicted self-harm, with effect sizes ranging from odds ratio (OR) = 1.05 (95% CI 1.02 to 1.07, q = 0.008) for lifetime cannabis use to OR = 1.20 (95% CI 1.16 to 1.23, q = 1.33 × 10-35) for MDD. No systematic differences emerged between suicidal and non-suicidal self-harm. To further probe causal relationships, two-sample Mendelian randomisation (MR) analyses were conducted, with MDD, ADHD, and schizophrenia emerging as the most plausible causal risk factors for self-harm. The genetic liabilities for MDD and schizophrenia were associated with self-harm independently of diagnosis and medication. Main limitations include the lack of representativeness of the UK Biobank sample, that self-harm was self-reported, and the limited power of some of the included GWASs, potentially leading to possible type II error.
In addition to confirming the role of MDD, we demonstrate that ADHD and schizophrenia likely play a role in the aetiology of self-harm using multivariate genetic designs for causal inference. Among the many individual risk factors we simultaneously considered, our findings suggest that systematic detection and treatment of core psychiatric symptoms, including psychotic and impulsivity symptoms, may be beneficial among people at risk for self-harm.
Individuals with lower socio-economic status (SES) are at increased risk of physical and mental illnesses and tend to die at an earlier age 1–3. Explanations for the association between SES and ...health typically focus on factors that are environmental in origin 4. However, common SNPs have been found collectively to explain around 18% of the phenotypic variance of an area-based social deprivation measure of SES 5. Molecular genetic studies have also shown that common physical and psychiatric diseases are partly heritable 6. It is possible that phenotypic associations between SES and health arise partly due to a shared genetic etiology. We conducted a genome-wide association study (GWAS) on social deprivation and on household income using 112,151 participants of UK Biobank. We find that common SNPs explain 21% of the variation in social deprivation and 11% of household income. Two independent loci attained genome-wide significance for household income, with the most significant SNP in each of these loci being rs187848990 on chromosome 2 and rs8100891 on chromosome 19. Genes in the regions of these SNPs have been associated with intellectual disabilities, schizophrenia, and synaptic plasticity. Extensive genetic correlations were found between both measures of SES and illnesses, anthropometric variables, psychiatric disorders, and cognitive ability. These findings suggest that some SNPs associated with SES are involved in the brain and central nervous system. The genetic associations with SES obviously do not reflect direct causal effects and are probably mediated via other partly heritable variables, including cognitive ability, personality, and health.
•Common SNPs explain 21% of social deprivation and 11% of household income•Two loci attained genome-wide significance for household income•Genes in these loci have been linked to synaptic plasticity•Genetic correlations were found between both measures of SES and many other traits
Individuals with lower socio-economic status (SES) are at increased risk of physical and mental illnesses. Hill et al. find extensive genetic correlations between SES and health, psychiatric, and cognitive traits. This suggests that the link between SES and health is driven, in part, by a shared genetic association.
•Living with a spouse or partner was associated with reduced odds of lifetime depression.•No evidence of an association between having two children and lifetime depression.•Having three or more ...children was associated with higher odds of lifetime depression.•Associations were similar across age groups, the sexes, neighbourhood deprivation and genetic predisposition.•Mendelian randomisation analyses indicate a causal effect of number of children on depression.
We examined associations between family status (living with a spouse or partner and number of children) and lifetime depression.
We used data from the UK Biobank, a large prospective study of middle-aged and older adults. Lifetime depression was assessed as part of a follow-up mental health questionnaire. Logistic regression was used to estimate associations between family status and depression. We included extensive adjustment for social, demographic and other potential confounders, including depression polygenic risk scores.
52,078 participants (mean age = 63.6, SD = 7.6; 52% female) were included in our analyses. Living with a spouse or partner was associated with substantially lower odds of lifetime depression (OR = 0.67, 95% CI 0.62-0.74). Compared to individuals without children, we found higher odds of lifetime depression for parents of one child (OR = 1.17, 95% CI 1.07-1.27) and parents of three (OR = 1.11, 95% CI 1.03-1.20) or four or more children (OR = 1.27, 95% CI 1.14-1.42). Amongst those not cohabiting, having any number of children was associated with higher odds of lifetime depression. Our results were consistent across age groups, the sexes, neighbourhood deprivation and genetic risk for depression. Exploratory Mendelian randomisation analyses suggested a causal effect of number of children on lifetime depression.
Our data did not allow distinguishing between non-marital and marital cohabitation. Results may not generalise to all ages or populations.
Living with a spouse or partner was strongly associated with reduced odds of depression. Having one or three or more children was associated with increased odds of depression, especially in individuals not living with a spouse or partner.
The authors investigated the pathways (genetic, environmental, lifestyle, medical) leading to inflammation in major depressive disorder using C-reactive protein (CRP), genetic, and phenotypic data ...from the UK Biobank.
This was a case-control study of 26,894 participants with a lifetime diagnosis of major depressive disorder from the Composite International Diagnostic Interview and 59,001 control subjects who reported no mental disorder and had not reported taking any antidepressant medication. Linear regression models of log CRP level were fitted to regress out the effects of age, sex, body mass index (BMI), and smoking and to test whether the polygenic risk score (PRS) for major depression was associated with log CRP level and whether the association between log CRP level and major depression remained after adjusting for early-life trauma, socioeconomic status, and self-reported health status.
CRP levels were significantly higher in patients with depression relative to control subjects (2.4 mg/L compared with 2.1 mg/L, respectively), and more case than control subjects had CRP levels >3 mg/L (21.2% compared with 16.8%, respectively), indicating low-grade inflammation. The PRS for depression was positively and significantly associated with log CRP levels, but this association was no longer significant after adjustment for BMI and smoking. The association between depression and increased log CRP level was substantially reduced, but still remained significant, after adjustment for the aforementioned clinical and sociodemographic factors.
The data indicate that the "genetic" contribution to increased inflammation in depression is due to regulation of eating and smoking habits rather than an "autoimmune" genetic predisposition. Moreover, the association between depression and increased inflammation even after full adjustment indicates either the presence of yet unknown or unmeasured psychosocial and clinical confounding factors or that a core biological association between depression and increased inflammation exists independently from confounders.