Ageing phenotypes, such as years lived in good health (healthspan), total years lived (lifespan), and survival until an exceptional old age (longevity), are of interest to us all but require ...exceptionally large sample sizes to study genetically. Here we combine existing genome-wide association summary statistics for healthspan, parental lifespan, and longevity in a multivariate framework, increasing statistical power, and identify 10 genomic loci which influence all three phenotypes, of which five (near FOXO3, SLC4A7, LINC02513, ZW10, and FGD6) have not been reported previously at genome-wide significance. The majority of these 10 loci are associated with cardiovascular disease and some affect the expression of genes known to change their activity with age. In total, we implicate 78 genes, and find these to be enriched for ageing pathways previously highlighted in model organisms, such as the response to DNA damage, apoptosis, and homeostasis. Finally, we identify a pathway worthy of further study: haem metabolism.
The effect of genetic variation in the male-specific region of the Y chromosome (MSY) on coronary artery disease and cardiovascular risk factors has been disputed. In this study, we systematically ...assessed the association of MSY genetic variation on these traits using a kin-cohort analysis of family disease history in the largest sample to date.
We tested 90 MSY haplogroups against coronary artery disease, hypertension, blood pressure, classical lipid levels, and all-cause mortality in up to 152 186 unrelated, genomically British individuals from UK Biobank. Unlike previous studies, we did not adjust for heritable lifestyle factors (to avoid collider bias) and instead adjusted for geographic variables and socioeconomic deprivation, given the link between MSY haplogroups and geography. For family history traits, subject MSY haplogroups were tested against father and mother disease as validation and negative control, respectively.
Our models find little evidence for an effect of any MSY haplogroup on cardiovascular risk in participants. Parental models confirm these findings.
Kin-cohort analysis of the Y chromosome uniquely allows for discoveries in subjects to be validated using family history data. Despite our large sample size, improved models, and parental validation, there is little evidence to suggest cardiovascular risk in UK Biobank is influenced by genetic variation in MSY.
Background and Aims
Genome‐wide association studies (GWAS) have identified several risk loci for gallstone disease. As with most polygenic traits, it is likely that many genetic determinants are ...undiscovered. The aim of this study was to identify genetic variants that represent new targets for gallstone research and treatment.
Approach and Results
We performed a GWAS of 28,627 gallstone cases and 348,373 controls in the UK Biobank, replicated findings in a Scottish cohort (1089 cases, 5228 controls), and conducted a GWA meta‐analysis (43,639 cases, 506,798 controls) with the FinnGen cohort. We assessed pathway enrichment using gene‐based then gene‐set analysis and tissue expression of identified genes in Genotype‐Tissue Expression project data. We constructed a polygenic risk score (PRS) and evaluated phenotypic traits associated with the score. Seventy‐five risk loci were identified (p < 5 × 10−8), of which 46 were new. Pathway enrichment revealed associations with lipid homeostasis, glucuronidation, phospholipid metabolism, and gastrointestinal motility. Anoctamin 1 (ANO1) and transmembrane Protein 147 (TMEM147), both in novel, replicated loci, are expressed in the gallbladder and gastrointestinal tract. Both regulate gastrointestinal motility. The gallstone risk allele rs7599‐A leads to suppression of hepatic TMEM147 expression, suggesting that the protein protects against gallstone formation. The highest decile of the PRS demonstrated a 6‐fold increased odds of gallstones compared with the lowest decile. The PRS was strongly associated with increased body mass index, serum liver enzymes, and C‐reactive protein concentrations, and decreased lipoprotein cholesterol concentrations.
Conclusions
This GWAS demonstrates the polygenic nature of gallstone risk and identifies 46 novel susceptibility loci. We implicate genes influencing gastrointestinal motility in the pathogenesis of gallstones.
The use of omic modalities to dissect the molecular underpinnings of common diseases and traits is becoming increasingly common. But multi-omic traits can be genetically predicted, which enables ...highly cost-effective and powerful analyses for studies that do not have multi-omics
. Here we examine a large cohort (the INTERVAL study
; n = 50,000 participants) with extensive multi-omic data for plasma proteomics (SomaScan, n = 3,175; Olink, n = 4,822), plasma metabolomics (Metabolon HD4, n = 8,153), serum metabolomics (Nightingale, n = 37,359) and whole-blood Illumina RNA sequencing (n = 4,136), and use machine learning to train genetic scores for 17,227 molecular traits, including 10,521 that reach Bonferroni-adjusted significance. We evaluate the performance of genetic scores through external validation across cohorts of individuals of European, Asian and African American ancestries. In addition, we show the utility of these multi-omic genetic scores by quantifying the genetic control of biological pathways and by generating a synthetic multi-omic dataset of the UK Biobank
to identify disease associations using a phenome-wide scan. We highlight a series of biological insights with regard to genetic mechanisms in metabolism and canonical pathway associations with disease; for example, JAK-STAT signalling and coronary atherosclerosis. Finally, we develop a portal ( https://www.omicspred.org/ ) to facilitate public access to all genetic scores and validation results, as well as to serve as a platform for future extensions and enhancements of multi-omic genetic scores.
Abstract
The enormous mammal’s lifespan variation is the result of each species’ adaptations to their own biological trade-offs and ecological conditions. Comparative genomics have demonstrated that ...genomic factors underlying both, species lifespans and longevity of individuals, are in part shared across the tree of life. Here, we compared protein-coding regions across the mammalian phylogeny to detect individual amino acid (AA) changes shared by the most long-lived mammals and genes whose rates of protein evolution correlate with longevity. We discovered a total of 2,737 AA in 2,004 genes that distinguish long- and short-lived mammals, significantly more than expected by chance (P = 0.003). These genes belong to pathways involved in regulating lifespan, such as inflammatory response and hemostasis. Among them, a total 1,157 AA showed a significant association with maximum lifespan in a phylogenetic test. Interestingly, most of the detected AA positions do not vary in extant human populations (81.2%) or have allele frequencies below 1% (99.78%). Consequently, almost none of these putatively important variants could have been detected by genome-wide association studies. Additionally, we identified four more genes whose rate of protein evolution correlated with longevity in mammals. Crucially, SNPs located in the detected genes explain a larger fraction of human lifespan heritability than expected, successfully demonstrating for the first time that comparative genomics can be used to enhance interpretation of human genome-wide association studies. Finally, we show that the human longevity-associated proteins are significantly more stable than the orthologous proteins from short-lived mammals, strongly suggesting that general protein stability is linked to increased lifespan.
We propose a novel effective framework for the analysis of the shared genetic background for a set of genetically correlated traits using SNP-level GWAS summary statistics. This framework called ...SHAHER is based on the construction of a linear combination of traits by maximizing the proportion of its genetic variance explained by the shared genetic factors. SHAHER requires only full GWAS summary statistics and matrices of genetic and phenotypic correlations between traits as inputs. Our framework allows both shared and unshared genetic factors to be effectively analyzed. We tested our framework using simulation studies, compared it with previous developments, and assessed its performance using three real datasets: anthropometric traits, psychiatric conditions and lipid concentrations. SHAHER is versatile and applicable to summary statistics from GWASs with arbitrary sample sizes and sample overlaps, allows for the incorporation of different GWAS models (Cox, linear and logistic), and is computationally fast.
Biomarkers may be useful endophenotypes for genetic studies if they share genetic sources of variation with the outcome, for example, with all-cause mortality. Australian adult study participants who ...had reported their parental survival information were included in the study: 14,169 participants had polygenic risk scores (PRS) from genotyping and up to 13,365 had biomarker results. We assessed associations between participants’ biomarker results and parental survival, and between biomarker results and eight parental survival PRS at varying p-value cut-offs. Survival in parents was associated with participants’ serum bilirubin, C-reactive protein, HDL cholesterol, triglycerides and uric acid, and with LDL cholesterol for participants’ fathers but not for their mothers. PRS for all-cause mortality were associated with liver function tests (alkaline phosphatase, butyrylcholinesterase, gamma-glutamyl transferase), metabolic tests (LDL and HDL cholesterol, triglycerides, uric acid), and acute-phase reactants (C-reactive protein, globulins). Association between offspring biomarker results and parental survival demonstrates the existence of familial effects common to both, while associations between biomarker results and PRS for mortality favor at least a partial genetic cause of this covariation. Identification of genetic loci affecting mortality-associated biomarkers offers a route to the identification of additional loci affecting mortality.
Genome‐wide association studies (GWAS) have identified several risk loci for nonalcoholic fatty liver disease (NAFLD). Previous studies have largely relied on small sample sizes and have assessed ...quantitative traits. We performed a case‐control GWAS in the UK Biobank using recorded diagnosis of NAFLD based on diagnostic codes recommended in recent consensus guidelines. We performed a GWAS of 4,761 cases of NAFLD and 373,227 healthy controls without evidence of NAFLD. Sensitivity analyses were performed excluding other co‐existing hepatic pathology, adjusting for body mass index (BMI) and adjusting for alcohol intake. A total of 9,723,654 variants were assessed by logistic regression adjusted for age, sex, genetic principal components, and genotyping batch. We performed a GWAS meta‐analysis using available summary association statistics. Six risk loci were identified (P < 5*10−8) (apolipoprotein E APOE, patatin‐like phospholipase domain containing 3 PNPLA3, transmembrane 6 superfamily member 2 TM6SF2, glucokinase regulator GCKR, mitochondrial amidoxime reducing component 1 MARC1, and tribbles pseudokinase 1 TRIB1). All loci retained significance in sensitivity analyses without co‐existent hepatic pathology and after adjustment for BMI. PNPLA3 and TM6SF2 remained significant after adjustment for alcohol (alcohol intake was known in only 158,388 individuals), with others demonstrating consistent direction and magnitude of effect. All six loci were significant on meta‐analysis. Rs429358 (P = 2.17*10−11) is a missense variant within the APOE gene determining ϵ4 versus ϵ2/ϵ3 alleles. The ϵ4 allele of APOE offered protection against NAFLD (odds ratio for heterozygotes 0.84 95% confidence interval 0.78‐0.90 and homozygotes 0.64 0.50‐0.79). Conclusion: This GWAS replicates six known NAFLD‐susceptibility loci and confirms that the ϵ4 allele of APOE is associated with protection against NAFLD. The results are consistent with published GWAS using histological and radiological measures of NAFLD, confirming that NAFLD identified through diagnostic codes from consensus guidelines is a valid alternative to more invasive and costly approaches.
Mortality risk is known to be associated with many physiological or biochemical risk factors, and polygenic risk scores (PRSs) may offer an additional or alternative approach to risk stratification. ...We have compared the predictive value of common biochemical tests, PRSs and information on parental survival in a cohort of twins and their families. Common biochemical test results were available for up to 13,365 apparently healthy men and women, aged 17−93 years (mean 49.0, standard deviation SD 13.7) at blood collection. PRSs for longevity were available for 14,169 study participants and reported parental survival for 25,784 participants. A search for information on date and cause of death was conducted through the Australian National Death Index, with median follow-up of 11.3 years. Cox regression was used to evaluate associations with mortality from all causes, cancers, cardiovascular diseases and other causes. Linear relationships with all-cause mortality were strongest for C-reactive protein, gamma-glutamyl transferase, glucose and alkaline phosphatase, with hazard ratios (HRs) of 1.16 (95% CI 1.07, 1.24), 1.15 (95% CI 1.04–1.21), 1.13 (95% CI 1.08, 1.19) and 1.11 (95% CI 1.05, 1.88) per SD difference, respectively. Significant nonlinear effects were found for urea, uric acid and butyrylcholinesterase. Lipid risk factors were not statistically significant for mortality in our cohort. Family history and PRS showed weaker but significant associations with survival, with HR in the range 1.05 to 1.09 per SD difference. In conclusion, biochemical tests currently predict long-term mortality more strongly than genetic scores based on genotyping or on reported parental survival.
ObjectivesIdentify causes and future trends underpinning Scottish mortality improvements and quantify the relative contributions of disease incidence and survival.DesignPopulation-based ...study.SettingLinked secondary care and mortality records across Scotland.Participants1 967 130 individuals born between 1905 and 1965 and resident in Scotland from 2001 to 2016.Main outcome measuresHospital admission rates and survival within 5 years postadmission for 28 diseases, stratified by sex and socioeconomic status.Results‘Influenza and pneumonia’, ‘Symptoms and signs involving circulatory and respiratory systems’ and ‘Malignant neoplasm of respiratory and intrathoracic organs’ were the hospital diagnosis groupings associated with most excess deaths, being both common and linked to high postadmission mortality. Using disease trends, we modelled a mean mortality HR of 0.737 (95% CI 0.730 to 0.745) from one decade of birth to the next, equivalent to a life extension of ~3 years per decade. This improvement was 61% (30%–93%) accounted for by improved disease survival after hospitalisation (principally cancer) with the remainder accounted for by lowered hospitalisation incidence (principally heart disease and cancer). In contrast, deteriorations in infectious disease incidence and survival increased mortality by 9% (~3.3 months per decade). Disease-driven mortality improvements were slightly greater for men than women (due to greater falls in disease incidence), and generally similar across socioeconomic deciles. We project mortality improvements will continue over the next decade but slow by 21% because much progress in disease survival has already been achieved.ConclusionMorbidity improvements broadly explain observed mortality improvements, with progress on prevention and treatment of heart disease and cancer contributing the most. The male–female health gaps are closing, but those between socioeconomic groups are not. Slowing improvements in morbidity may explain recent stalling in improvements of UK period life expectancies. However, these could be offset if we accelerate improvements in the diseases accounting for most deaths and counteract recent deteriorations in infectious disease.