Affinity-based proteomics has enabled scalable quantification of thousands of protein targets in blood enhancing biomarker discovery, understanding of disease mechanisms, and genetic evaluation of ...drug targets in humans through protein quantitative trait loci (pQTLs). Here, we integrate two partly complementary techniques-the aptamer-based SomaScan
v4 assay and the antibody-based Olink assays-to systematically assess phenotypic consequences of hundreds of pQTLs discovered for 871 protein targets across both platforms. We create a genetically anchored cross-platform proteome-phenome network comprising 547 protein-phenotype connections, 36.3% of which were only seen with one of the two platforms suggesting that both techniques capture distinct aspects of protein biology. We further highlight discordance of genetically predicted effect directions between assays, such as for PILRA and Alzheimer's disease. Our results showcase the synergistic nature of these technologies to better understand and identify disease mechanisms and provide a benchmark for future cross-platform discoveries.
Mapping the proteo-genomic convergence of human diseases Pietzner, Maik; Wheeler, Eleanor; Carrasco-Zanini, Julia ...
Science (American Association for the Advancement of Science),
2021-Nov-12, 2021-11-12, 20211112, Letnik:
374, Številka:
6569
Journal Article
Recenzirano
Odprti dostop
Characterization of the genetic regulation of proteins is essential for understanding disease etiology and developing therapies. We identified 10,674 genetic associations for 3892 plasma proteins to ...create a cis-anchored gene-protein-disease map of 1859 connections that highlights strong cross-disease biological convergence. This proteo-genomic map provides a framework to connect etiologically related diseases, to provide biological context for new or emerging disorders, and to integrate different biological domains to establish mechanisms for known gene-disease links. Our results identify proteo-genomic connections within and between diseases and establish the value of cis-protein variants for annotation of likely causal disease genes at loci identified in genome-wide association studies, thereby addressing a major barrier to experimental validation and clinical translation of genetic discoveries.
The melanocortin 4 receptor (MC4R) is a G protein-coupled receptor whose disruption causes obesity. We functionally characterized 61 MC4R variants identified in 0.5 million people from UK Biobank and ...examined their associations with body mass index (BMI) and obesity-related cardiometabolic diseases. We found that the maximal efficacy of β-arrestin recruitment to MC4R, rather than canonical Gαs-mediated cyclic adenosine-monophosphate production, explained 88% of the variance in the association of MC4R variants with BMI. While most MC4R variants caused loss of function, a subset caused gain of function; these variants were associated with significantly lower BMI and lower odds of obesity, type 2 diabetes, and coronary artery disease. Protective associations were driven by MC4R variants exhibiting signaling bias toward β-arrestin recruitment and increased mitogen-activated protein kinase pathway activation. Harnessing β-arrestin-biased MC4R signaling may represent an effective strategy for weight loss and the treatment of obesity-related cardiometabolic diseases.
Display omitted
•61 variants in the Melanocortin-4 Receptor gene were found in 0.5 million people•Variants causing a gain of function were associated with protection from obesity•Variants biased toward β-arrestin signaling mediated the protective effects
Gain-of-function genetic variants in the Melanocortin-4 Receptor associated with protection against obesity exhibit signaling bias for the recruitment of β-arrestin rather than canonical Gαs-mediated cAMP production.
The variation in weight within a shared environment is largely attributable to genetic factors. Whilst many genes/loci confer susceptibility to obesity, little is known about the genetic architecture ...of healthy thinness. Here, we characterise the heritability of thinness which we found was comparable to that of severe obesity (h2 = 28.07 vs 32.33% respectively), although with incomplete genetic overlap (r = -0.49, 95% CI -0.17, -0.82, p = 0.003). In a genome-wide association analysis of thinness (n = 1,471) vs severe obesity (n = 1,456), we identified 10 loci previously associated with obesity, and demonstrate enrichment for established BMI-associated loci (pbinomial = 3.05x10-5). Simulation analyses showed that different association results between the extremes were likely in agreement with additive effects across the BMI distribution, suggesting different effects on thinness and obesity could be due to their different degrees of extremeness. In further analyses, we detected a novel obesity and BMI-associated locus at PKHD1 (rs2784243, obese vs. thin p = 5.99x10-6, obese vs. controls p = 2.13x10-6 pBMI = 2.3x10-13), associations at loci recently discovered with much larger sample sizes (e.g. FAM150B and PRDM6-CEP120), and novel variants driving associations at previously established signals (e.g. rs205262 at the SNRPC/C6orf106 locus and rs112446794 at the PRDM6-CEP120 locus). Our ability to replicate loci found with much larger sample sizes demonstrates the value of clinical extremes and suggest that characterisation of the genetics of thinness may provide a more nuanced understanding of the genetic architecture of body weight regulation and may inform the identification of potential anti-obesity targets.
Understanding the genetic architecture of host proteins interacting with SARS-CoV-2 or mediating the maladaptive host response to COVID-19 can help to identify new or repurpose existing drugs ...targeting those proteins. We present a genetic discovery study of 179 such host proteins among 10,708 individuals using an aptamer-based technique. We identify 220 host DNA sequence variants acting in cis (MAF 0.01-49.9%) and explaining 0.3-70.9% of the variance of 97 of these proteins, including 45 with no previously known protein quantitative trait loci (pQTL) and 38 encoding current drug targets. Systematic characterization of pQTLs across the phenome identified protein-drug-disease links and evidence that putative viral interaction partners such as MARK3 affect immune response. Our results accelerate the evaluation and prioritization of new drug development programmes and repurposing of trials to prevent, treat or reduce adverse outcomes. Rapid sharing and detailed interrogation of results is facilitated through an interactive webserver ( https://omicscience.org/apps/covidpgwas/ ).
Higher cardiorespiratory fitness is associated with lower risk of type 2 diabetes. However, the causality of this relationship and the biological mechanisms that underlie it are unclear. Here, we ...examine genetic determinants of cardiorespiratory fitness in 450k European-ancestry individuals in UK Biobank, by leveraging the genetic overlap between fitness measured by an exercise test and resting heart rate. We identified 160 fitness-associated loci which we validated in an independent cohort, the Fenland study. Gene-based analyses prioritised candidate genes, such as CACNA1C, SCN10A, MYH11 and MYH6, that are enriched in biological processes related to cardiac muscle development and muscle contractility. In a Mendelian Randomisation framework, we demonstrate that higher genetically predicted fitness is causally associated with lower risk of type 2 diabetes independent of adiposity. Integration with proteomic data identified N-terminal pro B-type natriuretic peptide, hepatocyte growth factor-like protein and sex hormone-binding globulin as potential mediators of this relationship. Collectively, our findings provide insights into the biological mechanisms underpinning cardiorespiratory fitness and highlight the importance of improving fitness for diabetes prevention.
Despite the increasing global burden of neurological disorders, there is a lack of effective diagnostic and therapeutic biomarkers. Proteins are often dysregulated in disease and have a strong ...genetic component. Here, we carry out a protein quantitative trait locus analysis of 184 neurologically-relevant proteins, using whole genome sequencing data from two isolated population-based cohorts (N = 2893). In doing so, we elucidate the genetic landscape of the circulating proteome and its connection to neurological disorders. We detect 214 independently-associated variants for 107 proteins, the majority of which (76%) are cis-acting, including 114 variants that have not been previously identified. Using two-sample Mendelian randomisation, we identify causal associations between serum CD33 and Alzheimer's disease, GPNMB and Parkinson's disease, and MSR1 and schizophrenia, describing their clinical potential and highlighting drug repurposing opportunities.
Structural maintenance of chromosomes (SMC) complexes are essential for maintaining chromatin structure and regulating gene expression. Two the three known SMC complexes, cohesin and condensin, are ...important for sister chromatid cohesion and condensation, respectively; however, the function of the third complex, SMC5-6, which includes the E3 SUMO-ligase NSMCE2 (also widely known as MMS21) is less clear. Here, we characterized 2 patients with primordial dwarfism, extreme insulin resistance, and gonadal failure and identified compound heterozygous frameshift mutations in NSMCE2. Both mutations reduced NSMCE2 expression in patient cells. Primary cells from one patient showed increased micronucleus and nucleoplasmic bridge formation, delayed recovery of DNA synthesis, and reduced formation of foci containing Bloom syndrome helicase (BLM) after hydroxyurea-induced replication fork stalling. These nuclear abnormalities in patient dermal fibroblast were restored by expression of WT NSMCE2, but not a mutant form lacking SUMO-ligase activity. Furthermore, in zebrafish, knockdown of the NSMCE2 ortholog produced dwarfism, which was ameliorated by reexpression of WT, but not SUMO-ligase-deficient NSMCE. Collectively, these findings support a role for NSMCE2 in recovery from DNA damage and raise the possibility that loss of its function produces dwarfism through reduced tolerance of replicative stress.
Mosaic loss of chromosome Y (LOY) in leukocytes is the most common form of clonal mosaicism, caused by dysregulation in cell-cycle and DNA damage response pathways. Previous genetic studies have ...focussed on identifying common variants associated with LOY, which we now extend to rarer, protein-coding variation using exome sequences from 82,277 male UK Biobank participants. We find that loss of function of two genes-CHEK2 and GIGYF1-reach exome-wide significance. Rare alleles in GIGYF1 have not previously been implicated in any complex trait, but here loss-of-function carriers exhibit six-fold higher susceptibility to LOY (OR = 5.99 3.04-11.81, p = 1.3 × 10
). These same alleles are also associated with adverse metabolic health, including higher susceptibility to Type 2 Diabetes (OR = 6.10 3.51-10.61, p = 1.8 × 10
), 4 kg higher fat mass (p = 1.3 × 10
), 2.32 nmol/L lower serum IGF1 levels (p = 1.5 × 10
) and 4.5 kg lower handgrip strength (p = 4.7 × 10
) consistent with proposed GIGYF1 enhancement of insulin and IGF-1 receptor signalling. These associations are mirrored by a common variant nearby associated with the expression of GIGYF1. Our observations highlight a potential direct connection between clonal mosaicism and metabolic health.
Despite two years of intense global research activity, host genetic factors that predispose to a poorer prognosis of COVID-19 infection remain poorly understood. Here, we prioritise eight robust ...(e.g., ELF5) or suggestive but unreported (e.g., RAB2A) candidate protein mediators of COVID-19 outcomes by integrating results from the COVID-19 Host Genetics Initiative with population-based plasma proteomics using statistical colocalisation. The transcription factor ELF5 (ELF5) shows robust and directionally consistent associations across different outcome definitions, including a >4-fold higher risk (odds ratio: 4.88; 95%-CI: 2.47-9.63; p-value < 5.0 × 10
) for severe COVID-19 per 1 s.d. higher genetically predicted plasma ELF5. We show that ELF5 is specifically expressed in epithelial cells of the respiratory system, such as secretory and alveolar type 2 cells, using single-cell RNA sequencing and immunohistochemistry. These cells are also likely targets of SARS-CoV-2 by colocalisation with key host factors, including ACE2 and TMPRSS2. In summary, large-scale human genetic studies together with gene expression at single-cell resolution highlight ELF5 as a risk gene for severe COVID-19, supporting a role of epithelial cells of the respiratory system in the adverse host response to SARS-CoV-2.