Mendelian randomization (MR) is an epidemiological technique that uses genetic variants to distinguish correlation from causation in observational data. The reliability of a MR investigation depends ...on the validity of the genetic variants as instrumental variables (IVs). We develop the contamination mixture method, a method for MR with two modalities. First, it identifies groups of genetic variants with similar causal estimates, which may represent distinct mechanisms by which the risk factor influences the outcome. Second, it performs MR robustly and efficiently in the presence of invalid IVs. Compared to other robust methods, it has the lowest mean squared error across a range of realistic scenarios. The method identifies 11 variants associated with increased high-density lipoprotein-cholesterol, decreased triglyceride levels, and decreased coronary heart disease risk that have the same directions of associations with various blood cell traits, suggesting a shared mechanism linking lipids and coronary heart disease risk mediated via platelet aggregation.
Two inflammatory disorders, type 1 diabetes and celiac disease, cosegregate in populations, suggesting a common genetic origin. Since both diseases are associated with the HLA class II genes on ...chromosome 6p21, we tested whether non-HLA loci are shared.
We evaluated the association between type 1 diabetes and eight loci related to the risk of celiac disease by genotyping and statistical analyses of DNA samples from 8064 patients with type 1 diabetes, 9339 control subjects, and 2828 families providing 3064 parent-child trios (consisting of an affected child and both biologic parents). We also investigated 18 loci associated with type 1 diabetes in 2560 patients with celiac disease and 9339 control subjects.
Three celiac disease loci--RGS1 on chromosome 1q31, IL18RAP on chromosome 2q12, and TAGAP on chromosome 6q25--were associated with type 1 diabetes (P<1.00x10(-4)). The 32-bp insertion-deletion variant on chromosome 3p21 was newly identified as a type 1 diabetes locus (P=1.81x10(-8)) and was also associated with celiac disease, along with PTPN2 on chromosome 18p11 and CTLA4 on chromosome 2q33, bringing the total number of loci with evidence of a shared association to seven, including SH2B3 on chromosome 12q24. The effects of the IL18RAP and TAGAP alleles confer protection in type 1 diabetes and susceptibility in celiac disease. Loci with distinct effects in the two diseases included INS on chromosome 11p15, IL2RA on chromosome 10p15, and PTPN22 on chromosome 1p13 in type 1 diabetes and IL12A on 3q25 and LPP on 3q28 in celiac disease.
A genetic susceptibility to both type 1 diabetes and celiac disease shares common alleles. These data suggest that common biologic mechanisms, such as autoimmunity-related tissue damage and intolerance to dietary antigens, may be etiologic features of both diseases.
Cerebral small vessel disease is a major cause of stroke and dementia, but its genetic basis is incompletely understood. We perform a genetic study of three MRI markers of the disease in UK Biobank ...imaging data and other sources: white matter hyperintensities (N = 42,310), fractional anisotropy (N = 17,663) and mean diffusivity (N = 17,467). Our aim is to better understand the disease pathophysiology. Across the three traits, we identify 31 loci, of which 21 were previously unreported. We perform a transcriptome-wide association study to identify associations with gene expression in relevant tissues, identifying 66 associated genes across the three traits. This genetic study provides insights into the understanding of the biological mechanisms underlying small vessel disease.
Genome-wide association studies (GWAS) have identified thousands of genomic regions affecting complex diseases. The next challenge is to elucidate the causal genes and mechanisms involved. One ...approach is to use statistical colocalization to assess shared genetic aetiology across multiple related traits (e.g. molecular traits, metabolic pathways and complex diseases) to identify causal pathways, prioritize causal variants and evaluate pleiotropy. We propose HyPrColoc (Hypothesis Prioritisation for multi-trait Colocalization), an efficient deterministic Bayesian algorithm using GWAS summary statistics that can detect colocalization across vast numbers of traits simultaneously (e.g. 100 traits can be jointly analysed in around 1 s). We perform a genome-wide multi-trait colocalization analysis of coronary heart disease (CHD) and fourteen related traits, identifying 43 regions in which CHD colocalized with ≥1 trait, including 5 previously unknown CHD loci. Across the 43 loci, we further integrate gene and protein expression quantitative trait loci to identify candidate causal genes.
Recent genome-wide association studies in stroke have enabled the generation of genomic risk scores (GRS) but their predictive power has been modest compared to established stroke risk factors. Here, ...using a meta-scoring approach, we develop a metaGRS for ischaemic stroke (IS) and analyse this score in the UK Biobank (n = 395,393; 3075 IS events by age 75). The metaGRS hazard ratio for IS (1.26, 95% CI 1.22-1.31 per metaGRS standard deviation) doubles that of a previous GRS, identifying a subset of individuals at monogenic levels of risk: the top 0.25% of metaGRS have three-fold risk of IS. The metaGRS is similarly or more predictive compared to several risk factors, such as family history, blood pressure, body mass index, and smoking. We estimate the reductions needed in modifiable risk factors for individuals with different levels of genomic risk and suggest that, for individuals with high metaGRS, achieving risk factor levels recommended by current guidelines may be insufficient to mitigate risk.
The genetic basis of autoantibody production is largely unknown outside of associations located in the major histocompatibility complex (MHC) human leukocyte antigen (HLA) region. The aim of this ...study is the discovery of new genetic associations with autoantibody positivity using genome-wide association scan single nucleotide polymorphism (SNP) data in type 1 diabetes (T1D) patients with autoantibody measurements. We measured two anti-islet autoantibodies, glutamate decarboxylase (GADA, n = 2,506), insulinoma-associated antigen 2 (IA-2A, n = 2,498), antibodies to the autoimmune thyroid (Graves') disease (AITD) autoantigen thyroid peroxidase (TPOA, n = 8,300), and antibodies against gastric parietal cells (PCA, n = 4,328) that are associated with autoimmune gastritis. Two loci passed a stringent genome-wide significance level (p<10(-10)): 1q23/FCRL3 with IA-2A and 9q34/ABO with PCA. Eleven of 52 non-MHC T1D loci showed evidence of association with at least one autoantibody at a false discovery rate of 16%: 16p11/IL27-IA-2A, 2q24/IFIH1-IA-2A and PCA, 2q32/STAT4-TPOA, 10p15/IL2RA-GADA, 6q15/BACH2-TPOA, 21q22/UBASH3A-TPOA, 1p13/PTPN22-TPOA, 2q33/CTLA4-TPOA, 4q27/IL2/TPOA, 15q14/RASGRP1/TPOA, and 12q24/SH2B3-GADA and TPOA. Analysis of the TPOA-associated loci in 2,477 cases with Graves' disease identified two new AITD loci (BACH2 and UBASH3A).
Mitochondrial DNA (mtDNA) variation in common diseases has been underexplored, partly due to a lack of genotype calling and quality-control procedures. Developing an at-scale workflow for mtDNA ...variant analyses, we show correlations between nuclear and mitochondrial genomic structures within subpopulations of Great Britain and establish a UK Biobank reference atlas of mtDNA-phenotype associations. A total of 260 mtDNA-phenotype associations were new (P < 1 × 10
), including rs2853822 /m.8655 C>T (MT-ATP6) with type 2 diabetes, rs878966690 /m.13117 A>G (MT-ND5) with multiple sclerosis, 6 mtDNA associations with adult height, 24 mtDNA associations with 2 liver biomarkers and 16 mtDNA associations with parameters of renal function. Rare-variant gene-based tests implicated complex I genes modulating mean corpuscular volume and mean corpuscular hemoglobin. Seven traits had both rare and common mtDNA associations, where rare variants tended to have larger effects than common variants. Our work illustrates the value of studying mtDNA variants in common complex diseases and lays foundations for future large-scale mtDNA association studies.
Inflammation, which is directly regulated by interleukin-6 (IL-6) signaling, is implicated in the etiology of several chronic diseases. Although a common, non-synonymous variant in the IL-6 receptor ...gene (IL6R Asp358Ala; rs2228145 A>C) is associated with the risk of several common diseases, with the 358Ala allele conferring protection from coronary heart disease (CHD), rheumatoid arthritis (RA), atrial fibrillation (AF), abdominal aortic aneurysm (AAA), and increased susceptibility to asthma, the variant's effect on IL-6 signaling is not known. Here we provide evidence for the association of this non-synonymous variant with the risk of type 1 diabetes (T1D) in two independent populations and confirm that rs2228145 is the major determinant of the concentration of circulating soluble IL-6R (sIL-6R) levels (34.6% increase in sIL-6R per copy of the minor allele 358Ala; rs2228145 C). To further investigate the molecular mechanism of this variant, we analyzed expression of IL-6R in peripheral blood mononuclear cells (PBMCs) in 128 volunteers from the Cambridge BioResource. We demonstrate that, although 358Ala increases transcription of the soluble IL6R isoform (P = 8.3×10⁻²²) and not the membrane-bound isoform, 358Ala reduces surface expression of IL-6R on CD4+ T cells and monocytes (up to 28% reduction per allele; P≤5.6×10⁻²²). Importantly, reduced expression of membrane-bound IL-6R resulted in impaired IL-6 responsiveness, as measured by decreased phosphorylation of the transcription factors STAT3 and STAT1 following stimulation with IL-6 (P≤5.2×10⁻⁷). Our findings elucidate the regulation of IL-6 signaling by IL-6R, which is causally relevant to several complex diseases, identify mechanisms for new approaches to target the IL-6/IL-6R axis, and anticipate differences in treatment response to IL-6 therapies based on this common IL6R variant.
Stroke is the second leading cause of death with substantial unmet therapeutic needs. To identify potential stroke therapeutic targets, we estimate the causal effects of 308 plasma proteins on stroke ...outcomes in a two-sample Mendelian randomization framework and assess mediation effects by stroke risk factors. We find associations between genetically predicted plasma levels of six proteins and stroke (P ≤ 1.62 × 10
). The genetic associations with stroke colocalize (Posterior Probability >0.7) with the genetic associations of four proteins (TFPI, TMPRSS5, CD6, CD40). Mendelian randomization supports atrial fibrillation, body mass index, smoking, blood pressure, white matter hyperintensities and type 2 diabetes as stroke risk factors (P ≤ 0.0071). Body mass index, white matter hyperintensity and atrial fibrillation appear to mediate the TFPI, IL6RA, TMPRSS5 associations with stroke. Furthermore, thirty-six proteins are associated with one or more of these risk factors using Mendelian randomization. Our results highlight causal pathways and potential therapeutic targets for stroke.
Variation in the human leukocyte antigen (HLA) genes accounts for one-half of the genetic risk in type 1 diabetes (T1D). Amino acid changes in the HLA-DR and HLA-DQ molecules mediate most of the ...risk, but extensive linkage disequilibrium complicates the localization of independent effects. Using 18,832 case-control samples, we localized the signal to 3 amino acid positions in HLA-DQ and HLA-DR. HLA-DQβ1 position 57 (previously known; P = 1 × 10(-1,355)) by itself explained 15.2% of the total phenotypic variance. Independent effects at HLA-DRβ1 positions 13 (P = 1 × 10(-721)) and 71 (P = 1 × 10(-95)) increased the proportion of variance explained to 26.9%. The three positions together explained 90% of the phenotypic variance in the HLA-DRB1-HLA-DQA1-HLA-DQB1 locus. Additionally, we observed significant interactions for 11 of 21 pairs of common HLA-DRB1-HLA-DQA1-HLA-DQB1 haplotypes (P = 1.6 × 10(-64)). HLA-DRβ1 positions 13 and 71 implicate the P4 pocket in the antigen-binding groove, thus pointing to another critical protein structure for T1D risk, in addition to the HLA-DQ P9 pocket.