We integrate comeasured gene expression and DNA methylation (DNAme) in 265 human skeletal muscle biopsies from the FUSION study with >7 million genetic variants and eight physiological traits: ...height,waist,weight,waist–hip ratio, body mass index, fasting serum insulin, fasting plasma glucose, and type 2 diabetes. We find hundreds of genes and DNAme sites associated with fasting insulin, waist, and body mass index, as well as thousands of DNAme sites associated with gene expression (eQTM). We find that controlling for heterogeneity in tissue/muscle fiber type reduces the number of physiological trait associations, and that long-range eQTMs (>1 Mb) are reduced when controlling for tissue/muscle fiber type or latent factors. We map genetic regulators (quantitative trait loci; QTLs) of expression (eQTLs) and DNAme (mQTLs). Using Mendelian randomization (MR) and mediation techniques, we leverage these genetic maps to predict 213 causal relationships between expression and DNAme, approximately two-thirds of which predict methylation to causally influence expression. We use MR to integrate FUSION mQTLs, FUSION eQTLs, and GTEx eQTLs for 48 tissues with genetic associations for 534 diseases and quantitative traits. We identify hundreds of genes and thousands of DNAme sites that may drive the reported disease/quantitative trait genetic associations. We identify 300 gene expression MR associations that are present in both FUSION and GTEx skeletal muscle and that show stronger evidence of MR association in skeletal muscle than other tissues, which may partially reflect differences in power across tissues. As one example, we find that increased RXRA muscle expression may decrease lean tissue mass.
Subcutaneous adipose tissue stores excess lipids and maintains energy balance. We performed expression quantitative trait locus (eQTL) analyses by using abdominal subcutaneous adipose tissue of 770 ...extensively phenotyped participants of the METSIM study. We identified cis-eQTLs for 12,400 genes at a 1% false-discovery rate. Among an approximately 680 known genome-wide association study (GWAS) loci for cardio-metabolic traits, we identified 140 coincident cis-eQTLs at 109 GWAS loci, including 93 eQTLs not previously described. At 49 of these 140 eQTLs, gene expression was nominally associated (p < 0.05) with levels of the GWAS trait. The size of our dataset enabled identification of five loci associated (p < 5 × 10−8) with at least five genes located >5 Mb away. These trans-eQTL signals confirmed and extended the previously reported KLF14-mediated network to 55 target genes, validated the CIITA regulation of class II MHC genes, and identified ZNF800 as a candidate master regulator. Finally, we observed similar expression-clinical trait correlations of genes associated with GWAS loci in both humans and a panel of genetically diverse mice. These results provide candidate genes for further investigation of their potential roles in adipose biology and in regulating cardio-metabolic traits.
Lipid and lipoprotein subclasses are associated with metabolic and cardiovascular diseases, yet the genetic contributions to variability in subclass traits are not fully understood. We conducted ...single-variant and gene-based association tests between 15.1M variants from genome-wide and exome array and imputed genotypes and 72 lipid and lipoprotein traits in 8,372 Finns. After accounting for 885 variants at 157 previously identified lipid loci, we identified five novel signals near established loci at HIF3A, ADAMTS3, PLTP, LCAT, and LIPG. Four of the signals were identified with a low-frequency (0.005<minor allele frequency MAF<0.05) or rare (MAF<0.005) variant, including Arg123His in LCAT. Gene-based associations (P<10-10) support a role for coding variants in LIPC and LIPG with lipoprotein subclass traits. 30 established lipid-associated loci had a stronger association for a subclass trait than any conventional trait. These novel association signals provide further insight into the molecular basis of dyslipidemia and the etiology of metabolic disorders.
A glutamate-to-lysine variant (rs58542926-T) in transmembrane 6 superfamily member 2 (TM6SF2) is associated with increased fatty liver disease and diabetes in conjunction with decreased ...cardiovascular disease risk. To identify mediators of the effects of TM6SF2, we tested for associations between rs58542926-T and serum lipoprotein/metabolite measures in cross-sectional data from nondiabetic statin-naïve participants. We identified independent associations between rs58542926-T and apoB-100 particles (β = −0.057 g/l, P = 1.99 × 10−14) and tyrosine levels (β = 0.0020 mmol/l, P = 1.10 × 10−8), controlling for potential confounders, in 6,929 Finnish men. The association between rs58542926-T and apoB-100 was confirmed in an independent sample of 2,196 Finnish individuals from the FINRISK study (βreplication = −0.029, Preplication = 0.029). Secondary analyses demonstrated an rs58542926-T dose-dependent decrease in particle concentration, cholesterol, and triglyceride (TG) content for VLDL and LDL particles (P < 0.001 for all). No significant associations between rs58542926-T and HDL measures were observed. TM6SF2 SNP rs58542926-T and tyrosine levels were associated with increased incident T2D risk in both METSIM and FINRISK. Decreased liver production/secretion of VLDL, decreased cholesterol and TGs in VLDL/LDL particles in serum, and increased tyrosine levels identify possible mechanisms by which rs58542926-T exerts its effects on increasing risk of fatty liver disease, decreasing cardiovascular disease, and increasing diabetes risk, respectively.
Molecular mechanisms remain unknown for most type 2 diabetes genome-wide association study identified loci. Variants associated with type 2 diabetes and fasting glucose levels reside in introns of
, ...a gene that encodes adenylate cyclase 5. Adenylate cyclase 5 catalyzes the production of cyclic AMP, which is a second messenger molecule involved in cell signaling and pancreatic β-cell insulin secretion. We demonstrated that type 2 diabetes risk alleles are associated with decreased
expression in human islets and examined candidate variants for regulatory function. rs11708067 overlaps a predicted enhancer region in pancreatic islets. The type 2 diabetes risk rs11708067-A allele showed fewer H3K27ac ChIP-seq reads in human islets, lower transcriptional activity in reporter assays in rodent β-cells (rat 832/13 and mouse MIN6), and increased nuclear protein binding compared with the rs11708067-G allele. Homozygous deletion of the orthologous enhancer region in 832/13 cells resulted in a 64% reduction in expression level of
, but not adjacent gene
, and a 39% reduction in insulin secretion. Together, these data suggest that rs11708067-A risk allele contributes to type 2 diabetes by disrupting an islet enhancer, which results in reduced
expression and impaired insulin secretion.
Genome-wide association studies have shown that variation in MTNR1B (melatonin receptor 1B) is associated with insulin and glucose concentrations. Here we show that the risk genotype of this SNP ...predicts future type 2 diabetes (T2D) in two large prospective studies. Specifically, the risk genotype was associated with impairment of early insulin response to both oral and intravenous glucose and with faster deterioration of insulin secretion over time. We also show that the MTNR1B mRNA is expressed in human islets, and immunocytochemistry confirms that it is primarily localized in beta cells in islets. Nondiabetic individuals carrying the risk allele and individuals with T2D showed increased expression of the receptor in islets. Insulin release from clonal beta cells in response to glucose was inhibited in the presence of melatonin. These data suggest that the circulating hormone melatonin, which is predominantly released from the pineal gland in the brain, is involved in the pathogenesis of T2D. Given the increased expression of MTNR1B in individuals at risk of T2D, the pathogenic effects are likely exerted via a direct inhibitory effect on beta cells. In view of these results, blocking the melatonin ligand-receptor system could be a therapeutic avenue in T2D.
Few studies have explored the impact of rare variants (minor allele frequency < 1%) on highly heritable plasma metabolites identified in metabolomic screens. The Finnish population provides an ideal ...opportunity for such explorations, given the multiple bottlenecks and expansions that have shaped its history, and the enrichment for many otherwise rare alleles that has resulted. Here, we report genetic associations for 1391 plasma metabolites in 6136 men from the late-settlement region of Finland. We identify 303 novel association signals, more than one third at variants rare or enriched in Finns. Many of these signals identify genes not previously implicated in metabolite genome-wide association studies and suggest mechanisms for diseases and disease-related traits.
Genome-wide association studies (GWASs) have identified more than 150 loci associated with blood lipid and cholesterol levels; however, the functional and molecular mechanisms for many associations ...are unknown. We examined the functional regulatory effects of candidate variants at the GALNT2 locus associated with high-density lipoprotein cholesterol (HDL-C). Fine-mapping and conditional analyses in the METSIM study identified a single locus harboring 25 noncoding variants (r2 > 0.7 with the lead GWAS variants) strongly associated with total cholesterol in medium-sized HDL (e.g., rs17315646, p = 3.5 × 10−12). We used luciferase reporter assays in HepG2 cells to test all 25 variants for allelic differences in regulatory enhancer activity. rs2281721 showed allelic differences in transcriptional activity (75-fold T versus 27-fold C more than the empty-vector control), as did a separate 780-bp segment containing rs4846913, rs2144300, and rs6143660 (49-fold AT– haplotype versus 16-fold CC+ haplotype more). Using electrophoretic mobility shift assays, we observed differential CEBPB binding to rs4846913, and we confirmed this binding in a native chromatin context by performing chromatin-immunoprecipitation (ChIP) assays in HepG2 and Huh-7 cell lines of differing genotypes. Additionally, sequence reads in HepG2 DNase-I-hypersensitivity and CEBPB ChIP-seq signals spanning rs4846913 showed significant allelic imbalance. Allelic-expression-imbalance assays performed with RNA from primary human hepatocyte samples and expression-quantitative-trait-locus (eQTL) data in human subcutaneous adipose tissue samples confirmed that alleles associated with increased HDL-C are associated with a modest increase in GALNT2 expression. Together, these data suggest that at least rs4846913 and rs2281721 play key roles in influencing GALNT2 expression at this HDL-C locus.
Abstract
Integration of genome-wide association study (GWAS) signals with expression quantitative trait loci (eQTL) studies enables identification of candidate genes. However, evaluating whether ...nearby signals may share causal variants, termed colocalization, is affected by the presence of allelic heterogeneity, different variants at the same locus impacting the same phenotype. We previously identified eQTL in subcutaneous adipose tissue from 770 participants in the Metabolic Syndrome in Men (METSIM) study and detected 15 eQTL signals that colocalized with GWAS signals for waist–hip ratio adjusted for body mass index (WHRadjBMI) from the Genetic Investigation of Anthropometric Traits consortium. Here, we reevaluated evidence of colocalization using two approaches, conditional analysis and the Bayesian test COLOC, and show that providing COLOC with approximate conditional summary statistics at multi-signal GWAS loci can reconcile disagreements in colocalization classification between the two tests. Next, we performed conditional analysis on the METSIM subcutaneous adipose tissue data to identify conditionally distinct or secondary eQTL signals. We used the two approaches to test for colocalization with WHRadjBMI GWAS signals and evaluated the differences in colocalization classification between the two tests. Through these analyses, we identified four GWAS signals colocalized with secondary eQTL signals for FAM13A, SSR3, GRB14 and FMO1. Thus, at loci with multiple eQTL and/or GWAS signals, analyzing each signal independently enabled additional candidate genes to be identified.