The influence of genetic variation on complex diseases is potentially mediated through a range of highly dynamic epigenetic processes exhibiting temporal variation during development and later life. ...Here we present a catalogue of the genetic influences on DNA methylation (methylation quantitative trait loci (mQTL)) at five different life stages in human blood: children at birth, childhood, adolescence and their mothers during pregnancy and middle age.
We show that genetic effects on methylation are highly stable across the life course and that developmental change in the genetic contribution to variation in methylation occurs primarily through increases in environmental or stochastic effects. Though we map a large proportion of the cis-acting genetic variation, a much larger component of genetic effects influencing methylation are acting in trans. However, only 7 % of discovered mQTL are trans-effects, suggesting that the trans component is highly polygenic. Finally, we estimate the contribution of mQTL to variation in complex traits and infer that methylation may have a causal role consistent with an infinitesimal model in which many methylation sites each have a small influence, amounting to a large overall contribution.
DNA methylation contains a significant heritable component that remains consistent across the lifespan. Our results suggest that the genetic component of methylation may have a causal role in complex traits. The database of mQTL presented here provide a rich resource for those interested in investigating the role of methylation in disease.
Loci discovered by genome-wide association studies predominantly map outside protein-coding genes. The interpretation of the functional consequences of non-coding variants can be greatly enhanced by ...catalogs of regulatory genomic regions in cell lines and primary tissues. However, robust and readily applicable methods are still lacking by which to systematically evaluate the contribution of these regions to genetic variation implicated in diseases or quantitative traits. Here we propose a novel approach that leverages genome-wide association studies' findings with regulatory or functional annotations to classify features relevant to a phenotype of interest. Within our framework, we account for major sources of confounding not offered by current methods. We further assess enrichment of genome-wide association studies for 19 traits within Encyclopedia of DNA Elements- and Roadmap-derived regulatory regions. We characterize unique enrichment patterns for traits and annotations driving novel biological insights. The method is implemented in standalone software and an R package, to facilitate its application by the research community.
Upper- and lower-body fat depots exhibit opposing associations with obesity-related metabolic disease. We defined the relationship between DEXA-quantified fat depots and diabetes/cardiovascular risk ...factors in a healthy population-based cohort (n = 3,399). Gynoid fat mass correlated negatively with insulin resistance after total fat mass adjustment, whereas the opposite was seen for abdominal fat. Paired transcriptomic analysis of gluteal subcutaneous adipose tissue (GSAT) and abdominal subcutaneous adipose tissue (ASAT) was performed across the BMI spectrum (n = 49; 21.4-45.5 kg/m(2)). In both depots, energy-generating metabolic genes were negatively associated and inflammatory genes were positively associated with obesity. However, associations were significantly weaker in GSAT. At the systemic level, arteriovenous release of the proinflammatory cytokine interleukin-6 (n = 34) was lower from GSAT than ASAT. Isolated preadipocytes retained a depot-specific transcriptional "memory" of embryonic developmental genes and exhibited differential promoter DNA methylation of selected genes (HOTAIR, TBX5) between GSAT and ASAT. Short hairpin RNA-mediated silencing identified TBX5 as a regulator of preadipocyte proliferation and adipogenic differentiation in ASAT. In conclusion, intrinsic differences in the expression of developmental genes in regional adipocytes provide a mechanistic basis for diversity in adipose tissue (AT) function. The less inflammatory nature of lower-body AT offers insight into the opposing metabolic disease risk associations between upper- and lower-body obesity.
Acne vulgaris is a highly heritable skin disorder that primarily impacts facial skin. Severely inflamed lesions may leave permanent scars that have been associated with long-term psychosocial ...consequences. Here, we perform a GWAS meta-analysis comprising 20,165 individuals with acne from nine independent European ancestry cohorts. We identify 29 novel genome-wide significant loci and replicate 14 of the 17 previously identified risk loci, bringing the total number of reported acne risk loci to 46. Using fine-mapping and eQTL colocalisation approaches, we identify putative causal genes at several acne susceptibility loci that have previously been implicated in Mendelian hair and skin disorders, including pustular psoriasis. We identify shared genetic aetiology between acne, hormone levels, hormone-sensitive cancers and psychiatric traits. Finally, we show that a polygenic risk score calculated from our results explains up to 5.6% of the variance in acne liability in an independent cohort.
GWASs for atopic dermatitis have identified 25 reproducible loci. We attempt to prioritize the candidate causal genes at these loci using extensive molecular resources compiled into a bioinformatics ...pipeline. We identified a list of 103 molecular resources for atopic dermatitis etiology, including expression, protein, and DNA methylation quantitative trait loci datasets in the skin or immune-relevant tissues, which were tested for overlap with GWAS signals. This was combined with functional annotation using regulatory variant prediction and features such as promoter‒enhancer interactions, expression studies, and variant fine mapping. For each gene at each locus, we condensed the evidence into a prioritization score. Across the investigated loci, we detected significant enrichment of genes with adaptive immune regulatory function and epidermal barrier formation among the top-prioritized genes. At eight loci, we were able to prioritize a single candidate gene (IL6R, ADO, PRR5L, IL7R, ETS1, INPP5D, MDM1, TRAF3). In addition, at 6 of the 25 loci, our analysis prioritizes less familiar candidates (SLC22A5, IL2RA, MDM1, DEXI, ADO, STMN3). Our analysis provides support for previously implicated genes at several atopic dermatitis GWAS loci as well as evidence for plausible additional candidates at others, which may represent potential targets for drug discovery.
Pigmentation characteristics are well‐known risk factors for skin cancer. Polymorphisms in pigmentation genes have been associated with these traits and with the risk of malignancy. However, the ...functional relationship between genetic variation and disease is still unclear. This study aims to assess whether pigmentation SNPs are associated with pigmentary traits and skin cancer via DNA methylation (DNAm). Using a meta‐GWAS of whole‐blood DNAm from 36 European cohorts (N = 27,750; the Genetics of DNA Methylation Consortium, GoDMC), we found that 19 out of 27 SNPs in 10 pigmentation genes were associated with 391 DNAm sites across 30 genomic regions. We examined the effect of 25 selected DNAm sites on pigmentation traits, sun exposure phenotypes and skin cancer and on gene expression in whole blood. We uncovered an association of DNAm site cg07402062 with red hair in the Avon Longitudinal Study of Parents and Children (ALSPAC). We also found that the expression of ASIP and CDK10 was associated with hair colour, melanoma and basal cell carcinoma. Our results indicate that DNAm and expression of pigmentation genes may play a role as potential mediators of the relationship between genetic variants, pigmentation phenotypes and skin cancer and thus deserve further scrutiny.
Sequence-based variation in gene expression is a key driver of disease risk. Common variants regulating expression in cis have been mapped in many expression quantitative trait locus (eQTL) studies, ...typically in single tissues from unrelated individuals. Here, we present a comprehensive analysis of gene expression across multiple tissues conducted in a large set of mono- and dizygotic twins that allows systematic dissection of genetic (cis and trans) and non-genetic effects on gene expression. Using identity-by-descent estimates, we show that at least 40% of the total heritable cis effect on expression cannot be accounted for by common cis variants, a finding that reveals the contribution of low-frequency and rare regulatory variants with respect to both transcriptional regulation and complex trait susceptibility. We show that a substantial proportion of gene expression heritability is trans to the structural gene, and we identify several replicating trans variants that act predominantly in a tissue-restricted manner and may regulate the transcription of many genes.
Epigenetic modifications such as DNA methylation play a key role in gene regulation and disease susceptibility. However, little is known about the genome-wide frequency, localization, and function of ...methylation variation and how it is regulated by genetic and environmental factors. We utilized the Multiple Tissue Human Expression Resource (MuTHER) and generated Illumina 450K adipose methylome data from 648 twins. We found that individual CpGs had low variance and that variability was suppressed in promoters. We noted that DNA methylation variation was highly heritable (h2median = 0.34) and that shared environmental effects correlated with metabolic phenotype-associated CpGs. Analysis of methylation quantitative-trait loci (metQTL) revealed that 28% of CpGs were associated with nearby SNPs, and when overlapping them with adipose expression quantitative-trait loci (eQTL) from the same individuals, we found that 6% of the loci played a role in regulating both gene expression and DNA methylation. These associations were bidirectional, but there were pronounced negative associations for promoter CpGs. Integration of metQTL with adipose reference epigenomes and disease associations revealed significant enrichment of metQTL overlapping metabolic-trait or disease loci in enhancers (the strongest effects were for high-density lipoprotein cholesterol and body mass index BMI). We followed up with the BMI SNP rs713586, a cg01884057 metQTL that overlaps an enhancer upstream of ADCY3, and used bisulphite sequencing to refine this region. Our results showed widespread population invariability yet sequence dependence on adipose DNA methylation but that incorporating maps of regulatory elements aid in linking CpG variation to gene regulation and disease risk in a tissue-dependent manner.
Readily accessible samples such as peripheral blood or cell lines are increasingly being used in large cohorts to characterise gene expression differences between a patient group and healthy ...controls. However, cell and RNA isolation procedures and the variety of cell types that make up whole blood can affect gene expression measurements. We therefore systematically investigated global gene expression profiles in peripheral blood from six individuals collected during two visits by comparing five of the following cell and RNA isolation methods: whole blood (PAXgene), peripheral blood mononuclear cells (PBMCs), lymphoblastoid cell lines (LCLs), CD19 and CD20 specific B-cell subsets.
Gene expression measurements were clearly discriminated by isolation method although the reproducibility was high for all methods (range rho = 0.90-1.00). The PAXgene samples showed a decrease in the number of expressed genes (P < 1*10(-16)) with higher variability (P < 1*10(-16)) compared to the other methods. Differentially expressed probes between PAXgene and PBMCs were correlated with the number of monocytes, lymphocytes, neutrophils or erythrocytes. The correlations (rho = 0.83; rho = 0.79) of the expression levels of detected probes between LCLs and B-cell subsets were much lower compared to the two B-cell isolation methods (rho = 0.98). Gene ontology analysis of detected genes showed that genes involved in inflammatory responses are enriched in B-cells CD19 and CD20 whereas genes involved in alcohol metabolic process and the cell cycle were enriched in LCLs.
Gene expression profiles in blood-based samples are strongly dependent on the predominant constituent cell type(s) and RNA isolation method. It is crucial to understand the differences and variability of gene expression measurements between cell and RNA isolation procedures, and their relevance to disease processes, before application in large clinical studies.
Depression is a disabling and highly prevalent condition where genetic and epigenetic, such as DNA methylation (DNAm), differences contribute to disease risk. DNA methylation is influenced by genetic ...variation but the association between polygenic risk of depression and DNA methylation is unknown.
We investigated the association between polygenic risk scores (PRS) for depression and DNAm by conducting a methylome-wide association study (MWAS) in Generation Scotland (N = 8898, mean age = 49.8 years) with replication in the Lothian Birth Cohorts of 1921 and 1936 and adults in the Avon Longitudinal Study of Parents and Children (ALSPAC) (N
= 2049, mean age = 79.1, 69.6 and 47.2 years, respectively). We also conducted a replication MWAS in the ALSPAC children (N = 423, mean age = 17.1 years). Gene ontology analysis was conducted for the cytosine-guanine dinucleotide (CpG) probes significantly associated with depression PRS, followed by Mendelian randomisation (MR) analysis to infer the causal relationship between depression and DNAm.
Widespread associations (N
= 71, p
< 0.05, p < 6.3 × 10
) were found between PRS constructed using genetic risk variants for depression and DNAm in CpG probes that localised to genes involved in immune responses and neural development. The effect sizes for the significant associations were highly correlated between the discovery and replication samples in adults (r = 0.79) and in adolescents (r = 0.82). Gene Ontology analysis showed that significant CpG probes are enriched in immunological processes in the human leukocyte antigen system. Additional MWAS was conducted for each lead genetic risk variant. Over 47.9% of the independent genetic risk variants included in the PRS showed associations with DNAm in CpG probes located in both the same (cis) and distal (trans) locations to the genetic loci (p
< 0.045). Subsequent MR analysis showed that there are a greater number of causal effects found from DNAm to depression than vice versa (DNAm to depression: p
ranged from 0.024 to 7.45 × 10
; depression to DNAm: p
ranged from 0.028 to 0.003).
PRS for depression, especially those constructed from genome-wide significant genetic risk variants, showed methylome-wide differences associated with immune responses. Findings from MR analysis provided evidence for causal effect of DNAm to depression.