Previous studies have reported associations between attention-deficit/hyperactivity disorder symptoms and DNA methylation in children. We report the first epigenome-wide association study ...meta-analysis of adult attention-deficit/hyperactivity disorder symptoms, based on peripheral blood DNA methylation (Infinium HumanMethylation450K array) in three population-based adult cohorts.
An epigenome-wide association study was performed in the Netherlands Twin Register (N = 2258, mean age 37 years), Dunedin Multidisciplinary Health and Development Study (N = 800, age 38 years), and Environmental Risk Longitudinal Twin Study (N = 1631, age 18 years), and results were combined through meta-analysis (total sample size N = 4689). Region-based analyses accounting for the correlation between nearby methylation sites were also performed.
One epigenome-wide significant differentially methylated position was detected in the Dunedin study, but meta-analysis did not detect differentially methylated positions that were robustly associated across cohorts. In region-based analyses, six significant differentially methylation regions (DMRs) were identified in the Netherlands Twin Register, 19 in the Dunedin study, and none in the Environmental Risk Longitudinal Twin Study. Of these DMRs, 92% were associated with methylation quantitative trait loci, and 68% showed moderate to large blood-brain correlations for DNA methylation levels. DMRs included six nonoverlapping DMRs (three in the Netherlands Twin Register, three in the Dunedin study) in the major histocompatibility complex, which were associated with expression of genes in the major histocompatibility complex, including C4A and C4B, previously implicated in schizophrenia.
Our findings point at new candidate loci involved in immune and neuronal functions that await further replication. Our work also illustrates the need for further research to examine to what extent epigenetic associations with psychiatric traits depend on characteristics such as age, comorbidities, exposures, and genetic background.
Identification of causal drivers behind regulatory gene networks is crucial in understanding gene function. Here, we develop a method for the large-scale inference of gene-gene interactions in ...observational population genomics data that are both directed (using local genetic instruments as causal anchors, akin to Mendelian Randomization) and specific (by controlling for linkage disequilibrium and pleiotropy). Analysis of genotype and whole-blood RNA-sequencing data from 3072 individuals identified 49 genes as drivers of downstream transcriptional changes (Wald P < 7 × 10
), among which transcription factors were overrepresented (Fisher's P = 3.3 × 10
). Our analysis suggests new gene functions and targets, including for SENP7 (zinc-finger genes involved in retroviral repression) and BCL2A1 (target genes possibly involved in auditory dysfunction). Our work highlights the utility of population genomics data in deriving directed gene expression networks. A resource of trans-effects for all 6600 genes with a genetic instrument can be explored individually using a web-based browser.
We show that epigenome- and transcriptome-wide association studies (EWAS and TWAS) are prone to significant inflation and bias of test statistics, an unrecognized phenomenon introducing spurious ...findings if left unaddressed. Neither GWAS-based methodology nor state-of-the-art confounder adjustment methods completely remove bias and inflation. We propose a Bayesian method to control bias and inflation in EWAS and TWAS based on estimation of the empirical null distribution. Using simulations and real data, we demonstrate that our method maximizes power while properly controlling the false positive rate. We illustrate the utility of our method in large-scale EWAS and TWAS meta-analyses of age and smoking.
Most disease-associated genetic variants are noncoding, making it challenging to design experiments to understand their functional consequences. Identification of expression quantitative trait loci ...(eQTLs) has been a powerful approach to infer the downstream effects of disease-associated variants, but most of these variants remain unexplained. The analysis of DNA methylation, a key component of the epigenome, offers highly complementary data on the regulatory potential of genomic regions. Here we show that disease-associated variants have widespread effects on DNA methylation in trans that likely reflect differential occupancy of trans binding sites by cis-regulated transcription factors. Using multiple omics data sets from 3,841 Dutch individuals, we identified 1,907 established trait-associated SNPs that affect the methylation levels of 10,141 different CpG sites in trans (false discovery rate (FDR) < 0.05). These included SNPs that affect both the expression of a nearby transcription factor (such as NFKB1, CTCF and NKX2-3) and methylation of its respective binding site across the genome. Trans methylation QTLs effectively expose the downstream effects of disease-associated variants.
Genetic risk factors often localize to noncoding regions of the genome with unknown effects on disease etiology. Expression quantitative trait loci (eQTLs) help to explain the regulatory mechanisms ...underlying these genetic associations. Knowledge of the context that determines the nature and strength of eQTLs may help identify cell types relevant to pathophysiology and the regulatory networks underlying disease. Here we generated peripheral blood RNA-seq data from 2,116 unrelated individuals and systematically identified context-dependent eQTLs using a hypothesis-free strategy that does not require previous knowledge of the identity of the modifiers. Of the 23,060 significant cis-regulated genes (false discovery rate (FDR) ≤ 0.05), 2,743 (12%) showed context-dependent eQTL effects. The majority of these effects were influenced by cell type composition. A set of 145 cis-eQTLs depended on type I interferon signaling. Others were modulated by specific transcription factors binding to the eQTL SNPs.
The methylome is subject to genetic and environmental effects. Their impact may depend on sex and age, resulting in sex- and age-related physiological variation and disease susceptibility. Here we ...estimate the total heritability of DNA methylation levels in whole blood and estimate the variance explained by common single nucleotide polymorphisms at 411,169 sites in 2,603 individuals from twin families, to establish a catalogue of between-individual variation in DNA methylation. Heritability estimates vary across the genome (mean=19%) and interaction analyses reveal thousands of sites with sex-specific heritability as well as sites where the environmental variance increases with age. Integration with previously published data illustrates the impact of genome and environment across the lifespan at methylation sites associated with metabolic traits, smoking and ageing. These findings demonstrate that our catalogue holds valuable information on locations in the genome where methylation variation between people may reflect disease-relevant environmental exposures or genetic variation.
We tested whether DNA-methylation profiles account for inter-individual variation in body mass index (BMI) and height and whether they predict these phenotypes over and above genetic factors. Genetic ...predictors were derived from published summary results from the largest genome-wide association studies on BMI (n ∼ 350,000) and height (n ∼ 250,000) to date. We derived methylation predictors by estimating probe-trait effects in discovery samples and tested them in external samples. Methylation profiles associated with BMI in older individuals from the Lothian Birth Cohorts (LBCs, n = 1,366) explained 4.9% of the variation in BMI in Dutch adults from the LifeLines DEEP study (n = 750) but did not account for any BMI variation in adolescents from the Brisbane Systems Genetic Study (BSGS, n = 403). Methylation profiles based on the Dutch sample explained 4.9% and 3.6% of the variation in BMI in the LBCs and BSGS, respectively. Methylation profiles predicted BMI independently of genetic profiles in an additive manner: 7%, 8%, and 14% of variance of BMI in the LBCs were explained by the methylation predictor, the genetic predictor, and a model containing both, respectively. The corresponding percentages for LifeLines DEEP were 5%, 9%, and 13%, respectively, suggesting that the methylation profiles represent environmental effects. The differential effects of the BMI methylation profiles by age support previous observations of age modulation of genetic contributions. In contrast, methylation profiles accounted for almost no variation in height, consistent with a mainly genetic contribution to inter-individual variation. The BMI results suggest that combining genetic and epigenetic information might have greater utility for complex-trait prediction.
Although it is assumed that epigenetic mechanisms, such as changes in DNA methylation (DNAm), underlie the relationship between adverse intrauterine conditions and adult metabolic health, evidence ...from human studies remains scarce. Therefore, we evaluated whether DNAm in whole blood mediated the association between prenatal famine exposure and metabolic health in 422 individuals exposed to famine in utero and 463 (sibling) controls. We implemented a two-step analysis, namely, a genome-wide exploration across 342,596 cytosine-phosphate-guanine dinucleotides (CpGs) for potential mediators of the association between prenatal famine exposure and adult body mass index (BMI), serum triglycerides (TG), or glucose concentrations, which was followed by formal mediation analysis. DNAm mediated the association of prenatal famine exposure with adult BMI and TG but not with glucose. DNAm at
(cg09349128), a gene involved in energy metabolism, mediated 13.4% 95% confidence interval (CI), 5 to 28% of the association between famine exposure and BMI. DNAm at six CpGs, including
(cg19693031), influencing β cell function, and
(cg07397296), affecting lipid metabolism, together mediated 80% (95% CI, 38.5 to 100%) of the association between famine exposure and TG. Analyses restricted to those exposed to famine during early gestation identified additional CpGs mediating the relationship with TG near
(glycolysis) and
(adipogenesis). DNAm at the CpGs involved was associated with gene expression in an external data set and correlated with DNAm levels in fat depots in additional postmortem data. Our data are consistent with the hypothesis that epigenetic mechanisms mediate the influence of transient adverse environmental factors in early life on long-term metabolic health. The specific mechanism awaits elucidation.
We have generated a next-generation whole-exome sequencing data set of 2628 participants of the population-based Rotterdam Study cohort, comprising 669 737 single-nucleotide variants and 24 019 short ...insertions and deletions. Because of broad and deep longitudinal phenotyping of the Rotterdam Study, this data set permits extensive interpretation of genetic variants on a range of clinically relevant outcomes, and is accessible as a control data set. We show that next-generation sequencing data sets yield a large degree of population-specific variants, which are not captured by other available large sequencing efforts, being ExAC, ESP, 1000G, UK10K, GoNL and DECODE.
Recent genome-wide association studies (GWASs) identified the first genetic loci associated with attention-deficit/hyperactivity disorder (ADHD) and autism spectrum disorder (ASD). The next step is ...to use these results to increase our understanding of the biological mechanisms involved. Most of the identified variants likely influence gene regulation. The aim of the current study is to shed light on the mechanisms underlying the genetic signals and prioritize genes by integrating GWAS results with gene expression and DNA methylation (DNAm) levels.
We applied summary-data–based Mendelian randomization to integrate ADHD and ASD GWAS data with fetal brain expression and methylation quantitative trait loci, given the early onset of these disorders. We also analyzed expression and methylation quantitative trait loci datasets of adult brain and blood, as these provide increased statistical power. We subsequently used summary-data–based Mendelian randomization to investigate if the same variant influences both DNAm and gene expression levels.
We identified multiple gene expression and DNAm levels in fetal brain at chromosomes 1 and 17 that were associated with ADHD and ASD, respectively, through pleiotropy at shared genetic variants. The analyses in brain and blood showed additional associated gene expression and DNAm levels at the same and additional loci, likely because of increased statistical power. Several of the associated genes have not been identified in ADHD and ASD GWASs before.
Our findings identified the genetic variants associated with ADHD and ASD that likely act through gene regulation. This facilitates prioritization of candidate genes for functional follow-up studies.