DNA methylation plays an important role in the regulation of transcription. Genetic control of DNA methylation is a potential candidate for explaining the many identified SNP associations with ...disease that are not found in coding regions. We replicated 52,916 cis and 2,025 trans DNA methylation quantitative trait loci (mQTL) using methylation from whole blood measured on Illumina HumanMethylation450 arrays in the Brisbane Systems Genetics Study (n = 614 from 177 families) and the Lothian Birth Cohorts of 1921 and 1936 (combined n = 1366). The trans mQTL SNPs were found to be over-represented in 1 Mbp subtelomeric regions, and on chromosomes 16 and 19. There was a significant increase in trans mQTL DNA methylation sites in upstream and 5' UTR regions. The genetic heritability of a number of complex traits and diseases was partitioned into components due to mQTL and the remainder of the genome. Significant enrichment was observed for height (p = 2.1 × 10
), ulcerative colitis (p = 2 × 10
), Crohn's disease (p = 6 × 10
) and coronary artery disease (p = 5.5 × 10
) when compared to a random sample of SNPs with matched minor allele frequency, although this enrichment is explained by the genomic location of the mQTL SNPs.
Microarray technology has been used to measure genome-wide DNA methylation in thousands of individuals. These studies typically test the associations between individual DNA methylation sites ...("probes") and complex traits or diseases. The results can be used to generate methylation profile scores (MPS) to predict outcomes in independent data sets. Although there are many parallels between MPS and polygenic (risk) scores (PGS), there are key differences. Here, we review motivations, methods, and applications of DNA methylation-based trait prediction, with a focus on common diseases. We contrast MPS with PGS, highlighting where assumptions made in genetic modeling may not hold in epigenetic data.
DNA methylation changes with age. Chronological age predictors built from DNA methylation are termed 'epigenetic clocks'. The deviation of predicted age from the actual age ('age acceleration ...residual', AAR) has been reported to be associated with death. However, it is currently unclear how a better prediction of chronological age affects such association.
In this study, we build multiple predictors based on training DNA methylation samples selected from 13,661 samples (13,402 from blood and 259 from saliva). We use the Lothian Birth Cohorts of 1921 (LBC1921) and 1936 (LBC1936) to examine whether the association between AAR (from these predictors) and death is affected by (1) improving prediction accuracy of an age predictor as its training sample size increases (from 335 to 12,710) and (2) additionally correcting for confounders (i.e., cellular compositions). In addition, we investigated the performance of our predictor in non-blood tissues.
We found that in principle, a near-perfect age predictor could be developed when the training sample size is sufficiently large. The association between AAR and mortality attenuates as prediction accuracy increases. AAR from our best predictor (based on Elastic Net, https://github.com/qzhang314/DNAm-based-age-predictor ) exhibits no association with mortality in both LBC1921 (hazard ratio = 1.08, 95% CI 0.91-1.27) and LBC1936 (hazard ratio = 1.00, 95% CI 0.79-1.28). Predictors based on small sample size are prone to confounding by cellular compositions relative to those from large sample size. We observed comparable performance of our predictor in non-blood tissues with a multi-tissue-based predictor.
This study indicates that the epigenetic clock can be improved by increasing the training sample size and that its association with mortality attenuates with increased prediction of chronological age.
Genome-wide DNA methylation (DNAm) profiling has allowed for the development of molecular predictors for a multitude of traits and diseases. Such predictors may be more accurate than the ...self-reported phenotypes and could have clinical applications.
Here, penalized regression models are used to develop DNAm predictors for ten modifiable health and lifestyle factors in a cohort of 5087 individuals. Using an independent test cohort comprising 895 individuals, the proportion of phenotypic variance explained in each trait is examined for DNAm-based and genetic predictors. Receiver operator characteristic curves are generated to investigate the predictive performance of DNAm-based predictors, using dichotomized phenotypes. The relationship between DNAm scores and all-cause mortality (n = 212 events) is assessed via Cox proportional hazards models. DNAm predictors for smoking, alcohol, education, and waist-to-hip ratio are shown to predict mortality in multivariate models. The predictors show moderate discrimination of obesity, alcohol consumption, and HDL cholesterol. There is excellent discrimination of current smoking status, poorer discrimination of college-educated individuals and those with high total cholesterol, LDL with remnant cholesterol, and total:HDL cholesterol ratios.
DNAm predictors correlate with lifestyle factors that are associated with health and mortality. They may supplement DNAm-based predictors of age to identify the lifestyle profiles of individuals and predict disease risk.
DNA methylation age is an accurate biomarker of chronological age and predicts lifespan, but its underlying molecular mechanisms are unknown. In this genome-wide association study of 9907 ...individuals, we find gene variants mapping to five loci associated with intrinsic epigenetic age acceleration (IEAA) and gene variants in three loci associated with extrinsic epigenetic age acceleration (EEAA). Mendelian randomization analysis suggests causal influences of menarche and menopause on IEAA and lipoproteins on IEAA and EEAA. Variants associated with longer leukocyte telomere length (LTL) in the telomerase reverse transcriptase gene (TERT) paradoxically confer higher IEAA (P < 2.7 × 10
). Causal modeling indicates TERT-specific and independent effects on LTL and IEAA. Experimental hTERT-expression in primary human fibroblasts engenders a linear increase in DNA methylation age with cell population doubling number. Together, these findings indicate a critical role for hTERT in regulating the epigenetic clock, in addition to its established role of compensating for cell replication-dependent telomere shortening.
An improved understanding of etiological mechanisms in Parkinson's disease (PD) is urgently needed because the number of affected individuals is projected to increase rapidly as populations age. We ...present results from a blood-based methylome-wide association study of PD involving meta-analysis of 229 K CpG probes in 1,132 cases and 999 controls from two independent cohorts. We identify two previously unreported epigenome-wide significant associations with PD, including cg06690548 on chromosome 4. We demonstrate that cg06690548 hypermethylation in PD is associated with down-regulation of the SLC7A11 gene and show this is consistent with an environmental exposure, as opposed to medications or genetic factors with effects on DNA methylation or gene expression. These findings are notable because SLC7A11 codes for a cysteine-glutamate anti-porter regulating levels of the antioxidant glutathione, and it is a known target of the environmental neurotoxin β-methylamino-L-alanine (BMAA). Our study identifies the SLC7A11 gene as a plausible biological target in PD.
Migraine is a common heritable neurovascular disorder typically characterised by episodic attacks of severe pulsating headache and nausea, often accompanied by visual, auditory or other sensory ...symptoms. Although genome-wide association studies have identified over 40 single nucleotide polymorphisms associated with migraine, there remains uncertainty about the casual genes involved in disease pathogenesis and how their function is regulated.
We performed an epigenome-wide association study, quantifying genome-wide patterns of DNA methylation in 67 migraine cases and 67 controls with a matching age and sex distribution. Association analyses between migraine and methylation probe expression, after adjustment for cell type proportions, indicated an excess of small P values, but there was no significant single-probe association after correction for multiple testing (P < 1.09 × 10
). However, utilising a 1 kb sliding window approach to combine adjacent migraine-methylation association P values, we identified 62 independent differentially methylated regions (DMRs) underlying migraine (false discovery rate < 0.05). Migraine association signals were subtle but consistent in effect direction across the length of each DMR. Subsequent analyses showed that the migraine-associated DMRs were enriched in regulatory elements of the genome and were in close proximity to genes involved in solute transportation and haemostasis.
This study represents the first genome-wide analysis of DNA methylation in migraine. We have identified DNA methylation in the whole blood of subjects associated with migraine, highlighting novel loci that provide insight into the biological pathways and mechanisms underlying migraine pathogenesis.
Quantitative genetics theory predicts that X-chromosome dosage compensation (DC) will have a detectable effect on the amount of genetic and therefore phenotypic trait variances at associated loci in ...males and females. Here, we systematically examine the role of DC in humans in 20 complex traits in a sample of more than 450,000 individuals from the UK Biobank and 1600 gene expression traits from a sample of 2000 individuals as well as across-tissue gene expression from the GTEx resource. We find approximately twice as much X-linked genetic variation across the UK Biobank traits in males (mean h
= 0.63%) compared to females (mean h
= 0.30%), confirming the predicted DC effect. Our DC estimates for complex traits and gene expression are consistent with a small proportion of genes escaping X-inactivation in a trait- and tissue-dependent manner. Finally, we highlight examples of biologically relevant X-linked heterogeneity between the sexes that bias DC estimates if unaccounted for.
Twin studies have provided the basis for genetic and epidemiological studies in human complex traits. As epigenetic factors can contribute to phenotypic outcomes, we conducted a DNA methylation ...analysis in white blood cells (WBC), buccal epithelial cells and gut biopsies of 114 monozygotic (MZ) twins as well as WBC and buccal epithelial cells of 80 dizygotic (DZ) twins using 12K CpG island microarrays. Here we provide the first annotation of epigenetic metastability of ∼6,000 unique genomic regions in MZ twins. An intraclass correlation (ICC)-based comparison of matched MZ and DZ twins showed significantly higher epigenetic difference in buccal cells of DZ co-twins (P = 1.2 × 10−294). Although such higher epigenetic discordance in DZ twins can result from DNA sequence differences, our in silico SNP analyses and animal studies favor the hypothesis that it is due to epigenomic differences in the zygotes, suggesting that molecular mechanisms of heritability may not be limited to DNA sequence differences.