DNA methylation plays an important role in both normal human development and risk of disease. The most utilized method of assessing DNA methylation uses BeadChips, generating an epigenome-wide ...“snapshot” of >450,000 observations (probe measurements) per assay. However, the reliability of each of these measurements is not equal, and little consideration is paid to consequences for research. We correlated repeat measurements of the same DNA samples using the Illumina HumanMethylation450K and the Infinium MethylationEPIC BeadChips in 350 blood DNA samples. Probes that were reliably measured were more heritable and showed consistent associations with environmental exposures, gene expression, and greater cross-tissue concordance. Unreliable probes were less replicable and generated an unknown volume of false negatives. This serves as a lesson for working with DNA methylation data, but the lessons are equally applicable to working with other data: as we advance toward generating increasingly greater volumes of data, failure to document reliability risks harming reproducibility.
▪
•Measurements of DNA methylation made using BeadChip probes are differentially reliable•Unreliable probes were less heritable, less replicable, and less functionally relevant•This has serious implications for reporting and evaluating DNA methylation findings•Reliability joins replicability and reproducibility to make three fundamental Rs of STEM
Although DNA methylation data are used widely by researchers in many fields, the reliability of these data are surprisingly variable. Our findings remind us that, in an age of increasingly big data, research is only as robust as its foundations. We hope that our findings will improve the integrity of DNA methylation studies. We also hope that our findings serve as a cautionary reminder for those generating and implementing big data of any type: reliability is a fundamental aspect of replicability. Conducting analysis with reliable data will improve chances of replicable findings, which might lead to more actionable targets for further research. To the extent that reliable data improve replicability, the knock-on effect will be more public confidence in research and less effort spent trying to replicate findings that are bound to fail.
DNA methylation is an important mechanism of gene regulation. The most popular method to measure methylation is to use BeadChips that contain probes to index hundreds of thousands of methylation sites at once. However, these probes are not equally reliable. In blood DNA, unreliable probes were less heritable and less likely to index gene expression, and associations were less replicable. This has serious downstream consequences for reproducible science and should serve as a caution for all data scientists regardless of discipline.
A growing number of epigenome-wide association studies have demonstrated a role for DNA methylation in the brain in Alzheimer's disease. With the aim of exploring peripheral biomarker potential, we ...have examined DNA methylation patterns in whole blood collected from 284 individuals in the AddNeuroMed study, which included 89 nondemented controls, 86 patients with Alzheimer's disease, and 109 individuals with mild cognitive impairment, including 38 individuals who progressed to Alzheimer's disease within 1 year. We identified significant differentially methylated regions, including 12 adjacent hypermethylated probes in the HOXB6 gene in Alzheimer's disease, which we validated using pyrosequencing. Using weighted gene correlation network analysis, we identified comethylated modules of genes that were associated with key variables such as APOE genotype and diagnosis. In summary, this study represents the first large-scale epigenome-wide association study of Alzheimer's disease and mild cognitive impairment using blood. We highlight the differences in various loci and pathways in early disease, suggesting that these patterns relate to cognitive decline at an early stage.
•We performed an epigenome-wide assessment of DNA methylation in Alzheimer's disease, mild cognitive impairment, and control whole blood.•We observed hypermethylation of HOXB6 in AD, which was validated via pyrosequencing.•Network analysis (weighted gene correlation network analysis) showed differences in immune system pathways in disease.
DNA methylation plays an important role in both normal human development and risk of disease. The most utilized method of assessing DNA methylation uses BeadChips, generating an epigenome-wide ..."snapshot" of >450,000 observations (probe measurements) per assay. However, the reliability of each of these measurements is not equal, and little consideration is paid to consequences for research. We correlated repeat measurements of the same DNA samples using the Illumina HumanMethylation450K and the Infinium MethylationEPIC BeadChips in 350 blood DNA samples. Probes that were reliably measured were more heritable and showed consistent associations with environmental exposures, gene expression, and greater cross-tissue concordance. Unreliable probes were less replicable and generated an unknown volume of false negatives. This serves as a lesson for working with DNA methylation data, but the lessons are equally applicable to working with other data: as we advance toward generating increasingly greater volumes of data, failure to document reliability risks harming reproducibility.
Large-scale epigenome-wide association meta-analyses have identified multiple 'signatures'' of smoking. Drawing on these findings, we describe the construction of a polyepigenetic DNA methylation ...score that indexes smoking behavior and that can be utilized for multiple purposes in population health research. To validate the score, we use data from two birth cohort studies: The Dunedin Longitudinal Study, followed to age-38 years, and the Environmental Risk Study, followed to age-18 years. Longitudinal data show that changes in DNA methylation accumulate with increased exposure to tobacco smoking and attenuate with quitting. Data from twins discordant for smoking behavior show that smoking influences DNA methylation independently of genetic and environmental risk factors. Physiological data show that changes in DNA methylation track smoking-related changes in lung function and gum health over time. Moreover, DNA methylation changes predict corresponding changes in gene expression in pathways related to inflammation, immune response, and cellular trafficking. Finally, we present prospective data about the link between adverse childhood experiences (ACEs) and epigenetic modifications; these findings document the importance of controlling for smoking-related DNA methylation changes when studying biological embedding of stress in life-course research. We introduce the polyepigenetic DNA methylation score as a tool both for discovery and theory-guided research in epigenetic epidemiology.
Abstract
Illumina DNA methylation arrays are a widely used tool for performing genome-wide DNA methylation analyses. However, measurements obtained from these arrays may be affected by technical ...artefacts that result in spurious associations if left unchecked. Cross-reactivity represents one of the major challenges, meaning that probes may map to multiple regions in the genome. Although several studies have reported on this issue, few studies have empirically examined the impact of cross-reactivity in an epigenome-wide association study (EWAS). In this paper, we report on cross-reactivity issues that we discovered in a large EWAS on the presence of the C9orf72 repeat expansion in ALS patients. Specifically, we found that that the majority of the significant probes inadvertently cross-hybridized to the C9orf72 locus. Importantly, these probes were not flagged as cross-reactive in previous studies, leading to novel insights into the extent to which cross-reactivity can impact EWAS. Our findings are particularly relevant for epigenetic studies into diseases associated with repeat expansions and other types of structural variation. More generally however, considering that most spurious associations were not excluded based on pre-defined sets of cross-reactive probes, we believe that the presented data-driven flag and consider approach is relevant for any type of EWAS.
Measures to quantify changes in the pace of biological aging in response to intervention are needed to evaluate geroprotective interventions for humans. Previously, we showed that quantification of ...the pace of biological aging from a DNA-methylation blood test was possible (Belsky et al., 2020). Here, we report a next-generation DNA-methylation biomarker of Pace of Aging, DunedinPACE (for Pace of Aging Calculated from the Epigenome).
We used data from the Dunedin Study 1972-1973 birth cohort tracking within-individual decline in 19 indicators of organ-system integrity across four time points spanning two decades to model Pace of Aging. We distilled this two-decade Pace of Aging into a single-time-point DNA-methylation blood-test using elastic-net regression and a DNA-methylation dataset restricted to exclude probes with low test-retest reliability. We evaluated the resulting measure, named DunedinPACE, in five additional datasets.
DunedinPACE showed high test-retest reliability, was associated with morbidity, disability, and mortality, and indicated faster aging in young adults with childhood adversity. DunedinPACE effect-sizes were similar to GrimAge Clock effect-sizes. In analysis of incident morbidity, disability, and mortality, DunedinPACE and added incremental prediction beyond GrimAge.
DunedinPACE is a novel blood biomarker of the pace of aging for gerontology and geroscience.
This research was supported by US-National Institute on Aging grants AG032282, AG061378, AG066887, and UK Medical Research Council grant MR/P005918/1.
There has been a steady increase in the number of studies aiming to identify DNA methylation differences associated with complex phenotypes. Many of the challenges of epigenetic epidemiology ...regarding study design and interpretation have been discussed in detail, however there are analytical concerns that are outstanding and require further exploration. In this study we seek to address three analytical issues. First, we quantify the multiple testing burden and propose a standard statistical significance threshold for identifying DNA methylation sites that are associated with an outcome. Second, we establish whether linear regression, the chosen statistical tool for the majority of studies, is appropriate and whether it is biased by the underlying distribution of DNA methylation data. Finally, we assess the sample size required for adequately powered DNA methylation association studies.
We quantified DNA methylation in the Understanding Society cohort (n = 1175), a large population based study, using the Illumina EPIC array to assess the statistical properties of DNA methylation association analyses. By simulating null DNA methylation studies, we generated the distribution of p-values expected by chance and calculated the 5% family-wise error for EPIC array studies to be 9 × 10
. Next, we tested whether the assumptions of linear regression are violated by DNA methylation data and found that the majority of sites do not satisfy the assumption of normal residuals. Nevertheless, we found no evidence that this bias influences analyses by increasing the likelihood of affected sites to be false positives. Finally, we performed power calculations for EPIC based DNA methylation studies, demonstrating that existing studies with data on ~ 1000 samples are adequately powered to detect small differences at the majority of sites.
We propose that a significance threshold of P < 9 × 10
adequately controls the false positive rate for EPIC array DNA methylation studies. Moreover, our results indicate that linear regression is a valid statistical methodology for DNA methylation studies, despite the fact that the data do not always satisfy the assumptions of this test. These findings have implications for epidemiological-based studies of DNA methylation and provide a framework for the interpretation of findings from current and future studies.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Epigenome-wide association studies of Alzheimer's disease have highlighted neuropathology-associated DNA methylation differences, although existing studies have been limited in sample size and ...utilized different brain regions. Here, we combine data from six DNA methylomic studies of Alzheimer's disease (N = 1453 unique individuals) to identify differential methylation associated with Braak stage in different brain regions and across cortex. We identify 236 CpGs in the prefrontal cortex, 95 CpGs in the temporal gyrus and ten CpGs in the entorhinal cortex at Bonferroni significance, with none in the cerebellum. Our cross-cortex meta-analysis (N = 1408 donors) identifies 220 CpGs associated with neuropathology, annotated to 121 genes, of which 84 genes have not been previously reported at this significance threshold. We have replicated our findings using two further DNA methylomic datasets consisting of a further >600 unique donors. The meta-analysis summary statistics are available in our online data resource ( www.epigenomicslab.com/ad-meta-analysis/ ).
Characterizing the complex relationship between genetic, epigenetic, and transcriptomic variation has the potential to increase understanding about the mechanisms underpinning health and disease ...phenotypes. We undertook a comprehensive analysis of common genetic variation on DNA methylation (DNAm) by using the Illumina EPIC array to profile samples from the UK Household Longitudinal study. We identified 12,689,548 significant DNA methylation quantitative trait loci (mQTL) associations (p < 6.52 × 10−14) occurring between 2,907,234 genetic variants and 93,268 DNAm sites, including a large number not identified by previous DNAm-profiling methods. We demonstrate the utility of these data for interpreting the functional consequences of common genetic variation associated with > 60 human traits by using summary-data-based Mendelian randomization (SMR) to identify 1,662 pleiotropic associations between 36 complex traits and 1,246 DNAm sites. We also use SMR to characterize the relationship between DNAm and gene expression and thereby identify 6,798 pleiotropic associations between 5,420 DNAm sites and the transcription of 1,702 genes. Our mQTL database and SMR results are available via a searchable online database as a resource to the research community.
Epigenetic processes play a key role in orchestrating transcriptional regulation during development. The importance of DNA methylation in fetal brain development is highlighted by the dynamic ...expression of de novo DNA methyltransferases during the perinatal period and neurodevelopmental deficits associated with mutations in the methyl-CpG binding protein 2 (MECP2) gene. However, our knowledge about the temporal changes to the epigenome during fetal brain development has, to date, been limited. We quantified genome-wide patterns of DNA methylation at ∼ 400,000 sites in 179 human fetal brain samples (100 male, 79 female) spanning 23 to 184 d post-conception. We identified highly significant changes in DNA methylation across fetal brain development at >7% of sites, with an enrichment of loci becoming hypomethylated with fetal age. Sites associated with developmental changes in DNA methylation during fetal brain development were significantly underrepresented in promoter regulatory regions but significantly overrepresented in regions flanking CpG islands (shores and shelves) and gene bodies. Highly significant differences in DNA methylation were observed between males and females at a number of autosomal sites, with a small number of regions showing sex-specific DNA methylation trajectories across brain development. Weighted gene comethylation network analysis (WGCNA) revealed discrete modules of comethylated loci associated with fetal age that are significantly enriched for genes involved in neurodevelopmental processes. This is, to our knowledge, the most extensive study of DNA methylation across human fetal brain development to date, confirming the prenatal period as a time of considerable epigenomic plasticity.