Human aging cannot be fully understood in terms of the constrained genetic setting. Epigenetic drift is an alternative means of explaining age-associated alterations. To address this issue, we ...performed whole-genome bisulfite sequencing (WGBS) of newborn and centenarian genomes. The centenarian DNA had a lower DNA methylation content and a reduced correlation in the methylation status of neighboring cytosine—phosphate—guanine (CpGs) throughout the genome in comparison with the more homogeneously methylated newborn DNA. The more hypomethylated CpGs observed in the centenarian DNA compared with the neonate covered all genomic compartments, such as promoters, exonic, intronic, and intergenic regions. For regulatory regions, the most hypomethylated sequences in the centenarian DNA were present mainly at CpG-poor promoters and in tissue-specific genes, whereas a greater level of DNA methylation was observed in CpG island promoters. We extended the study to a larger cohort of newborn and nonagenarian samples using a 450,000 CpG-site DNA methylation microarray that reinforced the observation of more hypomethylated DNA sequences in the advanced age group. WGBS and 450,000 analyses of middle-age individuals demonstrated DNA methylomes in the crossroad between the newborn and the nonagenarian/centenarian groups. Our study constitutes a unique DNA methylation analysis of the extreme points of human life at a single-nucleotide resolution level.
Full text
Available for:
BFBNIB, NMLJ, NUK, PNG, SAZU, UL, UM, UPUK
Abstract
Motivation
DNA methylation is essential for normal embryogenesis and development in mammals and can be captured at single base pair resolution by whole genome bisulfite sequencing (WGBS). ...Current available analysis tools are becoming rapidly outdated as they lack sensible functionality and efficiency to handle large amounts of data now commonly created.
Results
We developed gemBS, a fast high-throughput bioinformatics pipeline specifically designed for large scale BS-Seq analysis that combines a high performance BS-mapper (GEM3) and a variant caller specifically for BS-Seq data (BScall). gemBS provides genotype information and methylation estimates for all genomic cytosines in different contexts (CpG and non-CpG) and a set of quality reports for comprehensive and reproducible analysis. gemBS is highly modular and can be easily automated, while producing robust and accurate results.
Availability and implementation
gemBS is released under the GNU GPLv3+ license. Source code and documentation are freely available from www.statgen.cat/gemBS.
Supplementary information
Supplementary data are available at Bioinformatics online.
Mammalian Y chromosomes are often neglected from genomic analysis. Due to their inherent assembly difficulties, high repeat content, and large ampliconic regions, only a handful of species have their ...Y chromosome properly characterized. To date, just a single human reference quality Y chromosome, of European ancestry, is available due to a lack of accessible methodology. To facilitate the assembly of such complicated genomic territory, we developed a novel strategy to sequence native, unamplified flow sorted DNA on a MinION nanopore sequencing device. Our approach yields a highly continuous assembly of the first human Y chromosome of African origin. It constitutes a significant improvement over comparable previous methods, increasing continuity by more than 800%. Sequencing native DNA also allows to take advantage of the nanopore signal data to detect epigenetic modifications in situ. This approach is in theory generalizable to any species simplifying the assembly of extremely large and repetitive genomes.
Here, we report DNA methylation and hydroxymethylation dynamics at nucleotide resolution using C/EBPα-enhanced reprogramming of B cells into induced pluripotent cells (iPSCs). We observed successive ...waves of hydroxymethylation at enhancers, concomitant with a decrease in DNA methylation, suggesting active demethylation. Consistent with this finding, ablation of the DNA demethylase Tet2 almost completely abolishes reprogramming. C/EBPα, Klf4, and Tfcp2l1 each interact with Tet2 and recruit the enzyme to specific DNA sites. During reprogramming, some of these sites maintain high levels of 5hmC, and enhancers and promoters of key pluripotency factors become demethylated as early as 1 day after Yamanaka factor induction. Surprisingly, methylation changes precede chromatin opening in distinct chromatin regions, including Klf4 bound sites, revealing a pioneer factor activity associated with alteration in DNA methylation. Rapid changes in hydroxymethylation similar to those in B cells were also observed during compound-accelerated reprogramming of fibroblasts into iPSCs, highlighting the generality of our observations.
Display omitted
•Base-resolution profiling of DNA (hydroxy)methylation during iPSC reprogramming•Major contribution of Tet2-mediated demethylation throughout reprogramming•C/EBPα, Klf4, and Tfcp2l1 drive Tet2-mediated enhancer demethylation and activation•Klf4 induces enhancer demethylation in the absence of nucleosome repositioning
Using a highly efficient reprogramming system, Sardina et al. examined the dynamics of DNA methylation and hydroxymethylation. They found that throughout the process several transcription factors can recruit Tet2 to specific sites, leading to demethylation. Some of these sites became demethylated before chromatin opening.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
Using a novel gene chip, investigators identified single-nucleotide polymorphisms (SNPs) from three chromosomal regions, including the
LPA
locus, that were associated with the risk of coronary ...disease. Two SNPs in
LPA
were strongly associated with both the level of Lp(a) lipoprotein and the risk of coronary disease. After adjustment for the Lp(a) lipoprotein level, the association with the risk of coronary disease was abolished. These findings support a causal role of an increased Lp(a) lipoprotein level in the risk of coronary disease.
Two SNPs in the
LPA
locus were strongly associated with both the level of Lp(a) lipoprotein and the risk of coronary disease. These findings support a causal role of an increased Lp(a) lipoprotein level in the risk of coronary disease.
Genomewide association studies have identified several novel susceptibility loci for coronary artery disease,
1
–
4
but it is likely that only common variants can be detected in this way.
5
,
6
Moreover, loci that are identified with the use of genomewide association studies explain only a small amount of the expected contribution to the risk of coronary disease. The use of arrays of high-density single-nucleotide polymorphisms (SNPs) in candidate genes for cardiovascular disease may help elucidate the genetic contribution to the risk of coronary disease.
A recent genomewide association study showed that a cluster of genes — solute carrier family 22 member . . .
Natural killer (NK) cells are lymphocytes involved in antimicrobial and antitumoral immune responses. Using N-ethyl-N-nitrosourea mutagenesis in mice, we identified a mutant with increased resistance ...to viral infections because of the presence of hyperresponsive NK cells. Whole-genome sequencing and functional analysis revealed a loss-of-function mutation in the Ncr1 gene encoding the activating receptor NKp46. The down-regulation of NK cell activity by NKp46 was associated with the silencing of the Helios transcription factor in NK cells. NKp46 was critical for the subsequent development of antiviral and antibacterial T cell responses, which suggests that the regulation of NK cell function by NKp46 allows for the optimal development of adaptive immune responses. NKp46 blockade enhanced NK cell reactivity in vivo, which could enable the design of immunostimulation strategies in humans.
Full text
Available for:
BFBNIB, NMLJ, NUK, PNG, SAZU, UL, UM, UPUK
F cells measure the presence of fetal hemoglobin, a heritable quantitative trait in adults that accounts for substantial phenotypic diversity of sickle cell disease and β thalassemia. We applied a ...genome-wide association mapping strategy to individuals with contrasting extreme trait values and mapped a new F cell quantitative trait locus to BCL11A, which encodes a zinc-finger protein, on chromosome 2p15. The 2p15 BCL11A quantitative trait locus accounts for 15.1% of the trait variance.
Full text
Available for:
DOBA, IJS, IZUM, KILJ, NUK, PILJ, PNG, SAZU, UILJ, UKNU, UL, UM, UPUK
The cost of whole-genome bisulfite sequencing (WGBS) remains a bottleneck for many studies and it is therefore imperative to extract as much information as possible from a given dataset. This is ...particularly important because even at the recommend 30X coverage for reference methylomes, up to 50% of high-resolution features such as differentially methylated positions (DMPs) cannot be called with current methods as determined by saturation analysis. To address this limitation, we have developed a tool that dynamically segments WGBS methylomes into blocks of comethylation (COMETs) from which lost information can be recovered in the form of differentially methylated COMETs (DMCs). Using this tool, we demonstrate recovery of ∼30% of the lost DMP information content as DMCs even at very low (5X) coverage. This constitutes twice the amount that can be recovered using an existing method based on differentially methylated regions (DMRs). In addition, we explored the relationship between COMETs and haplotypes in lymphoblastoid cell lines of African and European origin. Using best fit analysis, we show COMETs to be correlated in a population-specific manner, suggesting that this type of dynamic segmentation may be useful for integrated (epi)genome-wide association studies in the future.
The molecular classification of glioblastoma (GBM) based on gene expression might better explain outcome and response to treatment than clinical factors. Whole transcriptome sequencing using ...next-generation sequencing platforms is rapidly becoming accepted as a tool for measuring gene expression for both research and clinical use. Fresh frozen (FF) tissue specimens of GBM are difficult to obtain since tumor tissue obtained at surgery is often scarce and necrotic and diagnosis is prioritized over freezing. After diagnosis, leftover tissue is usually stored as formalin-fixed paraffin-embedded (FFPE) tissue. However, RNA from FFPE tissues is usually degraded, which could hamper gene expression analysis. We compared RNA-Seq data obtained from matched pairs of FF and FFPE GBM specimens. Only three FFPE out of eleven FFPE-FF matched samples yielded informative results. Several quality-control measurements showed that RNA from FFPE samples was highly degraded but maintained transcriptomic similarities to RNA from FF samples. Certain issues regarding mutation analysis and subtype prediction were detected. Nevertheless, our results suggest that RNA-Seq of FFPE GBM specimens provides reliable gene expression data that can be used in molecular studies of GBM if the RNA is sufficiently preserved.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK