Bisulfite sequencing is a powerful technique to study DNA cytosine methylation. Bisulfite treatment followed by PCR amplification specifically converts unmethylated cytosines to thymine. Coupled with ...next generation sequencing technology, it is able to detect the methylation status of every cytosine in the genome. However, mapping high-throughput bisulfite reads to the reference genome remains a great challenge due to the increased searching space, reduced complexity of bisulfite sequence, asymmetric cytosine to thymine alignments, and multiple CpG heterogeneous methylation.
We developed an efficient bisulfite reads mapping algorithm BSMAP to address the above issues. BSMAP combines genome hashing and bitwise masking to achieve fast and accurate bisulfite mapping. Compared with existing bisulfite mapping approaches, BSMAP is faster, more sensitive and more flexible.
BSMAP is the first general-purpose bisulfite mapping software. It is able to map high-throughput bisulfite reads at whole genome level with feasible memory and CPU usage. It is freely available under GPL v3 license at http://code.google.com/p/bsmap/.
The recognition of modified histones by “reader” proteins constitutes a key mechanism regulating gene expression in the chromatin context. Compared with the great variety of readers for histone ...methylation, few protein modules that recognize histone acetylation are known. Here, we show that the AF9 YEATS domain binds strongly to histone H3K9 acetylation and, to a lesser extent, H3K27 and H3K18 acetylation. Crystal structural studies revealed that AF9 YEATS adopts an eight-stranded immunoglobin fold and utilizes a serine-lined aromatic “sandwiching” cage for acetyllysine readout, representing a novel recognition mechanism that is distinct from that of known acetyllysine readers. ChIP-seq experiments revealed a strong colocalization of AF9 and H3K9 acetylation genome-wide, which is important for the chromatin recruitment of the H3K79 methyltransferase DOT1L. Together, our studies identified the evolutionarily conserved YEATS domain as a novel acetyllysine-binding module and established a direct link between histone acetylation and DOT1L-mediated H3K79 methylation in transcription control.
Display omitted
•The YEATS domains constitute a novel family of readers for histone acetylation•AF9 YEATS binds to histone H3K9 acetylation via a novel recognition mechanism•AF9 colocalizes with H3K9 acetylation genome-wide•AF9 recruits DOT1L to deposit H3K79 methylation on active chromatin
The evolutionarily conserved YEATS domain is a novel acetyllysine-binding module and binds strongly to histone H3K9 acetylation. It serves as a direct link between histone acetylation and DOT1L-mediated H3K79 methylation in transcription control.
Recent developments in next-generation sequencing have enabled whole-genome profiling of nucleosome organizations. Although several algorithms for inferring nucleosome position from a single ...experimental condition have been available, it remains a challenge to accurately define dynamic nucleosomes associated with environmental changes. Here, we report a comprehensive bioinformatics pipeline, DANPOS, explicitly designed for dynamic nucleosome analysis at single-nucleotide resolution. Using both simulated and real nucleosome data, we demonstrated that bias correction in preliminary data processing and optimal statistical testing significantly enhances the functional interpretation of dynamic nucleosomes. The single-nucleotide resolution analysis of DANPOS allows us to detect all three categories of nucleosome dynamics, such as position shift, fuzziness change, and occupancy change, using a uniform statistical framework. Pathway analysis indicates that each category is involved in distinct biological functions. We also analyzed the influence of sequencing depth and suggest that even 200-fold coverage is probably not enough to identify all the dynamic nucleosomes. Finally, based on nucleosome data from the human hematopoietic stem cells (HSCs) and mouse embryonic stem cells (ESCs), we demonstrated that DANPOS is also robust in defining functional dynamic nucleosomes, not only in promoters, but also in distal regulatory regions in the mammalian genome.
Loss of the de novo DNA methyltransferases Dnmt3a and Dnmt3b in embryonic stem cells obstructs differentiation; however, the role of these enzymes in somatic stem cells is largely unknown. Using ...conditional ablation, we show that Dnmt3a loss progressively impairs hematopoietic stem cell (HSC) differentiation over serial transplantation, while simultaneously expanding HSC numbers in the bone marrow. Dnmt3a-null HSCs show both increased and decreased methylation at distinct loci, including substantial CpG island hypermethylation. Dnmt3a-null HSCs upregulate HSC multipotency genes and downregulate differentiation factors, and their progeny exhibit global hypomethylation and incomplete repression of HSC-specific genes. These data establish Dnmt3a as a critical participant in the epigenetic silencing of HSC regulatory genes, thereby enabling efficient differentiation.
In the post-natal mammalian brain perivascular astrocytes (PAs) ensheath blood vessels to regulate their unique permeability properties known as the blood-brain barrier (BBB). Very little is known ...about PA-expressed genes and signaling pathways that mediate contact and communication with endothelial cells (ECs) to regulate BBB physiology. This is due, in part, to lack of suitable models to distinguish PAs from other astrocyte sub-populations in the brain. To decipher the unique biology of PAs, we used in vivo gene knock-in technology to fluorescently label these cells in the adult mouse brain followed by fractionation and quantitative single cell RNA sequencing. In addition, PAs and non-PAs were also distinguished with transgenic fluorescent reporters followed by gene expression comparisons using bulk RNA sequencing. These efforts have identified several genes and pathways in PAs with potential roles in contact and communication with brain ECs. These genes encode various extracellular matrix (ECM) proteins and adhesion receptors, secreted growth factors, and intracellular signaling enzymes. Collectively, our experimental data reveal a set of genes that are expressed in PAs with putative roles in BBB physiology.
Sirtuin proteins regulate diverse cellular pathways that influence genomic stability, metabolism and ageing. SIRT7 is a mammalian sirtuin whose biochemical activity, molecular targets and ...physiological functions have been unclear. Here we show that SIRT7 is an NAD(+)-dependent H3K18Ac (acetylated lysine 18 of histone H3) deacetylase that stabilizes the transformed state of cancer cells. Genome-wide binding studies reveal that SIRT7 binds to promoters of a specific set of gene targets, where it deacetylates H3K18Ac and promotes transcriptional repression. The spectrum of SIRT7 target genes is defined in part by its interaction with the cancer-associated E26 transformed specific (ETS) transcription factor ELK4, and comprises numerous genes with links to tumour suppression. Notably, selective hypoacetylation of H3K18Ac has been linked to oncogenic transformation, and in patients is associated with aggressive tumour phenotypes and poor prognosis. We find that deacetylation of H3K18Ac by SIRT7 is necessary for maintaining essential features of human cancer cells, including anchorage-independent growth and escape from contact inhibition. Moreover, SIRT7 is necessary for a global hypoacetylation of H3K18Ac associated with cellular transformation by the viral oncoprotein E1A. Finally, SIRT7 depletion markedly reduces the tumorigenicity of human cancer cell xenografts in mice. Together, our work establishes SIRT7 as a highly selective H3K18Ac deacetylase and demonstrates a pivotal role for SIRT7 in chromatin regulation, cellular transformation programs and tumour formation in vivo.
The structural complexity of nucleosomes underlies their functional versatility. Here we report a new type of complexity-nucleosome fragility, manifested as high sensitivity to micrococcal nuclease, ...in contrast to the common presumption that nucleosomes are similar in resistance to MNase digestion. Using differential MNase digestion of chromatin and high-throughput sequencing, we have identified a special group of nucleosomes termed "fragile nucleosomes" throughout the yeast genome, nearly 1000 of which were at previously determined "nucleosome-free" loci. Nucleosome fragility is broadly implicated in multiple chromatin processes, including transcription, translocation, and replication, in correspondence to specific physiological states of cells. In the environmental-stress-response genes, the presence of fragile nucleosomes prior to the occurrence of environmental changes suggests that nucleosome fragility poises genes for swift up-regulation in response to the environmental changes. We propose that nucleosome fragility underscores distinct functional statuses of the chromatin and provides a new dimension for portraying the landscape of genome organization.
Noncoding transcription is a defining feature of active enhancers, linking transcription factor (TF) binding to the molecular mechanisms controlling gene expression. To determine the relationship ...between enhancer activity and biological outcomes in breast cancers, we profiled the transcriptomes (using GRO-seq and RNA-seq) and epigenomes (using ChIP-seq) of 11 different human breast cancer cell lines representing five major molecular subtypes of breast cancer, as well as two immortalized ("normal") human breast cell lines. In addition, we developed a robust and unbiased computational pipeline that simultaneously identifies putative subtype-specific enhancers and their cognate TFs by integrating the magnitude of enhancer transcription, TF mRNA expression levels, TF motif
-values, and enrichment of H3K4me1 and H3K27ac. When applied across the 13 different cell lines noted above, the Total Functional Score of Enhancer Elements (TFSEE) identified key breast cancer subtype-specific TFs that act at transcribed enhancers to dictate gene expression patterns determining growth outcomes, including Forkhead TFs, FOSL1, and PLAG1. FOSL1, a Fos family TF, (1) is highly enriched at the enhancers of triple negative breast cancer (TNBC) cells, (2) acts as a key regulator of the proliferation and viability of TNBC cells, but not Luminal A cells, and (3) is associated with a poor prognosis in TNBC breast cancer patients. Taken together, our results validate our enhancer identification pipeline and reveal that enhancers transcribed in breast cancer cells direct critical gene regulatory networks that promote pathogenesis.
The histone lysine methyltransferase NSD2 (MMSET/WHSC1) is implicated in diverse diseases and commonly overexpressed in multiple myeloma due to a recurrent t(4;14) chromosomal translocation. However, ...the precise catalytic activity of NSD2 is obscure, preventing progress in understanding how this enzyme influences chromatin biology and myeloma pathogenesis. Here, we show that dimethylation of histone H3 at lysine 36 (H3K36me2) is the principal chromatin-regulatory activity of NSD2. Catalysis of H3K36me2 by NSD2 is sufficient for gene activation. In t(4;14)-positive myeloma cells, the normal genome-wide and gene-specific distribution of H3K36me2 is obliterated, creating a chromatin landscape that selects for a transcription profile favorable for myelomagenesis. Catalytically active NSD2 confers xenograft tumor formation upon t(4;14)-negative cells and promotes oncogenic transformation of primary cells in an H3K36me2-dependent manner. Together, our findings establish H3K36me2 as the primary product generated by NSD2 and demonstrate that genomic disorganization of this canonical chromatin mark by NSD2 initiates oncogenic programming.
► Dimethylation of H3K36 is the principal chromatin-regulatory activity of NSD2 ► NSD2, via H3K36me2 catalysis, promotes transcription and cell transformation ► NSD2 links genomic disorganization of H3K36me2 to oncogenic programming ► NSD2 catalytic activity is required for t(4;14)+ myeloma cell tumorigenicity