HiC-Pro is an optimized and flexible pipeline for processing Hi-C data from raw reads to normalized contact maps. HiC-Pro maps reads, detects valid ligation products, performs quality controls and ...generates intra- and inter-chromosomal contact maps. It includes a fast implementation of the iterative correction method and is based on a memory-efficient data format for Hi-C contact maps. In addition, HiC-Pro can use phased genotype data to build allele-specific contact maps. We applied HiC-Pro to different Hi-C datasets, demonstrating its ability to easily process large data in a reasonable time. Source code and documentation are available at http://github.com/nservant/HiC-Pro .
Paternal and maternal epigenomes undergo marked changes after fertilization
. Recent epigenomic studies have revealed the unusual chromatin landscapes that are present in oocytes, sperm and early ...preimplantation embryos, including atypical patterns of histone modifications
and differences in chromosome organization and accessibility, both in gametes
and after fertilization
. However, these studies have led to very different conclusions: the global absence of local topological-associated domains (TADs) in gametes and their appearance in the embryo
versus the pre-existence of TADs and loops in the zygote
. The questions of whether parental structures can be inherited in the newly formed embryo and how these structures might relate to allele-specific gene regulation remain open. Here we map genomic interactions for each parental genome (including the X chromosome), using an optimized single-cell high-throughput chromosome conformation capture (HiC) protocol
, during preimplantation in the mouse. We integrate chromosome organization with allelic expression states and chromatin marks, and reveal that higher-order chromatin structure after fertilization coincides with an allele-specific enrichment of methylation of histone H3 at lysine 27. These early parental-specific domains correlate with gene repression and participate in parentally biased gene expression-including in recently described, transiently imprinted loci
. We also find TADs that arise in a non-parental-specific manner during a second wave of genome assembly. These de novo domains are associated with active chromatin. Finally, we obtain insights into the relationship between TADs and gene expression by investigating structural changes to the paternal X chromosome before and during X chromosome inactivation in preimplantation female embryos
. We find that TADs are lost as genes become silenced on the paternal X chromosome but linger in regions that escape X chromosome inactivation. These findings demonstrate the complex dynamics of three-dimensional genome organization and gene expression during early development.
Long non‐coding RNAs (lncRNAs) play diverse roles in physiological and pathological processes. Several lncRNAs have been suggested to modulate gene expression by guiding chromatin‐modifying complexes ...to specific sites in the genome. However, besides the example of Xist, clear‐cut evidence demonstrating this novel mode of regulation remains sparse. Here, we focus on HOTAIR, a lncRNA that is overexpressed in several tumor types and previously proposed to play a key role in gene silencing through direct recruitment of Polycomb Repressive Complex 2 (PRC2) to defined genomic loci. Using genetic tools and a novel RNA‐tethering system, we investigated the interplay between HOTAIR and PRC2 in gene silencing. Surprisingly, we observed that forced overexpression of HOTAIR in breast cancer cells leads to subtle transcriptomic changes that appear to be independent of PRC2. Mechanistically, we found that artificial tethering of HOTAIR to chromatin causes transcriptional repression, but that this effect does not require PRC2. Instead, PRC2 recruitment appears to be a consequence of gene silencing. We propose that PRC2 binding to RNA might serve functions other than chromatin targeting.
Synopsis
The ability of lncRNA HOTAIR to recruit chromatin‐modifying factors has served as a paradigm for lncRNA‐mediated repression of target genes. Contrary to expectation, HOTAIR can exert repressive effects independent of PRC2, whose recruitment may rather be a downstream consequence of gene silencing.
Overexpression of lncRNA HOTAIR in breast cancer cells has limited transcriptional consequences.
Artificial tethering of HOTAIR at a reporter transgene leads to transcriptional repression.
The transcriptional repression mediated by HOTAIR is independent of PRC2.
Recruitment of chromatin‐modifying factors may not be the sought‐after mechanism for gene silencing by HOTAIR lncRNA, but rather its downstream consequence.
During the last 3 years, a number of approaches for the normalization of RNA sequencing data have emerged in the literature, differing both in the type of bias adjustment and in the statistical ...strategy adopted. However, as data continue to accumulate, there has been no clear consensus on the appropriate normalization method to be used or the impact of a chosen method on the downstream analysis. In this work, we focus on a comprehensive comparison of seven recently proposed normalization methods for the differential analysis of RNA-seq data, with an emphasis on the use of varied real and simulated datasets involving different species and experimental designs to represent data characteristics commonly observed in practice. Based on this comparison study, we propose practical recommendations on the appropriate normalization method to be used and its impact on the differential analysis of RNA-seq data.
DNA methylation is essential for protecting the mammalian germline against transposons. When DNA methylation-based transposon control is defective, meiotic chromosome pairing is consistently impaired ...during spermatogenesis: How and why meiosis is vulnerable to transposon activity is unknown. Using two DNA methylation-deficient backgrounds, the Dnmt3L and Miwi2 mutant mice, we reveal that DNA methylation is largely dispensable for silencing transposons before meiosis onset. After this, it becomes crucial to back up to a developmentally programmed H3K9me2 loss. Massive retrotransposition does not occur following transposon derepression, but the meiotic chromatin landscape is profoundly affected. Indeed, H3K4me3 marks gained over transcriptionally active transposons correlate with formation of SPO11-dependent double-strand breaks and recruitment of the DMC1 repair enzyme in Dnmt3L(-/-) meiotic cells, whereas these features are normally exclusive to meiotic recombination hot spots. Here, we demonstrate that DNA methylation restrains transposons from adopting chromatin characteristics amenable to meiotic recombination, which we propose prevents the occurrence of erratic chromosomal events.
Sequencing technologies give access to a precise picture of the molecular mechanisms acting upon genome regulation. One of the biggest technical challenges with sequencing data is to map millions of ...reads to a reference genome. This problem is exacerbated when dealing with repetitive sequences such as transposable elements that occupy half of the mammalian genome mass. Sequenced reads coming from these regions introduce ambiguities in the mapping step. Therefore, applying dedicated parameters and algorithms has to be taken into consideration when transposable elements regulation is investigated with sequencing datasets.
Here, we used simulated reads on the mouse and human genomes to define the best parameters for aligning transposable element-derived reads on a reference genome. The efficiency of the most commonly used aligners was compared and we further evaluated how transposable element representation should be estimated using available methods. The mappability of the different transposon families in the mouse and the human genomes was calculated giving an overview into their evolution.
Based on simulated data, we provided recommendations on the alignment and the quantification steps to be performed when transposon expression or regulation is studied, and identified the limits in detecting specific young transposon families of the mouse and human genomes. These principles may help the community to adopt standard procedures and raise awareness of the difficulties encountered in the study of transposable elements.
Polycomb Group (PcG) proteins maintain transcriptional repression throughout development, mostly by regulating chromatin structure. Polycomb Repressive Complex 2 (PRC2), a component of the Polycomb ...machinery, is responsible for the methylation of histone H3 lysine 27 (H3K27me2/3). Jarid2 was previously identified as a cofactor of PRC2, regulating PRC2 targeting to chromatin and its enzymatic activity. Deletion of Jarid2 leads to impaired orchestration of gene expression during cell lineage commitment. Here, we reveal an unexpected crosstalk between Jarid2 and PRC2, with Jarid2 being methylated by PRC2. This modification is recognized by the Eed core component of PRC2 and triggers an allosteric activation of PRC2’s enzymatic activity. We show that Jarid2 methylation is important to promote PRC2 activity at a locus devoid of H3K27me3 and for the correct deposition of this mark during cell differentiation. Our results uncover a regulation loop where Jarid2 methylation fine-tunes PRC2 activity depending on the chromatin context.
Display omitted
•PRC2 methylates Jarid2 on K116•Jarid2 methylation promotes PRC2 activity•H3K27me3 and Jarid2-K116me3 bind to the aromatic cage of Eed•Jarid2 methylation regulates H3K27me3 deposition during ESC differentiation
Methylation of H3K27 by the Polycomb complex PRC2 is essential for maintaining transcriptional silencing. Sanulli et al. report that Jarid2, a cofactor of this complex is also methylated by PRC2. Reciprocally, methylated Jarid2 regulates PRC2 enzymatic activity creating a regulatory loop.
In Drosophila, a complex consisting of Calypso and ASX catalyzes H2A deubiquitination and has been reported to act as part of the Polycomb machinery in transcriptional silencing. The mammalian ...homologs of these proteins (BAP1 and ASXL1/2/3, respectively), are frequently mutated in various cancer types, yet their precise functions remain unclear. Using an integrative approach based on isogenic cell lines generated with CRISPR/Cas9, we uncover an unanticipated role for BAP1 in gene activation. This function requires the assembly of an enzymatically active BAP1-associated core complex (BAP1.com) containing one of the redundant ASXL proteins. We investigate the mechanism underlying BAP1.com-mediated transcriptional regulation and show that it does not participate in Polycomb-mediated silencing. Instead, our results establish that the function of BAP1.com is to safeguard transcriptionally active genes against silencing by the Polycomb Repressive Complex 1.
Normalization is essential to ensure accurate analysis and proper interpretation of sequencing data, and chromosome conformation capture data such as Hi-C have particular challenges. Although several ...methods have been proposed, the most widely used type of normalization of Hi-C data usually casts estimation of unwanted effects as a matrix balancing problem, relying on the assumption that all genomic regions interact equally with each other.
In order to explore the effect of copy-number variations on Hi-C data normalization, we first propose a simulation model that predict the effects of large copy-number changes on a diploid Hi-C contact map. We then show that the standard approaches relying on equal visibility fail to correct for unwanted effects in the presence of copy-number variations. We thus propose a simple extension to matrix balancing methods that model these effects. Our approach can either retain the copy-number variation effects (LOIC) or remove them (CAIC). We show that this leads to better downstream analysis of the three-dimensional organization of rearranged genomes.
Taken together, our results highlight the importance of using dedicated methods for the analysis of Hi-C cancer data. Both CAIC and LOIC methods perform well on simulated and real Hi-C data sets, each fulfilling different needs.
The Polycomb group of proteins is required for the proper orchestration of gene expression due to its role in maintaining transcriptional silencing. It is composed of several chromatin modifying ...complexes, including Polycomb Repressive Complex 2 (PRC2), which deposits H3K27me2/3. Here, we report the identification of a cofactor of PRC2, EZHIP (EZH1/2 Inhibitory Protein), expressed predominantly in the gonads. EZHIP limits the enzymatic activity of PRC2 and lessens the interaction between the core complex and its accessory subunits, but does not interfere with PRC2 recruitment to chromatin. Deletion of Ezhip in mice leads to a global increase in H3K27me2/3 deposition both during spermatogenesis and at late stages of oocyte maturation. This does not affect the initial number of follicles but is associated with a reduction of follicles in aging. Our results suggest that mature oocytes Ezhip-/- might not be fully functional and indicate that fertility is strongly impaired in Ezhip-/- females. Altogether, our study uncovers EZHIP as a regulator of chromatin landscape in gametes.