Naïve human pluripotent stem cells (hPSCs) provide a unique experimental platform of cell fate decisions during pre-implantation development, but their lineage potential remains incompletely ...characterized. As naïve hPSCs share transcriptional and epigenomic signatures with trophoblast cells, it has been proposed that the naïve state may have enhanced predisposition for differentiation along this extraembryonic lineage. Here we examined the trophoblast potential of isogenic naïve and primed hPSCs. We found that naïve hPSCs can directly give rise to human trophoblast stem cells (hTSCs) and undergo further differentiation into both extravillous and syncytiotrophoblast. In contrast, primed hPSCs do not support hTSC derivation, but give rise to non-self-renewing cytotrophoblasts in response to BMP4. Global transcriptome and chromatin accessibility analyses indicate that hTSCs derived from naïve hPSCs are similar to blastocyst-derived hTSCs and acquire features of post-implantation trophectoderm. The derivation of hTSCs from naïve hPSCs will enable elucidation of early mechanisms that govern normal human trophoblast development and associated pathologies.
Transposable elements (TEs) have been shown to contain functional binding sites for certain transcription factors (TFs). However, the extent to which TEs contribute to the evolution of TF binding ...sites is not well known. We comprehensively mapped binding sites for 26 pairs of orthologous TFs in two pairs of human and mouse cell lines (representing two cell lineages), along with epigenomic profiles, including DNA methylation and six histone modifications. Overall, we found that 20% of binding sites were embedded within TEs. This number varied across different TFs, ranging from 2% to 40%. We further identified 710 TF-TE relationships in which genomic copies of a TE subfamily contributed a significant number of binding peaks for a TF, and we found that LTR elements dominated these relationships in human. Importantly, TE-derived binding peaks were strongly associated with open and active chromatin signatures, including reduced DNA methylation and increased enhancer-associated histone marks. On average, 66% of TE-derived binding events were cell type-specific with a cell type-specific epigenetic landscape. Most of the binding sites contributed by TEs were species-specific, but we also identified binding sites conserved between human and mouse, the functional relevance of which was supported by a signature of purifying selection on DNA sequences of these TEs. Interestingly, several TFs had significantly expanded binding site landscapes only in one species, which were linked to species-specific gene functions, suggesting that TEs are an important driving force for regulatory innovation. Taken together, our data suggest that TEs have significantly and continuously shaped gene regulatory networks during mammalian evolution.
The recent derivation of human trophoblast stem cells (hTSCs) provides a scalable in vitro model system of human placental development, but the molecular regulators of hTSC identity have not been ...systematically explored thus far. Here, we utilize a genome-wide CRISPR-Cas9 knockout screen to comprehensively identify essential and growth-restricting genes in hTSCs. By cross-referencing our data to those from similar genetic screens performed in other cell types, as well as gene expression data from early human embryos, we define hTSC-specific and -enriched regulators. These include both well-established and previously uncharacterized trophoblast regulators, such as ARID3A, GATA2, and TEAD1 (essential), and GCM1, PTPN14, and TET2 (growth-restricting). Integrated analysis of chromatin accessibility, gene expression, and genome-wide location data reveals that the transcription factor TEAD1 regulates the expression of many trophoblast regulators in hTSCs. In the absence of TEAD1, hTSCs fail to complete faithful differentiation into extravillous trophoblast (EVT) cells and instead show a bias towards syncytiotrophoblast (STB) differentiation, thus indicating that this transcription factor safeguards the bipotent lineage potential of hTSCs. Overall, our study provides a valuable resource for dissecting the molecular regulation of human placental development and diseases.
ATAC-seq is widely used to measure chromatin accessibility and identify open chromatin regions (OCRs). OCRs usually indicate active regulatory elements in the genome and are directly associated with ...the gene regulatory network. The identification of differential accessibility regions (DARs) between different biological conditions is critical in determining the differential activity of regulatory elements. Differential analysis of ATAC-seq shares many similarities with differential expression analysis of RNA-seq data. However, the distribution of ATAC-seq signal intensity is different from that of RNA-seq data, and higher sensitivity is required for DARs identification. Many different tools can be used to perform differential analysis of ATAC-seq data, but a comprehensive comparison and benchmarking of these methods is still lacking. Here, we used simulated datasets to systematically measure the sensitivity and specificity of six different methods. We further discussed the statistical and signal density cut-offs in the differential analysis of ATAC-seq by applying them to real data. Batch effects are very common in high-throughput sequencing experiments. We illustrated that batch-effect correction can dramatically improve sensitivity in the differential analysis of ATAC-seq data. Finally, we developed a user-friendly package, BeCorrect, to perform batch effect correction and visualization of corrected ATAC-seq signals in a genome browser.
Transposable elements (TEs) are major contributors of genetic material in mammalian genomes. These often include binding sites for architectural proteins, including the multifarious master protein, ...CTCF, which shapes the 3D genome by creating loops, domains, compartment borders, and RNA-DNA interactions. These play a role in the compact packaging of DNA and have the potential to facilitate regulatory function. In this study, we explore the widespread contribution of TEs to mammalian 3D genomes by quantifying the extent to which they give rise to loops and domain border differences across various cell types and species using several 3D genome mapping technologies. We show that specific families and subfamilies of TEs have contributed to lineage-specific 3D chromatin structures across mammalian species. In many cases, these loops may facilitate sustained interaction between distant cis-regulatory elements and target genes, and domains may segregate chromatin state to impact gene expression in a lineage-specific manner. An experimental validation of our analytical findings using CRISPR-Cas9 to delete a candidate TE resulted in disruption of species-specific 3D chromatin structure. Taken together, we comprehensively quantify and selectively validate our finding that TEs contribute to shaping 3D genome organization and may, in some cases, impact gene regulation during the course of mammalian evolution.
We investigated maternal obesity in inbred SM/J mice by assigning females to a high-fat diet or a low-fat diet at weaning, mating them to low-fat-fed males, cross-fostering the offspring to ...low-fat-fed SM/J nurses at birth, and weaning the offspring onto a high-fat or low-fat diet. A maternal high-fat diet exacerbated obesity in the high-fat-fed daughters, causing them to weigh more, have more fat, and have higher serum levels of leptin as adults, accompanied by dozens of gene expression changes and thousands of DNA methylation changes in their livers and hearts. Maternal diet particularly affected genes involved in RNA processing, immune response, and mitochondria. Between one-quarter and one-third of differentially expressed genes contained a differentially methylated region associated with maternal diet. An offspring high-fat diet reduced overall variation in DNA methylation, increased body weight and organ weights, increased long bone lengths and weights, decreased insulin sensitivity, and changed the expression of 3,908 genes in the liver. Although the offspring were more affected by their own diet, their maternal diet had epigenetic effects lasting through adulthood, and in the daughters these effects were accompanied by phenotypic changes relevant to obesity and diabetes.
Understanding the molecular underpinnings of pluripotency is a prerequisite for optimal maintenance and application of embryonic stem cells (ESCs). While the protein-protein interactions of core ...pluripotency factors have been identified in mouse ESCs, their interactome in human ESCs (hESCs) has not to date been explored. Here we mapped the OCT4 interactomes in naïve and primed hESCs, revealing extensive connections to mammalian ATP-dependent nucleosome remodeling complexes. In naïve hESCs, OCT4 is associated with both BRG1 and BRM, the two paralog ATPases of the BAF complex. Genome-wide location analyses and genetic studies reveal that these two enzymes cooperate in a functionally redundant manner in the transcriptional regulation of blastocyst-specific genes. In contrast, in primed hESCs, OCT4 cooperates with BRG1 and SOX2 to promote chromatin accessibility at ectodermal genes. This work reveals how a common transcription factor utilizes differential BAF complexes to control distinct transcriptional programs in naïve and primed hESCs.
The zebrafish (Danio rerio) has been widely used in the study of human disease and development, and about 70% of the protein-coding genes are conserved between the two species
. However, studies in ...zebrafish remain constrained by the sparse annotation of functional control elements in the zebrafish genome. Here we performed RNA sequencing, assay for transposase-accessible chromatin using sequencing (ATAC-seq), chromatin immunoprecipitation with sequencing, whole-genome bisulfite sequencing, and chromosome conformation capture (Hi-C) experiments in up to eleven adult and two embryonic tissues to generate a comprehensive map of transcriptomes, cis-regulatory elements, heterochromatin, methylomes and 3D genome organization in the zebrafish Tübingen reference strain. A comparison of zebrafish, human and mouse regulatory elements enabled the identification of both evolutionarily conserved and species-specific regulatory sequences and networks. We observed enrichment of evolutionary breakpoints at topologically associating domain boundaries, which were correlated with strong histone H3 lysine 4 trimethylation (H3K4me3) and CCCTC-binding factor (CTCF) signals. We performed single-cell ATAC-seq in zebrafish brain, which delineated 25 different clusters of cell types. By combining long-read DNA sequencing and Hi-C, we assembled the sex-determining chromosome 4 de novo. Overall, our work provides an additional epigenomic anchor for the functional annotation of vertebrate genomes and the study of evolutionarily conserved elements of 3D genome organization.
Although it is known that the methylation of DNA in 5' promoters suppresses gene expression, the role of DNA methylation in gene bodies is unclear. In mammals, tissue- and cell type-specific ...methylation is present in a small percentage of 5' CpG island (CGI) promoters, whereas a far greater proportion occurs across gene bodies, coinciding with highly conserved sequences. Tissue-specific intragenic methylation might reduce, or, paradoxically, enhance transcription elongation efficiency. Capped analysis of gene expression (CAGE) experiments also indicate that transcription commonly initiates within and between genes. To investigate the role of intragenic methylation, we generated a map of DNA methylation from the human brain encompassing 24.7 million of the 28 million CpG sites. From the dense, high-resolution coverage of CpG islands, the majority of methylated CpG islands were shown to be in intragenic and intergenic regions, whereas less than 3% of CpG islands in 5' promoters were methylated. The CpG islands in all three locations overlapped with RNA markers of transcription initiation, and unmethylated CpG islands also overlapped significantly with trimethylation of H3K4, a histone modification enriched at promoters. The general and CpG-island-specific patterns of methylation are conserved in mouse tissues. An in-depth investigation of the human SHANK3 locus and its mouse homologue demonstrated that this tissue-specific DNA methylation regulates intragenic promoter activity in vitro and in vivo. These methylation-regulated, alternative transcripts are expressed in a tissue- and cell type-specific manner, and are expressed differentially within a single cell type from distinct brain regions. These results support a major role for intragenic methylation in regulating cell context-specific alternative promoters in gene bodies.
BTB domain And CNC Homolog 2 (Bach2) is a transcription repressor that actively participates in T and B lymphocyte development, but it is unknown if Bach2 is also involved in the development of ...innate immune cells, such as natural killer (NK) cells. Here, we followed the expression of Bach2 during murine NK cell development, finding that it peaked in immature CD27
CD11b
cells and decreased upon further maturation. Bach2 showed an organ and tissue-specific expression pattern in NK cells. Bach2 expression positively correlated with the expression of transcription factor TCF1 and negatively correlated with genes encoding NK effector molecules and those involved in the cell cycle. Lack of Bach2 expression caused changes in chromatin accessibility of corresponding genes. In the end, Bach2 deficiency resulted in increased proportions of terminally differentiated NK cells with increased production of granzymes and cytokines. NK cell-mediated control of tumor metastasis was also augmented in the absence of Bach2. Therefore, Bach2 is a key checkpoint protein regulating NK terminal maturation.