Central nervous system-expressed long non-coding RNAs (lncRNAs) are often located in the genome close to protein coding genes involved in transcriptional control. Such lncRNA-protein coding gene ...pairs are frequently temporally and spatially co-expressed in the nervous system and are predicted to act together to regulate neuronal development and function. Although some of these lncRNAs also bind and modulate the activity of the encoded transcription factors, the regulatory mechanisms controlling co-expression of neighbouring lncRNA-protein coding genes remain unclear. Here, we used high resolution NG Capture-C to map the cis-regulatory interaction landscape of the key neuro-developmental Paupar-Pax6 lncRNA-mRNA locus. The results define chromatin architecture changes associated with high Paupar-Pax6 expression in neurons and identify both promoter selective as well as shared cis-regulatory-promoter interactions involved in regulating Paupar-Pax6 co-expression. We discovered that the TCF7L2 transcription factor, a regulator of chromatin architecture and major effector of the Wnt signalling pathway, binds to a subset of these candidate cis-regulatory elements to coordinate Paupar and Pax6 co-expression. We describe distinct roles for Paupar in Pax6 expression control and show that the Paupar DNA locus contains a TCF7L2 bound transcriptional silencer whilst the Paupar transcript can act as an activator of Pax6. Our work provides important insights into the chromatin interactions, signalling pathways and transcription factors controlling co-expression of adjacent lncRNAs and protein coding genes in the brain.
Gene expression during development and differentiation is regulated in a cell- and stage-specific manner by complex networks of intergenic and intragenic cis-regulatory elements whose numbers and ...representation in the genome far exceed those of structural genes. Using chromosome conformation capture, it is now possible to analyze in detail the interaction between enhancers, silencers, boundary elements and promoters at individual loci, but these techniques are not readily scalable. Here we present a high-throughput approach (Capture-C) to analyze cis interactions, interrogating hundreds of specific interactions at high resolution in a single experiment. We show how this approach will facilitate detailed, genome-wide analysis to elucidate the general principles by which cis-acting sequences control gene expression. In addition, we show how Capture-C will expedite identification of the target genes and functional effects of SNPs that are associated with complex diseases, which most frequently lie in intergenic cis-acting regulatory elements.
Abstract
The ubiquitous family of dimeric transcription factors AP-1 is made up of Fos and Jun family proteins. It has long been thought to operate principally at gene promoters and how it controls ...transcription is still ill-understood. The Fos family protein Fra-1 is overexpressed in triple negative breast cancers (TNBCs) where it contributes to tumor aggressiveness. To address its transcriptional actions in TNBCs, we combined transcriptomics, ChIP-seqs, machine learning and NG Capture-C. Additionally, we studied its Fos family kin Fra-2 also expressed in TNBCs, albeit much less. Consistently with their pleiotropic effects, Fra-1 and Fra-2 up- and downregulate individually, together or redundantly many genes associated with a wide range of biological processes. Target gene regulation is principally due to binding of Fra-1 and Fra-2 at regulatory elements located distantly from cognate promoters where Fra-1 modulates the recruitment of the transcriptional co-regulator p300/CBP and where differences in AP-1 variant motif recognition can underlie preferential Fra-1- or Fra-2 bindings. Our work also shows no major role for Fra-1 in chromatin architecture control at target gene loci, but suggests collaboration between Fra-1-bound and -unbound enhancers within chromatin hubs sometimes including promoters for other Fra-1-regulated genes. Our work impacts our view of AP-1.
The severe acute respiratory syndrome coronavirus 2 (SARS‑CoV‑2) disease (COVID-19) pandemic has caused millions of deaths worldwide. Genome-wide association studies identified the 3p21.31 region as ...conferring a twofold increased risk of respiratory failure. Here, using a combined multiomics and machine learning approach, we identify the gain-of-function risk A allele of an SNP, rs17713054G>A, as a probable causative variant. We show with chromosome conformation capture and gene-expression analysis that the rs17713054-affected enhancer upregulates the interacting gene, leucine zipper transcription factor like 1 (LZTFL1). Selective spatial transcriptomic analysis of lung biopsies from patients with COVID-19 shows the presence of signals associated with epithelial-mesenchymal transition (EMT), a viral response pathway that is regulated by LZTFL1. We conclude that pulmonary epithelial cells undergoing EMT, rather than immune cells, are likely responsible for the 3p21.31-associated risk. Since the 3p21.31 effect is conferred by a gain-of-function, LZTFL1 may represent a therapeutic target.
Methods for analyzing chromosome conformation in mammalian cells are either low resolution or low throughput and are technically challenging. In next-generation (NG) Capture-C, we have redesigned the ...Capture-C method to achieve unprecedented levels of sensitivity and reproducibility. NG Capture-C can be used to analyze many genetic loci and samples simultaneously. High-resolution data can be produced with as few as 100,000 cells, and single-nucleotide polymorphisms can be used to generate allele-specific tracks. The method is straightforward to perform and should greatly facilitate the investigation of many questions related to gene regulation as well as the functional dissection of traits examined in genome-wide association studies.
Histone H3.3 is a replication-independent histone variant which replaces histones that are turned over throughout the entire cell cycle. H3.3 deposition at euchromatin is dependent on HIRA while ...ATRX/Daxx deposits H3.3 at pericentric heterochromatin and telomeres. The role of H3.3 at heterochromatic regions is unknown but mutations in the ATRX/Daxx/H3.3 pathway are linked to aberrant telomere lengthening in certain cancers. In this study, we show that ATRX-dependent deposition of H3.3 is not limited to pericentric heterochromatin and telomeres but also occurs at heterochromatic sites throughout the genome. Notably, ATRX/H3.3 specifically localises to silenced imprinted alleles in mouse ES cells. ATRX KO cells failed to deposit H3.3 at these sites leading to loss of the H3K9me3 heterochromatin modification, loss of repression and aberrant allelic expression. We propose a model whereby ATRX-dependent deposition of H3.3 into heterochromatin is normally required to maintain the epigenetic memory of silencing at imprinted loci.
Display omitted
-ATRX deposits H3.3 at heterochromatin throughout the genome.-ATRX and H3.3 preferentially bind the methylated allele of imprinted DMRs.-H3.3 deposition at imprinted DMRs is dependent on ATRX.-Loss of ATRX/H3.3 leads to loss of H3K9me3 modification at imprinted DMRs.
The genome is organized via CTCF-cohesin-binding sites, which partition chromosomes into 1-5 megabase (Mb) topologically associated domains (TADs), and further into smaller sub-domains (sub-TADs). ...Here we examined in vivo an ∼80 kb sub-TAD, containing the mouse α-globin gene cluster, lying within a ∼1 Mb TAD. We find that the sub-TAD is flanked by predominantly convergent CTCF-cohesin sites that are ubiquitously bound by CTCF but only interact during erythropoiesis, defining a self-interacting erythroid compartment. Whereas the α-globin regulatory elements normally act solely on promoters downstream of the enhancers, removal of a conserved upstream CTCF-cohesin boundary extends the sub-TAD to adjacent upstream CTCF-cohesin-binding sites. The α-globin enhancers now interact with the flanking chromatin, upregulating expression of genes within this extended sub-TAD. Rather than acting solely as a barrier to chromatin modification, CTCF-cohesin boundaries in this sub-TAD delimit the region of chromatin to which enhancers have access and within which they interact with receptive promoters.
Long non-coding (lnc) RNAs can regulate gene expression and protein functions. However, the proportion of lncRNAs with biological activities among the thousands expressed in mammalian cells is ...controversial. We studied Lockd (lncRNA downstream of Cdkn1b), a 434-nt polyadenylated lncRNA originating 4 kb 3′ to the Cdkn1b gene. Deletion of the 25-kb Lockd locus reduced Cdkn1b transcription by approximately 70% in an erythroid cell line. In contrast, homozygous insertion of a polyadenylation cassette 80 bp downstream of the Lockd transcription start site reduced the entire lncRNA transcript level by >90% with no effect on Cdkn1b transcription. The Lockd promoter contains a DNase-hypersensitive site, binds numerous transcription factors, and physically associates with the Cdkn1b promoter in chromosomal conformation capture studies. Therefore, the Lockd gene positively regulates Cdkn1b transcription through an enhancer-like cis element, whereas the lncRNA itself is dispensable, which may be the case for other lncRNAs.
Display omitted
•The 5′ region of the Lockd lncRNA gene “loops” to the adjacent Cdkn1b gene promoter•Deletion of the Lockd gene impairs Cdkn1b transcription•Lockd RNA was truncated by insertion of a polyA signal•Lockd truncation has no effect on Cdkn1b transcription
Whether loci encoding lncRNAs function via their lncRNA transcripts or DNA elements is often unclear. Paralkar et al. provide a model for dissecting these contributions and show that the 5′ region of the Lockd lncRNA gene contains an enhancer for the neighboring Cdkn1b gene, whereas Lockd lncRNA is dispensable for Cdkn1b expression.
A substantial amount of organismal complexity is thought to be encoded by enhancers which specify the location, timing, and levels of gene expression. In mammals there are more enhancers than ...promoters which are distributed both between and within genes. Here we show that activated, intragenic enhancers frequently act as alternative tissue-specific promoters producing a class of abundant, spliced, multiexonic poly(A)+ RNAs (meRNAs) which reflect the host gene's structure. meRNAs make a substantial and unanticipated contribution to the complexity of the transcriptome, appearing as alternative isoforms of the host gene. The low protein-coding potential of meRNAs suggests that many meRNAs may be byproducts of enhancer activation or underlie as-yet-unidentified RNA-encoded functions. Distinguishing between meRNAs and mRNAs will transform our interpretation of dynamic changes in transcription both at the level of individual genes and of the genome as a whole.
Display omitted
► Intragenic enhancers act as unidirectional, cell-specific, alternative promoters ► Activated intragenic enhancers produce a class of multiexonic poly(A)+ RNAs, meRNAs ► meRNAs appear as isoforms of the host gene ► meRNAs add significantly to transcriptome complexity
Specific communication between gene promoters and enhancers is critical for accurate regulation of gene expression. However, it remains unclear how specific interactions between multiple regulatory ...elements contained within a single chromatin domain are coordinated. Recent technological advances which can detect multi-way chromatin interactions at single alleles can provide insights into how multiple regulatory elements cooperate or compete for transcriptional activation. Here, we use such an approach to investigate how interactions of the α-globin enhancers are distributed between multiple promoters in a mouse model in which the α-globin domain is extended to include several additional genes. Our data show that gene promoters do not form mutually exclusive interactions with enhancers, but all interact simultaneously in a single complex. These findings suggest that promoters do not structurally compete for interactions with enhancers, but form a regulatory hub structure, which is consistent with recent models of transcriptional activation occurring in non-membrane bound nuclear compartments.