Eukaryotic genomes are extensively transcribed, forming both messenger RNAs (mRNAs) and noncoding RNAs (ncRNAs). ncRNAs made by RNA polymerase II often initiate from bidirectional promoters ...(nucleosome-depleted chromatin) that synthesize mRNA and ncRNA in opposite directions. We demonstrate that, by adopting a gene-loop conformation, actively transcribed mRNA encoding genes restrict divergent transcription of ncRNAs. Because gene-loop formation depends on a protein factor (Ssu72) that coassociates with both the promoter and the terminator, the inactivation of Ssu72 leads to increased synthesis of promoter-associated divergent ncRNAs, referred to as Ssu72-restricted transcripts (SRTs). Similarly, inactivation of individual gene loops by gene mutation enhances SRT synthesis. We demonstrate that gene-loop conformation enforces transcriptional directionality on otherwise bidirectional promoters.
The genetic code-the binding specificity of all transfer-RNAs--defines how protein primary structure is determined by DNA sequence. DNA also dictates when and where proteins are expressed, and this ...information is encoded in a pattern of specific sequence motifs that are recognized by transcription factors. However, the DNA-binding specificity is only known for a small fraction of the approximately 1400 human transcription factors (TFs). We describe here a high-throughput method for analyzing transcription factor binding specificity that is based on systematic evolution of ligands by exponential enrichment (SELEX) and massively parallel sequencing. The method is optimized for analysis of large numbers of TFs in parallel through the use of affinity-tagged proteins, barcoded selection oligonucleotides, and multiplexed sequencing. Data are analyzed by a new bioinformatic platform that uses the hundreds of thousands of sequencing reads obtained to control the quality of the experiments and to generate binding motifs for the TFs. The described technology allows higher throughput and identification of much longer binding profiles than current microarray-based methods. In addition, as our method is based on proteins expressed in mammalian cells, it can also be used to characterize DNA-binding preferences of full-length proteins or proteins requiring post-translational modifications. We validate the method by determining binding specificities of 14 different classes of TFs and by confirming the specificities for NFATC1 and RFX3 using ChIP-seq. Our results reveal unexpected dimeric modes of binding for several factors that were thought to preferentially bind DNA as monomers.
The Polycomb repressive complexes PRC1 and PRC2 maintain embryonic stem cell (ESC) pluripotency by silencing lineage-specifying developmental regulator genes. Emerging evidence suggests that Polycomb ...complexes act through controlling spatial genome organization. We show that PRC1 functions as a master regulator of mouse ESC genome architecture by organizing genes in three-dimensional interaction networks. The strongest spatial network is composed of the four Hox gene clusters and early developmental transcription factor genes, the majority of which contact poised enhancers. Removal of Polycomb repression leads to disruption of promoter-promoter contacts in the Hox gene network. In contrast, promoter-enhancer contacts are maintained in the absence of Polycomb repression, with accompanying widespread acquisition of active chromatin signatures at network enhancers and pronounced transcriptional upregulation of network genes. Thus, PRC1 physically constrains developmental transcription factor genes and their enhancers in a silenced but poised spatial network. We propose that the selective release of genes from this spatial network underlies cell fate specification during early embryonic development.
Motor neurons (MNs) and astrocytes (ACs) are implicated in the pathogenesis of amyotrophic lateral sclerosis (ALS), but their interaction and the sequence of molecular events leading to MN death ...remain unresolved. Here, we optimized directed differentiation of induced pluripotent stem cells (iPSCs) into highly enriched (> 85%) functional populations of spinal cord MNs and ACs. We identify significantly increased cytoplasmic TDP-43 and ER stress as primary pathogenic events in patient-specific valosin-containing protein (VCP)-mutant MNs, with secondary mitochondrial dysfunction and oxidative stress. Cumulatively, these cellular stresses result in synaptic pathology and cell death in VCP-mutant MNs. We additionally identify a cell-autonomous VCP-mutant AC survival phenotype, which is not attributable to the same molecular pathology occurring in VCP-mutant MNs. Finally, through iterative co-culture experiments, we uncover non-cell-autonomous effects of VCP-mutant ACs on both control and mutant MNs. This work elucidates molecular events and cellular interplay that could guide future therapeutic strategies in ALS.
Display omitted
•Robust and enriched motor neurogenesis and astrogliogenesis from human iPSCs•VCP-mutant motor neurons show TDP-43 mislocalization and ER stress as early pathogenic events•VCP-mutant astrocytes exhibit a cell-autonomous survival phenotype•VCP-mutations perturb the ability of astrocytes to support motor neuron survival
Hall et al. use iPSCs to examine the sequence of events by which motor neurons degenerate in a genetic form of ALS. They find that astrocytes, a type of supportive cell, also degenerate under these conditions. The ALS-causing mutation disrupts the ability of astrocytes to promote survival of motor neurons.
The regulatory interactions between transcription factors and their target genes can be conceptualised as a directed graph. At a global level, these regulatory networks display a scale-free topology, ...indicating the presence of regulatory hubs. At a local level, substructures such as motifs and modules can be discerned in these networks. Despite the general organisational similarity of networks across the phylogenetic spectrum, there are interesting qualitative differences among the network components, such as the transcription factors. Although the DNA-binding domains of the transcription factors encoded by a given organism are drawn from a small set of ancient conserved superfamilies, their relative abundance often shows dramatic variation among different phylogenetic groups. Large portions of these networks appear to have evolved through extensive duplication of transcription factors and targets, often with inheritance of regulatory interactions from the ancestral gene. Interactions are conserved to varying degrees among genomes. Insights from the structure and evolution of these networks can be translated into predictions and used for engineering of the regulatory networks of different organisms.
Abstract
Reactive astrocytes are implicated in amyotrophic lateral sclerosis (ALS), although the mechanisms controlling reactive transformation are unknown. We show that decreased intron retention ...(IR) is common to human-induced pluripotent stem cell (hiPSC)-derived astrocytes carrying ALS-causing mutations in VCP, SOD1 and C9orf72. Notably, transcripts with decreased IR and increased expression are overrepresented in reactivity processes including cell adhesion, stress response and immune activation. This was recapitulated in public-datasets for (i) hiPSC-derived astrocytes stimulated with cytokines to undergo reactive transformation and (ii) in vivo astrocytes following selective deletion of TDP-43. We also re-examined public translatome sequencing (TRAP-seq) of astrocytes from a SOD1 mouse model, which revealed that transcripts upregulated in translation significantly overlap with transcripts exhibiting decreased IR. Using nucleocytoplasmic fractionation of VCP mutant astrocytes coupled with mRNA sequencing and proteomics, we identify that decreased IR in nuclear transcripts is associated with enhanced nonsense mediated decay and increased cytoplasmic expression of transcripts and proteins regulating reactive transformation. These findings are consistent with a molecular model for reactive transformation in astrocytes whereby poised nuclear reactivity-related IR transcripts are spliced, undergo nuclear-to-cytoplasmic translocation and translation. Our study therefore provides new insights into the molecular regulation of reactive transformation in astrocytes.
Graphical Abstract
Graphical abstract
Healthy astrocytes (left, blue) have prevalent nuclear intron retention in reactivity transcripts, promoting their nuclear confinement (introns shown as red rectangles). Conversely, ALS astrocytes (right, red) exhibit decreased intron retention in reactivity transcripts, enabling cytoplasmic translocation and translation upon engaging ribosomes, underlying their reactive transformation. A small number of intron retaining transcripts escape to the cytoplasm where they are degraded by nonsense mediated decay (NMD), which is enhanced in ALS astrocytes.
Abstract
RNA-binding proteins (RBPs) play diverse roles in regulating co-transcriptional RNA-processing and chromatin functions, but our knowledge of the repertoire of chromatin-associated RBPs ...(caRBPs) and their interactions with chromatin remains limited. Here, we developed SPACE (Silica Particle Assisted Chromatin Enrichment) to isolate global and regional chromatin components with high specificity and sensitivity, and SPACEmap to identify the chromatin-contact regions in proteins. Applied to mouse embryonic stem cells, SPACE identified 1459 chromatin-associated proteins, ∼48% of which are annotated as RBPs, indicating their dual roles in chromatin and RNA-binding. Additionally, SPACEmap stringently verified chromatin-binding of 403 RBPs and identified their chromatin-contact regions. Notably, SPACEmap showed that about 40% of the caRBPs bind chromatin by intrinsically disordered regions (IDRs). Studying SPACE and total proteome dynamics from mES cells grown in 2iL and serum medium indicates significant correlation (R = 0.62). One of the most dynamic caRBPs is Dazl, which we find co-localized with PRC2 at transcription start sites of genes that are distinct from Dazl mRNA binding. Dazl and other PRC2-colocalised caRBPs are rich in intrinsically disordered regions (IDRs), which could contribute to the formation and regulation of phase-separated PRC condensates. Together, our approach provides an unprecedented insight into IDR-mediated interactions and caRBPs with moonlighting functions in native chromatin.
Graphical Abstract
Graphical Abstract
SPACE, SPACEmap and ChIP-SPACE are highly sensitive and stringent approaches for identification of chromatin-associated proteins and their chromatin-contact regions.
Abstract
We recently described aberrantly increased cytoplasmic SFPQ intron-retaining transcripts (IRTs) and concurrent SFPQ protein mislocalization as new hallmarks of amyotrophic lateral sclerosis ...(ALS). However, the generalizability and potential roles of cytoplasmic IRTs in health and disease remain unclear. Here, using time-resolved deep sequencing of nuclear and cytoplasmic fractions of human induced pluripotent stem cells undergoing motor neurogenesis, we reveal that ALS-causing VCP gene mutations lead to compartment-specific aberrant accumulation of IRTs. Specifically, we identify >100 IRTs with increased cytoplasmic abundance in ALS samples. Furthermore, these aberrant cytoplasmic IRTs possess sequence-specific attributes and differential predicted binding affinity to RNA binding proteins. Remarkably, TDP-43, SFPQ and FUS—RNA binding proteins known for nuclear-to-cytoplasmic mislocalization in ALS—abundantly and specifically bind to this aberrant cytoplasmic pool of IRTs. Our data are therefore consistent with a novel role for cytoplasmic IRTs in regulating compartment-specific protein abundance. This study provides new molecular insight into potential pathomechanisms underlying ALS and highlights aberrant cytoplasmic IRTs as potential therapeutic targets.
The mammalian genome harbors up to one million regulatory elements often located at great distances from their target genes. Long-range elements control genes through physical contact with promoters ...and can be recognized by the presence of specific histone modifications and transcription factor binding. Linking regulatory elements to specific promoters genome-wide is currently impeded by the limited resolution of high-throughput chromatin interaction assays. Here we apply a sequence capture approach to enrich Hi-C libraries for >22,000 annotated mouse promoters to identify statistically significant, long-range interactions at restriction fragment resolution, assigning long-range interacting elements to their target genes genome-wide in embryonic stem cells and fetal liver cells. The distal sites contacting active genes are enriched in active histone modifications and transcription factor occupancy, whereas inactive genes contact distal sites with repressive histone marks, demonstrating the regulatory potential of the distal elements identified. Furthermore, we find that coregulated genes cluster nonrandomly in spatial interaction networks correlated with their biological function and expression level. Interestingly, we find the strongest gene clustering in ES cells between transcription factor genes that control key developmental processes in embryogenesis. The results provide the first genome-wide catalog linking gene promoters to their long-range interacting elements and highlight the complex spatial regulatory circuitry controlling mammalian gene expression.
A central tenet in evolutionary theory is that mutations occur randomly with respect to their value to an organism; selection then governs whether they are fixed in a population. This principle has ...been challenged by long-standing theoretical models predicting that selection could modulate the rate of mutation itself. However, our understanding of how the mutation rate varies between different sites within a genome has been hindered by technical difficulties in measuring it. Here we present a study that overcomes previous limitations by combining phylogenetic and population genetic techniques. Upon comparing 34 Escherichia coli genomes, we observe that the neutral mutation rate varies by more than an order of magnitude across 2,659 genes, with mutational hot and cold spots spanning several kilobases. Importantly, the variation is not random: we detect a lower rate in highly expressed genes and in those undergoing stronger purifying selection. Our observations suggest that the mutation rate has been evolutionarily optimized to reduce the risk of deleterious mutations. Current knowledge of factors influencing the mutation rate—including transcription-coupled repair and context-dependent mutagenesis—do not explain these observations, indicating that additional mechanisms must be involved. The findings have important implications for our understanding of evolution and the control of mutations.