Immortalized cell lines are widely used model systems whose genomes are often highly rearranged and polyploid. However, their genome structure is seldom deciphered and is thus not accounted for ...during analyses. We therefore used linked short- and long-read sequencing to perform haplotype-level reconstruction of the genome of a Drosophila melanogaster cell line (S2-DRSC) with a complex genome structure.
Using a custom implementation (that is designed to use ultra-long reads in complex genomes with nested rearrangements) to call structural variants (SVs), we found that the most common SV was repetitive sequence insertion or deletion (> 80% of SVs), with Gypsy retrotransposon insertions dominating. The second most common SV was local sequence duplication. SNPs and other SVs were rarer, but several large chromosomal translocations and mitochondrial genome insertions were observed. Haplotypes were highly similar at the nucleotide level but structurally very different. Insertion SVs existed at various haplotype frequencies and were unlinked on chromosomes, demonstrating that haplotypes have different structures and suggesting the existence of a mechanism that allows SVs to propagate across haplotypes. Finally, using public short-read data, we found that transposable element insertions and local duplications are common in other D. melanogaster cell lines.
The S2-DRSC cell line evolved through retrotransposon activity and vast local sequence duplications, that we hypothesize were the products of DNA re-replication events. Additionally, mutations can propagate across haplotypes (possibly explained by mitotic recombination), which enables fine-tuning of mutational impact and prevents accumulation of deleterious events, an inherent problem of clonal reproduction. We conclude that traditional linear homozygous genome representation conceals the complexity when dealing with rearranged and heterozygous clonal cells.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Transcription activation involves RNA polymerase II (Pol II) recruitment and release from the promoter into productive elongation, but how specific chromatin regulators control these steps is ...unclear. Here, we identify a novel activity of the histone acetyltransferase p300/CREB-binding protein (CBP) in regulating promoter-proximal paused Pol II. We find that Drosophila CBP inhibition results in “dribbling” of Pol II from the pause site to positions further downstream but impedes transcription through the +1 nucleosome genome-wide. Promoters strongly occupied by CBP and GAGA factor have high levels of paused Pol II, a unique chromatin signature, and are highly expressed regardless of cell type. Interestingly, CBP activity is rate limiting for Pol II recruitment to these highly paused promoters through an interaction with TFIIB but for transit into elongation by histone acetylation at other genes. Thus, CBP directly stimulates both Pol II recruitment and the ability to traverse the first nucleosome, thereby promoting transcription of most genes.
Display omitted
•Magnitude of CBP promoter occupancy correlates with RNA polymerase II pausing•Paused Pol II dribbles to more downstream positions upon CBP inhibition•A CBP-TFIIB interaction influences Pol II recruitment to promoters•CBP facilitates transcription through the +1 nucleosome by histone acetylation
CBP and p300 have known functions at transcriptional enhancers. Boija et al. identify a regulatory role for the Drosophila p300/CBP homolog also at promoters. Drosophila CBP maintains RNA Pol II at the promoter-proximal pause site, recruits Pol II through an interaction with TFIIB, and facilitates transcription through the +1 nucleosome.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPUK, ZAGLJ, ZRSKP
Mammalian DNA folds into 3D structures that facilitate and regulate genetic processes such as transcription, DNA repair, and epigenetics. Several insights derive from chromosome capture methods, such ...as Hi-C, which allow researchers to construct contact maps depicting 3D interactions among all DNA segment pairs. These maps show a complex cross-scale organization spanning megabase-pair compartments to short-ranged DNA loops. To better understand the organizing principles, several groups analyzed Hi-C data assuming a Russian-doll-like nested hierarchy where DNA regions of similar sizes merge into larger and larger structures. Apart from being a simple and appealing description, this model explains, e.g., the omnipresent chequerboard pattern seen in Hi-C maps, known as A/B compartments, and foreshadows the co-localization of some functionally similar DNA regions. However, while successful, this model is incompatible with the two competing mechanisms that seem to shape a significant part of the chromosomes' 3D organization: loop extrusion and phase separation. This paper aims to map out the chromosome's actual folding hierarchy from empirical data. To this end, we take advantage of Hi-C experiments and treat the measured DNA-DNA interactions as a weighted network. From such a network, we extract 3D communities using the generalized Louvain algorithm. This algorithm has a resolution parameter that allows us to scan seamlessly through the community size spectrum, from A/B compartments to topologically associated domains (TADs). By constructing a hierarchical tree connecting these communities, we find that chromosomes are more complex than a perfect hierarchy. Analyzing how communities nest relative to a simple folding model, we found that chromosomes exhibit a significant portion of nested and non-nested community pairs alongside considerable randomness. In addition, by examining nesting and chromatin types, we discovered that nested parts are often associated with active chromatin. These results highlight that cross-scale relationships will be essential components in models aiming to reach a deep understanding of the causal mechanisms of chromosome folding.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
In specific cases, chromatin clearly forms long-range loops that place distant regulatory elements in close proximity to transcription start sites, but we have limited understanding of many loops ...identified by Chromosome Conformation Capture (such as Hi-C) analyses. In efforts to elucidate their characteristics and functions, we have identified highly interacting regions (HIRs) using intra-chromosomal Hi-C datasets with a new computational method based on looking at the eigenvector that corresponds to the smallest eigenvalue (here unity). Analysis of these regions using ENCODE data shows that they are in general enriched in bound factors involved in DNA damage repair and have actively transcribed genes. However, both highly transcribed regions as well as transcriptionally inactive regions can form HIRs. The results also indicate that enhancers and super-enhancers in particular form long-range interactions within the same chromosome. The accumulation of DNA repair factors in most identified HIRs suggests that protection from DNA damage in these regions is essential for avoidance of detrimental rearrangements.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
Polycomb (PcG) regulation has been thought to produce stable long-term gene silencing. Genomic analyses in Drosophila and mammals, however, have shown that it targets many genes, which can switch ...state during development. Genetic evidence indicates that critical for the active state of PcG target genes are the histone methyltransferases Trithorax (TRX) and ASH1. Here we analyze the repertoire of alternative states in which PcG target genes are found in different Drosophila cell lines and the role of PcG proteins TRX and ASH1 in controlling these states. Using extensive genome-wide chromatin immunoprecipitation analysis, RNAi knockdowns, and quantitative RT-PCR, we show that, in addition to the known repressed state, PcG targets can reside in a transcriptionally active state characterized by formation of an extended domain enriched in ASH1, the N-terminal, but not C-terminal moiety of TRX and H3K27ac. ASH1/TRX N-ter domains and transcription are not incompatible with repressive marks, sometimes resulting in a "balanced" state modulated by both repressors and activators. Often however, loss of PcG repression results instead in a "void" state, lacking transcription, H3K27ac, or binding of TRX or ASH1. We conclude that PcG repression is dynamic, not static, and that the propensity of a target gene to switch states depends on relative levels of PcG, TRX, and activators. N-ter TRX plays a remarkable role that antagonizes PcG repression and preempts H3K27 methylation by acetylation. This role is distinct from that usually attributed to TRX/MLL proteins at the promoter. These results have important implications for Polycomb gene regulation, the "bivalent" chromatin state of embryonic stem cells, and gene expression in development.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Several experiments show that the three dimensional (3D) organization of chromosomes affects genetic processes such as transcription and gene regulation. To better understand this connection, ...researchers developed the Hi-C method that is able to detect the pairwise physical contacts of all chromosomal loci. The Hi-C data show that chromosomes are composed of 3D compartments that range over a variety of scales. However, it is challenging to systematically detect these cross-scale structures. Most studies have therefore designed methods for specific scales to study foremost topologically associated domains (TADs) and A/B compartments. To go beyond this limitation, we tailor a network community detection method that finds communities in compact fractal globule polymer systems. Our method allows us to continuously scan through all scales with a single resolution parameter. We found: (i) polymer segments belonging to the same 3D community do not have to be in consecutive order along the polymer chain. In other words, several TADs may belong to the same 3D community. (ii) CTCF proteins-a loop-stabilizing protein that is ascribed a big role in TAD formation-are well correlated with community borders only at one level of organization. (iii) TADs and A/B compartments are traditionally treated as two weakly related 3D structures and detected with different algorithms. With our method, we detect both by simply adjusting the resolution parameter. We therefore argue that they represent two specific levels of a continuous spectrum 3D communities, rather than seeing them as different structural entities.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
Polycomb Group (PcG) proteins are epigenetic repressors that control metazoan development and cell differentiation. In Drosophila, PcG proteins form five distinct complexes targeted to genes by ...Polycomb Response Elements (PREs). Of all PcG complexes PhoRC is the only one that contains a sequence-specific DNA binding subunit (PHO or PHOL), which led to a model that places PhoRC at the base of the recruitment hierarchy. Here we demonstrate that in vivo PHO is preferred to PHOL as a subunit of PhoRC and that PHO and PHOL associate with PREs and a subset of transcriptionally active promoters. Although the binding to the promoter sites depends on the quality of recognition sequences, the binding to PREs does not. Instead, the efficient recruitment of PhoRC to PREs requires the SFMBT subunit and crosstalk with Polycomb Repressive Complex 1. We find that human YY1 protein, the ortholog of PHO, binds sites at active promoters in the human genome but does not bind most PcG target genes, presumably because the interactions involved in the targeting to Drosophila PREs are lost in the mammalian lineage. We conclude that the recruitment of PhoRC to PREs is based on combinatorial interactions and propose that such a recruitment strategy is important to attenuate the binding of PcG proteins when the target genes are transcriptionally active. Our findings allow the appropriate placement of PhoRC in the PcG recruitment hierarchy and provide a rationale to explain why YY1 is unlikely to serve as a general recruiter of mammalian Polycomb complexes despite its reported ability to participate in PcG repression in flies.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Several processes in the cell, such as gene regulation, start when key proteins recognize and bind to short DNA sequences. However, as these sequences can be hundreds of million times shorter than ...the genome, they are hard to find by simple diffusion: diffusion-limited association rates may underestimate in vitro measurements up to several orders of magnitude. Moreover, the rates increase if the DNA is coiled rather than straight. Here we model how this works in vivo in mammalian cells. We use chromatin-chromatin contact data from Hi-C experiments to map the protein target-search onto a network problem. The nodes represent DNA segments and the weight of the links are proportional to measured contact probabilities. We then put forward a diffusion-reaction equation for the density of searching protein that allows us to calculate the association rates across the genome analytically. For segments where the rates are high, we find that they are enriched with active gene starts and have high RNA expression levels. This paper suggests that the DNA's 3D conformation is important for protein search times in vivo and offers a method to interpret protein-binding profiles in eukaryotes that cannot be explained by the DNA sequence itself.
Full text
Available for:
CMK, CTK, FMFMET, IJS, NUK, PNG, UL, UM, UPUK
Long non-coding RNAs contribute to dosage compensation in both mammals and Drosophila by inducing changes in the chromatin structure of the X-chromosome. In Drosophila melanogaster, roX1 and roX2 are ...long non-coding RNAs that together with proteins form the male-specific lethal (MSL) complex, which coats the entire male X-chromosome and mediates dosage compensation by increasing its transcriptional output. Studies on polytene chromosomes have demonstrated that when both roX1 and roX2 are absent, the MSL-complex becomes less abundant on the male X-chromosome and is relocated to the chromocenter and the 4th chromosome. Here we address the role of roX RNAs in MSL-complex targeting and the evolution of dosage compensation in Drosophila. We performed ChIP-seq experiments which showed that MSL-complex recruitment to high affinity sites (HAS) on the X-chromosome is independent of roX and that the HAS sequence motif is conserved in D. simulans. Additionally, a complete and enzymatically active MSL-complex is recruited to six specific genes on the 4th chromosome. Interestingly, our sequence analysis showed that in the absence of roX RNAs, the MSL-complex has an affinity for regions enriched in Hoppel transposable elements and repeats in general. We hypothesize that roX mutants reveal the ancient targeting of the MSL-complex and propose that the role of roX RNAs is to prevent the binding of the MSL-complex to heterochromatin.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Microorganisms are essential constituents of ecosystems. To improve our understanding of how various factors shape microbial diversity and composition in nature it is important to study how ...microorganisms vary in space and time. Factors shaping microbial communities in ground level air have been surveyed in a limited number of studies, indicating that geographic location, season and local climate influence the microbial communities. However, few have surveyed more than one location, at high latitude or continuously over more than a year. We surveyed the airborne microbial communities over two full consecutive years in Kiruna, in the Arctic boreal zone, and Ljungbyhed, in the Southern nemoral zone of Sweden, by using a unique collection of archived air filters. We mapped both geographic and seasonal differences in bacterial and fungal communities and evaluated environmental factors that may contribute to these differences and found that location, season and weather influence the airborne communities. Location had stronger influence on the bacterial community composition compared to season, while location and season had equal influence on the fungal community composition. However, the airborne bacterial and fungal diversity showed overall the same trend over the seasons, regardless of location, with a peak during the warmer parts of the year, except for the fungal seasonal trend in Ljungbyhed, which fluctuated more within season. Interestingly, the diversity and evenness of the airborne communities were generally lower in Ljungbyhed. In addition, both bacterial and fungal communities varied significantly within and between locations, where orders like Rhizobiales, Rhodospirillales and Agaricales dominated in Kiruna, whereas Bacillales, Clostridiales and Sordariales dominated in Ljungbyhed. These differences are a likely reflection of the landscape surrounding the sampling sites where the landscape in Ljungbyhed is more homogenous and predominantly characterized by artificial and agricultural surroundings. Our results further indicate that local landscape, as well as seasonal variation, shapes microbial communities in air.