The transition from naive to primed pluripotency is accompanied by an extensive reorganisation of transcriptional and epigenetic programmes. However, the role of transcriptional enhancers and ...three-dimensional chromatin organisation in coordinating these developmental programmes remains incompletely understood. Here, we generate a high-resolution atlas of gene regulatory interactions, chromatin profiles and transcription factor occupancy in naive and primed human pluripotent stem cells, and develop a network-graph approach to examine the atlas at multiple spatial scales. We uncover highly connected promoter hubs that change substantially in interaction frequency and in transcriptional co-regulation between pluripotent states. Small hubs frequently merge to form larger networks in primed cells, often linked by newly-formed Polycomb-associated interactions. We identify widespread state-specific differences in enhancer activity and interactivity that correspond with an extensive reconfiguration of OCT4, SOX2 and NANOG binding and target gene expression. These findings provide multilayered insights into the chromatin-based gene regulatory control of human pluripotent states.
Eukaryotic genomes are folded into loops. It is thought that these are formed by cohesin complexes via extrusion, either until loop expansion is arrested by CTCF or until cohesin is removed from DNA ...by WAPL. Although WAPL limits cohesin’s chromatin residence time to minutes, it has been reported that some loops exist for hours. How these loops can persist is unknown. We show that during G1-phase, mammalian cells contain acetylated cohesinSTAG1 which binds chromatin for hours, whereas cohesinSTAG2 binds chromatin for minutes. Our results indicate that CTCF and the acetyltransferase ESCO1 protect a subset of cohesinSTAG1 complexes from WAPL, thereby enable formation of long and presumably long-lived loops, and that ESCO1, like CTCF, contributes to boundary formation in chromatin looping. Our data are consistent with a model of nested loop extrusion, in which acetylated cohesinSTAG1 forms stable loops between CTCF sites, demarcating the boundaries of more transient cohesinSTAG2 extrusion activity.
Children with cancer are frequently immunocompromised. While children are generally thought to be at less risk of severe SARS-CoV-2 infection than adults, comprehensive population-based evidence for ...the risk in children with cancer is unavailable. We aimed to produce evidence of the incidence and outcomes from SARS-CoV-2 in children with cancer attending all hospitals treating this population across the UK.
Retrospective and prospective observational study of all children in the UK under 16 diagnosed with cancer through data collection from all hospitals providing cancer care to this population. Eligible patients tested positive for SARS-CoV-2 on reverse transcription polymerase chain reaction (RT-PCR). The primary end-point was death, discharge or end of active care for COVID-19 for those remaining in hospital.
Between 12 March 2020 and 31 July 2020, 54 cases were identified: 15 (28%) were asymptomatic, 34 (63%) had mild infections and 5 (10%) moderate, severe or critical infections. No patients died and only three patients required intensive care support due to COVID-19. Estimated incidence of hospital identified SARS-CoV-2 infection in children with cancer under 16 was 3%.
Children with cancer with SARS-CoV-2 infection do not appear at increased risk of severe infection compared to the general paediatric population. This is reassuring and supports the continued delivery of standard treatment.
Long-range
regulatory elements such as enhancers coordinate cell-specific transcriptional programmes by engaging in DNA looping interactions with target promoters. Deciphering the interplay between ...the promoter connectivity and activity of
regulatory elements during lineage commitment is crucial for understanding developmental transcriptional control. Here, we use Promoter Capture Hi-C to generate a high-resolution atlas of chromosomal interactions involving ~22,000 gene promoters in human pluripotent and lineage-committed cells, identifying putative target genes for known and predicted enhancer elements. We reveal extensive dynamics of
-regulatory contacts upon lineage commitment, including the acquisition and loss of promoter interactions. This spatial rewiring occurs preferentially with predicted changes in the activity of
regulatory elements and is associated with changes in target gene expression. Our results provide a global and integrated view of promoter interactome dynamics during lineage commitment of human pluripotent cells.
We present reaction free energy calculations using the adaptive buffered force mixing quantum mechanics/molecular mechanics (bf-QM/MM) method. The bf-QM/MM method combines nonadaptive electrostatic ...embedding QM/MM calculations with extended and reduced QM regions to calculate accurate forces on all atoms, which can be used in free energy calculation methods that require only the forces and not the energy. We calculate the free energy profiles of two reactions in aqueous solution: the nucleophilic substitution reaction of methyl chloride with a chloride anion and the deprotonation reaction of the tyrosine side chain. We validate the bf-QM/MM method against a full QM simulation, and show that it correctly reproduces both geometrical properties and free energy profiles of the QM model, while the electrostatic embedding QM/MM method using a static QM region comprising only the solute is unable to do so. The bf-QM/MM method is not explicitly dependent on the details of the QM and MM methods, so long as it is possible to compute QM forces in a small region and MM forces in the rest of the system, as in a conventional QM/MM calculation. It is simple, with only a few parameters needed to control the QM calculation sizes, and allows (but does not require) a varying and adapting QM region which is necessary for simulating solutions.
Chromosomes in proliferating metazoan cells undergo marked structural metamorphoses every cell cycle, alternating between highly condensed mitotic structures that facilitate chromosome segregation, ...and decondensed interphase structures that accommodate transcription, gene silencing and DNA replication. Here we use single-cell Hi-C (high-resolution chromosome conformation capture) analysis to study chromosome conformations in thousands of individual cells, and discover a continuum of cis-interaction profiles that finely position individual cells along the cell cycle. We show that chromosomal compartments, topological-associated domains (TADs), contact insulation and long-range loops, all defined by bulk Hi-C maps, are governed by distinct cell-cycle dynamics. In particular, DNA replication correlates with a build-up of compartments and a reduction in TAD insulation, while loops are generally stable from G1 to S and G2 phase. Whole-genome three-dimensional structural models reveal a radial architecture of chromosomal compartments with distinct epigenomic signatures. Our single-cell data therefore allow re-interpretation of chromosome conformation maps through the prism of the cell cycle.
Evolutionary information stored in multiple sequence alignments (MSAs) has been used to identify the interaction interface of protein complexes, by measuring either co-conservation or co-mutation of ...amino acid residues across the interface. Recently, maximum entropy related correlated mutation measures (CMMs) such as direct information, decoupling direct from indirect interactions, have been developed to identify residue pairs interacting across the protein complex interface. These studies have focussed on carefully selected protein complexes with large, good-quality MSAs. In this work, we study protein complexes with a more typical MSA consisting of fewer than 400 sequences, using a set of 79 intramolecular protein complexes. Using a maximum entropy based CMM at the residue level, we develop an interface level CMM score to be used in re-ranking docking decoys. We demonstrate that our interface level CMM score compares favourably to the complementarity trace score, an evolutionary information-based score measuring co-conservation, when combined with the number of interface residues, a knowledge-based potential and the variability score of individual amino acid sites. We also demonstrate, that, since co-mutation and co-complementarity in the MSA contain orthogonal information, the best prediction performance using evolutionary information can be achieved by combining the co-mutation information of the CMM with co-conservation information of a complementarity trace score, predicting a near-native structure as the top prediction for 41% of the dataset. The method presented is not restricted to small MSAs, and will likely improve interface prediction also for complexes with large and good-quality MSAs.
Maximum Likelihood (ML) optimization schemes are widely used for parameter inference. They maximize the likelihood of some experimentally observed data, with respect to the model parameters ...iteratively, following the gradient of the logarithm of the likelihood. Here, we employ a ML inference scheme to infer a generalizable, physics-based coarse-grained protein model (which includes Go̅-like biasing terms to stabilize secondary structure elements in room-temperature simulations), using native conformations of a training set of proteins as the observed data. Contrastive divergence, a novel statistical machine learning technique, is used to efficiently approximate the direction of the gradient ascent, which enables the use of a large training set of proteins. Unlike previous work, the generalizability of the protein model allows the folding of peptides and a protein (protein G) which are not part of the training set. We compare the same force field with different van der Waals (vdW) potential forms: a hard cutoff model, and a Lennard-Jones (LJ) potential with vdW parameters inferred or adopted from the CHARMM or AMBER force fields. Simulations of peptides and protein G show that the LJ model with inferred parameters outperforms the hard cutoff potential, which is consistent with previous observations. Simulations using the LJ potential with inferred vdW parameters also outperforms the protein models with adopted vdW parameter values, demonstrating that model parameters generally cannot be used with force fields with different energy functions. The software is available at https://sites.google.com/site/crankite/.
Mammalian genomes are spatially organized into compartments, topologically associating domains (TADs), and loops to facilitate gene regulation and other chromosomal functions. How compartments, TADs, ...and loops are generated is unknown. It has been proposed that cohesin forms TADs and loops by extruding chromatin loops until it encounters CTCF, but direct evidence for this hypothesis is missing. Here, we show that cohesin suppresses compartments but is required for TADs and loops, that CTCF defines their boundaries, and that the cohesin unloading factor WAPL and its PDS5 binding partners control the length of loops. In the absence of WAPL and PDS5 proteins, cohesin forms extended loops, presumably by passing CTCF sites, accumulates in axial chromosomal positions (vermicelli), and condenses chromosomes. Unexpectedly, PDS5 proteins are also required for boundary function. These results show that cohesin has an essential genome‐wide function in mediating long‐range chromatin interactions and support the hypothesis that cohesin creates these by loop extrusion, until it is delayed by CTCF in a manner dependent on PDS5 proteins, or until it is released from DNA by WAPL.
Synopsis
Spatial organization of mammalian genomes facilitates gene regulation and replication. Hi‐C data reveal that cohesin binding at CTCF sites controls long‐range chromatin interactions, supporting a model in which chromatin loop extrusion drives genome 3D architecture.
Cohesin suppresses compartments but is required for TADs and loops.
CTCF and PDS5 proteins define TAD boundaries and loop anchors.
WAPL and PDS5 proteins define the length of loops.
Cohesin appears to form long‐range contacts by loop extrusion.
Hi‐C data show that cohesin is required for long‐range chromatin interactions, supporting a model of loop extrusion driving the spatial organization of mammalian genomes.
Capture Hi-C (CHi-C) is a method for profiling chromosomal interactions involving targeted regions of interest, such as gene promoters, globally and at high resolution. Signal detection in CHi-C data ...involves a number of statistical challenges that are not observed when using other Hi-C-like techniques. We present a background model and algorithms for normalisation and multiple testing that are specifically adapted to CHi-C experiments. We implement these procedures in CHiCAGO ( http://regulatorygenomicsgroup.org/chicago ), an open-source package for robust interaction detection in CHi-C. We validate CHiCAGO by showing that promoter-interacting regions detected with this method are enriched for regulatory features and disease-associated SNPs.