Spatial genome organization and its effect on transcription remains a fundamental question. We applied an advanced chromatin interaction analysis by paired-end tag sequencing (ChIA-PET) strategy to ...comprehensively map higher-order chromosome folding and specific chromatin interactions mediated by CCCTC-binding factor (CTCF) and RNA polymerase II (RNAPII) with haplotype specificity and nucleotide resolution in different human cell lineages. We find that CTCF/cohesin-mediated interaction anchors serve as structural foci for spatial organization of constitutive genes concordant with CTCF-motif orientation, whereas RNAPII interacts within these structures by selectively drawing cell-type-specific genes toward CTCF foci for coordinated transcription. Furthermore, we show that haplotype variants and allelic interactions have differential effects on chromosome configuration, influencing gene expression, and may provide mechanistic insights into functions associated with disease susceptibility. 3D genome simulation suggests a model of chromatin folding around chromosomal axes, where CTCF is involved in defining the interface between condensed and open compartments for structural regulation. Our 3D genome strategy thus provides unique insights in the topological mechanism of human variations and diseases.
Display omitted
•ChIA-PET is inclusive in mapping 3D genome at multi-scale and nucleotide resolution•CTCF foci spatially arrange RNAPII transcription concordant in CTCF-motif direction•SNPs alter haplotype chromatin topology and function that link to disease risks•3D genome models elucidate topological framework for transcriptional regulation
Advanced ChIA-PET shows that CTCF/cohesin and RNA polymerase II arrange spatial organization for coordinated transcription. Haplotype variants exhibit allelic effects on chromatin topology and transcription that link disease susceptibility.
The tandem duplicator phenotype (TDP) is a genome-wide instability configuration primarily observed in breast, ovarian, and endometrial carcinomas. Here, we stratify TDP tumors by classifying their ...tandem duplications (TDs) into three span intervals, with modal values of 11 kb, 231 kb, and 1.7 Mb, respectively. TDPs with ∼11 kb TDs feature loss of TP53 and BRCA1. TDPs with ∼231 kb and ∼1.7 Mb TDs associate with CCNE1 pathway activation and CDK12 disruptions, respectively. We demonstrate that p53 and BRCA1 conjoint abrogation drives TDP induction by generating short-span TDP mammary tumors in genetically modified mice lacking them. Lastly, we show how TDs in TDP tumors disrupt heterogeneous combinations of tumor suppressors and chromatin topologically associating domains while duplicating oncogenes and super-enhancers.
Display omitted
•Abundant and distributed tandem duplications form a distinct chromotype in cancer•Six recurrent tandem duplicator phenotypes (TDPs) are characterized by TD span size•Conjoint abrogation of BRCA1 and TP53 causes TDPs with ∼11 kb TDs•CCNE1 pathway activation and CDK12 mutations associate with ∼231 kb and ∼1.7 Mb TDs
Menghi et al. stratify tandem duplicator phenotype tumors by classifying their tandem duplications (TDs) into three span sizes associated with different pathway alterations and show how TDs disrupt tumor suppressors and chromatin topologically associating domains while duplicating oncogenes and super-enhancers.
Chromatin interaction analysis by paired-end tag sequencing (ChIA-PET) is a robust method for capturing genome-wide chromatin interactions. Unlike other 3C-based methods, it includes a chromatin ...immunoprecipitation (ChIP) step that enriches for interactions mediated by specific target proteins. This unique feature allows ChIA-PET to provide the functional specificity and higher resolution needed to detect chromatin interactions, which chromosome conformation capture (3C)/Hi-C approaches have not achieved. The original ChIA-PET protocol generates short paired-end tags (2 × 20 base pairs (bp)) to detect two genomic loci that are far apart on linear chromosomes but are in spatial proximity in the folded genome. We have improved the original approach by developing long-read ChIA-PET, in which the length of the paired-end tags is increased (up to 2 × 250 bp). The longer PET reads not only improve the tag-mapping efficiency but also increase the probability of covering phased single-nucleotide polymorphisms (SNPs), which allows haplotype-specific chromatin interactions to be identified. Here, we provide the detailed protocol for long-read ChIA-PET that includes cell fixation and lysis, chromatin fragmentation by sonication, ChIP, proximity ligation with a bridge linker, Tn5 tagmentation, PCR amplification and high-throughput sequencing. For a well-trained molecular biologist, it typically takes 6 d from cell harvesting to the completion of library construction, up to a further 36 h for DNA sequencing and <20 h for processing of raw sequencing reads.
Cohesin extrusion is thought to play a central role in establishing the architecture of mammalian genomes. However, extrusion has not been visualized in vivo, and thus, its functional impact and ...energetics are unknown. Using ultra-deep Hi-C, we show that loop domains form by a process that requires cohesin ATPases. Once formed, however, loops and compartments are maintained for hours without energy input. Strikingly, without ATP, we observe the emergence of hundreds of CTCF-independent loops that link regulatory DNA. We also identify architectural “stripes,” where a loop anchor interacts with entire domains at high frequency. Stripes often tether super-enhancers to cognate promoters, and in B cells, they facilitate Igh transcription and recombination. Stripe anchors represent major hotspots for topoisomerase-mediated lesions, which promote chromosomal translocations and cancer. In plasmacytomas, stripes can deregulate Igh-translocated oncogenes. We propose that higher organisms have coopted cohesin extrusion to enhance transcription and recombination, with implications for tumor development.
Display omitted
•Cohesin extrusion requires cohesin’s intrinsic ATPase activity•Extensive loading of cohesin near CTCF anchors creates architectural stripes•Architectural stripes facilitate cognate promoter-enhancer interactions•Stripe anchors are prime sites of tumor-inducing TOP2β DNA breaks
Cohesin continually extrudes loops of chromatin in vivo, relying on ATP to fuel the process.
The number of reported examples of chromatin architecture alterations involved in the regulation of gene transcription and in disease is increasing. However, no genome-wide testing has been performed ...to assess the abundance of these events and their importance relative to other factors affecting genome regulation. This is particularly interesting given that a vast majority of genetic variations identified in association studies are located outside coding sequences. This study attempts to address this lack by analyzing the impact on chromatin spatial organization of genetic variants identified in individuals from 26 human populations and in genome-wide association studies.
We assess the tendency of structural variants to accumulate in spatially interacting genomic segments and design an algorithm to model chromatin conformational changes caused by structural variations. We show that differential gene transcription is closely linked to the variation in chromatin interaction networks mediated by RNA polymerase II. We also demonstrate that CTCF-mediated interactions are well conserved across populations, but enriched with disease-associated SNPs. Moreover, we find boundaries of topological domains as relatively frequent targets of duplications, which suggest that these duplications can be an important evolutionary mechanism of genome spatial organization.
This study assesses the critical impact of genetic variants on the higher-order organization of chromatin folding and provides insight into the mechanisms regulating gene transcription at the population scale, of which local arrangement of chromatin loops seems to be the most significant. It provides the first insight into the variability of the human 3D genome at the population scale.
The male sterility of thermosensitive genic male sterile (TGMS) lines of wheat (Triticum aestivum) is strictly controlled by temperature. The early phase of anther development is especially ...susceptible to cold stress. MicroRNAs (miRNAs) play an important role in plant development and in responses to environmental stress. In this study, deep sequencing of small RNA (smRNA) libraries obtained from spike tissues of the TGMS line under cold and control conditions identified a total of 78 unique miRNA sequences from 30 families and trans-acting small interfering RNAs (tasiRNAs) derived from two TAS3 genes. To identify smRNA targets in the wheat TGMS line, we applied the degradome sequencing method, which globally and directly identifies the remnants of smRNA-directed target cleavage. We identified 26 targets of 16 miRNA families and three targets of tasiRNAs. Comparing smRNA sequencing data sets and TaqMan quantitative polymerase chain reaction results, we identified six miRNAs and one tasiRNA (tasiRNA-ARF for Auxin-Responsive Factor) as cold stress-responsive smRNAs in spike tissues of the TGMS line. We also determined the expression profiles of target genes that encode transcription factors in response to cold stress. Interestingly, the expression of cold stress-responsive smRNAs integrated in the auxin-signaling pathway and their target genes was largely noncorrelated. We investigated the tissue-specific expression of smRNAs using a tissue microarray approach. Our data indicated that miR167 and tasiRNA-ARF play roles in regulating the auxin-signaling pathway and possibly in the developmental response to cold stress. These data provide evidence that smRNA regulatory pathways are linked with male sterility in the TGMS line during cold stress.
Abstract Despite the advent of genomic sequencing, molecular diagnosis remains unsolved in approximately half of patients with Mendelian disorders, largely due to unclarified functions of noncoding ...regions and the difficulty in identifying complex structural variations. In this study, we map a unique form of central iris hypoplasia in a large family to 6q15-q23.3 and 18p11.31-q12.1 using a genome-wide linkage scan. Long-read sequencing reveals a balanced translocation t(6;18)(q22.31;p11.22) with intergenic breakpoints. By performing Hi-C on induced pluripotent stem cells from a patient, we identify two chromatin topologically associating domains spanning across the breakpoints. These alterations lead the ectopic chromatin interactions between APCDD1 on chromosome 18 and enhancers on chromosome 6, resulting in upregulation of APCDD1 . Notably, APCDD1 is specifically localized in the iris of human eyes. Our findings demonstrate that noncoding structural variations can lead to Mendelian diseases by disrupting the 3D genome structure and resulting in altered gene expression.
Computational methods have been proposed to leverage spatially resolved transcriptomic data, pinpointing genes with spatial expression patterns and delineating tissue domains. However, existing ...approaches fall short in uniformly quantifying spatially variable genes (SVGs). Moreover, from a methodological viewpoint, while SVGs are naturally associated with depicting spatial domains, they are technically dissociated in most methods. Here, we present a framework (PROST) for the quantitative recognition of spatial transcriptomic patterns, consisting of (i) quantitatively characterizing spatial variations in gene expression patterns through the PROST Index; and (ii) unsupervised clustering of spatial domains via a self-attention mechanism. We demonstrate that PROST performs superior SVG identification and domain segmentation with various spatial resolutions, from multicellular to cellular levels. Importantly, PROST Index can be applied to prioritize spatial expression variations, facilitating the exploration of biological insights. Together, our study provides a flexible and robust framework for analyzing diverse spatial transcriptomic data.
Epstein-Barr virus (EBV) immortalization of resting B lymphocytes (RBLs) to lymphoblastoid cell lines (LCLs) models human DNA tumor virus oncogenesis. RBL and LCL chromatin interaction maps are ...compared to identify the spatial and temporal genome architectural changes during EBV B cell transformation. EBV induces global genome reorganization where contact domains frequently merge or subdivide during transformation. Repressed B compartments in RBLs frequently switch to active A compartments in LCLs. LCLs gain 40% new contact domain boundaries. Newly gained LCL boundaries have strong CTCF binding at their borders while in RBLs, the same sites have much less CTCF binding. Some LCL CTCF sites also have EBV nuclear antigen (EBNA) leader protein EBNALP binding. LCLs have more local interactions than RBLs at LCL dependency factors and super-enhancer targets. RNA Pol II HiChIP and FISH of RBL and LCL further validate the Hi-C results. EBNA3A inactivation globally alters LCL genome interactions. EBNA3A inactivation reduces CTCF and RAD21 DNA binding. EBNA3C inactivation rewires the looping at the CDKN2A/B and AICDA loci. Disruption of a CTCF site at AICDA locus increases AICDA expression. These data suggest that EBV controls lymphocyte growth by globally reorganizing host genome architecture to facilitate the expression of key oncogenes.