DNA replication initiates from replication origins firing throughout S phase. Debate remains about whether origins are a fixed set of loci, or a loose agglomeration of potential sites used ...stochastically in individual cells, and about how consistent their firing time is. We develop an approach to profile DNA replication from whole-genome sequencing of thousands of single cells, which includes in silico flow cytometry, a method for discriminating replicating and non-replicating cells. Using two microfluidic platforms, we analyze up to 2437 replicating cells from a single sample. The resolution and scale of the data allow focused analysis of replication initiation sites, demonstrating that most occur in confined genomic regions. While initiation order is remarkably similar across cells, we unexpectedly identify several subtypes of initiation regions in late-replicating regions. Taken together, high throughput, high resolution sequencing of individual cells reveals previously underappreciated variability in replication initiation and progression.
Cancer is a disease potentiated by mutations in somatic cells. Cancer mutations are not distributed uniformly along the human genome. Instead, different human genomic regions vary by up to fivefold ...in the local density of cancer somatic mutations, posing a fundamental problem for statistical methods used in cancer genomics. Epigenomic organization has been proposed as a major determinant of the cancer mutational landscape. However, both somatic mutagenesis and epigenomic features are highly cell-type-specific. We investigated the distribution of mutations in multiple independent samples of diverse cancer types and compared them to cell-type-specific epigenomic features. Here we show that chromatin accessibility and modification, together with replication timing, explain up to 86% of the variance in mutation rates along cancer genomes. The best predictors of local somatic mutation density are epigenomic features derived from the most likely cell type of origin of the corresponding malignancy. Moreover, we find that cell-of-origin chromatin features are much stronger determinants of cancer mutation profiles than chromatin features of matched cancer cell lines. Furthermore, we show that the cell type of origin of a cancer can be accurately determined based on the distribution of mutations along its genome. Thus, the DNA sequence of a cancer genome encompasses a wealth of information about the identity and epigenomic features of its cell of origin.
The spatiotemporal organization of DNA replication produces a highly robust and reproducible replication timing profile. Sequencing-based methods for assaying replication timing genome-wide have ...become commonplace, but regions of high repeat content in the human genome have remained refractory to analysis. Here, we report the first nearly-gapless telomere-to-telomere replication timing profiles in human, using the T2T-CHM13 genome assembly and sequencing data for five cell lines. We find that replication timing can be successfully assayed in centromeres and large blocks of heterochromatin. Centromeric regions replicate in mid-to-late S-phase and contain replication-timing peaks at a similar density to other genomic regions, while distinct families of heterochromatic satellite DNA differ in their bias for replicating in late S-phase. The high degree of consistency in centromeric replication timing across chromosomes within each cell line prompts further investigation into the mechanisms dictating that some cell lines replicate their centromeres earlier than others, and what the consequences of this variation are.
Genomic DNA replicates according to a defined temporal program in which early-replicating loci are associated with open chromatin, higher gene density, and increased gene expression levels, while ...late-replicating loci tend to be heterochromatic and show higher rates of genomic instability. The ability to measure DNA replication dynamics at genome scale has proven crucial for understanding the mechanisms and cellular consequences of DNA replication timing. Several methods, such as quantification of nucleotide analog incorporation and DNA copy number analyses, can accurately reconstruct the genomic replication timing profiles of various species and cell types. More recent developments have expanded the DNA replication genomic toolkit to assays that directly measure the activity of replication origins, while single-cell replication timing assays are beginning to reveal a new level of replication timing regulation. The combination of these methods, applied on a genomic scale and in multiple biological systems, promises to resolve many open questions and lead to a holistic understanding of how eukaryotic cells replicate their genomes accurately and efficiently.
DNA replication follows a strict spatiotemporal program that intersects with chromatin structure but has a poorly understood genetic basis. To systematically identify genetic regulators of ...replication timing, we exploited inter-individual variation in human pluripotent stem cells from 349 individuals. We show that the human genome's replication program is broadly encoded in DNA and identify 1,617 cis-acting replication timing quantitative trait loci (rtQTLs) - sequence determinants of replication initiation. rtQTLs function individually, or in combinations of proximal and distal regulators, and are enriched at sites of histone H3 trimethylation of lysines 4, 9, and 36 together with histone hyperacetylation. H3 trimethylation marks are individually repressive yet synergistically associate with early replication. We identify pluripotency-related transcription factors and boundary elements as positive and negative regulators of replication timing, respectively. Taken together, human replication timing is controlled by a multi-layered mechanism with dozens of effectors working combinatorially and following principles analogous to transcription regulation.
In eukaryotic cells, genomic DNA replicates in a defined temporal order. The inactive X chromosome (Xi), the most extensive instance of facultative heterochromatin in mammals, replicates later than ...the active X chromosome (Xa), but the replication dynamics of inactive chromatin are not known. By profiling human DNA replication in an allele-specific, chromosomally phased manner, we determined for the first time the replication timing along the active and inactive chromosomes (Xa and Xi) separately. Replication of the Xi was different from that of the Xa, varied among individuals, and resembled a random, unstructured process. The Xi replicated rapidly and at a time largely separable from that of the euchromatic genome. Late-replicating, transcriptionally inactive regions on the autosomes also replicated in an unstructured manner, similar to the Xi. We conclude that DNA replication follows two strategies: slow, ordered replication associated with transcriptional activity, and rapid, random replication of silent chromatin. The two strategies coexist in the same cell, yet are segregated in space and time.
Mutational processes constantly shape the somatic genome, leading to immunity, aging, cancer, and other diseases. When cancer is the outcome, we are afforded a glimpse into these processes by the ...clonal expansion of the malignant cell. Here, we characterize a less explored layer of the mutational landscape of cancer: mutational asymmetries between the two DNA strands. Analyzing whole-genome sequences of 590 tumors from 14 different cancer types, we reveal widespread asymmetries across mutagenic processes, with transcriptional (“T-class”) asymmetry dominating UV-, smoking-, and liver-cancer-associated mutations and replicative (“R-class”) asymmetry dominating POLE-, APOBEC-, and MSI-associated mutations. We report a striking phenomenon of transcription-coupled damage (TCD) on the non-transcribed DNA strand and provide evidence that APOBEC mutagenesis occurs on the lagging-strand template during DNA replication. As more genomes are sequenced, studying and classifying their asymmetries will illuminate the underlying biological mechanisms of DNA damage and repair.
Display omitted
•Replicative and transcriptional mutational asymmetries are widespread across cancer•APOBEC mutagenesis in humans primarily occurs on the lagging-strand template•Mismatch repair balances asymmetric replication errors•Transcription-coupled damage (TCD) introduces sense-strand mutations in liver cancer
Using an approach that distinguishes whether mutations in cancer genomes occurred on the transcribed or non-transcribed DNA strand with respect to transcription and on the leading or lagging strand with respect to replication, the predominant mutational mechanisms associated with different types of cancers and mutational patterns can be inferred.
Alteration of normal ploidy (aneuploidy) can have a number of opposing effects, such as unbalancing protein abundances and inhibiting cell growth but also accelerating genetic diversification and ...rapid adaptation. The interplay of these detrimental and beneficial effects remains puzzling. Here, to understand how cells develop tolerance to aneuploidy, we subject disomic (i.e. with an extra chromosome copy) strains of yeast to long-term experimental evolution under strong selection, by forcing disomy maintenance and daily population dilution. We characterize mutations, karyotype alterations and gene expression changes, and dissect the associated molecular strategies. Cells with different extra chromosomes accumulated mutations at distinct rates and displayed diverse adaptive events. They tended to evolve towards normal ploidy through chromosomal DNA loss and gene expression changes. We identify genes with recurrent mutations and altered expression in multiple lines, revealing a variant that improves growth under genotoxic stresses. These findings support rapid evolvability of disomic strains that can be used to characterize fitness effects of mutations under different stress conditions.
Human genetic variation is distributed nonrandomly across the genome, though the principles governing its distribution are only partially known. DNA replication creates opportunities for mutation, ...and the timing of DNA replication correlates with the density of SNPs across the human genome. To enable deeper investigation of how DNA replication timing relates to human mutation and variation, we generated a high-resolution map of the human genome’s replication timing program and analyzed its relationship to point mutations, copy number variations, and the meiotic recombination hotspots utilized by males and females. DNA replication timing associated with point mutations far more strongly than predicted from earlier analyses and showed a stronger relationship to transversion than transition mutations. Structural mutations arising from recombination-based mechanisms and recombination hotspots used more extensively by females were enriched in early-replicating parts of the genome, though these relationships appeared to relate more strongly to the genomic distribution of causative sequence features. These results indicate differential and sex-specific relationship of DNA replication timing to different forms of mutation and recombination.