DNA replication initiates from replication origins firing throughout S phase. Debate remains about whether origins are a fixed set of loci, or a loose agglomeration of potential sites used ...stochastically in individual cells, and about how consistent their firing time is. We develop an approach to profile DNA replication from whole-genome sequencing of thousands of single cells, which includes in silico flow cytometry, a method for discriminating replicating and non-replicating cells. Using two microfluidic platforms, we analyze up to 2437 replicating cells from a single sample. The resolution and scale of the data allow focused analysis of replication initiation sites, demonstrating that most occur in confined genomic regions. While initiation order is remarkably similar across cells, we unexpectedly identify several subtypes of initiation regions in late-replicating regions. Taken together, high throughput, high resolution sequencing of individual cells reveals previously underappreciated variability in replication initiation and progression.
The spatiotemporal organization of DNA replication produces a highly robust and reproducible replication timing profile. Sequencing-based methods for assaying replication timing genome-wide have ...become commonplace, but regions of high repeat content in the human genome have remained refractory to analysis. Here, we report the first nearly-gapless telomere-to-telomere replication timing profiles in human, using the T2T-CHM13 genome assembly and sequencing data for five cell lines. We find that replication timing can be successfully assayed in centromeres and large blocks of heterochromatin. Centromeric regions replicate in mid-to-late S-phase and contain replication-timing peaks at a similar density to other genomic regions, while distinct families of heterochromatic satellite DNA differ in their bias for replicating in late S-phase. The high degree of consistency in centromeric replication timing across chromosomes within each cell line prompts further investigation into the mechanisms dictating that some cell lines replicate their centromeres earlier than others, and what the consequences of this variation are.
Abstract
Motivation
Genomic DNA replicates according to a reproducible spatiotemporal program, with some loci replicating early in S phase while others replicate late. Despite being a central ...cellular process, DNA replication timing studies have been limited in scale due to technical challenges.
Results
We present TIGER (Timing Inferred from Genome Replication), a computational approach for extracting DNA replication timing information from whole genome sequence data obtained from proliferating cell samples. The presence of replicating cells in a biological specimen leads to non-uniform representation of genomic DNA that depends on the timing of replication of different genomic loci. Replication dynamics can hence be observed in genome sequence data by analyzing DNA copy number along chromosomes while accounting for other sources of sequence coverage variation. TIGER is applicable to any species with a contiguous genome assembly and rivals the quality of experimental measurements of DNA replication timing. It provides a straightforward approach for measuring replication timing and can readily be applied at scale.
Availability and implementation
TIGER is available at https://github.com/TheKorenLab/TIGER.
Supplementary information
Supplementary data are available at Bioinformatics online.
Induced pluripotent stem cells (iPSCs) are the foundation of cell therapy. Differences in gene expression, DNA methylation, and chromatin conformation, which could affect differentiation capacity, ...have been identified between iPSCs and embryonic stem cells (ESCs). Less is known about whether DNA replication timing, a process linked to both genome regulation and genome stability, is efficiently reprogrammed to the embryonic state. To answer this, we compare genome-wide replication timing between ESCs, iPSCs, and cells reprogrammed by somatic cell nuclear transfer (NT-ESCs). While NT-ESCs replicate their DNA in a manner indistinguishable from ESCs, a subset of iPSCs exhibits delayed replication at heterochromatic regions containing genes downregulated in iPSCs with incompletely reprogrammed DNA methylation. DNA replication delays are not the result of gene expression or DNA methylation aberrations and persist after cells differentiate to neuronal precursors. Thus, DNA replication timing can be resistant to reprogramming and influence the quality of iPSCs.
Display omitted
•Genome-wide comparison of DNA replication timing between stem cell types•Aberrant replication timing in a subset of induced pluripotent stem cell lines•Delayed replication tends to occur near centromeres and telomeres•Aberrant replication timing is maintained following stem cell differentiation
Edwards et al. compare DNA replication timing between human embryonic stem cells and stem cells reprogrammed by defined factors or through nuclear transfer. Induced pluripotent stem cells incur aberrant replication timing at specific genomic regions in a subset of cell lines. These aberrations are carried through stem cell differentiation.
Human cleavage-stage embryos frequently acquire chromosomal aneuploidies during mitosis due to unknown mechanisms. Here, we show that S phase at the 1-cell stage shows replication fork stalling, low ...fork speed, and DNA synthesis extending into G2 phase. DNA damage foci consistent with collapsed replication forks, DSBs, and incomplete replication form in G2 in an ATR- and MRE11-dependent manner, followed by spontaneous chromosome breakage and segmental aneuploidies. Entry into mitosis with incomplete replication results in chromosome breakage, whole and segmental chromosome errors, micronucleation, chromosome fragmentation, and poor embryo quality. Sites of spontaneous chromosome breakage are concordant with sites of DNA synthesis in G2 phase, locating to gene-poor regions with long neural genes, which are transcriptionally silent at this stage of development. Thus, DNA replication stress in mammalian preimplantation embryos predisposes gene-poor regions to fragility, and in particular in the human embryo, to the formation of aneuploidies, impairing developmental potential.
Display omitted
•1-cell embryos show replication fork stalling, with replication extending into G2 phase•Incompletely replicated DNA is converted to chromosome breaks and aneuploidy in mitosis•Spontaneous chromosome breaks and G2 DNA synthesis occur in congruent gene-poor regions•Chromosome fragility in human embryos occurs independently of embryonic genome activation
In human preimplantation embryos, DNA replication in G2 phase results in chromosome breakage, segmental aneuploidies, and poor embryo quality.
Genomic DNA replicates according to a defined temporal program in which early-replicating loci are associated with open chromatin, higher gene density, and increased gene expression levels, while ...late-replicating loci tend to be heterochromatic and show higher rates of genomic instability. The ability to measure DNA replication dynamics at genome scale has proven crucial for understanding the mechanisms and cellular consequences of DNA replication timing. Several methods, such as quantification of nucleotide analog incorporation and DNA copy number analyses, can accurately reconstruct the genomic replication timing profiles of various species and cell types. More recent developments have expanded the DNA replication genomic toolkit to assays that directly measure the activity of replication origins, while single-cell replication timing assays are beginning to reveal a new level of replication timing regulation. The combination of these methods, applied on a genomic scale and in multiple biological systems, promises to resolve many open questions and lead to a holistic understanding of how eukaryotic cells replicate their genomes accurately and efficiently.
Cancer somatic mutations are the product of multiple mutational and repair processes, both of which are tightly associated with DNA replication. Distinctive patterns of somatic mutation accumulation, ...termed mutational signatures, are indicative of processes sustained within tumors. However, the association of various mutational processes with replication timing (RT) remains an open question. In this study, we systematically analyzed the mutational landscape of 2,787 tumors from 32 tumor types separately for early and late replicating regions using sequence context normalization and chromatin data to account for sequence and chromatin accessibility differences. To account for sequence differences between various genomic regions, an artificial genome-based approach was developed to expand the signature analyses to doublet base substitutions and small insertions and deletions. The association of mutational processes and RT was signature specific: Some signatures were associated with early or late replication (such as SBS7b and SBS7a, respectively), and others had no association. Most associations existed even after normalizing for genome accessibility. A focused mutational signature identification approach was also developed that uses RT information to improve signature identification; this approach found that SBS16, which is biased toward early replication, is strongly associated with better survival rates in liver cancer. Overall, this novel and comprehensive approach provides a better understanding of the etiology of mutational signatures, which may lead to improved cancer prevention, diagnosis, and treatment. SIGNIFICANCE: Many mutational processes associate with early or late replication timing regions independently of chromatin accessibility, enabling development of a focused identification approach to improve mutational signature detection.
Centromeres serve a critical function in preserving genome integrity across sequential cell divisions, by mediating symmetric chromosome segregation. The repetitive, heterochromatic nature of ...centromeres is thought to be inhibitory to DNA replication, but has also led to their underrepresentation in human reference genome assemblies. Consequently, centromeres have been excluded from genomic replication timing analyses, leaving their time of replication unresolved. However, the most recent human reference genome, hg38, included models of centromere sequences. To establish the experimental requirements for achieving replication timing profiles for centromeres, we sequenced G₁- and S-phase cells from five human cell lines, and aligned the sequence reads to hg38. We were able to infer DNA replication timing profiles for the centromeres in each of the five cell lines, which showed that centromere replication occurs in mid-to-late S phase. Furthermore, we found that replication timing was more variable between cell lines in the centromere regions than expected, given the distribution of variation in replication timing genome-wide. These results suggest the potential of these, and future, sequence models to enable high-resolution studies of replication in centromeres and other heterochromatic regions.
Summary
Aging is characterized by genome instability, which contributes to cancer formation and cell lethality leading to organismal decline. The high levels of DNA double‐strand breaks (DSBs) ...observed in old cells and premature aging syndromes are likely a primary source of genome instability, but the underlying cause of their formation is still unclear. DSBs might result from higher levels of damage or repair defects emerging with advancing age, but repair pathways in old organisms are still poorly understood. Here, we show that premeiotic germline cells of young and old flies have distinct differences in their ability to repair DSBs by the error‐free pathway homologous recombination (HR). Repair of DSBs induced by either ionizing radiation (IR) or the endonuclease I‐SceI is markedly defective in older flies. This correlates with a remarkable reduction in HR repair measured with the DR‐white DSB repair reporter assay. Strikingly, most of this repair defect is already present at 8 days of age. Finally, HR defects correlate with increased expression of early HR components and increased recruitment of Rad51 to damage in older organisms. Thus, we propose that the defect in the HR pathway for germ cells in older flies occurs following Rad51 recruitment. These data reveal that DSB repair defects arise early in the aging process and suggest that HR deficiencies are a leading cause of genome instability in germ cells of older animals.