An integrated account of the molecular changes occurring during the process of cellular aging is crucial towards understanding the underlying mechanisms. Here, using novel culturing and computational ...methods as well as latest analytical techniques, we mapped the proteome and transcriptome during the replicative lifespan of budding yeast. With age, we found primarily proteins involved in protein biogenesis to increase relative to their transcript levels. Exploiting the dynamic nature of our data, we reconstructed high-level directional networks, where we found the same protein biogenesis-related genes to have the strongest ability to predict the behavior of other genes in the system. We identified metabolic shifts and the loss of stoichiometry in protein complexes as being consequences of aging. We propose a model whereby the uncoupling of protein levels of biogenesis-related genes from their transcript levels is causal for the changes occurring in aging yeast. Our model explains why targeting protein synthesis, or repairing the downstream consequences, can serve as interventions in aging.
As whole-genome sequencing (WGS) becomes the gold standard tool for studying population genomics and medical applications, data on diverse non-European and admixed individuals are still scarce. Here, ...we present a high-coverage WGS dataset of 1,171 highly admixed elderly Brazilians from a census-based cohort, providing over 76 million variants, of which ~2 million are absent from large public databases. WGS enables identification of ~2,000 previously undescribed mobile element insertions without previous description, nearly 5 Mb of genomic segments absent from the human genome reference, and over 140 alleles from HLA genes absent from public resources. We reclassify and curate pathogenicity assertions for nearly four hundred variants in genes associated with dominantly-inherited Mendelian disorders and calculate the incidence for selected recessive disorders, demonstrating the clinical usefulness of the present study. Finally, we observe that whole-genome and HLA imputation could be significantly improved compared to available datasets since rare variation represents the largest proportion of input from WGS. These results demonstrate that even smaller sample sizes of underrepresented populations bring relevant data for genomic studies, especially when exploring analyses allowed only by WGS.
A variety of mutational mechanisms shape the dynamic architecture of human genomes and occasionally result in congenital defects and disease. Here, we used genome-wide long mate-pair sequencing to ...systematically screen for inherited and de novo structural variation in a trio including a child with severe congenital abnormalities. We identified 4321 inherited structural variants and 17 de novo rearrangements. We characterized the de novo structural changes to the base-pair level revealing a complex series of balanced inter- and intra-chromosomal rearrangements consisting of 12 breakpoints involving chromosomes 1, 4 and 10. Detailed inspection of breakpoint regions indicated that a series of simultaneous double-stranded DNA breaks caused local shattering of chromosomes. Fusion of the resulting chromosomal fragments involved non-homologous end joining, since junction points displayed limited or no homology and small insertions and deletions. The pattern of random joining of chromosomal fragments that we observe here strongly resembles the somatic rearrangement patterns--termed chromothripsis--that have recently been described in deranged cancer cells. We conclude that a similar mechanism may also drive the formation of de novo structural variation in the germline.
Sister chromatid exchanges (SCEs) are products of joint DNA molecule resolution, and are considered to form through homologous recombination (HR). Indeed, SCE induction upon irradiation requires the ...canonical HR factors BRCA1, BRCA2 and RAD51. In contrast, replication-blocking agents, including PARP inhibitors, induce SCEs independently of BRCA1, BRCA2 and RAD51. PARP inhibitor-induced SCEs are enriched at difficult-to-replicate genomic regions, including common fragile sites (CFSs). PARP inhibitor-induced replication lesions are transmitted into mitosis, suggesting that SCEs can originate from mitotic processing of under-replicated DNA. Proteomics analysis reveals mitotic recruitment of DNA polymerase theta (POLQ) to synthetic DNA ends. POLQ inactivation results in reduced SCE numbers and severe chromosome fragmentation upon PARP inhibition in HR-deficient cells. Accordingly, analysis of CFSs in cancer genomes reveals frequent allelic deletions, flanked by signatures of POLQ-mediated repair. Combined, we show PARP inhibition generates under-replicated DNA, which is processed into SCEs during mitosis, independently of canonical HR factors.
Small insertions and deletions (indels) and large structural variations (SVs) are major contributors to human genetic diversity and disease. However, mutation rates and characteristics of de novo ...indels and SVs in the general population have remained largely unexplored. We report 332 validated de novo structural changes identified in whole genomes of 250 families, including complex indels, retrotransposon insertions, and interchromosomal events. These data indicate a mutation rate of 2.94 indels (1-20 bp) and 0.16 SVs (>20 bp) per generation. De novo structural changes affect on average 4.1 kbp of genomic sequence and 29 coding bases per generation, which is 91 and 52 times more nucleotides than de novo substitutions, respectively. This contrasts with the equal genomic footprint of inherited SVs and substitutions. An excess of structural changes originated on paternal haplotypes. Additionally, we observed a nonuniform distribution of de novo SVs across offspring. These results reveal the importance of different mutational mechanisms to changes in human genome structure across generations.
Homologous recombination involving sister chromatids is the most accurate, and thus most frequently used, form of recombination-mediated DNA repair. Despite its importance, sister chromatid ...recombination is not easily studied because it does not result in a change in DNA sequence, making recombination between sister chromatids difficult to detect. We have previously developed a novel DNA template strand sequencing technique, called Strand-seq, that can be used to map sister chromatid exchange (SCE) events genome-wide in single cells. An increase in the rate of SCE is an indicator of elevated recombination activity and of genome instability, which is a hallmark of cancer. In this study, we have adapted Strand-seq to detect SCE in the yeast
. We provide the first quantifiable evidence that most spontaneous SCE events in wild-type cells are not due to the repair of DNA double-strand breaks.
To safeguard genetic integrity, cells have evolved an accurate but not failsafe mechanism of DNA replication. Not all DNA sequences tolerate DNA replication equally well
1. Also, genomic regions that ...impose structural barriers to the DNA replication fork are a potential source of genetic instability
2, 3. Here, we demonstrate that G4 DNA—a sequence motif that folds into quadruplex structures in vitro
4, 5—is highly mutagenic in vivo and is removed from genomes that lack
dog-1, the
C. elegans ortholog of mammalian FANCJ
6, 7, which is mutated in Fanconi anemia patients
8–11. We show that sequences that match the G4 DNA signature G
3-5N
1-3G
3-5N
1-3G
3-5N
1-3G
3-5 are deleted in germ and somatic tissues of
dog-1 animals. Unbiased aCGH analyses of
dog-1 genomes that were allowed to accumulate mutations in >100 replication cycles indicate that deletions are found exclusively at G4 DNA; deletion frequencies can reach 4% per site per animal generation. We found that deletion sizes fall short of Okazaki fragment lengths
12, and no significant microhomology was observed at deletion junctions. The existence of 376,000 potentially mutagenic G4 DNA sites in the human genome could have major implications for the etiology of hereditary FancJ and nonhereditary cancers.
Regeneration-capable flatworms are informative research models to study the mechanisms of stem cell regulation, regeneration, and tissue patterning. However, the lack of transgenesis methods ...considerably hampers their wider use. Here we report development of a transgenesis method for Macrostomum lignano, a basal flatworm with excellent regeneration capacity. We demonstrate that microinjection of DNA constructs into fertilized one-cell stage eggs, followed by a low dose of irradiation, frequently results in random integration of the transgene in the genome and its stable transmission through the germline. To facilitate selection of promoter regions for transgenic reporters, we assembled and annotated the M. lignano genome, including genome-wide mapping of transcription start regions, and show its utility by generating multiple stable transgenic lines expressing fluorescent proteins under several tissue-specific promoters. The reported transgenesis method and annotated genome sequence will permit sophisticated genetic studies on stem cells and regeneration using M. lignano as a model organism.
Cell type-specific differential gene expression analyses based on single-cell transcriptome datasets are sensitive to the presence of cell-free mRNA in the droplets containing single cells. This ...so-called ambient RNA contamination may differ between samples obtained from patients and healthy controls. Current ambient RNA correction methods were not developed specifically for single-cell differential gene expression (sc-DGE) analyses and might therefore not sufficiently correct for ambient RNA-derived signals. Here, we show that ambient RNA levels are highly sample-specific. We found that without ambient RNA correction, sc-DGE analyses erroneously identify transcripts originating from ambient RNA as cell type-specific disease-associated genes. We therefore developed a computationally lean and intuitive correction method, Fast Correction for Ambient RNA (FastCAR), optimized for sc-DGE analysis of scRNA-Seq datasets generated by droplet-based methods including the 10XGenomics Chromium platform. FastCAR uses the profile of transcripts observed in libraries that likely represent empty droplets to determine the level of ambient RNA in each individual sample, and then corrects for these ambient RNA gene expression values. FastCAR can be applied as part of the data pre-processing and QC in sc-DGE workflows comparing scRNA-Seq data in a health versus disease experimental design. We compared FastCAR with two methods previously developed to remove ambient RNA, SoupX and CellBender. All three methods identified additional genes in sc-DGE analyses that were not identified in the absence of ambient RNA correction. However, we show that FastCAR performs better at correcting gene expression values attributed to ambient RNA, resulting in a lower frequency of false-positive observations. Moreover, the use of FastCAR in a sc-DGE workflow increases the cell-type specificity of sc-DGE analyses across disease conditions.
Accurate reference genome sequences provide the foundation for modern molecular biology and genomics as the interpretation of sequence data to study evolution, gene expression, and epigenetics ...depends heavily on the quality of the genome assembly used for its alignment. Correctly organising sequenced fragments such as contigs and scaffolds in relation to each other is a critical and often challenging step in the construction of robust genome references. We previously identified misoriented regions in the mouse and human reference assemblies using Strand-seq, a single cell sequencing technique that preserves DNA directionality Here we demonstrate the ability of Strand-seq to build and correct full-length chromosomes by identifying which scaffolds belong to the same chromosome and determining their correct order and orientation, without the need for overlapping sequences. We demonstrate that Strand-seq exquisitely maps assembly fragments into large related groups and chromosome-sized clusters without using new assembly data. Using template strand inheritance as a bi-allelic marker, we employ genetic mapping principles to cluster scaffolds that are derived from the same chromosome and order them within the chromosome based solely on directionality of DNA strand inheritance. We prove the utility of our approach by generating improved genome assemblies for several model organisms including the ferret, pig, Xenopus, zebrafish, Tasmanian devil and the Guinea pig.