Despite extraordinary efforts to profile cancer genomes, interpreting the vast amount of genomic data in the light of cancer evolution remains challenging. Here we demonstrate that neutral tumor ...evolution results in a power-law distribution of the mutant allele frequencies reported by next-generation sequencing of tumor bulk samples. We find that the neutral power law fits with high precision 323 of 904 cancers from 14 types and from different cohorts. In malignancies identified as evolving neutrally, all clonal selection seemingly occurred before the onset of cancer growth and not in later-arising subclones, resulting in numerous passenger mutations that are responsible for intratumoral heterogeneity. Reanalyzing cancer sequencing data within the neutral framework allowed the measurement, in each patient, of both the in vivo mutation rate and the order and timing of mutations. This result provides a new way to interpret existing cancer genomic data and to discriminate between functional and non-functional intratumoral heterogeneity.
Subclonal architectures are prevalent across cancer types. However, the temporal evolutionary dynamics that produce tumor subclones remain unknown. Here we measure clone dynamics in human cancers by ...using computational modeling of subclonal selection and theoretical population genetics applied to high-throughput sequencing data. Our method determined the detectable subclonal architecture of tumor samples and simultaneously measured the selective advantage and time of appearance of each subclone. We demonstrate the accuracy of our approach and the extent to which evolutionary dynamics are recorded in the genome. Application of our method to high-depth sequencing data from breast, gastric, blood, colon and lung cancer samples, as well as metastatic deposits, showed that detectable subclones under selection, when present, consistently emerged early during tumor growth and had a large fitness advantage (>20%). Our quantitative framework provides new insight into the evolutionary trajectories of human cancers and facilitates predictive measurements in individual tumors from widely available sequencing data.
Most cancer genomic data are generated from bulk samples composed of mixtures of cancer subpopulations, as well as normal cells. Subclonal reconstruction methods based on machine learning aim to ...separate those subpopulations in a sample and infer their evolutionary history. However, current approaches are entirely data driven and agnostic to evolutionary theory. We demonstrate that systematic errors occur in the analysis if evolution is not accounted for, and this is exacerbated with multi-sampling of the same tumor. We present a novel approach for model-based tumor subclonal reconstruction, called MOBSTER, which combines machine learning with theoretical population genetics. Using public whole-genome sequencing data from 2,606 samples from different cohorts, new data and synthetic validation, we show that this method is more robust and accurate than current techniques in single-sample, multiregion and longitudinal data. This approach minimizes the confounding factors of nonevolutionary methods, thus leading to more accurate recovery of the evolutionary history of human cancers.
Quantification of the effect of spatial tumour sampling on the patterns of mutations detected in next-generation sequencing data is largely lacking. Here we use a spatial stochastic cellular ...automaton model of tumour growth that accounts for somatic mutations, selection, drift and spatial constraints, to simulate multi-region sequencing data derived from spatial sampling of a neoplasm. We show that the spatial structure of a solid cancer has a major impact on the detection of clonal selection and genetic drift from both bulk and single-cell sequencing data. Our results indicate that spatial constrains can introduce significant sampling biases when performing multi-region bulk sampling and that such bias becomes a major confounding factor for the measurement of the evolutionary dynamics of human tumours. We also propose a statistical inference framework that incorporates spatial effects within a growing tumour and so represents a further step forwards in the inference of evolutionary dynamics from genomic data. Our analysis shows that measuring cancer evolution using next-generation sequencing while accounting for the numerous confounding factors remains challenging. However, mechanistic model-based approaches have the potential to capture the sources of noise and better interpret the data.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Cancers originate from somatic cells in the human body that have accumulated genetic alterations. These mutations modify the phenotype of the cells, allowing them to escape the homeostatic regulation ...that maintains normal cell number. Viewed through the lens of evolutionary biology, the transformation of normal cells into malignant cells is evolution in action. Evolution continues throughout cancer growth, progression, treatment resistance, and disease relapse, driven by adaptation to changes in the cancer's environment, and intratumor heterogeneity is an inevitable consequence of this evolutionary process. Genomics provides a powerful means to characterize tumor evolution, enabling quantitative measurement of evolving clones across space and time. In this review, we discuss concepts and approaches to quantify and measure this evolutionary process in cancer using genomics.
To establish whether 4-nitroquinoline N-oxide-induced carcinogenesis mirrors the heterogeneity of human oral squamous cell carcinoma (OSCC), we have performed genomic analysis of mouse tongue ...lesions. The mutational signatures of human and mouse OSCC overlap extensively. Mutational burden is higher in moderate dysplasias and invasive SCCs than in hyperplasias and mild dysplasias, although mutations in p53, Notch1 and Fat1 occur in early lesions. Laminin-α3 mutations are associated with tumour invasiveness and Notch1 mutant tumours have an increased immune infiltrate. Computational modelling of clonal dynamics indicates that high genetic heterogeneity may be a feature of those mild dysplasias that are likely to progress to more aggressive tumours. These studies provide a foundation for exploring OSCC evolution, heterogeneity and progression.
Subclonal copy number alterations are a prevalent feature in tumors with high chromosomal instability and result in heterogeneous cancer cell populations with distinct phenotypes. However, the extent ...to which subclonal copy number alterations contribute to clone-specific phenotypes remains poorly understood. We develop TreeAlign, which computationally integrates independently sampled single-cell DNA and RNA sequencing data from the same cell population. TreeAlign accurately encodes dosage effects from subclonal copy number alterations, the impact of allelic imbalance on allele-specific transcription, and obviates the need to define genotypic clones from a phylogeny a priori, leading to highly granular definitions of clones with distinct expression programs. These improvements enable clone-clone gene expression comparisons with higher resolution and identification of expression programs that are genomically independent. Our approach sets the stage for dissecting the relative contribution of fixed genomic alterations and dynamic epigenetic processes on gene expression programs in cancer.
IBD confers an increased lifetime risk of developing colorectal cancer (CRC), and colitis-associated CRC (CA-CRC) is molecularly distinct from sporadic CRC (S-CRC). Here we have dissected the ...evolutionary history of CA-CRC using multiregion sequencing.
Exome sequencing was performed on fresh-frozen multiple regions of carcinoma, adjacent non-cancerous mucosa and blood from 12 patients with CA-CRC (n=55 exomes), and key variants were validated with orthogonal methods. Genome-wide copy number profiling was performed using single nucleotide polymorphism arrays and low-pass whole genome sequencing on archival non-dysplastic mucosa (n=9), low-grade dysplasia (LGD; n=30), high-grade dysplasia (HGD; n=13), mixed LGD/HGD (n=7) and CA-CRC (n=19). Phylogenetic trees were reconstructed, and evolutionary analysis used to reveal the temporal sequence of events leading to CA-CRC.
10/12 tumours were microsatellite stable with a median mutation burden of 3.0 single nucleotide alterations (SNA) per Mb, ~20% higher than S-CRC (2.5 SNAs/Mb), and consistent with elevated ageing-associated mutational processes. Non-dysplastic mucosa had considerable mutation burden (median 47 SNAs), including mutations shared with the neighbouring CA-CRC, indicating a precancer mutational field. CA-CRCs were often near triploid (40%) or near tetraploid (20%) and phylogenetic analysis revealed that copy number alterations (CNAs) began to accrue in non-dysplastic bowel, but the LGD/HGD transition often involved a punctuated 'catastrophic' CNA increase.
Evolutionary genomic analysis revealed precancer clones bearing extensive SNAs and CNAs, with progression to cancer involving a dramatic accrual of CNAs at HGD. Detection of the cancerised field is an encouraging prospect for surveillance, but punctuated evolution may limit the window for early detection.
Mitochondrial DNA (mtDNA) variants influence the risk of late-onset human diseases, but the reasons for this are poorly understood. Undertaking a hypothesis-free analysis of 5,689 blood-derived ...biomarkers with mtDNA variants in 16,220 healthy donors, here we show that variants defining mtDNA haplogroups Uk and H4 modulate the level of circulating N-formylmethionine (fMet), which initiates mitochondrial protein translation. In human cytoplasmic hybrid (cybrid) lines, fMet modulated both mitochondrial and cytosolic proteins on multiple levels, through transcription, post-translational modification and proteolysis by an N-degron pathway, abolishing known differences between mtDNA haplogroups. In a further 11,966 individuals, fMet levels contributed to all-cause mortality and the disease risk of several common cardiovascular disorders. Together, these findings indicate that fMet plays a key role in common age-related disease through pleiotropic effects on cell proteostasis.
Both normal tissue development and cancer growth are driven by a branching process of cell division and mutation accumulation that leads to intra-tissue genetic heterogeneity. However, quantifying ...somatic evolution in humans remains challenging. Here, we show that multi-sample genomic data from a single time point of normal and cancer tissues contains information on single-cell divisions. We present a new theoretical framework that, applied to whole-genome sequencing data of healthy tissue and cancer, allows inferring the mutation rate and the cell survival/death rate per division. On average, we found that cells accumulate 1.14 mutations per cell division in healthy haematopoiesis and 1.37 mutations per division in brain development. In both tissues, cell survival was maximal during early development. Analysis of 131 biopsies from 16 tumours showed 4 to 100 times increased mutation rates compared to healthy development and substantial inter-patient variation of cell survival/death rates.