We report the first large-scale exome-wide analysis of the combined germline-somatic landscape in ovarian cancer. Here we analyse germline and somatic alterations in 429 ovarian carcinoma cases and ...557 controls. We identify 3,635 high confidence, rare truncation and 22,953 missense variants with predicted functional impact. We find germline truncation variants and large deletions across Fanconi pathway genes in 20% of cases. Enrichment of rare truncations is shown in BRCA1, BRCA2 and PALB2. In addition, we observe germline truncation variants in genes not previously associated with ovarian cancer susceptibility (NF1, MAP3K4, CDKN2B and MLL3). Evidence for loss of heterozygosity was found in 100 and 76% of cases with germline BRCA1 and BRCA2 truncations, respectively. Germline-somatic interaction analysis combined with extensive bioinformatics annotation identifies 222 candidate functional germline truncation and missense variants, including two pathogenic BRCA1 and 1 TP53 deleterious variants. Finally, integrated analyses of germline and somatic variants identify significantly altered pathways, including the Fanconi, MAPK and MLL pathways.
Medulloblastoma is a malignant childhood brain tumour comprising four discrete subgroups. Here, to identify mutations that drive medulloblastoma, we sequenced the entire genomes of 37 tumours and ...matched normal blood. One-hundred and thirty-six genes harbouring somatic mutations in this discovery set were sequenced in an additional 56 medulloblastomas. Recurrent mutations were detected in 41 genes not yet implicated in medulloblastoma; several target distinct components of the epigenetic machinery in different disease subgroups, such as regulators of H3K27 and H3K4 trimethylation in subgroups 3 and 4 (for example, KDM6A and ZMYM3), and CTNNB1-associated chromatin re-modellers in WNT-subgroup tumours (for example, SMARCA4 and CREBBP). Modelling of mutations in mouse lower rhombic lip progenitors that generate WNT-subgroup tumours identified genes that maintain this cell lineage (DDX3X), as well as mutated genes that initiate (CDH1) or cooperate (PIK3CA) in tumorigenesis. These data provide important new insights into the pathogenesis of medulloblastoma subgroups and highlight targets for therapeutic development.
Helicobacter pylori produces acute superficial gastritis in nearly all of its human hosts. However, a subset of individuals develops chronic atrophic gastritis (ChAG), a condition characterized in ...part by diminished numbers of acid-producing parietal cells and increased risk for development of gastric adenocarcinoma. Previously, we used a gnotobiotic transgenic mouse model with an engineered ablation of parietal cells to show that loss of parietal cells provides an opportunity for a H. pylori isolate from a patient with ChAG (HPAG1) to bind to, enter, and persist within gastric stem cells. This finding raises the question of how ChAG influences H. pylori genome evolution, physiology, and tumorigenesis. Here we describe the 1,596,366-bp HPAG1 genome. Custom HPAG1 Affymetrix GeneChips, representing 99.6% of its predicted ORFs, were used for whole-genome genotyping of additional H. pylori ChAG isolates obtained from Swedish patients enrolled in a casecontrol study of gastric cancer, as well as ChAG- and cancerassociated isolates from an individual who progressed from ChAG to gastric adenocarcinoma. The results reveal a shared gene signature among ChAG strains, as well as genes that may have been lost or gained during progression to adenocarcinoma. Wholegenome transcriptional profiling of HPAG1's response to acid during in vitro growth indicates that genes encoding components of metal uptake and utilization pathways, outer membrane proteins, and virulence factors are among those associated with H. pylori's adaptation to ChAG.
Retinoblastoma is a rare childhood cancer of the developing retina. Most retinoblastomas initiate with biallelic inactivation of the RB1 gene through diverse mechanisms including point mutations, ...nucleotide insertions, deletions, loss of heterozygosity and promoter hypermethylation. Recently, a novel mechanism of retinoblastoma initiation was proposed. Gallie and colleagues discovered that a small proportion of retinoblastomas lack RB1 mutations and had MYCN amplification 1. In this study, we identified recurrent chromosomal, regional and focal genomic lesions in 94 primary retinoblastomas with their matched normal DNA using SNP 6.0 chips. We also analyzed the RB1 gene mutations and compared the mechanism of RB1 inactivation to the recurrent copy number variations in the retinoblastoma genome. In addition to the previously described focal amplification of MYCN and deletions in RB1 and BCOR, we also identified recurrent focal amplification of OTX2, a transcription factor required for retinal photoreceptor development. We identified 10 retinoblastomas in our cohort that lacked RB1 point mutations or indels. We performed whole genome sequencing on those 10 tumors and their corresponding germline DNA. In one of the tumors, the RB1 gene was unaltered, the MYCN gene was amplified and RB1 protein was expressed in the nuclei of the tumor cells. In addition, several tumors had complex patterns of structural variations and we identified 3 tumors with chromothripsis at the RB1 locus. This is the first report of chromothripsis as a mechanism for RB1 gene inactivation in cancer.
A complete reference assembly is essential for accurately interpreting individual genomes and associating variation with phenotypes. While the current human reference genome sequence is of very high ...quality, gaps and misassemblies remain due to biological and technical complexities. Large repetitive sequences and complex allelic diversity are the two main drivers of assembly error. Although increasing the length of sequence reads and library fragments can improve assembly, even the longest available reads do not resolve all regions. In order to overcome the issue of allelic diversity, we used genomic DNA from an essentially haploid hydatidiform mole, CHM1. We utilized several resources from this DNA including a set of end-sequenced and indexed BAC clones and 100× Illumina whole-genome shotgun (WGS) sequence coverage. We used the WGS sequence and the GRCh37 reference assembly to create an assembly of the CHM1 genome. We subsequently incorporated 382 finished BAC clone sequences to generate a draft assembly, CHM1_1.1 (NCBI AssemblyDB GCA_000306695.2). Analysis of gene, repetitive element, and segmental duplication content show this assembly to be of excellent quality and contiguity. However, comparison to assembly-independent resources, such as BAC clone end sequences and PacBio long reads, indicate misassembled regions. Most of these regions are enriched for structural variation and segmental duplication, and can be resolved in the future. This publicly available assembly will be integrated into the Genome Reference Consortium curation framework for further improvement, with the ultimate goal being a completely finished gap-free assembly.
Oil palm is the most productive oil-bearing crop. Although it is planted on only 5% of the total world vegetable oil acreage, palm oil accounts for 33% of vegetable oil and 45% of edible oil ...worldwide, but increased cultivation competes with dwindling rainforest reserves. We report the 1.8-gigabase (Gb) genome sequence of the African oil palm Elaeis guineensis, the predominant source of worldwide oil production. A total of 1.535 Gb of assembled sequence and transcriptome data from 30 tissue types were used to predict at least 34,802 genes, including oil biosynthesis genes and homologues of WRINKLED1 (WRI1), and other transcriptional regulators, which are highly expressed in the kernel. We also report the draft sequence of the South American oil palm Elaeis oleifera, which has the same number of chromosomes (2n = 32) and produces fertile interspecific hybrids with E. guineensis but seems to have diverged in the New World. Segmental duplications of chromosome arms define the palaeotetraploid origin of palm trees. The oil palm sequence enables the discovery of genes for important traits as well as somaclonal epigenetic alterations that restrict the use of clones in commercial plantings, and should therefore help to achieve sustainability for biofuels and edible oils, reducing the rainforest footprint of this tropical plantation crop.
The human X and Y chromosomes evolved from an ordinary pair of autosomes during the past 200-300 million years. The human MSY (male-specific region of Y chromosome) retains only three percent of the ...ancestral autosomes' genes owing to genetic decay. This evolutionary decay was driven by a series of five 'stratification' events. Each event suppressed X-Y crossing over within a chromosome segment or 'stratum', incorporated that segment into the MSY and subjected its genes to the erosive forces that attend the absence of crossing over. The last of these events occurred 30 million years ago, 5 million years before the human and Old World monkey lineages diverged. Although speculation abounds regarding ongoing decay and looming extinction of the human Y chromosome, remarkably little is known about how many MSY genes were lost in the human lineage in the 25 million years that have followed its separation from the Old World monkey lineage. To investigate this question, we sequenced the MSY of the rhesus macaque, an Old World monkey, and compared it to the human MSY. We discovered that during the last 25 million years MSY gene loss in the human lineage was limited to the youngest stratum (stratum 5), which comprises three percent of the human MSY. In the older strata, which collectively comprise the bulk of the human MSY, gene loss evidently ceased more than 25 million years ago. Likewise, the rhesus MSY has not lost any older genes (from strata 1-4) during the past 25 million years, despite its major structural differences to the human MSY. The rhesus MSY is simpler, with few amplified gene families or palindromes that might enable intrachromosomal recombination and repair. We present an empirical reconstruction of human MSY evolution in which each stratum transitioned from rapid, exponential loss of ancestral genes to strict conservation through purifying selection.
To characterize patient-derived xenografts (PDXs) for functional studies, we made whole-genome comparisons with originating breast cancers representative of the major intrinsic subtypes. Structural ...and copy number aberrations were found to be retained with high fidelity. However, at the single-nucleotide level, variable numbers of PDX-specific somatic events were documented, although they were only rarely functionally significant. Variant allele frequencies were often preserved in the PDXs, demonstrating that clonal representation can be transplantable. Estrogen-receptor-positive PDXs were associated with ESR1 ligand-binding-domain mutations, gene amplification, or an ESR1/YAP1 translocation. These events produced different endocrine-therapy-response phenotypes in human, cell line, and PDX endocrine-response studies. Hence, deeply sequenced PDX models are an important resource for the search for genome-forward treatment options and capture endocrine-drug-resistance etiologies that are not observed in standard cell lines. The originating tumor genome provides a benchmark for assessing genetic drift and clonal representation after transplantation.
Display omitted
•Genome-wide structural variants are highly preserved in breast cancer PDX models•PDXs harbor rarely functionally significant single-nucleotide variants•PDXs harbor ESR1 mutations and translocations associated with endocrine therapy resistance•ESR1 gene amplification in PDX correlated with paradoxical estradiol-induced regression
In this study, Ellis and colleagues compare whole-tumor genomes from drug-resistant breast cancers with paired xenografts. Genomic fidelity upon transplantation was high for structural variants but variable at the single-nucleotide level. Therefore, tumor and xenograft whole-genome comparisons critically assess genetic drift and clonal representation. Additional analysis revealed ESR1 mutations, amplification, and translocations associated with endocrine resistance in lumenal xenografts. Sequenced patient-derived xenografts are an important resource for functional genomics and capture treatment-resistance etiologies that are not observed in standard cell lines.
There is interest in using leukemia-gene panels and next-generation sequencing to assess acute myelogenous leukemia (AML) response to induction chemotherapy. Studies have shown that patients with AML ...in morphologic remission may continue to have clonal hematopoiesis with populations closely related to the founding AML clone and that this confers an increased risk of relapse. However, it remains unknown how induction chemotherapy influences the clonal evolution of a patient's nonleukemic hematopoietic population. Here, we report that 5 of 15 patients with genetic clearance of their founding AML clone after induction chemotherapy had a concomitant expansion of a hematopoietic population unrelated to the initial AML. These populations frequently harbored somatic mutations in genes recurrently mutated in AML or myelodysplastic syndromes and were detectable at very low frequencies at the time of AML diagnosis. These results suggest that nonleukemic hematopoietic stem and progenitor cells, harboring specific aging-acquired mutations, may have a competitive fitness advantage after induction chemotherapy, expand, and persist long after the completion of chemotherapy. Although the clinical importance of these “rising” clones remains to be determined, it will be important to distinguish them from leukemia-related populations when assessing for molecular responses to induction chemotherapy.
•Hematopoietic populations unrelated to the AML founding clone often expand after induction therapy, resulting in oligoclonal hematopoiesis.
The human Y chromosome began to evolve from an autosome hundreds of millions of years ago, acquiring a sex-determining function and undergoing a series of inversions that suppressed crossing over ...with the X chromosome. Little is known about the recent evolution of the Y chromosome because only the human Y chromosome has been fully sequenced. Prevailing theories hold that Y chromosomes evolve by gene loss, the pace of which slows over time, eventually leading to a paucity of genes, and stasis. These theories have been buttressed by partial sequence data from newly emergent plant and animal Y chromosomes, but they have not been tested in older, highly evolved Y chromosomes such as that of humans. Here we finished sequencing of the male-specific region of the Y chromosome (MSY) in our closest living relative, the chimpanzee, achieving levels of accuracy and completion previously reached for the human MSY. By comparing the MSYs of the two species we show that they differ radically in sequence structure and gene content, indicating rapid evolution during the past 6 million years. The chimpanzee MSY contains twice as many massive palindromes as the human MSY, yet it has lost large fractions of the MSY protein-coding genes and gene families present in the last common ancestor. We suggest that the extraordinary divergence of the chimpanzee and human MSYs was driven by four synergistic factors: the prominent role of the MSY in sperm production, 'genetic hitchhiking' effects in the absence of meiotic crossing over, frequent ectopic recombination within the MSY, and species differences in mating behaviour. Although genetic decay may be the principal dynamic in the evolution of newly emergent Y chromosomes, wholesale renovation is the paramount theme in the continuing evolution of chimpanzee, human and perhaps other older MSYs.