Genome-sequencing studies indicate that all humans carry many genetic variants predicted to cause loss of function (LoF) of protein-coding genes, suggesting unexpected redundancy in the human genome. ...Here we apply stringent filters to 2951 putative LoF variants obtained from 185 human genomes to determine their true prevalence and properties. We estimate that human genomes typically contain ~100 genuine LoF variants with ~20 genes completely inactivated. We identify rare and likely deleterious LoF alleles, including 26 known and 21 predicted severe disease—causing variants, as well as common LoF variants in nonessential genes. We describe functional and evolutionary differences between LoF-tolerant and recessive disease genes and a method for using these differences to prioritize candidate genes found in clinical sequencing studies.
Personal-genomics endeavors, such as the 1000 Genomes project, are generating maps of genomic structural variants by analyzing ends of massively sequenced genome fragments. To process these we ...developed Paired-End Mapper (PEMer; http://sv.gersteinlab.org/pemer). This comprises an analysis pipeline, compatible with several next-generation sequencing platforms; simulation-based error models, yielding confidence-values for each structural variant; and a back-end database. The simulations demonstrated high structural variant reconstruction efficiency for PEMer's coverage-adjusted multi-cutoff scoring-strategy and showed its relative insensitivity to base-calling errors.
The phase III JAVELIN Renal 101 trial demonstrated prolonged progression-free survival (PFS) in patients (N = 886) with advanced renal cell carcinoma treated with first-line avelumab + axitinib ...(A+Ax) versus sunitinib. We report novel findings from integrated analyses of longitudinal blood samples and baseline tumor tissue. PFS was associated with elevated lymphocyte levels in the sunitinib arm and an abundance of innate immune subsets in the A+Ax arm. Treatment with A+Ax led to greater T-cell repertoire modulation and less change in T-cell numbers versus sunitinib. In the A+Ax arm, patients with tumors harboring mutations in ≥2 of 10 previously identified PFS-associated genes (double mutants) had distinct circulating and tumor-infiltrating immunologic profiles versus those with wild-type or single-mutant tumors, suggesting a role for non-T-cell-mediated and non-natural killer cell-mediated mechanisms in double-mutant tumors. We provide evidence for different immunomodulatory mechanisms based on treatment (A+Ax vs. sunitinib) and tumor molecular subtypes.
Our findings provide novel insights into the different immunomodulatory mechanisms governing responses in patients treated with avelumab (PD-L1 inhibitor) + axitinib or sunitinib (both VEGF inhibitors), highlighting the contribution of tumor biology to the complexity of the roles and interactions of infiltrating immune cells in response to these treatment regimens. This article is featured in Selected Articles from This Issue, p. 384.
Molecular profiling of cancer is increasingly common as part of routine care in oncology, and germline and somatic profiling may provide insights and actionable targets for men with metastatic ...prostate cancer. However, all reported cases are of deidentified individuals without full medical and genomic data available in the public domain.
We present a case of whole-genome tumor and germline sequencing in a patient with advanced prostate cancer, who has agreed to make his genomic and clinical data publicly available.
We describe an 84-year-old Caucasian male with a Gleason 10 oligometastastic hormone-sensitive prostate cancer. Whole-genome sequencing provided insights into his tumor's underlying mutational processes and the development of an SPOP mutation. It also revealed an androgen-receptor dependency of his cancer which was reflected in his durable response to radiation and hormonal therapy. Potentially actionable genomic lesions in the tumor were identified through a personalized medicine approach for potential future therapy, but at the moment, he remains in remission, illustrating the hormonal sensitivity of his SPOP-driven prostate cancer. We also placed this patient in the context of a large prostate-cancer cohort from the PCAWG (Pan-cancer Analysis of Whole Genomes) group. In this comparison, the patient's cancer appears typical in terms of the number and type of somatic mutations, but it has a somewhat larger contribution from the mutational process associated with aging.
We combined the expertise of medical oncology and genomics approaches to develop a molecular tumor board to integrate the care and study of this patient, who continues to have an outstanding response to his combined modality treatment. This identifiable case potentially helps overcome barriers to clinical and genomic data sharing.
BACKGROUND: Pseudogenes have long been considered as nonfunctional genomic sequences. However, recent evidence suggests that many of them might have some form of biological activity, and the ...possibility of functionality has increased interest in their accurate annotation and integration with functional genomics data. RESULTS: As part of the GENCODE annotation of the human genome, we present the first genome-wide pseudogene assignment for protein-coding genes, based on both large-scale manual annotation and in silico pipelines. A key aspect of this coupled approach is that it allows us to identify pseudogenes in an unbiased fashion as well as untangle complex events through manual evaluation. We integrate the pseudogene annotations with the extensive ENCODE functional genomics information. In particular, we determine the expression level, transcription-factor and RNA polymerase II binding, and chromatin marks associated with each pseudogene. Based on their distribution, we develop simple statistical models for each type of activity, which we validate with large-scale RT-PCR-Seq experiments. Finally, we compare our pseudogenes with conservation and variation data from primate alignments and the 1000 Genomes project, producing lists of pseudogenes potentially under selection. CONCLUSIONS: At one extreme, some pseudogenes possess conventional characteristics of functionality; these may represent genes that have recently died. On the other hand, we find interesting patterns of partial activity, which may suggest that dead genes are being resurrected as functioning non-coding RNAs. The activity data of each pseudogene are stored in an associated resource, psiDR, which will be useful for the initial identification of potentially functional pseudogenes.
Open source and open data have been driving forces in bioinformatics in the past. However, privacy concerns may soon change the landscape, limiting future access to important data sets, including ...personal genomics data. Here we survey this situation in some detail, describing, in particular, how the large scale of the data from personal genomic sequencing makes it especially hard to share data, exacerbating the privacy problem. We also go over various aspects of genomic privacy: first, there is basic identifiability of subjects having their genome sequenced. However, even for individuals who have consented to be identified, there is the prospect of very detailed future characterization of their genotype, which, unanticipated at the time of their consent, may be more personal and invasive than the release of their medical records. We go over various computational strategies for dealing with the issue of genomic privacy. One can "slice" and reformat datasets to allow them to be partially shared while securing the most private variants. This is particularly applicable to functional genomics information, which can be largely processed without variant information. For handling the most private data there are a number of legal and technological approaches--for example, modifying the informed consent procedure to acknowledge that privacy cannot be guaranteed, and/or employing a secure cloud computing environment. Cloud computing in particular may allow access to the data in a more controlled fashion than the current practice of downloading and computing on large datasets. Furthermore, it may be particularly advantageous for small labs, given that the burden of many privacy issues falls disproportionately on them in comparison to large corporations and genome centers. Finally, we discuss how education of future genetics researchers will be important, with curriculums emphasizing privacy and data security. However, teaching personal genomics with identifiable subjects in the university setting will, in turn, create additional privacy issues and social conundrums.