Droplet-based single-cell RNA sequence analyses assume that all acquired RNAs are endogenous to cells. However, any cell-free RNAs contained within the input solution are also captured by these ...assays. This sequencing of cell-free RNA constitutes a background contamination that confounds the biological interpretation of single-cell transcriptomic data.
We demonstrate that contamination from this "soup" of cell-free RNAs is ubiquitous, with experiment-specific variations in composition and magnitude. We present a method, SoupX, for quantifying the extent of the contamination and estimating "background-corrected" cell expression profiles that seamlessly integrate with existing downstream analysis tools. Applying this method to several datasets using multiple droplet sequencing technologies, we demonstrate that its application improves biological interpretation of otherwise misleading data, as well as improving quality control metrics.
We present SoupX, a tool for removing ambient RNA contamination from droplet-based single-cell RNA sequencing experiments. This tool has broad applicability, and its application can improve the biological utility of existing and future datasets.
We present GOseq, an application for performing Gene Ontology (GO) analysis on RNA-seq data. GO analysis is widely used to reduce complexity and highlight biological processes in genome-wide ...expression studies, but standard methods give biased results on RNA-seq data due to over-detection of differential expression for long and highly expressed transcripts. Application of GOseq to a prostate cancer data set shows that GOseq dramatically changes the results, highlighting categories more consistent with the known biology.
Summary Background Nivolumab has shown improved survival in the treatment of advanced non-small-cell lung cancer (NSCLC) previously treated with chemotherapy. We assessed the safety and activity of ...combination nivolumab plus ipilimumab as first-line therapy for NSCLC. Methods The open-label, phase 1, multicohort study (CheckMate 012) cohorts reported here were enrolled at eight US academic centres. Eligible patients were aged 18 years or older with histologically or cytologically confirmed recurrent stage IIIb or stage IV, chemotherapy-naive NSCLC. Patients were randomly assigned (1:1:1) by an interactive voice response system to receive nivolumab 1 mg/kg every 2 weeks plus ipilimumab 1 mg/kg every 6 weeks, nivolumab 3 mg/kg every 2 weeks plus ipilimumab 1 mg/kg every 12 weeks, or nivolumab 3 mg/kg every 2 weeks plus ipilimumab 1 mg/kg every 6 weeks until disease progression, unacceptable toxicities, or withdrawal of consent. Data from the latter two cohorts, which were considered potentially suitable for further clinical development, are presented in this report; data from the other cohort (as well as several earlier cohorts) are described in the appendix . The primary outcome was safety and tolerability, assessed in all treated patients. This ongoing study is registered with ClinicalTrials.gov , number NCT01454102. Findings Between May 15, 2014, and March 25, 2015, 78 patients were randomly assigned to receive nivolumab every 2 weeks plus ipilimumab every 12 weeks (n=38) or nivolumab every 2 weeks plus ipilimumab every 6 weeks (n=40). One patient in the ipilimumab every-6-weeks cohort was excluded before treatment; therefore 77 patients actually received treatment (38 in the ipilimumab every-12-weeks cohort; 39 in the ipilimumab every-6-weeks cohort). At data cut-off on Jan 7, 2016, 29 (76%) patients in the ipilimumab every-12-weeks cohort and 32 (82%) in the ipilimumab every-6-weeks cohort had discontinued treatment. Grade 3–4 treatment-related adverse events occurred in 14 (37%) patients in the ipilimumab every-12-weeks cohort and 13 (33%) patients in the every-6-weeks cohort; the most commonly reported grade 3 or 4 treatment-related adverse events were increased lipase (three 8% and no patients), pneumonitis (two 5% and one 3% patients), adrenal insufficiency (one 3% and two 5% patients), and colitis (one 3% and two 5% patients). Treatment-related serious adverse events were reported in 12 (32%) patients in the ipilimumab every-12-weeks cohort and 11 (28%) patients in the every-6-weeks cohort. Treatment-related adverse events (any grade) prompted treatment discontinuation in four (11%) patients in the every-12-weeks cohort and five (13%) patients in the every-6-weeks cohort. No treatment-related deaths occurred. Confirmed objective responses were achieved in 18 (47% 95% CI 31–64) patients in the ipilimumab every-12-weeks cohort and 15 (38% 95% CI 23–55) patients in the ipilimumab every-6-weeks cohort; median duration of response was not reached in either cohort, with median follow-up times of 12·8 months (IQR 9·3–15·5) in the ipilimumab every-12-weeks cohort and 11·8 months (6·7–15·9) in the ipilimumab every-6-weeks cohort. In patients with PD-L1 of 1% or greater, confirmed objective responses were achieved in 12 (57%) of 21 patients in the ipilimumab every-12-weeks cohort and 13 (57%) of 23 patients in the ipilimumab every-6-weeks cohort. Interpretation In NSCLC, first-line nivolumab plus ipilimumab had a tolerable safety profile and showed encouraging clinical activity characterised by a high response rate and durable response. To our knowledge, the results of this study are the first suggestion of improved benefit compared with anti-PD-1 monotherapy in patients with NSCLC, supporting further assessment of this combination in a phase 3 study. Funding Bristol-Myers Squibb.
Germ cell tumours (GCTs) are a collection of benign and malignant neoplasms derived from primordial germ cells. They are uniquely able to recapitulate embryonic and extraembryonic tissues, which ...carries prognostic and therapeutic significance. The developmental pathways underpinning GCT initiation and histogenesis are incompletely understood. Here, we study the relationship of histogenesis and clonal diversification in GCTs by analysing the genomes and transcriptomes of 547 microdissected histological units. We find no correlation between genomic and histological heterogeneity. However, we identify unifying features including the retention of fetal developmental transcripts across tissues, expression changes on chromosome 12p, and a conserved somatic evolutionary sequence of whole genome duplication followed by clonal diversification. While this pattern is preserved across all GCTs, the developmental timing of the duplication varies between prepubertal and postpubertal cases. In addition, tumours of younger children exhibit distinct substitution signatures which may lend themselves as potential biomarkers for risk stratification. Our findings portray the extensive diversification of GCT tissues and genetic subclones as randomly distributed, while identifying overarching transcriptional and genomic features.
Abstract
Motivation
Increasing numbers of large scale single cell RNA-Seq projects are leading to a data explosion, which can only be fully exploited through data integration. A number of methods ...have been developed to combine diverse datasets by removing technical batch effects, but most are computationally intensive. To overcome the challenge of enormous datasets, we have developed BBKNN, an extremely fast graph-based data integration algorithm. We illustrate the power of BBKNN on large scale mouse atlasing data, and favourably benchmark its run time against a number of competing methods.
Availability and implementation
BBKNN is available at https://github.com/Teichlab/bbknn, along with documentation and multiple example notebooks, and can be installed from pip.
Supplementary information
Supplementary data are available at Bioinformatics online.
Placentas can exhibit chromosomal aberrations that are absent from the fetus
. The basis of this genetic segregation, which is known as confined placental mosaicism, remains unknown. Here we ...investigated the phylogeny of human placental cells as reconstructed from somatic mutations, using whole-genome sequencing of 86 bulk placental samples (with a median weight of 28 mg) and of 106 microdissections of placental tissue. We found that every bulk placental sample represents a clonal expansion that is genetically distinct, and exhibits a genomic landscape akin to that of childhood cancer in terms of mutation burden and mutational imprints. To our knowledge, unlike any other healthy human tissue studied so far, the placental genomes often contained changes in copy number. We reconstructed phylogenetic relationships between tissues from the same pregnancy, which revealed that developmental bottlenecks genetically isolate placental tissues by separating trophectodermal lineages from lineages derived from the inner cell mass. Notably, there were some cases with full segregation-within a few cell divisions of the zygote-of placental lineages and lineages derived from the inner cell mass. Such early embryonic bottlenecks may enable the normalization of zygotic aneuploidy. We observed direct evidence for this in a case of mosaic trisomic rescue. Our findings reveal extensive mutagenesis in placental tissues and suggest that mosaicism is a typical feature of placental development.
Post-translational protein modifications such as phosphorylation and ubiquitinylation are common molecular targets of conflict between viruses and their hosts. However, the role of other ...post-translational modifications, such as ADP-ribosylation, in host-virus interactions is less well characterized. ADP-ribosylation is carried out by proteins encoded by the PARP (also called ARTD) gene family. The majority of the 17 human PARP genes are poorly characterized. However, one PARP protein, PARP13/ZAP, has broad antiviral activity and has evolved under positive (diversifying) selection in primates. Such evolution is typical of domains that are locked in antagonistic 'arms races' with viral factors. To identify additional PARP genes that may be involved in host-virus interactions, we performed evolutionary analyses on all primate PARP genes to search for signatures of rapid evolution. Contrary to expectations that most PARP genes are involved in 'housekeeping' functions, we found that nearly one-third of PARP genes are evolving under strong recurrent positive selection. We identified a >300 amino acid disordered region of PARP4, a component of cytoplasmic vault structures, to be rapidly evolving in several mammalian lineages, suggesting this region serves as an important host-pathogen specificity interface. We also found positive selection of PARP9, 14 and 15, the only three human genes that contain both PARP domains and macrodomains. Macrodomains uniquely recognize, and in some cases can reverse, protein mono-ADP-ribosylation, and we observed strong signatures of recurrent positive selection throughout the macro-PARP macrodomains. Furthermore, PARP14 and PARP15 have undergone repeated rounds of gene birth and loss during vertebrate evolution, consistent with recurrent gene innovation. Together with previous studies that implicated several PARPs in immunity, as well as those that demonstrated a role for virally encoded macrodomains in host immune evasion, our evolutionary analyses suggest that addition, recognition and removal of ADP-ribosylation is a critical, underappreciated currency in host-virus conflicts.
Colonoscopy examination does not always detect colorectal cancer (CRC)— some patients develop CRC after negative findings from an examination. When this occurs before the next recommended ...examination, it is called interval cancer. From a colonoscopy quality assurance perspective, that term is too restrictive, so the term post-colonoscopy colorectal cancer (PCCRC) was created in 2010. However, PCCRC definitions and methods for calculating rates vary among studies, making it impossible to compare results. We aimed to standardize the terminology, identification, analysis, and reporting of PCCRCs and CRCs detected after other whole-colon imaging evaluations (post-imaging colorectal cancers PICRCs).
A 20-member international team of gastroenterologists, pathologists, and epidemiologists; a radiologist; and a non-medical professional met to formulate a series of recommendations, standardize definitions and categories (to align with interval cancer terminology), develop an algorithm to determine most-plausible etiologies, and develop standardized methodology to calculate rates of PCCRC and PICRC. The team followed the Appraisal of Guidelines for Research and Evaluation II tool. A literature review provided 401 articles to support proposed statements; evidence was rated using the GRADE (Grading of Recommendations Assessment, Development and Evaluation) system. The statements were voted on anonymously by team members, using a modified Delphi approach.
The team produced 21 statements that provide comprehensive guidance on PCCRCs and PICRCs. The statements present standardized definitions and terms, as well as methods for qualitative review, determination of etiology, calculation of PCCRC rates, and non-colonoscopic imaging of the colon.
A 20-member international team has provided standardized methods for analysis of etiologies of PCCRCs and PICRCs and defines its use as a quality indicator. The team provides recommendations for clinicians, organizations, researchers, policy makers, and patients.