Single-cell sequencing technologies that simultaneously generate multimodal cellular profiles present opportunities for improved understanding of cell heterogeneity in tissues. How the multimodal ...information can be integrated to obtain a common cell type identification, however, poses a computational challenge. Multilayer graphs provide a natural representation of multi-omic single-cell sequencing datasets, and finding cell clusters may be understood as a multilayer graph partition problem.
We introduce two spectral algorithms on multilayer graphs, spectral clustering on multilayer graphs and the weighted locally linear (WLL) method, to cluster cells in multi-omic single-cell sequencing datasets. We connect these algorithms through a unifying mathematical framework that represents each layer using a Hamiltonian operator and a mixture of its eigenstates to integrate the multiple graph layers, demonstrating in the process that the WLL method is a rigorous multilayer spectral graph theoretic reformulation of the popular Seurat weighted nearest neighbor (WNN) algorithm. Implementing our algorithms and applying them to a CITE-seq dataset of cord blood mononuclear cells yields results similar to the Seurat WNN analysis. Our work thus extends spectral methods to multimodal single-cell data analysis.
The code used in this study can be found at https://github.com/jssong-lab/sc-spectrum. All public data used in the article are accurately cited and described in Materials and Methods and in Supplementary Information.
Supplementary data are available at Bioinformatics online.
Immature teratoma is a subtype of malignant germ cell tumor of the ovary that occurs most commonly in the first three decades of life, frequently with bilateral ovarian disease. Despite being the ...second most common malignant germ cell tumor of the ovary, little is known about its genetic underpinnings. Here we performed multiregion whole-exome sequencing to interrogate the genetic zygosity, clonal relationship, DNA copy number, and mutational status of 52 pathologically distinct tumor components from ten females with ovarian immature teratomas, with bilateral tumors present in five cases and peritoneal dissemination in seven cases. We found that ovarian immature teratomas are genetically characterized by 2N near-diploid genomes with extensive loss of heterozygosity and an absence of genes harboring recurrent somatic mutations or known oncogenic variants. All components within a single ovarian tumor (immature teratoma, mature teratoma with different histologic patterns of differentiation, and yolk sac tumor) were found to harbor an identical pattern of loss of heterozygosity across the genome, indicating a shared clonal origin. In contrast, the four analyzed bilateral teratomas showed distinct patterns of zygosity changes in the right versus left sided tumors, indicating independent clonal origins. All disseminated teratoma components within the peritoneum (including gliomatosis peritonei) shared a clonal pattern of loss of heterozygosity with either the right or left primary ovarian tumor. The observed genomic loss of heterozygosity patterns indicate that diverse meiotic errors contribute to the formation of ovarian immature teratomas, with 11 out of the 15 genetically distinct clones determined to result from nondisjunction errors during meiosis I or II. Overall, these findings suggest that copy-neutral loss of heterozygosity resulting from meiotic abnormalities may be sufficient to generate ovarian immature teratomas from germ cells.
First-generation molecular profiles for human breast cancers have enabled the identification of features that can predict therapeutic response; however, little is known about how the various data ...types can best be combined to yield optimal predictors. Collections of breast cancer cell lines mirror many aspects of breast cancer molecular pathobiology, and measurements of their omic and biological therapeutic responses are well-suited for development of strategies to identify the most predictive molecular feature sets.
We used least squares-support vector machines and random forest algorithms to identify molecular features associated with responses of a collection of 70 breast cancer cell lines to 90 experimental or approved therapeutic agents. The datasets analyzed included measurements of copy number aberrations, mutations, gene and isoform expression, promoter methylation and protein expression. Transcriptional subtype contributed strongly to response predictors for 25% of compounds, and adding other molecular data types improved prediction for 65%. No single molecular dataset consistently out-performed the others, suggesting that therapeutic response is mediated at multiple levels in the genome. Response predictors were developed and applied to TCGA data, and were found to be present in subsets of those patient samples.
These results suggest that matching patients to treatments based on transcriptional subtype will improve response rates, and inclusion of additional features from other profiling data types may provide additional benefit. Further, we suggest a systems biology strategy for guiding clinical trials so that patient cohorts most likely to respond to new therapies may be more efficiently identified.
While the patient's thin, scaly lesions resembled an eczematous dermatitis, the moderate demarcation of some plaques suggested psoriatic skin disease, so he was initiated on the tumour necrosis ...factor-α (TNFα) inhibitor adalimumab; no improvement was noted at 5 months and it was stopped (appendix). To gain further understanding of our patient's disease, a single-cell RNA sequencing profile of his skin-resident T cells was performed—part of an ongoing molecular study of complex cases of skin disease (appendix). The patient also reported substantial improvement in joint mobility and regained his ability to walk independently and to type on a keyboard. ...we believe that in addition to gout, the patient had long-standing, undiagnosed psoriatic joint disease, which was improved by IL-23 blockade.
Timely intervention for cancer requires knowledge of its earliest genetic aberrations. Sequencing of tumors and their metastases reveals numerous abnormalities occurring late in progression. A means ...to temporally order aberrations in a single cancer, rather than inferring them from serially acquired samples, would define changes preceding even clinically evident disease. We integrate DNA sequence and copy number information to reconstruct the order of abnormalities as individual tumors evolve for 2 separate cancer types. We detect vast, unreported expansion of simple mutations sharply demarcated by recombinative loss of the second copy of TP53 in cutaneous squamous cell carcinomas (cSCC) and serous ovarian adenocarcinomas, in the former surpassing 50 mutations per megabase. In cSCCs, we also report diverse secondary mutations in known and novel oncogenic pathways, illustrating how such expanded mutagenesis directly promotes malignant progression. These results reframe paradigms in which TP53 mutation is required later, to bypass senescence induced by driver oncogenes.
Although melanoma is initiated by acquisition of point mutations and limited focal copy number alterations in melanocytes-of-origin, the nature of genetic changes that characterise lethal metastatic ...disease is poorly understood. Here, we analyze the evolution of human melanoma progressing from early to late disease in 13 patients by sampling their tumours at multiple sites and times. Whole exome and genome sequencing data from 88 tumour samples reveals only limited gain of point mutations generally, with net mutational loss in some metastases. In contrast, melanoma evolution is dominated by whole genome doubling and large-scale aneuploidy, in which widespread loss of heterozygosity sculpts the burden of point mutations, neoantigens and structural variants even in treatment-naïve and primary cutaneous melanomas in some patients. These results imply that dysregulation of genomic integrity is a key driver of selective clonal advantage during melanoma progression.
Identifying genetic variation underlying human diseases establishes targets for therapeutic development and helps tailor treatments to individual patients. Large-scale transcriptomic profiling has ...extended the study of such molecular heterogeneity between patients to somatic tissues. However, the lower resolution of bulk RNA profiling, especially in a complex, composite tissue such as the skin, has limited its success. Here we demonstrate approaches to interrogate patient-level molecular variance in a chronic skin inflammatory disease, psoriasis vulgaris, leveraging single-cell RNA-sequencing of CD45
+
cells isolated from active lesions. Highly psoriasis-specific transcriptional abnormalities display greater than average inter-individual variance, nominating them as potential sources of clinical heterogeneity. We find that one of these chemokines,
CXCL13
, demonstrates significant correlation with severity of lesions within our patient series. Our analyses also establish that genes elevated in psoriatic skin-resident memory T cells are enriched for programs orchestrating chromatin and CDC42-dependent cytoskeleton remodeling, specific components of which are distinctly correlated with and against Th17 identity on a single-cell level. Collectively, these analyses describe systematic means to dissect cell type- and patient-level differences in cutaneous psoriasis using high-resolution transcriptional profiles of human inflammatory disease.
IMPORTANCE: Multiple hereditary infundibulocystic basal cell carcinoma syndrome (MHIBCC) is a rare genodermatosis in which numerous indolent, well-differentiated basal cell carcinomas develop ...primarily on the face and genitals, without other features characteristic of basal cell nevus syndrome. The cause is unknown. The purpose of the study was to identify a genetic basis for the syndrome and a mechanism by which the associated tumors develop. OBSERVATIONS: Whole-exome sequencing of 5 tumors and a normal buccal mucosal sample from a patient with MHIBCC was performed. A conserved splice-site mutation in 1 copy of the suppressor of fused gene (SUFU) was identified in all tumor and normal tissue samples. Additional distinct deletions of the trans SUFU allele were identified in all tumor samples, none of which were present in the normal sample. CONCLUSIONS AND RELEVANCE: A germline SUFU mutation was present in a patient with MHIBCC, and additional acquired SUFU mutations underlie the development of infundibulocystic basal cell carcinomas. The downstream location of the SUFU gene within the sonic hedgehog pathway may explain why its loss is associated with relatively well-differentiated tumors and suggests that MHIBCC will not respond to therapeutic strategies, such as smoothened inhibitors, that target upstream components of this pathway.
Single-cell RNA sequencing (scRNA-seq) and spatial transcriptomics (ST) are two emerging research technologies that uniquely characterize gene expression microenvironments on a cellular or ...subcellular level. The skin, a clinically accessible tissue composed of diverse, essential cell populations, serves as an ideal target for these high-resolution investigative approaches. Using these tools, researchers are assembling a compendium of data and discoveries in healthy skin as well as a range of dermatologic pathophysiologies, including atopic dermatitis, psoriasis, and cutaneous malignancies. The ongoing advancement of single-cell approaches, coupled with anticipated decreases in cost with increased adoption, will reshape dermatologic research, profoundly influencing disease characterization, prognosis, and ultimately clinical practice.