Interfacing Seurat with the R tidy universe Mangiola, Stefano; Doyle, Maria A; Papenfuss, Anthony T
Bioinformatics (Oxford, England),
11/2021, Letnik:
37, Številka:
22
Journal Article
Recenzirano
Odprti dostop
Abstract
Motivation
Seurat is one of the most popular software suites for the analysis of single-cell RNA sequencing data. Considering the popularity of the tidyverse ecosystem, which offers a large ...set of data display, query, manipulation, integration and visualization utilities, a great opportunity exists to interface the Seurat object with the tidyverse. This interface gives the large data science community of tidyverse users the possibility to operate with familiar grammar.
Results
To provide Seurat with a tidyverse-oriented interface without compromising efficiency, we developed tidyseurat, a lightweight adapter to the tidyverse. Tidyseurat displays cell information as a tibble abstraction, allowing intuitively interfacing Seurat with dplyr, tidyr, ggplot2 and plotly packages powering efficient data manipulation, integration and visualization. Iterative analyses on data subsets are enabled by interfacing with the popular nest-map framework.
Availability and implementation
The software is freely available at cran.r-project.org/web/packages/tidyseurat and github.com/stemangiola/tidyseurat.
Supplementary information
Supplementary data are available at Bioinformatics online.
To analyze the reasons for patient dissatisfaction after phacoemulsification with multifocal intraocular lens (IOL) implantation and the outcomes after intervention.
Emory Eye Center, Atlanta, ...Georgia, USA.
This retrospective review comprised eyes of patients dissatisfied with visual outcomes after multifocal IOL implantation. Outcomes analyzed included type of visual complaint, treatment modality for each complaint, and degree of clinical improvement after intervention.
Thirty-two patients (43 eyes) reported unwanted visual symptoms after multifocal IOL implantation, including in 28 eyes (65%) with an AcrySof ReSTOR IOL and 15 (35%) with a ReZoom IOL. Thirty patients (41 eyes) reported blurred vision, 15 (18 eyes) reported photic phenomena, and 13 (16 eyes) reported both. Causes of blurred vision included ametropia (12 eyes, 29%), dry eye syndrome (6 eyes, 15%), posterior capsule opacification (PCO) (22 eyes, 54%), and unexplained etiology (1 eye, 2%). Causes of photic phenomena included IOL decentration (2 eyes, 12%), retained lens fragment (1 eye, 6%), PCO (12 eyes, 66%), dry-eye syndrome (1 eye, 2%), and unexplained etiology (2 eyes, 11%). Photic phenomena attributed to PCO also caused blurred vision. Thirty-five eyes (81%) had improvement with conservative treatment. Five eyes (12%) did not have improvement despite treatment combinations. Three eyes (7%) required IOL exchange.
Complaints of blurred vision and photic phenomena after multifocal IOL implantation were effectively managed with appropriate treatment. Few eyes (7%) required IOL exchange. Neodymium:YAG capsulotomy should be delayed until it has been determined that IOL exchange will not be necessary.
RNA-Seq Data Analysis in Galaxy Batut, Bérénice; van den Beek, Marius; Doyle, Maria A ...
Methods in molecular biology (Clifton, N.J.),
01/2021, Letnik:
2284
Journal Article
Odprti dostop
A complete RNA-Seq analysis involves the use of several different tools, with substantial software and computational requirements. The Galaxy platform simplifies the execution of such bioinformatics ...analyses by embedding the needed tools in its web interface, while also providing reproducibility. Here, we describe how to perform a reference-based RNA-Seq analysis using Galaxy, from data upload to visualization and functional enrichment analysis of differentially expressed genes.
Recently, efforts have been made toward the harmonization of transcriptomic data structures and workflows using the concept of data tidiness, to facilitate modularisation. We present tidybulk, a ...modular framework for bulk transcriptional analyses that introduces a tidy transcriptomic data structure paradigm and analysis grammar. Tidybulk covers a wide variety of analysis procedures and integrates a large ecosystem of publicly available analysis algorithms under a common framework. Tidybulk decreases coding burden, facilitates reproducibility, increases efficiency for expert users, lowers the learning curve for inexperienced users, and bridges transcriptional data analysis with the tidyverse. Tidybulk is available at R/Bioconductor bioconductor.org/packages/tidybulk .
In light of the increasing adoption of targeted resequencing (TR) as a cost-effective strategy to identify disease-causing variants, a robust method for copy number variation (CNV) analysis is needed ...to maximize the value of this promising technology.
We present a method for CNV detection for TR data, including whole-exome capture data. Our method calls copy number gains and losses for each target region based on normalized depth of coverage. Our key strategies include the use of base-level log-ratios to remove GC-content bias, correction for an imbalanced library size effect on log-ratios, and the estimation of log-ratio variations via binning and interpolation. Our methods are made available via CONTRA (COpy Number Targeted Resequencing Analysis), a software package that takes standard alignment formats (BAM/SAM) and outputs in variant call format (VCF4.0), for easy integration with other next-generation sequencing analysis packages. We assessed our methods using samples from seven different target enrichment assays, and evaluated our results using simulated data and real germline data with known CNV genotypes.
Despite intensive efforts using linkage and candidate gene approaches, the genetic etiology for the majority of families with a multi-generational breast cancer predisposition is unknown. In this ...study, we used whole-exome sequencing of thirty-three individuals from 15 breast cancer families to identify potential predisposing genes. Our analysis identified families with heterozygous, deleterious mutations in the DNA repair genes FANCC and BLM, which are responsible for the autosomal recessive disorders Fanconi Anemia and Bloom syndrome. In total, screening of all exons in these genes in 438 breast cancer families identified three with truncating mutations in FANCC and two with truncating mutations in BLM. Additional screening of FANCC mutation hotspot exons identified one pathogenic mutation among an additional 957 breast cancer families. Importantly, none of the deleterious mutations were identified among 464 healthy controls and are not reported in the 1,000 Genomes data. Given the rarity of Fanconi Anemia and Bloom syndrome disorders among Caucasian populations, the finding of multiple deleterious mutations in these critical DNA repair genes among high-risk breast cancer families is intriguing and suggestive of a predisposing role. Our data demonstrate the utility of intra-family exome-sequencing approaches to uncover cancer predisposition genes, but highlight the major challenge of definitively validating candidates where the incidence of sporadic disease is high, germline mutations are not fully penetrant, and individual predisposition genes may only account for a tiny proportion of breast cancer families.
Next-generation sequencing (NGS) is arguably one of the most significant technological advances in the biological sciences of the last 30 years. The second generation sequencing platforms have ...advanced rapidly to the point that several genomes can now be sequenced simultaneously in a single instrument run in under two weeks. Targeted DNA enrichment methods allow even higher genome throughput at a reduced cost per sample. Medical research has embraced the technology and the cancer field is at the forefront of these efforts given the genetic aspects of the disease. World-wide efforts to catalogue mutations in multiple cancer types are underway and this is likely to lead to new discoveries that will be translated to new diagnostic, prognostic and therapeutic targets. NGS is now maturing to the point where it is being considered by many laboratories for routine diagnostic use. The sensitivity, speed and reduced cost per sample make it a highly attractive platform compared to other sequencing modalities. Moreover, as we identify more genetic determinants of cancer there is a greater need to adopt multi-gene assays that can quickly and reliably sequence complete genes from individual patient samples. Whilst widespread and routine use of whole genome sequencing is likely to be a few years away, there are immediate opportunities to implement NGS for clinical use. Here we review the technology, methods and applications that can be immediately considered and some of the challenges that lie ahead.
Patients with high-grade serous ovarian cancer (HGSC) have experienced little improvement in overall survival, and standard treatment has not advanced beyond platinum-based combination chemotherapy, ...during the past 30 years. To understand the drivers of clinical phenotypes better, here we use whole-genome sequencing of tumour and germline DNA samples from 92 patients with primary refractory, resistant, sensitive and matched acquired resistant disease. We show that gene breakage commonly inactivates the tumour suppressors RB1, NF1, RAD51B and PTEN in HGSC, and contributes to acquired chemotherapy resistance. CCNE1 amplification was common in primary resistant and refractory disease. We observed several molecular events associated with acquired resistance, including multiple independent reversions of germline BRCA1 or BRCA2 mutations in individual patients, loss of BRCA1 promoter methylation, an alteration in molecular subtype, and recurrent promoter fusion associated with overexpression of the drug efflux pump MDR1.
RNA has become one of the major research topics in molecular biology. As a central player in key processes regulating gene expression, RNA is in the focus of many efforts to decipher the pathways ...that govern the transition of genetic information to a fully functional cell. As more and more researchers join this endeavour, there is a rapidly growing demand for comprehensive collections of tools that cover the diverse layers of RNA-related research. However, increasing amounts of data, from diverse types of experiments, addressing different aspects of biological questions need to be consolidated and integrated into a single framework. Only then is it possible to connect findings from e.g. RNA-Seq experiments and methods for e.g. target predictions. To address these needs, we present the RNA Workbench 2.0 , an updated online resource for RNA related analysis. With the RNA Workbench we created a comprehensive set of analysis tools and workflows that enables researchers to analyze their data without the need for sophisticated command-line skills. This update takes the established framework to the next level, providing not only a containerized infrastructure for analysis, but also a ready-to-use platform for hands-on training, analysis, data exploration, and visualization. The new framework is available at https://rna.usegalaxy.eu , and login is free and open to all users. The containerized version can be found at https://github.com/bgruening/galaxy-rna-workbench.
E proteins and their antagonists, the Id proteins, are transcriptional regulators important for normal hematopoiesis. We found that Id2 acts as a key regulator of leukemia stem cell (LSC) potential ...in MLL-rearranged acute myeloid leukemia (AML). Low endogenous Id2 expression is associated with LSC enrichment while Id2 overexpression impairs MLL-AF9-leukemia initiation and growth. Importantly, MLL-AF9 itself controls the E-protein pathway by suppressing Id2 while directly activating E2-2 expression, and E2-2 depletion phenocopies Id2 overexpression in MLL-AF9-AML cells. Remarkably, Id2 tumor-suppressive function is conserved in t(8;21) AML. Low expression of Id2 and its associated gene signature are associated with poor prognosis in MLL-rearranged and t(8;21) AML patients, identifying the Id2/E-protein axis as a promising new therapeutic target in AML.
Display omitted
•MLL-AF9 co-ordinately downregulates Id2 and upregulates E2-2 expression•Id2 functions as a negative regulator of LSC potential in MLL-rearranged AML•Id2 overexpression or E2-2 knockdown suppresses MLL-rearranged and t(8;21) AML•Id2-associated gene signature correlates with prognosis in AML patients
Ghisi et al. show that MLL-AF9 reduces Id2 and increases E2-2 expression to drive and sustain leukemia stem cell potential in MLL-rearranged acute myeloid leukemia (AML). Low expression of Id2 or of an Id2 gene signature is associated with poor prognosis in not only MLL-rearranged but also t(8;21) AML patients.