Next Generation Sequencing (NGS) experiments produce millions of short sequences that, mapped to a reference genome, provide biological insights at genomic, transcriptomic and epigenomic level. ...Typically the amount of reads that correctly maps to the reference genome ranges between 70% and 90%, leaving in some cases a consistent fraction of unmapped sequences. This 'misalignment' can be ascribed to low quality bases or sequence differences between the sample reads and the reference genome. Investigating the source of the unmapped reads is definitely important to better assess the quality of the whole experiment and to check for possible downstream or upstream 'contamination' from exogenous nucleic acids.
Here we propose DecontaMiner, a tool to unravel the presence of contaminating sequences among the unmapped reads. It uses a subtraction approach to identify bacteria, fungi and viruses genome contamination. DecontaMiner generates several output files to track all the processed reads, and to provide a complete report of their characteristics. The good quality matches on microorganism genomes are counted and compared among samples. DecontaMiner builds an offline HTML page containing summary statistics and plots. The latter are obtained using the state-of-the-art D3 javascript libraries. DecontaMiner has been mainly used to detect contamination in human RNA-Seq data. The software is freely available at http://www-labgtp.na.icar.cnr.it/decontaminer .
DecontaMiner is a tool designed and developed to investigate the presence of contaminating sequences in unmapped NGS data. It can suggest the presence of contaminating organisms in sequenced samples, that might derive either from laboratory contamination or from their biological source, and in both cases can be considered as worthy of further investigation and experimental validation. The novelty of DecontaMiner is mainly represented by its easy integration with the standard procedures of NGS data analysis, while providing a complete, reliable, and automatic pipeline.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Prostate cancer (PCa) is characterised by androgen dependency. Unfortunately, under anti-androgen treatment pressure, castration-resistant prostate cancer (CRPC) emerges, characterised by ...heterogeneous cell populations that, over time, lead to the development of different androgen-dependent or -independent phenotypes. Despite important advances in therapeutic strategies, CRPC remains incurable. Context-specific essential genes represent valuable candidates for targeted anti-cancer therapies. Through the investigation of gene and protein annotations and the integration of published transcriptomic data, we identified two consensus lists to stratify PCa patients' risk and discriminate CRPC phenotypes based on androgen receptor activity. ROC and Kaplan-Meier survival analyses were used for gene set validation in independent datasets. We further evaluated these genes for their association with cancer dependency. The deregulated expression of the PCa-related genes was associated with overall and disease-specific survival, metastasis and/or high recurrence risk, while the CRPC-related genes clearly discriminated between adeno and neuroendocrine phenotypes. Some of the genes showed context-specific essentiality. We further identified candidate drugs through a computational repositioning approach for targeting these genes and treating lethal variants of PCa. This work provides a proof-of-concept for the use of an integrative approach to identify candidate biomarkers involved in PCa progression and CRPC pathogenesis within the goal of precision medicine.
Obesity is a complex disorder associated with an increased risk of developing several comorbid chronic diseases, including postmenopausal breast cancer. Although many studies have investigated this ...issue, the link between body weight and either risk or poor outcome of breast cancer is still to characterize. Systems biology approaches, based on the integration of multiscale models and data from a wide variety of sources, are particularly suitable for investigating the underlying molecular mechanisms of complex diseases. In this scenario, GEnome-scale metabolic Models (GEMs) are a valuable tool, since they represent the metabolic structure of cells and provide a functional scaffold for simulating and quantifying metabolic fluxes in living organisms through constraint-based mathematical methods. The integration of omics data into the structural information described by GEMs allows to build more accurate descriptions of metabolic states.
In this work, we exploited gene expression data of postmenopausal breast cancer obese and lean patients to simulate a curated GEM of the human adipocyte, available in the Human Metabolic Atlas database. To this aim, we used a published algorithm which exploits a data-driven approach to overcome the limitation of defining a single objective function to simulate the model. The flux solutions were used to build condition-specific graphs to visualise and investigate the reaction networks and their properties. In particular, we performed a network topology differential analysis to search for pattern differences and identify the principal reactions associated with significant changes across the two conditions under study.
Metabolic network models represent an important source to study the metabolic phenotype of an organism in different conditions. Here we demonstrate the importance of exploiting Next Generation Sequencing data to perform condition-specific GEM analyses. In particular, we show that the qualitative and quantitative assessment of metabolic fluxes modulated by gene expression data provides a valuable method for investigating the mechanisms associated with the phenotype under study, and can foster our interpretation of biological phenomena.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Biological networks are representative of the diverse molecular interactions that occur within cells. Some of the commonly studied biological networks are modeled through protein-protein ...interactions, gene regulatory, and metabolic pathways. Among these, metabolic networks are probably the most studied, as they directly influence all physiological processes. Exploration of biochemical pathways using multigraph representation is important in understanding complex regulatory mechanisms. Feature extraction and clustering of these networks enable grouping of samples obtained from different biological specimens. Clustering techniques separate networks depending on their mutual similarity.
We present a clustering analysis on tissue-specific metabolic networks for single samples from three primary tumor sites: breast, lung, and kidney cancer. The metabolic networks were obtained by integrating genome scale metabolic models with gene expression data. We performed network simplification to reduce the computational time needed for the computation of network distances. We empirically proved that networks clustering can characterize groups of patients in multiple conditions.
We provide a computational methodology to explore and characterize the metabolic landscape of tumors, thus providing a general methodology to integrate analytic metabolic models with gene expression data. This method represents a first attempt in clustering large scale metabolic networks. Moreover, this approach gives the possibility to get valuable information on what are the effects of different conditions on the overall metabolism.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Gene essentiality is a genetic concept crucial for a comprehensive understanding of life and evolution. In the last decade, many essential genes (EGs) have been determined using different ...experimental and computational approaches, and this information has been used to reduce the genomes of model organisms. A growing amount of evidence highlights that essentiality is a property that depends on the context. Because of their importance in vital biological processes, recognising context-specific EGs (csEGs) could help for identifying new potential pharmacological targets and to improve precision therapeutics. Since most of the computational procedures proposed to identify and predict EGs neglect their context-specificity, we focused on this aspect, providing a theoretical and experimental overview of the literature, data and computational methods dedicated to recognising csEGs. To this end, we adapted existing computational methods to exploit a specific context (the kidney tissue) and experimented with four different prediction methods using the labels provided by four different identification approaches. The considerations derived from the analysis of the obtained results, confirmed and validated also by further experiments for a different tissue context, provide the reader with guidance on exploiting existing tools for achieving csEGs identification and prediction.
Neural development is accomplished by differentiation events leading to metabolic reprogramming. Glycosphingolipid metabolism is reprogrammed during neural development with a switch from globo‐ to ...ganglio‐series glycosphingolipid production. Failure to execute this glycosphingolipid switch leads to neurodevelopmental disorders in humans, indicating that glycosphingolipids are key players in this process. Nevertheless, both the molecular mechanisms that control the glycosphingolipid switch and its function in neurodevelopment are poorly understood. Here, we describe a self‐contained circuit that controls glycosphingolipid reprogramming and neural differentiation. We find that globo‐series glycosphingolipids repress the epigenetic regulator of neuronal gene expression AUTS2. AUTS2 in turn binds and activates the promoter of the first and rate‐limiting ganglioside‐producing enzyme GM3 synthase, thus fostering the synthesis of gangliosides. By this mechanism, the globo–AUTS2 axis controls glycosphingolipid reprogramming and neural gene expression during neural differentiation, which involves this circuit in neurodevelopment and its defects in neuropathology.
Synopsis
Schematic representation of glycosphingolipid reprogramming circuit in neural differentiation.
Globo‐series glycosphingolipids inhibit the production of ganglio‐series glycosphingolipids.
AUTS2 expression is repressed by globo‐series glycosphingolipids.
AUTS2 activates the promoter of the first and rate limiting enzyme involved in ganglio‐series glycosphingolipids production i.e., GM3 synthase by inducing histone acetylation.
The globo‐AUTS2 axis regulates the expression of neuronal genes during neural differentiation.
The decrease of globo‐series glycosphingolipids is required for AUTS2 induction and for stem cell differentiation to neural cells.
The switch from globo‐ to ganglio‐series glycophospholipids during neurodevelopment involves a self‐contained regulatory circuit controlling expression of both neuronal and ganglioside‐producing genes.
Colorectal cancer (CRC) is one of the most common malignancies in the Western world and intestinal dysbiosis might contribute to its pathogenesis. The mucosal colon microbiome and C-C motif chemokine ...2 (CCL2) were investigated in 20 healthy controls (HC) and 20 CRC patients using 16S rRNA sequencing and immunoluminescent assay, respectively. A total of 10 HC subjects were classified as overweight/obese (OW/OB_HC) and 10 subjects were normal weight (NW_HC); 15 CRC patients were classified as OW/OB_CRC and 5 patients were NW_CRC. Results: Fusobacterium nucleatum and Escherichia coli were more abundant in OW/OB_HC than in NW_HC microbiomes. Globally, Streptococcus intermedius, Gemella haemolysans, Fusobacterium nucleatum, Bacteroides fragilis and Escherichia coli were significantly increased in CRC patient tumor/lesioned tissue (CRC_LT) and CRC patient unlesioned tissue (CRC_ULT) microbiomes compared to HC microbiomes. CCL2 circulating levels were associated with tumor presence and with the abundance of Fusobacterium nucleatum, Bacteroides fragilis and Gemella haemolysans. Our data suggest that mucosal colon dysbiosis might contribute to CRC pathogenesis by inducing inflammation. Notably, Fusobacterium nucleatum, which was more abundant in the OW/OB_HC than in the NW_HC microbiomes, might represent a putative link between obesity and increased CRC risk.
The neural crest (NC) is an important multipotent embryonic cell population and its impaired specification leads to various developmental defects, often in an anteroposterior (A-P) axial ...level-specific manner. The mechanisms underlying the correct A-P regionalisation of human NC cells remain elusive. Recent studies have indicated that trunk NC cells, the presumed precursors of childhood tumour neuroblastoma, are derived from neuromesodermal-potent progenitors of the postcranial body. Here we employ human embryonic stem cell differentiation to define how neuromesodermal progenitor (NMP)-derived NC cells acquire a posterior axial identity. We show that TBXT, a pro-mesodermal transcription factor, mediates early posterior NC/spinal cord regionalisation together with WNT signalling effectors. This occurs by TBXT-driven chromatin remodelling via its binding in key enhancers within HOX gene clusters and other posterior regulator-associated loci. This initial posteriorisation event is succeeded by a second phase of trunk HOX gene control that marks the differentiation of NMPs toward their TBXT-negative NC/spinal cord derivatives and relies predominantly on FGF signalling. Our work reveals a previously unknown role of TBXT in influencing posterior NC fate and points to the existence of temporally discrete, cell type-dependent modes of posterior axial identity control.
The neural crest (NC) is a multipotent embryonic cell population that generates distinct cell types in an axial position-dependent manner. The production of NC cells from human pluripotent stem cells ...(hPSCs) is a valuable approach to study human NC biology. However, the origin of human trunk NC remains undefined and current in vitro differentiation strategies induce only a modest yield of trunk NC cells. Here we show that hPSC-derived axial progenitors, the posteriorly-located drivers of embryonic axis elongation, give rise to trunk NC cells and their derivatives. Moreover, we define the molecular signatures associated with the emergence of human NC cells of distinct axial identities in vitro. Collectively, our findings indicate that there are two routes toward a human post-cranial NC state: the birth of cardiac and vagal NC is facilitated by retinoic acid-induced posteriorisation of an anterior precursor whereas trunk NC arises within a pool of posterior axial progenitors.
New arylthioindole derivatives having different cyclic substituents at position 2 of the indole were synthesized as anticancer agents. Several compounds inhibited tubulin polymerization at ...submicromolar concentration and inhibited cell growth at low nanomolar concentrations. Compounds 18 and 57 were superior to the previously synthesized 5. Compound 18 was exceptionally potent as an inhibitor of cell growth: it showed IC50 = 1.0 nM in MCF-7 cells, and it was uniformly active in the whole panel of cancer cells and superior to colchicine and combretastatin A-4. Compounds 18, 20, 55, and 57 were notably more potent than vinorelbine, vinblastine, and paclitaxel in the NCI/ADR-RES and Messa/Dx5 cell lines, which overexpress P-glycoprotein. Compounds 18 and 57 showed initial vascular disrupting effects in a tumor model of liver rhabdomyosarcomas at 15 mg/kg intravenous dosage. Derivative 18 showed water solubility and higher metabolic stability than 5 in human liver microsomes.