MicroRNA (miRNA) profiles have been evaluated in several biospecimens in relation to common diseases for which diet may have a considerable impact. We aimed at characterising how specific diets are ...associated with the miRNome in stool of vegans, vegetarians and omnivores and how this is reflected in the gut microbial composition, as this is still poorly explored.
We performed small RNA and shotgun metagenomic sequencing in faecal samples and dietary recording from 120 healthy volunteers, equally distributed for the different diets and matched for sex and age.
We found 49 miRNAs differentially expressed among vegans, vegetarians and omnivores (adj. p <0.05) and confirmed trends of expression levels of such miRNAs in vegans and vegetarians compared with an independent cohort of 45 omnivores. Two miRNAs related to lipid metabolism, miR-636 and miR-4739, were inversely correlated to the non-omnivorous diet duration, independently of subject age. Seventeen miRNAs correlated (|rho|>0.22, adj. p <0.05) with the estimated intake of nutrients, particularly animal proteins, phosphorus and, interestingly, lipids. In omnivores, higher
and
and lower
abundances than in vegans and vegetarians were observed. Lipid metabolism-related miR-425-3p and miR-638 expression levels were associated with increased abundances of microbial species, such as
sp. CAG 182 and
specific of different diets. An integrated analysis identified 25 miRNAs, 25 taxa and 7 dietary nutrients that clearly discriminated (area under the receiver operating characteristic curve=0.89) the three diets.
Stool miRNA profiles are associated with specific diets and support the role of lipids as a driver of epigenetic changes and host-microbial molecular interactions in the gut.
The development of molecularly targeted anticancer agents relies on large panels of tumour-specific preclinical models closely recapitulating the molecular heterogeneity observed in patients. Here we ...describe the mutational and gene expression analyses of 151 colorectal cancer (CRC) cell lines. We find that the whole spectrum of CRC molecular and transcriptional subtypes, previously defined in patients, is represented in this cell line compendium. Transcriptional outlier analysis identifies RAS/BRAF wild-type cells, resistant to EGFR blockade, functionally and pharmacologically addicted to kinase genes including ALK, FGFR2, NTRK1/2 and RET. The same genes are present as expression outliers in CRC patient samples. Genomic rearrangements (translocations) involving the ALK and NTRK1 genes are associated with the overexpression of the corresponding proteins in CRC specimens. The approach described here can be used to pinpoint CRCs with exquisite dependencies to individual kinases for which clinically approved drugs are already available.
Reproducibility of a research is a key element in the modern science and it is mandatory for any industrial application. It represents the ability of replicating an experiment independently by the ...location and the operator. Therefore, a study can be considered reproducible only if all used data are available and the exploited computational analysis workflow is clearly described. However, today for reproducing a complex bioinformatics analysis, the raw data and the list of tools used in the workflow could be not enough to guarantee the reproducibility of the results obtained. Indeed, different releases of the same tools and/or of the system libraries (exploited by such tools) might lead to sneaky reproducibility issues.
To address this challenge, we established the Reproducible Bioinformatics Project (RBP), which is a non-profit and open-source project, whose aim is to provide a schema and an infrastructure, based on docker images and R package, to provide reproducible results in Bioinformatics. One or more Docker images are then defined for a workflow (typically one for each task), while the workflow implementation is handled via R-functions embedded in a package available at github repository. Thus, a bioinformatician participating to the project has firstly to integrate her/his workflow modules into Docker image(s) exploiting an Ubuntu docker image developed ad hoc by RPB to make easier this task. Secondly, the workflow implementation must be realized in R according to an R-skeleton function made available by RPB to guarantee homogeneity and reusability among different RPB functions. Moreover she/he has to provide the R vignette explaining the package functionality together with an example dataset which can be used to improve the user confidence in the workflow utilization.
Reproducible Bioinformatics Project provides a general schema and an infrastructure to distribute robust and reproducible workflows. Thus, it guarantees to final users the ability to repeat consistently any analysis independently by the used UNIX-like architecture.
Multiple Sclerosis (MS) represents nowadays in Europe the leading cause of non-traumatic disabilities in young adults, with more than 700,000 EU cases. Although huge strides have been made over the ...years, MS etiology remains partially unknown. Furthermore, the presence of various endogenous and exogenous factors can greatly influence the immune response of different individuals, making it difficult to study and understand the disease. This becomes more evident in a personalized-fashion when medical doctors have to choose the best therapy for patient well-being. In this optics, the use of stochastic models, capable of taking into consideration all the fluctuations due to unknown factors and individual variability, is highly advisable.
We propose a new model to study the immune response in relapsing remitting MS (RRMS), the most common form of MS that is characterized by alternate episodes of symptom exacerbation (relapses) with periods of disease stability (remission). In this new model, both the peripheral lymph node/blood vessel and the central nervous system are explicitly represented. The model was created and analysed using Epimod, our recently developed general framework for modeling complex biological systems. Then the effectiveness of our model was shown by modeling the complex immunological mechanisms characterizing RRMS during its course and under the DAC administration.
Simulation results have proven the ability of the model to reproduce in silico the immune T cell balance characterizing RRMS course and the DAC effects. Furthermore, they confirmed the importance of a timely intervention on the disease course.
Among the several mechanisms accounting for endocrine resistance in breast cancer, autophagy has emerged as an important player. Previous reports have evidenced that tamoxifen (Tam) induces autophagy ...and activates transcription factor EB (TFEB), which regulates the expression of genes controlling autophagy and lysosomal biogenesis. However, the mechanisms by which this occurs have not been elucidated as yet. This investigation aims at dissecting how TFEB is activated and contributes to Tam resistance in luminal A breast cancer cells. TFEB was overexpressed and prominently nuclear in Tam-resistant MCF7 cells (MCF7-TamR) compared with their parental counterpart, and this was not dependent on alterations of its nucleo-cytoplasmic shuttling. Tam promoted the release of lysosomal Ca
through the major transient receptor potential cation channel mucolipin subfamily member 1 (TRPML1) and two-pore channels (TPCs), which caused the nuclear translocation and activation of TFEB. Consistently, inhibiting lysosomal calcium release restored the susceptibility of MCF7-TamR cells to Tam. Our findings demonstrate that Tam drives the nuclear relocation and transcriptional activation of TFEB by triggering the release of Ca
from the acidic compartment, and they suggest that lysosomal Ca
channels may represent new druggable targets to counteract the onset of autophagy-mediated endocrine resistance in luminal A breast cancer cells.
Biological processes are based on complex networks of cells and molecules. Single cell multi-omics is a new tool aiming to provide new incites in the complex network of events controlling the ...functionality of the cell.
Since single cell technologies provide many sample measurements, they are the ideal environment for the application of Deep Learning and Machine Learning approaches. An autoencoder is composed of an encoder and a decoder sub-model. An autoencoder is a very powerful tool in data compression and noise removal. However, the decoder model remains a black box from which is impossible to depict the contribution of the single input elements. We have recently developed a new class of autoencoders, called Sparsely Connected Autoencoders (SCA), which have the advantage of providing a controlled association among the input layer and the decoder module. This new architecture has the benefit that the decoder model is not a black box anymore and can be used to depict new biologically interesting features from single cell data.
Here, we show that SCA hidden layer can grab new information usually hidden in single cell data, like providing clustering on meta-features difficult, i.e. transcription factors expression, or not technically not possible, i.e. miRNA expression, to depict in single cell RNAseq data. Furthermore, SCA representation of cell clusters has the advantage of simulating a conventional bulk RNAseq, which is a data transformation allowing the identification of similarity among independent experiments.
In our opinion, SCA represents the bioinformatics version of a universal "Swiss-knife" for the extraction of hidden knowledgeable features from single cell omics data.
This preface introduces the content of the BioMed Central Bioinformatics journal Supplement related to the 15th annual meeting of the Bioinformatics Italian Society, BITS2018. The Conference was held ...in Torino, Italy, from June 27th to 29th, 2018.
Bladder cancer (BC) is the most frequent malignancy of the urinary tract with a high incidence in men and smokers. Currently, there are no non-invasive markers useful for BC diagnosis and subtypes ...classification that could overcome invasive procedures such as cystoscopy. Dysregulated miRNA profiles have been associated with numerous cancers, including BC. Cell-free miRNAs are abundantly present in a variety of biofluids including urine and make them promising candidates in cancer biomarker discovery. In the present study, the identification of miRNA fingerprints associated with different BC status was performed by next-generation sequencing on urine samples from 66 BC and 48 controls. Three signatures based on dysregulated miRNAs have been identified by regression models, assessing the power to discriminate different BC subtypes. Altered miRNAs according to invasiveness and grade were validated by qPCR on 112 cases and 65 controls (among which 46 cases and 16 controls were an independent group of subjects while the rest were replica samples). The area under the curve (AUC) computed including three miRNAs (miR-30a-5p, let-7c-5p and miR-486-5p) altered in all BC subtypes showed a significantly increased accuracy in the discrimination of cases and controls (AUC model = 0.70;
-value = 0.01). In conclusions, the non-invasive detection in urine of a selected number of miRNAs altered in different BC subtypes could lead to an accurate early diagnosis of cancer and stratification of patients.
High-Throughput technologies provide genomic and trascriptomic data that are suitable for biomarker detection for classification purposes. However, the high dimension of the output of such ...technologies and the characteristics of the data sets analysed represent an issue for the classification task. Here we present a new feature selection method based on three steps to detect class-specific biomarkers in case of high-dimensional data sets. The first step detects the differentially expressed genes according to the experimental conditions tested in the experimental design, the second step filters out the features with low discriminative power and the third step detects the class-specific features and defines the final biomarker as the union of the class-specific features. The proposed procedure is tested on two microarray datasets, one characterized by a strong imbalance between the size of classes and the other one where the size of classes is perfectly balanced. We show that, using the proposed feature selection procedure, the classification performances of a Support Vector Machine on the imbalanced data set reach a 82% whereas other methods do not exceed 73%. Furthermore, in case of perfectly balanced dataset, the classification performances are comparable with other methods. Finally, the Gene Ontology enrichments performed on the signatures selected with the proposed pipeline, confirm the biological relevance of our methodology. The download of the package with the implementation of Peculiar Genes Selection, 'PGS', is available for R users at: http://github.com/mbeccuti/PGS.
Abstract
Summary
Short reads sequencing technology has been used for more than a decade now. However, the analysis of RNAseq and ChIPseq data is still computational demanding and the simple access to ...raw data does not guarantee results reproducibility between laboratories. To address these two aspects, we developed SeqBox, a cheap, efficient and reproducible RNAseq/ChIPseq hardware/software solution based on NUC6I7KYK mini-PC (an Intel consumer game computer with a fast processor and a high performance SSD disk), and Docker container platform. In SeqBox the analysis of RNAseq and ChIPseq data is supported by a friendly GUI. This allows access to fast and reproducible analysis also to scientists with/without scripting experience.
Availability and implementation
Docker container images, docker4seq package and the GUI are available at http://www.bioinformatica.unito.it/reproducibile.bioinformatics.html.
Supplementary information
Supplementary data are available at Bioinformatics online.