The majority of the previous methods for identifying cancer driver modules output nonoverlapping modules. This assumption is biologically inaccurate as genes can participate in multiple molecular ...pathways. This is particularly true for cancer-associated genes as many of them are network hubs connecting functionally distinct set of genes. It is important to provide combinatorial optimization problem definitions modeling this biological phenomenon and to suggest efficient algorithms for its solution. We provide a formal definition of the Overlapping Driver Module Identification in Cancer (ODMIC) problem. We show that the problem is NP-hard. We propose a seed-and-extend based heuristic named DriveWays that identifies overlapping cancer driver modules from the graph built from the IntAct PPI network. DriveWays incorporates mutual exclusivity, coverage, and the network connectivity information of the genes. We show that DriveWays outperforms the state-of-the-art methods in recovering well-known cancer driver genes performed on TCGA pan-cancer data. Additionally, DriveWay's output modules show a stronger enrichment for the reference pathways in almost all cases. Overall, we show that enabling modules to overlap improves the recovery of functional pathways filtered with known cancer drivers, which essentially constitute the reference set of cancer-related pathways.
Recent cancer genomic studies have generated detailed molecular data on a large number of cancer patients. A key remaining problem in cancer genomics is the identification of driver genes.
We propose ...BetweenNet, a computational approach that integrates genomic data with a protein-protein interaction network to identify cancer driver genes. BetweenNet utilizes a measure based on betweenness centrality on patient specific networks to identify the so-called outlier genes that correspond to dysregulated genes for each patient. Setting up the relationship between the mutated genes and the outliers through a bipartite graph, it employs a random-walk process on the graph, which provides the final prioritization of the mutated genes. We compare BetweenNet against state-of-the art cancer gene prioritization methods on lung, breast, and pan-cancer datasets.
Our evaluations show that BetweenNet is better at recovering known cancer genes based on multiple reference databases. Additionally, we show that the GO terms and the reference pathways enriched in BetweenNet ranked genes and those that are enriched in known cancer genes overlap significantly when compared to the overlaps achieved by the rankings of the alternative methods.
In this study, the life cycle assessment of cotton woven shirt production, from cotton cultivation to the final product, has been done. In this scope, four alternative production scenarios were ...developed and evaluated with GaBi 8.0 software with CML 2001—January 2016 methodology. These scenarios include conventional cotton woven shirt production, organic cotton cultivation incorporated with renewable energy use in production phase, evaluation of natural dyeing in manufacturing process and using recovered cotton as the raw material. For each of these scenarios, several environmental impact categories including global warming, acidification and eutrophication potentials were evaluated. The functional unit was determined as 1000 pcs of shirts. In the assessment of conventional cotton woven shirt production, pesticide and synthetic fertilizer usage during cotton cultivation as well as the energy supply for the production phases were found to be the major factors increasing environmental impacts. Using organic cotton cultivation and renewable energy sources instead of the traditional techniques, decreased eutrophication potential, acidification potential and global warming potential by 48%, 52% and 70%, respectively. Using recovered cotton fibers as the raw material decreased eutrophication potential, acidification potential, abiotic depletion potential and global warming potential by 96%, 90%, 69% and 47%, respectively, by eliminating the environmental impacts that originate from cotton cultivation stage. Moreover, as these recovered cotton fibers are already colored, additional dyeing is not required. Alternatively, natural dyeing process could be a good alternative to synthetic dyeing and decline environmental impacts by minimizing the use of chemicals and decreasing the required heat for dyeing.
Graphic abstract
Combination antibiotic therapies are being increasingly used in the clinic to enhance potency and counter drug resistance. However, the large search space of candidate drugs and dosage regimes makes ...the identification of effective combinations highly challenging. Here, we present a computational approach called INDIGO, which uses chemogenomics data to predict antibiotic combinations that interact synergistically or antagonistically in inhibiting bacterial growth. INDIGO quantifies the influence of individual chemical–genetic interactions on synergy and antagonism and significantly outperforms existing approaches based on experimental evaluation of novel predictions in Escherichia coli. Our analysis revealed a core set of genes and pathways (e.g. central metabolism) that are predictive of antibiotic interactions. By identifying the interactions that are associated with orthologous genes, we successfully estimated drug‐interaction outcomes in the bacterial pathogens Mycobacterium tuberculosis and Staphylococcus aureus, using the E. coli INDIGO model. INDIGO thus enables the discovery of effective combination therapies in less‐studied pathogens by leveraging chemogenomics data in model organisms.
Synopsis
Novel combination therapies are needed to counter antibiotic resistance and reduce treatment times. The INDIGO algorithm enables the discovery of effective antibiotic combinations in less‐studied pathogens by leveraging chemogenomics data in model organisms.
INDIGO approach identifies antibiotic combinations that interact synergistically or antagonistically using chemogenomics.
The analysis reveals a small set of genes in E. coli that are predictive of interaction outcomes and are surprisingly conserved between distant bacterial species.
INDIGO can estimate the interaction outcomes in pathogens such as M. tuberculosis and S. aureus, based on the conservation of drug‐interaction‐related genes in E. coli.
Novel predictions were experimentally validated in E. coli (66 combinations) and S. aureus (45 combinations).
Novel combination therapies are needed to counter antibiotic resistance and reduce treatment times. The INDIGO algorithm enables the discovery of effective antibiotic combinations in less‐studied pathogens by leveraging chemogenomics data in model organisms.
RBPmotif web server (http://www.rnamotif.org) implements tools to identify binding preferences of RNA-binding proteins (RBPs). Given a set of sequences that are known to be bound and unbound by the ...RBP of interest, RBPmotif provides two types of analysis: (i) de novo motif finding when there is no a priori knowledge on RBP's binding preferences and (ii) analysis of structure preferences when there is a previously identified sequence motif for the RBP. De novo motif finding is performed with the previously published RNAcontext algorithm that learns discriminative motif models to identify both sequence and structure preferences. The results of this analysis include the inferred binding preferences of the RBP and the added predictive value of incorporating structure preferences. Second type of analysis investigates whether the instances of the previously identified sequence motif are enriched in a particular structure context in bound sequences, relative to its instances in unbound sequences. On completion, the results page shows the comparison of structure contexts of the motif instances between bound and unbound sequences and an assessment of statistical significance of detected preferences. In summary, RBPmotif web server enables the concurrent analysis of sequence and structure preferences of RBPs through a user-friendly interface.
Macroautophagy (autophagy) is an evolutionarily conserved recycling and stress response mechanism. Active at basal levels in eukaryotes, autophagy is upregulated under stress providing cells with ...building blocks such as amino acids. A lysosome-integrated sensor system composed of RRAG GTPases and MTOR complex 1 (MTORC1) regulates lysosome biogenesis and autophagy in response to amino acid availability. Stress-mediated inhibition of MTORC1 results in the dephosphorylation and nuclear translocation of the TFE/MITF family of transcriptional factors, and triggers an autophagy- and lysosomal-related gene transcription program. The role of family members TFEB and TFE3 have been studied in detail, but the importance of MITF proteins in autophagy regulation is not clear so far. Here we introduce for the first time a specific role for MITF in autophagy control that involves upregulation of MIR211. We show that, under stress conditions including starvation and MTOR inhibition, a MITF-MIR211 axis constitutes a novel feed-forward loop that controls autophagic activity in cells. Direct targeting of the MTORC2 component RICTOR by MIR211 led to the inhibition of the MTORC1 pathway, further stimulating MITF translocation to the nucleus and completing an autophagy amplification loop. In line with a ubiquitous function, MITF and MIR211 were co-expressed in all tested cell lines and human tissues, and the effects on autophagy were observed in a cell-type independent manner. Thus, our study provides direct evidence that MITF has rate-limiting and specific functions in autophagy regulation. Collectively, the MITF-MIR211 axis constitutes a novel and universal autophagy amplification system that sustains autophagic activity under stress conditions.
Abbreviations: ACTB: actin beta; AKT: AKT serine/threonine kinase; AKT1S1/PRAS40: AKT1 substrate 1; AMPK: AMP-activated protein kinase; ATG: autophagy-related; BECN1: beclin 1; DEPTOR: DEP domain containing MTOR interacting protein; GABARAP: GABA type A receptor-associated protein; HIF1A: hypoxia inducible factor 1 subunit alpha; LAMP1: lysosomal associated membrane protein 1; MAP1LC3B/LC3B: microtubule associated protein 1 light chain 3 beta; MAPKAP1/SIN1: mitogen-activated protein kinase associated protein 1; MITF: melanogenesis associated transcription factor; MLST8: MTOR associated protein, LST8 homolog; MRE: miRNA response element; MTOR: mechanistic target of rapamycin kinase; MTORC1: MTOR complex 1; MTORC2: MTOR complex 2; PRR5/Protor 1: proline rich 5; PRR5L/Protor 2: proline rich 5 like; RACK1: receptor for activated C kinase 1; RPTOR: regulatory associated protein of MTOR complex 1; RICTOR: RPTOR independent companion of MTOR complex 2; RPS6KB/p70S6K: ribosomal protein S6 kinase; RT-qPCR: quantitative reverse transcription-polymerase chain reaction; SQSTM1: sequestosome 1; STK11/LKB1: serine/threonine kinase 11; TFE3: transcription factor binding to IGHM enhancer 3; TFEB: transcription factor EB; TSC1/2: TSC complex subunit 1/2; ULK1: unc-51 like autophagy activating kinase 1; UVRAG: UV radiation resistance associated; VIM: vimentin; VPS11: VPS11, CORVET/HOPS core subunit; VPS18: VPS18, CORVET/HOPS core subunit; WIPI1: WD repeat domain, phosphoinositide interacting 1
Metazoan genomes encode hundreds of RNA-binding proteins (RBPs) but RNA-binding preferences for relatively few RBPs have been well defined. Current techniques for determining RNA targets, including ...in vitro selection and RNA co-immunoprecipitation, require significant time and labor investment. Here we introduce RNAcompete, a method for the systematic analysis of RNA binding specificities that uses a single binding reaction to determine the relative preferences of RBPs for short RNAs that contain a complete range of k-mers in structured and unstructured RNA contexts. We tested RNAcompete by analyzing nine diverse RBPs (HuR, Vts1, FUSIP1, PTB, U1A, SF2/ASF, SLM2, RBM4 and YB1). RNAcompete identified expected and previously unknown RNA binding preferences. Using in vitro and in vivo binding data, we demonstrate that preferences for individual 7-mers identified by RNAcompete are a more accurate representation of binding activity than are conventional motif models. We anticipate that RNAcompete will be a valuable tool for the study of RNA-protein interactions.
RNA-binding proteins (RBPs) play key roles in post-transcriptional regulation of mRNAs. Dysregulations in RBP-mediated mechanisms have been found to be associated with many steps of cancer initiation ...and progression. Despite this, previous studies of gene expression in cancer have ignored the effect of RBPs. To this end, we developed a lasso regression model that predicts gene expression in cancer by incorporating RBP-mediated regulation as well as the effects of other well-studied factors such as copy-number variation, DNA methylation, TFs and miRNAs. As a case study, we applied our model to Lung squamous cell carcinoma (LUSC) data as we found that there are several RBPs differentially expressed in LUSC. Including RBP-mediated regulatory effects in addition to the other features significantly increased the Spearman rank correlation between predicted and measured expression of held-out genes. Using a feature selection procedure that accounts for the adaptive search employed by lasso regularization, we identified the candidate regulators in LUSC. Remarkably, several of these candidate regulators are RBPs. Furthermore, majority of the candidate regulators have been previously found to be associated with lung cancer. To investigate the mechanisms that are controlled by these regulators, we predicted their target gene sets based on our model. We validated the target gene sets by comparing against experimentally verified targets. Our results suggest that the future studies of gene expression in cancer must consider the effect of RBP-mediated regulation.
Matrix factorization methods are linear models, with limited capability to model complex relations. In our work, we use tropical semiring to introduce non-linearity into matrix factorization models. ...We propose a method called Sparse Tropical Matrix Factorization (STMF) for the estimation of missing (unknown) values in sparse data.
We evaluate the efficiency of the STMF method on both synthetic data and biological data in the form of gene expression measurements downloaded from The Cancer Genome Atlas (TCGA) database. Tests on unique synthetic data showed that STMF approximation achieves a higher correlation than non-negative matrix factorization (NMF), which is unable to recover patterns effectively. On real data, STMF outperforms NMF on six out of nine gene expression datasets. While NMF assumes normal distribution and tends toward the mean value, STMF can better fit to extreme values and distributions.
STMF is the first work that uses tropical semiring on sparse data. We show that in certain cases semirings are useful because they consider the structure, which is different and simpler to understand than it is with standard linear algebra.
Metazoan genomes encode hundreds of RNA-binding proteins (RBPs). These proteins regulate post-transcriptional gene expression and have critical roles in numerous cellular processes including mRNA ...splicing, export, stability and translation. Despite their ubiquity and importance, the binding preferences for most RBPs are not well characterized. In vitro and in vivo studies, using affinity selection-based approaches, have successfully identified RNA sequence associated with specific RBPs; however, it is difficult to infer RBP sequence and structural preferences without specifically designed motif finding methods. In this study, we introduce a new motif-finding method, RNAcontext, designed to elucidate RBP-specific sequence and structural preferences with greater accuracy than existing approaches. We evaluated RNAcontext on recently published in vitro and in vivo RNA affinity selected data and demonstrate that RNAcontext identifies known binding preferences for several control proteins including HuR, PTB, and Vts1p and predicts new RNA structure preferences for SF2/ASF, RBM4, FUSIP1 and SLM2. The predicted preferences for SF2/ASF are consistent with its recently reported in vivo binding sites. RNAcontext is an accurate and efficient motif finding method ideally suited for using large-scale RNA-binding affinity datasets to determine the relative binding preferences of RBPs for a wide range of RNA sequences and structures.