Essential proteins/genes are indispensable to the survival or reproduction of an organism, and the deletion of such essential proteins will result in lethality or infertility. The identification of ...essential genes is very important not only for understanding the minimal requirements for survival of an organism, but also for finding human disease genes and new drug targets. Experimental methods for identifying essential genes are costly, time-consuming, and laborious. With the accumulation of sequenced genomes data and high-throughput experimental data, many computational methods for identifying essential proteins are proposed, which are useful complements to experimental methods. In this review, we show the state-of-the-art methods for identifying essential genes and proteins based on machine learning and network topological features, point out the progress and limitations of current methods, and discuss the challenges and directions for further research.
Chronic inflammatory diseases (CIDs), including inflammatory bowel disease (IBD), rheumatoid arthritis (RA) and systemic lupus erythematosus (SLE) are thought to emerge from an impaired complex ...network of inter- and intracellular biochemical interactions among several proteins and small chemical compounds under strong influence of genetic and environmental factors. CIDs are characterised by shared and disease-specific processes, which is reflected by partially overlapping genetic risk maps and pathogenic cells (e.g., T cells). Their pathogenesis involves a plethora of intracellular pathways. The translation of the research findings on CIDs molecular mechanisms into effective treatments is challenging and may explain the low remission rates despite modern targeted therapies. Modelling CID-related causal interactions as networks allows us to tackle the complexity at a systems level and improve our understanding of the interplay of key pathways. Here we report the construction, description, and initial applications of the SYSCID map (https://syscid.elixir-luxembourg.org/), a mechanistic causal interaction network covering the molecular crosstalk between IBD, RA and SLE. We demonstrate that the map serves as an interactive, graphical review of IBD, RA and SLE molecular mechanisms, and helps to understand the complexity of omics data. Examples of such application are illustrated using transcriptome data from time-series gene expression profiles following anti-TNF treatment and data from genome-wide associations studies that enable us to suggest potential effects to altered pathways and propose possible mechanistic biomarkers of treatment response.
The transcription process is crucial to life and the enzyme RNA polymerase (RNAP) is the major component of the transcription machinery. The development of single-molecule techniques, such as ...magnetic and optical tweezers, atomic-force microscopy and single-molecule fluorescence, increased our understanding of the transcription process and complements traditional biochemical studies. Based on these studies, theoretical models have been proposed to explain and predict the kinetics of the RNAP during the polymerization, highlighting the results achieved by models based on the thermodynamic stability of the transcription elongation complex. However, experiments showed that if more than one RNAP initiates from the same promoter, the transcription behavior slightly changes and new phenomenona are observed. We proposed and implemented a theoretical model that considers collisions between RNAPs and predicts their cooperative behavior during multi-round transcription generalizing the Bai et al. stochastic sequence-dependent model. In our approach, collisions between elongating enzymes modify their transcription rate values. We performed the simulations in Mathematica® and compared the results of the single and the multiple-molecule transcription with experimental results and other theoretical models. Our multi-round approach can recover several expected behaviors, showing that the transcription process for the studied sequences can be accelerated up to 48% when collisions are allowed: the dwell times on pause sites are reduced as well as the distance that the RNAPs backtracked from backtracking sites.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Cancer has been increasingly recognized as a systems biology disease since many investigators have demonstrated that this malignant phenotype emerges from abnormal protein-protein, regulatory and ...metabolic interactions induced by simultaneous structural and regulatory changes in multiple genes and pathways. Therefore, the identification of oncogenic interactions and cancer-related signaling networks is crucial for better understanding cancer. As experimental techniques for determining such interactions and signaling networks are labor-intensive and time-consuming, the development of a computational approach capable to accomplish this task would be of great value. For this purpose, we present here a novel computational approach based on network topology and machine learning capable to predict oncogenic interactions and extract relevant cancer-related signaling subnetworks from an integrated network of human genes interactions (INHGI). This approach, called graph2sig, is twofold: first, it assigns oncogenic scores to all interactions in the INHGI and then these oncogenic scores are used as edge weights to extract oncogenic signaling subnetworks from INHGI. Regarding the prediction of oncogenic interactions, we showed that graph2sig is able to recover 89% of known oncogenic interactions with a precision of 77%. Moreover, the interactions that received high oncogenic scores are enriched in genes for which mutations have been causally implicated in cancer. We also demonstrated that graph2sig is potentially useful in extracting oncogenic signaling subnetworks: more than 80% of constructed subnetworks contain more than 50% of original interactions in their corresponding oncogenic linear pathways present in the KEGG PATHWAY database. In addition, the potential oncogenic signaling subnetworks discovered by graph2sig are supported by experimental evidence. Taken together, these results suggest that graph2sig can be a useful tool for investigators involved in cancer research interested in detecting signaling networks most prone to contribute with the emergence of malignant phenotype.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
The Epstein-Barr virus (EBV) is a ubiquitous γ herpesvirus strongly associated with nasopharyngeal carcinomas, and the viral oncogenicity in part relies on cellular effects of the viral latent ...membrane protein 1 (LMP1). It was previously described that EBV strains B95.8 and M81 differ in cell tropism and the activation of the lytic cycle. Nonetheless, it is unknown whether LMP1 from these strains have different effects when expressed in nasopharyngeal cells. Thus, herein we evaluated the effects of EBV LMP1 derived from viral strains B95.8 and M81 and expressed in immortalized nasopharyngeal cells NP69
in the regulation of 91 selected cellular miRNAs. We found that cells expressing either LMP1 behave similarly in terms of NF-kB activation and cell migration. Nonetheless, the miRs 100-5p, 192-5p, and 574-3p were expressed at higher levels in cells expressing LMP1 B95.8 compared to M81. Additionally, results generated by in silico pathway enrichment analysis indicated that LMP1 M81 distinctly regulate genes involved in cell cycle (i.e.,
), mRNA processing (i.e.,
), and mitochondrial biogenesis (i.e.,
). In conclusion, LMP1 M81 was found to distinctively regulate miRs 100-5p, 192-5p, and 574-3p, and the in silico analysis provided valuable clues to dissect the molecular effects of EBV LMP1 expressed in nasopharyngeal cells.
Mitochondrial inner membrane uncoupling proteins (UCP) dissipate the proton electrochemical gradient established by the respiratory chain, thus affecting the yield of ATP synthesis. UCP ...overexpression in plants has been correlated with oxidative stress tolerance, improved photosynthetic efficiency and increased mitochondrial biogenesis. This study reports the main transcriptomic responses associated with the overexpression of an UCP (AtUCP1) in tobacco seedlings. Compared to wild-type (WT), AtUCP1 transgenic seedlings showed unaltered ATP levels and higher accumulation of serine. By using RNA-sequencing, a total of 816 differentially expressed genes between the investigated overexpressor lines and the untransformed WT control were identified. Among them, 239 were up-regulated and 577 were down-regulated. As a general response to AtUCP1 overexpression, noticeable changes in the expression of genes involved in energy metabolism and redox homeostasis were detected. A substantial set of differentially expressed genes code for products targeted to the chloroplast and mainly involved in photosynthesis. The overall results demonstrate that the alterations in mitochondrial function provoked by AtUCP1 overexpression require important transcriptomic adjustments to maintain cell homeostasis. Moreover, the occurrence of an important cross-talk between chloroplast and mitochondria, which culminates in the transcriptional regulation of several genes involved in different pathways, was evidenced.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
As a conceptual model of disease mechanisms, a disease map integrates available knowledge and is applied for data interpretation, predictions and hypothesis generation. It is possible to model ...disease mechanisms on different levels of granularity and adjust the approach to the goals of a particular project. This rich environment together with requirements for high-quality network reconstruction makes it challenging for new curators and groups to be quickly introduced to the development methods. In this review, we offer a step-by-step guide for developing a disease map within its mainstream pipeline that involves using the CellDesigner tool for creating and editing diagrams and the MINERVA Platform for online visualisation and exploration. We also describe how the Neo4j graph database environment can be used for managing and querying efficiently such a resource. For assessing the interoperability and reproducibility we apply FAIR principles.
Solanum paniculatum L. belongs to the Solanaceae family and has the ability to grow and develop under unfavorable environmental conditions such as drought and salt stress, acid soils and soils poor ...in nutrients. The present work aimed to analyze S. paniculatum seed transcriptome associated with induced tolerance to drought stress by osmopriming. Seeds subjected to osmopriming (−1.0 MPa) displayed a higher germination and normal seedling percentage under drought stress when compared with unprimed seeds. RNA-seq transcriptome profiles of osmoprimed and unprimed seeds were determined and the potential proteins involved in the drought tolerance of S. paniculatum were identified. From the 34,640 assembled transcripts for both osmoprimed and unprimed seeds, only 235 were differentially expressed and, among these, 23 (10%) transcripts were predicted to code for proteins potentially involved in response to stress, response to abiotic stimulus and response to chemical. The possible mechanisms by which these stress-associated genes may confer tolerance to osmoprimed Solanum paniculatum seeds to germinate under water deficit was discussed and may help to find markers for the selection of new materials belonging to the Solanaceae family that are more tolerant to stress during and following germination.
Nile tilapia is the third most cultivated fish worldwide and a novel model species for evolutionary studies. Aiming to improve productivity and contribute to the selection of traits of economic ...impact, biotechnological approaches have been intensively applied to species enhancement. In this sense, recent studies have focused on the multiple roles played by microRNAs (miRNAs) in the post-transcriptional regulation of protein-coding genes involved in the emergence of phenotypes with relevance for aquaculture. However, there is still a growing demand for a reference resource dedicated to integrating Nile Tilapia miRNA information, obtained from both experimental and in silico approaches, and facilitating the analysis and interpretation of RNA sequencing data. Here, we present an open repository dedicated to Nile Tilapia miRNAs: the "miRTil database". The database stores data on 734 mature miRNAs identified in 11 distinct tissues and five key developmental stages. The database provides detailed information about miRNA structure, genomic context, predicted targets, expression profiles, and relative 5p/3p arm usage. Additionally, miRTil also includes a comprehensive pre-computed miRNA-target interaction network containing 4936 targets and 19,580 interactions.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
The emergence of -omics technologies has allowed the collection of vast amounts of data on biological systems. Although, the pace of such collection has been exponential, the impact of these data ...remains small on many critical biomedical applications such as drug development. Limited resources, high costs, and low hit-to-lead ratio have led researchers to search for more cost effective methodologies. A possible alternative is to incorporate computational methods of potential drug target prediction early during drug discovery workflow. Computational methods based on systems approaches have the advantage of taking into account the global properties of a molecule not limited to its sequence, structure or function. Machine learning techniques are powerful tools that can extract relevant information from massive and noisy data sets. In recent years the scientific community has explored the combined power of these fields to propose increasingly accurate and low cost methods to propose interesting drug targets. In this mini-review, we describe promising approaches based on the simultaneous use of systems biology and machine learning to access gene and protein druggability. Moreover, we discuss the state-of-the-art of this emerging and interdisciplinary field, discussing data sources, algorithms and the performance of the different methodologies. Finally, we indicate interesting avenues of research and some remaining open challenges.