Traditional drug discovery faces a severe efficacy crisis. Repurposing of registered drugs provides an alternative with lower costs and faster drug development timelines. However, the data necessary ...for the identification of disease modules, i.e. pathways and sub-networks describing the mechanisms of complex diseases which contain potential drug targets, are scattered across independent databases. Moreover, existing studies are limited to predictions for specific diseases or non-translational algorithmic approaches. There is an unmet need for adaptable tools allowing biomedical researchers to employ network-based drug repurposing approaches for their individual use cases. We close this gap with NeDRex, an integrative and interactive platform for network-based drug repurposing and disease module discovery. NeDRex integrates ten different data sources covering genes, drugs, drug targets, disease annotations, and their relationships. NeDRex allows for constructing heterogeneous biological networks, mining them for disease modules, prioritizing drugs targeting disease mechanisms, and statistical validation. We demonstrate the utility of NeDRex in five specific use-cases.
Many bacteria can form wall-deficient variants, or L-forms, that divide by a simple mechanism that does not require the FtsZ-based cell division machinery. Here, we use microfluidic systems to probe ...the growth, chromosome cycle and division mechanism of Bacillus subtilis L-forms. We find that forcing cells into a narrow linear configuration greatly improves the efficiency of cell growth and chromosome segregation. This reinforces the view that L-form division is driven by an excess accumulation of surface area over volume. Cell geometry also plays a dominant role in controlling the relative positions and movement of segregating chromosomes. Furthermore, the presence of the nucleoid appears to influence division both via a cell volume effect and by nucleoid occlusion, even in the absence of FtsZ. Our results emphasise the importance of geometric effects for a range of crucial cell functions, and are of relevance for efforts to develop artificial or minimal cell systems.
Abstract
Background
Probabilistic functional integrated networks (PFINs) are designed to aid our understanding of cellular biology and can be used to generate testable hypotheses about protein ...function. PFINs are generally created by scoring the quality of interaction datasets against a Gold Standard dataset, usually chosen from a separate high-quality data source, prior to their integration. Use of an external Gold Standard has several drawbacks, including data redundancy, data loss and the need for identifier mapping, which can complicate the network build and impact on PFIN performance. Additionally, there typically are no Gold Standard data for non-model organisms.
Results
We describe the development of an integration technique, ssNet, that scores and integrates both high-throughput and low-throughout data from a single source database in a consistent manner without the need for an external Gold Standard dataset. Using data from
Saccharomyces cerevisiae
we show that ssNet is easier and faster, overcoming the challenges of data redundancy, Gold Standard bias and ID mapping. In addition ssNet results in less loss of data and produces a more complete network.
Conclusions
The ssNet method allows PFINs to be built successfully from a single database, while producing comparable network performance to networks scored using an external Gold Standard source and with reduced data loss.
A goal of the biotechnology industry is to be able to recognise detrimental cellular states that may lead to suboptimal or anomalous growth in a bacterial population. Our current knowledge of how ...different environmental treatments modulate gene regulation and bring about physiology adaptations is limited, and hence it is difficult to determine the mechanisms that lead to their effects. Patterns of gene expression, revealed using technologies such as microarrays or RNA-seq, can provide useful biomarkers of different gene regulatory states indicative of a bacterium's physiological status. It is desirable to have only a few key genes as the biomarkers to reduce the costs of determining the transcriptional state by opening the way for methods such as quantitative RT-PCR and amplicon panels. In this paper, we used unsupervised machine learning to construct a transcriptional landscape model from condition-dependent transcriptome data, from which we have identified 10 clusters of samples with differentiated gene expression profiles and linked to different cellular growth states. Using an iterative feature elimination strategy, we identified a minimal panel of 10 biomarker genes that achieved 100% cross-validation accuracy in predicting the cluster assignment. Moreover, we designed and evaluated a variety of data processing strategies to ensure our methods were able to generate meaningful transcriptional landscape models, capturing relevant biological processes. Overall, the computational strategies introduced in this study facilitate the identification of a detailed set of relevant cellular growth states, and how to sense them using a reduced biomarker panel.
The re-use of previously validated designs is critical to the evolution of synthetic biology from a research discipline to an engineering practice. Here we describe the Synthetic Biology Open ...Language (SBOL), a proposed data standard for exchanging designs within the synthetic biology community. SBOL represents synthetic biology designs in a community-driven, formalized format for exchange between software tools, research groups and commercial service providers. The SBOL Developers Group has implemented SBOL as an XML/RDF serialization and provides software libraries and specification documentation to help developers implement SBOL in their own software. We describe early successes, including a demonstration of the utility of SBOL for information exchange between several different software tools and repositories from both academic and industrial partners. As a community-driven standard, SBOL will be updated as synthetic biology evolves to provide specific capabilities for different aspects of the synthetic biology workflow.
Cellular senescence—the permanent arrest of cycling in normally proliferating cells such as fibroblasts—contributes both to age‐related loss of mammalian tissue homeostasis and acts as a tumour ...suppressor mechanism. The pathways leading to establishment of senescence are proving to be more complex than was previously envisaged. Combining in‐silico interactome analysis and functional target gene inhibition, stochastic modelling and live cell microscopy, we show here that there exists a dynamic feedback loop that is triggered by a DNA damage response (DDR) and, which after a delay of several days, locks the cell into an actively maintained state of ‘deep’ cellular senescence. The essential feature of the loop is that long‐term activation of the checkpoint gene CDKN1A (p21) induces mitochondrial dysfunction and production of reactive oxygen species (ROS) through serial signalling through GADD45‐MAPK14(p38MAPK)‐GRB2‐TGFBR2‐TGFβ. These ROS in turn replenish short‐lived DNA damage foci and maintain an ongoing DDR. We show that this loop is both necessary and sufficient for the stability of growth arrest during the establishment of the senescent phenotype.
Synopsis
The phenomenon of cellular ‘senescence’—the permanent arrest of division in normally proliferating mammalian cells such as fibroblasts—is thought to be a central component of the ageing process. Senescence contributes both to age‐related loss of tissue homeostasis, as the loss of division capacity leads to impaired cell renewal, and also to protect against cancer, because it acts to block the uncontrolled proliferation of cells that may give rise to a malignant tumour. Replicative senescence is triggered by uncapped telomeres or by ‘unrepairable’ non‐telomeric DNA damage. Both lesions initiate the same canonical DNA damage response (DDR) (d'Adda di Fagagna, 2008). This response is characterized by activation of sensor kinases (ATM/ATR, DNA‐PK), formation of DNA damage foci containing activated H2A.X (γH2A.X) and ultimately induction of cell cycle arrest through activation of checkpoint proteins, notably p53 (TP53) and the CDK inhibitor p21 (CDKN1A). This signalling pathway continues to contribute actively to the stability of the G0 arrest in fully senescent cells long after induction of senescence (d'Adda di Fagagna et al, 2003). However, senescence is more complex than mere CDKI‐mediated growth arrest. Senescent cells alter their expression of literally hundreds of genes (Shelton et al, 1999), prominent among these being pro‐inflammatory secretory genes (Coppe et al, 2008) and marker genes for a retrograde response induced by mitochondrial dysfunction (Passos et al, 2007a).
There is a growing evidence that multiple mechanisms interact to underpin ageing at the cellular level (Kirkwood, 2005; Passos et al, 2007b) necessitating a systems biology approach if the complex mechanisms of ageing are to be understood (Kirkwood, 2008). With respect to cell senescence, the two major unanswered questions are (i) How does a DNA lesion that can be repaired, at least in principle, induce and maintain irreversible growth arrest? and (ii) How does a growth arrest trigger a completely different cellular phenotype as soon as it becomes irreversible?
To understand those questions, we performed a kinetic analysis of the establishment phase of senescence initiated by DNA damage or telomere dysfunction, focussing on pathways downstream of the classical DDR. Using an approach that combined (i) in‐silico interactome analysis, (ii) functional target gene inhibition, (iii) stochastic modelling, and (iv) live cell microscopy, we identified a positive feedback loop between DDR and mitochondrial production of reactive oxygen species (ROS) as necessary and sufficient for long‐term maintenance of growth arrest. Using pathway log likelihood scores calculated by a quantitative in‐silico interactome analysis to guide siRNA and small molecule inhibition experiments, and using results of sequential and combined inhibition experiments to refine the predictions from the interactome analysis, we found that DDR triggered mitochondrial dysfunction leading to enhanced ROS activation through a linear signal transduction through TP53, CDKN1A, GADD45A, p38 (MAPK14), GRB2, TGFBR2 and TGFβ(Figure 2D). We hypothesized that these ROS stochastically generate novel DNA damage in the nucleus, thus forming a positive feedback loop contributing to the long‐term maintenance of DDR (Figure 3A). First confirmation came from static inhibitor experiments as before, showing that nuclear DNA damage foci frequencies in senescent cells were reduced if feedback signalling was suppressed. To formally establish the existence of a feedback loop and its relevance for senescence, we used live cell microscopy in combination with quantitative modelling.
We transformed the conceptual model shown in Figure 3A into a stochastic mechanistic model of the DDR feedback loop by extending the previously published model of the TP53/Mdm2 circuit (Proctor and Gray, 2008) to include reactions for synthesis/activation and degradation/deactivation/repair of CDKN1A, GADD45, MAPK14, ROS and DNA damage. The model replicated very precisely the kinetic behaviour of activated TP53, CDKN1A, ROS and DNA damage foci after initiation of senescence by irradiation. Having established its concordance with the experimental data, the model was then used to predict the effects of intervening in the feedback loop. The model predicted that any intervention reducing ROS levels by about half would decrease average DNA damage foci frequencies from six to four foci/nucleus within about 15 h. It further predicted that this would be sufficient to reduce CDKN1A to basal levels continuously for at least 6 h in about 20% of the treated cells, thus allowing a significant fraction of cells to escape from growth arrest and to resume proliferation. This should happen even if the intervention into the feedback loop was started at a late time point (e.g. 6 days) after induction of senescence.
To analyse DNA damage foci dynamics we used a reporter construct (AcGFP–53BP1c) that quantitatively reports single DNA damage foci kinetics in time‐resolved live cell microscopy (Nelson et al, 2009). Foci frequency measurements quantitatively confirmed the prediction from the stochastic model. More importantly, we found that many individual foci in both telomere‐ and stress‐dependent senescence had short lifespans with half‐lives below 15 h. Feedback loop inhibition reduced only the frequencies of short‐lived DNA damage foci in accordance with the hypothesis that ROS production contributed to DDR by constant replenishment of short‐lived DNA damage foci.
Finally, we inhibited signalling through the loop at different time points after induction of senescence by ionizing radiation and measured ROS levels, DNA damage foci frequencies and proliferation markers. Treatments with the MAPK14 inhibitor SB203580 or the free radical scavenger PBN were used to block the loop. The results quantitatively confirmed the model prediction and indicated that the feedback loop between DDR and ROS production was both necessary and sufficient to maintain cell cycle arrest for at least 6–10 days after induction of senescence. Interestingly, the loop was still active at later time points and in deep senescence, but proliferation arrest was then stabilized by additional factor(s). This indicated that certain features of the senescent phenotype‐like ROS production that might be responsible for the negative impact of senescent cells into their tissue environment can be successfully inhibited even in deep senescence. This may prove relevant for novel therapeutic studies aiming to modulate intracellular ROS levels in both aging and cancer.
The sustained activation of CDKN1A (p21/Waf1/Cip1) by a DNA damage response induces mitochondrial dysfunction and reactive oxygen species (ROS) production via signalling through CDKN1A‐GADD45A‐MAPK14‐ GRB2‐TGFBR2‐TGFbeta in senescing primary human and mouse cells in vitro and in vivo.
Enhanced ROS production in senescing cells generates additional DNA damage. Although this damage is repairable and transient, it elevates the average levels of DNA damage response permanently, thus forming a positive feedback loop.
This loop is necessary and sufficient to maintain the stability of growth arrest until a ‘point of no return’ is reached during establishment of senescence.
Drug development is both increasing in cost whilst decreasing in productivity. There is a general acceptance that the current paradigm of R&D needs to change. One alternative approach is drug ...repositioning. With target-based approaches utilised heavily in the field of drug discovery, it becomes increasingly necessary to have a systematic method to rank gene-disease associations. Although methods already exist to collect, integrate and score these associations, they are often not a reliable reflection of expert knowledge. Furthermore, the amount of data available in all areas covered by bioinformatics is increasing dramatically year on year. It thus makes sense to move away from more generalised hypothesis driven approaches to research to one that allows data to generate their own hypothesis. We introduce an integrated, data driven approach to drug repositioning. We first apply a Bayesian statistics approach to rank 309,885 gene-disease associations using existing knowledge. Ranked associations are then integrated with other biological data to produce a semantically-rich drug discovery network. Using this network, we show how our approach identifies diseases of the central nervous system (CNS) to be an area of interest. CNS disorders are identified due to the low numbers of such disorders that currently have marketed treatments, in comparison to other therapeutic areas. We then systematically mine our network for semantic subgraphs that allow us to infer drug-disease relations that are not captured in the network. We identify and rank 275,934 drug-disease has_indication associations after filtering those that are more likely to be side effects, whilst commenting on the top ranked associations in more detail. The dataset has been created in Neo4j and is available for download at https://bitbucket.org/ncl-intbio/genediseaserepositioning along with a Java implementation of the searching algorithm.
Motivation: In silico experiments in bioinformatics involve the co-ordinated use of computational tools and information repositories. A growing number of these resources are being made available with ...programmatic access in the form of Web services. Bioinformatics scientists will need to orchestrate these Web services in workflows as part of their analyses. Results: The Taverna project has developed a tool for the composition and enactment of bioinformatics workflows for the life sciences community. The tool includes a workbench application which provides a graphical user interface for the composition of workflows. These workflows are written in a new language called the simple conceptual unified flow language (Scufl), where by each step within a workflow represents one atomic task. Two examples are used to illustrate the ease by which in silico experiments can be represented as Scufl workflows using the workbench application. Availability: The Taverna workflow system is available as open source and can be downloaded with example Scufl workflows from http://taverna.sourceforge.net
A long-term objective of network medicine is to replace our current, mainly phenotype-based disease definitions by subtypes of health conditions corresponding to distinct pathomechanisms. For this, ...molecular and health data are modeled as networks and are mined for pathomechanisms. However, many such studies rely on large-scale disease association data where diseases are annotated using the very phenotype-based disease definitions the network medicine field aims to overcome. This raises the question to which extent the biases mechanistically inadequate disease annotations introduce in disease association data distort the results of studies which use such data for pathomechanism mining. We address this question using global- and local-scale analyses of networks constructed from disease association data of various types. Our results indicate that large-scale disease association data should be used with care for pathomechanism mining and that analyses of such data should be accompanied by close-up analyses of molecular data for well-characterized patient cohorts.