Abstract
Background
Even though several computational methods for rhythmicity detection and analysis of biological data have been proposed in recent years, classical trigonometric regression based on ...cosinor still has several advantages over these methods and is still widely used. Different software packages for cosinor-based rhythmometry exist, but lack certain functionalities and require data in different, non-unified input formats.
Results
We present CosinorPy, a Python implementation of cosinor-based methods for rhythmicity detection and analysis. CosinorPy merges and extends the functionalities of existing cosinor packages. It supports the analysis of rhythmic data using single- or multi-component cosinor models, automatic selection of the best model, population-mean cosinor regression, and differential rhythmicity assessment. Moreover, it implements functions that can be used in a design of experiments, a synthetic data generator, and import and export of data in different formats.
Conclusion
CosinorPy is an easy-to-use Python package for straightforward detection and analysis of rhythmicity requiring minimal statistical knowledge, and produces publication-ready figures. Its code, examples, and documentation are available to download from
https://github.com/mmoskon/CosinorPy
. CosinorPy can be installed manually or by using pip, the package manager for Python packages. The implementation reported in this paper corresponds to the software release v1.1.
In search of maximum non-overlapping codes Stanovnik, Lidija; Moškon, Miha; Mraz, Miha
Designs, codes, and cryptography,
05/2024, Letnik:
92, Številka:
5
Journal Article
Recenzirano
Odprti dostop
Non-overlapping codes are block codes that have arisen in diverse contexts of computer science and biology. Applications typically require finding non-overlapping codes with large cardinalities, but ...the maximum size of non-overlapping codes has been determined only for cases where the codeword length divides the size of the alphabet, and for codes with codewords of length two or three. For all other alphabet sizes and codeword lengths no computationally feasible way to identify non-overlapping codes that attain the maximum size has been found to date. Herein we characterize maximal non-overlapping codes. We formulate the maximum non-overlapping code problem as an integer optimization problem and determine necessary conditions for optimality of a non-overlapping code. Moreover, we solve several instances of the optimization problem to show that the hitherto known constructions do not generate the optimal codes for many alphabet sizes and codeword lengths. We also evaluate the number of distinct maximum non-overlapping codes.
Evolution-based processes have been widely applied in bioengineering as well as in cellular computing. For example, different approaches for protein evolution have been proposed. Moreover, ...implementations of evolutionary heuristics to solve optimisation problems using cellular populations have been demonstrated. However, heuristics implemented with cellular populations to optimise their own response, have not yet been reported. Here we present a heuristic optimisation framework that integrates a programmable synthetic evolution into a cellular population. The proposed evolution is based on the automatic selection of computing parts to execute a given objective. These parts are implemented in the form of plasmids, which are randomly distributed among a cellular population. Further evolution of their distribution is guided by a fitness function integrated within each cell in the population. While high values of fitness functions stimulate the propagation of computing parts composing optimal solutions through the population, low fitness values trigger the apoptosis of a cell. We provide a theoretical implementation of the framework in which we demonstrate the programmable evolution of different functions with different levels of complexity. To the best of our knowledge, our approach describes the first synthetic evolution framework for programmable self-optimisation of cellular populations. It requires little human intervention without a requirement to specify the exact implementation of a biological function the population should perform. Namely, the designer only needs to define the response the population should obtain and does not need to know how this response will be implemented. The proposed computational framework is available at
https://github.com/mmoskon/evolution
.
Boolean networks provide an effective mechanism for describing interactions and dynamics of gene regulatory networks (GRNs). Deriving accurate Boolean descriptions of GRNs is a challenging task. The ...number of experiments is usually much smaller than the number of genes. In addition, binarization leads to a loss of information and inconsistencies arise in binarized time-series data. The inference of Boolean networks from binarized time-series data alone often leads to complex and overfitted models. To obtain relevant Boolean models of gene regulatory networks, inference methods could incorporate data from multiple sources and prior knowledge in terms of general network structure and/or exact interactions. We propose the Boolean network inference method SAILoR (Structure-Aware Inference of Logic Rules). SAILoR incorporates time-series gene expression data in combination with provided reference networks to infer accurate Boolean models. SAILoR automatically extracts topological properties from reference networks. These can describe a more general structure of the GRN or can be more precise and describe specific interactions. SAILoR infers a Boolean network by learning from both continuous and binarized time-series data. It navigates between two main objectives, topological similarity to reference networks and correspondence with gene expression data. By incorporating the NSGA-II multi-objective genetic algorithm, SAILoR relies on the wisdom of crowds. Our results indicate that SAILoR can infer accurate and biologically relevant Boolean descriptions of GRNs from both a static and a dynamic perspective. We show that SAILoR improves the static accuracy of the inferred network compared to the network inference method dynGENIE3. Furthermore, we compared the performance of SAILoR with other Boolean network inference approaches including Best-Fit, REVEAL, MIBNI, GABNI, ATEN, and LogBTF. We have shown that by incorporating prior knowledge about the overall network structure, SAILoR can improve the structural correctness of the inferred Boolean networks while maintaining dynamic accuracy. To demonstrate the applicability of SAILoR, we inferred context-specific Boolean subnetworks of female Drosophila melanogaster before and after mating.
Artificial neural networks, inspired by the biological networks of the human brain, have become game-changing computing models in modern computer science. Inspired by their wide scope of ...applications, synthetic biology strives to create their biological counterparts, which we denote synthetic biological neural networks (SYNBIONNs). Their use in the fields of medicine, biosensors, biotechnology, and many more shows great potential and presents exciting possibilities. So far, many different synthetic biological networks have been successfully constructed, however, SYNBIONN implementations have been sparse. The latter are mostly based on neural networks pretrained in silico and being heavily dependent on extensive human input. In this paper, we review current implementations and models of SYNBIONNs. We briefly present the biological platforms that show potential for designing and constructing perceptrons and/or multilayer SYNBIONNs. We explore their future possibilities along with the challenges that must be overcome to successfully implement a scalable in vivo biological neural network capable of online learning.
Genes and gene products do not function in isolation but as components of complex networks of macromolecules through physical or biochemical interactions. Dependencies of gene mutations on genetic ...background (i.e., epistasis) are believed to play a role in understanding molecular underpinnings of complex diseases such as inflammatory bowel disease (IBD). However, the process of identifying such interactions is complex due to for instance the curse of high dimensionality, dependencies in the data and non-linearity. Here, we propose a novel approach for robust and computationally efficient epistasis detection. We do so by first reducing dimensionality, per gene via diffusion kernel principal components (kpc). Subsequently, kpc gene summaries are used for downstream analysis including the construction of a gene-based epistasis network. We show that our approach is not only able to recover known IBD associated genes but also additional genes of interest linked to this difficult gastrointestinal disease.
Ever since its foundational years, synthetic biology has been focused on the implementation of biological computing structures. In the beginning, engineered biological computation has mainly been ...based on uncoupled monoclonal cellular populations. Implementations of such computing structures were mostly inspired by digital electronic circuits and revealed many constraints that limited the advance of the field to relatively simple information processing structures. The focus has recently shifted towards the implementation of biological computing structures within coupled intercellular circuits composed of engineered cellular modules. These circuits have, however, advanced only to a certain point, namely to consist of a few engineered bacterial strains, which perform the computation. It is now time to make a transition from modules and relatively simple systems of biological processing structures to networks composing different strains each presenting a designated computing structure. In such networks, each strain is analogous to a logic chip on a breadboard circuit and is connected to other strains by means of intercellular communication mechanisms rather than copper wires. This analogy can be driven further to use a set of engineered biological modules to construct a complex computing system, such as a multicellular biological processor. We review the state of the art of distributed cellular computation, communication mechanisms, and computational analysis and design approaches for distributed biological computing. We demonstrate the potential next step in engineered biological computation by a proposal of a design of a multicellular biological processor. We demonstrate an analysis of the proposed computing network using in silico simulation and optimisation approaches. Finally, we discuss the potential applications of the reviewed distributed cellular computing structures to the field of neural computing.
Genome-scale metabolic models (GEMs) have found numerous applications in different domains, ranging from biotechnology to systems medicine. Herein, we overview the most popular algorithms for the ...automated reconstruction of context-specific GEMs using high-throughput experimental data. Moreover, we describe different datasets applied in the process, and protocols that can be used to further automate the model reconstruction and validation. Finally, we describe recent COVID-19 applications of context-specific GEMs, focusing on the analysis of metabolic implications, identification of biomarkers and potential drug targets.
The liver is to date the best example of a sexually dimorphic non-reproductive organ. Over 1,000 genes are differentially expressed between sexes indicating that female and male livers are two ...metabolically distinct organs. The spectrum of liver diseases is broad and is usually prevalent in one or the other sex, with different contributing genetic and environmental factors. It is thus difficult to predict individual's disease outcomes and treatment options. Systems approaches including mathematical modeling can aid importantly in understanding the multifactorial liver disease etiology leading toward tailored diagnostics, prognostics and therapy. The currently established computational models of hepatic metabolism that have proven to be essential for understanding of non-alcoholic fatty liver disease (NAFLD) and hepatocellular carcinoma (HCC) are limited to the description of gender-independent response or reflect solely the response of the males. Herein we present
, the first sex-based multi-tissue and multi-level liver metabolic computational model. The model was constructed based on
liver model
and the object-oriented modeling. The crucial factor in adaptation of liver metabolism to the sex is the inclusion of estrogen and androgen receptor responses to respective hormones and the link to sex-differences in growth hormone release. The model was extensively validated on literature data and experimental data obtained from wild type C57BL/6 mice fed with regular chow and western diet. These experimental results show extensive sex-dependent changes and could not be reproduced
with the uniform model
.
represents the first large-scale liver metabolic model, which allows a detailed insight into the sex-dependent complex liver pathologies, and how the genetic and environmental factors interact with the sex in disease appearance and progression. We used the model to identify the most important sex-dependent metabolic pathways, which are involved in accumulation of triglycerides representing initial steps of NAFLD. We identified PGC1A, PPARα, FXR, and LXR as regulatory factors that could become important in sex-dependent personalized treatment of NAFLD.