Inferring a Gene Regulatory Network (GRN) from gene expression data is a computationally expensive task, exacerbated by increasing data sizes due to advances in high-throughput gene profiling ...technology, such as single-cell RNA-seq. To equip researchers with a toolset to infer GRNs from large expression datasets, we propose GRNBoost2 and the Arboreto framework. GRNBoost2 is an efficient algorithm for regulatory network inference using gradient boosting, based on the GENIE3 architecture. Arboreto is a computational framework that scales up GRN inference algorithms complying with this architecture. Arboreto includes both GRNBoost2 and an improved implementation of GENIE3, as a user-friendly open source Python package.
Arboreto is available under the 3-Clause BSD license at http://arboreto.readthedocs.io.
Supplementary data are available at Bioinformatics online.
One of the challenges in multi-omics data analysis for precision medicine is the efficient exploration of undiscovered molecular interactions in disease processes. We present BioMOBS, a workflow ...consisting of two data visualization tools integrated with an open-source molecular information database to perform clinically relevant analyses (https://github.com/driesheylen123/BioMOBS). We performed exploratory pathway analysis with BioMOBS and demonstrate its ability to generate relevant molecular hypotheses, by reproducing recent findings in type 2 diabetes UK biobank data. The central visualisation tool, where data-driven and literature-based findings can be integrated, is available within the github link as well. BioMOBS is a workflow that leverages information from multiple data-driven interactive analyses and visually integrates it with established pathway knowledge. The demonstrated use cases place trust in the usage of BioMOBS as a procedure to offer clinically relevant insights in disease pathway analyses on various types of omics data.
Structural variations of DNA greater than 1 kilobase in size account for most bases that vary among human genomes, but are still relatively under-ascertained. Here we use tiling oligonucleotide ...microarrays, comprising 42 million probes, to generate a comprehensive map of 11,700 copy number variations (CNVs) greater than 443 base pairs, of which most (8,599) have been validated independently. For 4,978 of these CNVs, we generated reference genotypes from 450 individuals of European, African or East Asian ancestry. The predominant mutational mechanisms differ among CNV size classes. Retrotransposition has duplicated and inserted some coding and non-coding DNA segments randomly around the genome. Furthermore, by correlation with known trait-associated single nucleotide polymorphisms (SNPs), we identified 30 loci with CNVs that are candidates for influencing disease susceptibility. Despite this, having assessed the completeness of our map and the patterns of linkage disequilibrium between CNVs and SNPs, we conclude that, for complex traits, the heritability void left by genome-wide association studies will not be accounted for by common CNVs.
The BioRuby software toolkit contains a comprehensive set of free development tools and libraries for bioinformatics and molecular biology, written in the Ruby programming language. BioRuby has ...components for sequence analysis, pathway analysis, protein modelling and phylogenetic analysis; it supports many widely used data formats and provides easy access to databases, external programs and public web services, including BLAST, KEGG, GenBank, MEDLINE and GO. BioRuby comes with a tutorial, documentation and an interactive environment, which can be used in the shell, and in the web browser. Availability: BioRuby is free and open source software, made available under the Ruby license. BioRuby runs on all platforms that support Ruby, including Linux, Mac OS X and Windows. And, with JRuby, BioRuby runs on the Java Virtual Machine. The source code is available from http://www.bioruby.org/. Contact: katayama@bioruby.org
Precision medicine as a framework for disease diagnosis, treatment, and prevention at the molecular level has entered clinical practice. From the start, genetics has been an indispensable tool to ...understand and stratify the biology of chronic and complex diseases in precision medicine. However, with the advances in biomedical and omics technologies, quantitative proteomics is emerging as a powerful technology complementing genetics. Quantitative proteomics provide insight about the dynamic behaviour of proteins as they represent intermediate phenotypes. They provide direct biological insights into physiological patterns, while genetics accounting for baseline characteristics. Additionally, it opens a wide range of applications in clinical diagnostics, treatment stratification, and drug discovery. In this mini-review, we discuss the current status of quantitative proteomics in precision medicine including the available technologies and common methods to analyze quantitative proteomics data. Furthermore, we highlight the current challenges to put quantitative proteomics into clinical settings and provide a perspective to integrate proteomics data with genomics data for future applications in precision medicine.
Understanding complex systems often requires a bottom-up analysis towards a systems biology approach. The need to investigate a system, not only as individual components but as a whole, emerges. This ...can be done by examining the elementary constituents individually and then how these are connected. The myriad components of a system and their interactions are best characterized as networks and they are mainly represented as graphs where thousands of nodes are connected with thousands of vertices. In this article we demonstrate approaches, models and methods from the graph theory universe and we discuss ways in which they can be used to reveal hidden properties and features of a network. This network profiling combined with knowledge extraction will help us to better understand the biological significance of the system.
BackgroundCOVID-19 mortality, excess mortality, deaths per million population (DPM), infection fatality ratio (IFR) and case fatality ratio (CFR) are reported and compared for many countries ...globally. These measures may appear objective, however, they should be interpreted with caution.AimWe examined reported COVID-19-related mortality in Belgium from 9 March 2020 to 28 June 2020, placing it against the background of excess mortality and compared the DPM and IFR between countries and within subgroups.MethodsThe relation between COVID-19-related mortality and excess mortality was evaluated by comparing COVID-19 mortality and the difference between observed and weekly average predictions of all-cause mortality. DPM were evaluated using demographic data of the Belgian population. The number of infections was estimated by a stochastic compartmental model. The IFR was estimated using a delay distribution between infection and death.ResultsIn the study period, 9,621 COVID-19-related deaths were reported, which is close to the excess mortality estimated using weekly averages (8,985 deaths). This translates to 837 DPM and an IFR of 1.5% in the general population. Both DPM and IFR increase with age and are substantially larger in the nursing home population.DiscussionDuring the first pandemic wave, Belgium had no discrepancy between COVID-19-related mortality and excess mortality. In light of this close agreement, it is useful to consider the DPM and IFR, which are both age, sex, and nursing home population-dependent. Comparison of COVID-19 mortality between countries should rather be based on excess mortality than on COVID-19-related mortality.
Elucidating the genotype-phenotype connection is one of the big challenges of modern molecular biology. To fully understand this connection, it is necessary to consider the underlying networks and ...the time factor. In this context of data deluge and heterogeneous information, visualization plays an essential role in interpreting complex and dynamic topologies. Thus, software that is able to bring the network, phenotypic and temporal information together is needed. Arena3D has been previously introduced as a tool that facilitates link discovery between processes. It uses a layered display to separate different levels of information while emphasizing the connections between them. We present novel developments of the tool for the visualization and analysis of dynamic genotype-phenotype landscapes.
Version 2.0 introduces novel features that allow handling time course data in a phenotypic context. Gene expression levels or other measures can be loaded and visualized at different time points and phenotypic comparison is facilitated through clustering and correlation display or highlighting of impacting changes through time. Similarity scoring allows the identification of global patterns in dynamic heterogeneous data. In this paper we demonstrate the utility of the tool on two distinct biological problems of different scales. First, we analyze a medium scale dataset that looks at perturbation effects of the pluripotency regulator Nanog in murine embryonic stem cells. Dynamic cluster analysis suggests alternative indirect links between Nanog and other proteins in the core stem cell network. Moreover, recurrent correlations from the epigenetic to the translational level are identified. Second, we investigate a large scale dataset consisting of genome-wide knockdown screens for human genes essential in the mitotic process. Here, a potential new role for the gene lsm14a in cytokinesis is suggested. We also show how phenotypic patterning allows for extensive comparison and identification of high impact knockdown targets.
We present a new visualization approach for perturbation screens with multiple phenotypic outcomes. The novel functionality implemented in Arena3D enables effective understanding and comparison of temporal patterns within morphological layers, to help with the system-wide analysis of dynamic processes. Arena3D is available free of charge for academics as a downloadable standalone application from: http://arena3d.org/.
Genetic testing for monogenic diabetes is important for patient care. Given the extensive genetic and clinical heterogeneity of diabetes, exome sequencing might provide additional diagnostic ...potential when standard Sanger sequencing-based diagnostics is inconclusive.
The aim of the study was to examine the performance of exome sequencing for a molecular diagnosis of MODY in patients who have undergone conventional diagnostic sequencing of candidate genes with negative results.
We performed exome enrichment followed by high-throughput sequencing in nine patients with suspected MODY. They were Sanger sequencing-negative for mutations in the HNF1A, HNF4A, GCK, HNF1B and INS genes. We excluded common, non-coding and synonymous gene variants, and performed in-depth analysis on filtered sequence variants in a pre-defined set of 111 genes implicated in glucose metabolism.
On average, we obtained 45 X median coverage of the entire targeted exome and found 199 rare coding variants per individual. We identified 0-4 rare non-synonymous and nonsense variants per individual in our a priori list of 111 candidate genes. Three of the variants were considered pathogenic (in ABCC8, HNF4A and PPARG, respectively), thus exome sequencing led to a genetic diagnosis in at least three of the nine patients. Approximately 91% of known heterozygous SNPs in the target exomes were detected, but we also found low coverage in some key diabetes genes using our current exome sequencing approach. Novel variants in the genes ARAP1, GLIS3, MADD, NOTCH2 and WFS1 need further investigation to reveal their possible role in diabetes.
Our results demonstrate that exome sequencing can improve molecular diagnostics of MODY when used as a complement to Sanger sequencing. However, improvements will be needed, especially concerning coverage, before the full potential of exome sequencing can be realized.
To develop and test a novel approach to xenotransplantation of isolated preantral follicles underneath the kidney capsule of immunodeficient mice.
Prospective experimental animal study.
Academic ...research unit.
Healthy adult nude mice.
Bovine ovaries from fetuses (n = 3) and calves (n = 3) were enzymatically disaggregated and subsequently filtered. Isolated preantral follicles were suspended in phosphate buffered saline, and granulosa and stroma cells originating from the ovarian digest served as embedding matrix. The suspension was injected under the kidney capsule of adult nude mice.
Fourteen days after transplantation, follicular survival and proliferation in grafts was assessed by histology and proliferating cell nuclear antigen (PCNA) immunostaining, and was compared with ungrafted control tissue.
Primordial follicles decreased from 58.2% in control tissue to 17.1% in transplants in the fetal group, and from 76.0% to 17.2% in the calf group. Concomitantly, primary follicles increased from 13.4% to 62.2% in the fetal group, and from 5.4% to 63.5% in the calf group. Follicular proliferation measured by PCNA immunolabeling exhibited an increase from 40.6% growing follicles to 81.9% in the fetal group, and from 21.0% to 80.7% in the calf group.
The massive follicular activation following transplantation indicates that isolated preantral follicles are able to survive and grow 14 days after renal subcapsular xenotransplantation.