A heterogeneous disease such as cancer is activated through multiple pathways and different perturbations. Depending upon the activated pathway(s), the survival of the patients varies significantly ...and shows different efficacy to various drugs. Therefore, cancer subtype detection using genomics level data is a significant research problem. Subtype detection is often a complex problem, and in most cases, needs multi-omics data fusion to achieve accurate subtyping. Different data fusion and subtyping approaches have been proposed over the years, such as kernel-based fusion, matrix factorization, and deep learning autoencoders. In this paper, we compared the performance of different deep learning autoencoders for cancer subtype detection. We performed cancer subtype detection on four different cancer types from The Cancer Genome Atlas (TCGA) datasets using four autoencoder implementations. We also predicted the optimal number of subtypes in a cancer type using the silhouette score and found that the detected subtypes exhibit significant differences in survival profiles. Furthermore, we compared the effect of feature selection and similarity measures for subtype detection. For further evaluation, we used the Glioblastoma multiforme (GBM) dataset and identified the differentially expressed genes in each of the subtypes. The results obtained are consistent with other genomic studies and can be corroborated with the involved pathways and biological functions. Thus, it shows that the results from the autoencoders, obtained through the interaction of different datatypes of cancer, can be used for the prediction and characterization of patient subgroups and survival profiles.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
Alzheimer's disease (AD) and Parkinson's disease (PD) are the most common neurodegenerative disorders related to aging. Though several risk factors are shared between these two diseases, the exact ...relationship between them is still unknown. In this paper, we analyzed how these two diseases relate to each other from the genomic, epigenomic, and transcriptomic viewpoints. Using an extensive literature mining, we first accumulated the list of genes from major genome-wide association (GWAS) studies. Based on these GWAS studies, we observed that only one gene (
) was shared between AD and PD. A subsequent literature search identified a few other genes involved in these two diseases, among which SIRT1 seemed to be the most prominent one. While we listed all the miRNAs that have been previously reported for AD and PD separately, we found only 15 different miRNAs that were reported in both diseases. In order to get better insights, we predicted the gene co-expression network for both AD and PD using network analysis algorithms applied to two GEO datasets. The network analysis revealed six clusters of genes related to AD and four clusters of genes related to PD; however, there was very low functional similarity between these clusters, pointing to insignificant similarity between AD and PD even at the level of affected biological processes. Finally, we postulated the putative epigenetic regulator modules that are common to AD and PD.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
Bacteria carrying antibiotic resistance genes (ARGs) are naturally prevalent in lotic ecosystems such as rivers. Their ability to spread in anthropogenic waters could lead to the emergence of ...multidrug-resistant bacteria of clinical importance. For this study, three regions of the Isabela river, an important urban river in the city of Santo Domingo, were evaluated for the presence of ARGs. The Isabela river is surrounded by communities that do not have access to proper sewage systems; furthermore, water from this river is consumed daily for many activities, including recreation and sanitation. To assess the state of antibiotic resistance dissemination in the Isabela river, nine samples were collected from these three bluedistinct sites in June 2019 and isolates obtained from these sites were selected based on resistance to beta-lactams. Physico-chemical and microbiological parameters were in accordance with the Dominican legislation. Matrix-assisted laser desorption ionization-time of flight mass spectrometry analyses of ribosomal protein composition revealed a total of 8 different genera. Most common genera were as follows:
(44.6%) and
(18%). Twenty clinically important bacterial isolates were identified from urban regions of the river; these belonged to genera
(
= 9),
(
= 8),
(
= 2), and
(
= 1). Clinically important multi-resistant isolates were not obtained from rural areas. Fifteen isolates were selected for genome sequencing and analysis. Most isolates were resistant to at least three different families of antibiotics. Among beta-lactamase genes encountered, we found the presence of bla
, bla
, bla
, and bla
through both deep sequencing and PCR amplification. Bacteria found from genus
and
demonstrated ample repertoire of antibiotic resistance genes, including resistance from a family of last resort antibiotics reserved for dire infections: carbapenems. Some of the alleles found were KPC-3, OXA-1, OXA-72, OXA-132, CTX-M-55, CTX-M-15, and TEM-1.
One of the biggest problems that the Dominican Republic has had in recent decades is the efficient management of solid domestic waste. This problem has worsened in recent years due to the decrease in ...available areas for the construction of sanitary landfills, the lack of recycling culture in the population, the deficiency in waste collection, and the scarce legal controls aimed at preserving water, air and soil among other factors. The objective of this study is to explore the management of solid waste by the population and the municipality of Puñal, province of Santiago, to evaluate and analyze the situation and generation of solid waste, municipal solid waste management services, and the attitudes of the population regarding recycling projects and waste management. A total of 275 households from 29 localities in the municipality of Puñal were surveyed, which allowed for a significant population sample. According to the results, the most significant type of waste produced by families is organic waste, followed by plastic waste and paper. Of the total organic waste produced in the municipality, 53% of solids wastes are handled through the municipal waste collection system, while 47% is used as plant fertilizers or animal feed. On the other hand, most households receive the municipal waste collection service and pay for this service, through which the municipal government collects, processes, and deposits the waste in different landfills. However, a more efficient waste collection system and the development of programs and projects that allow households to manage the solid waste efficiently they produce would be necessary.
This study developed a computational tool with a graphical interface and a web-service that allows the identification of phage regions through homology search and gene clustering. It uses G+C content ...variation evaluation and tRNA prediction sites as evidence to reinforce the presence of prophages in indeterminate regions. Also, it performs the functional characterization of the prophages regions through data integration of biological databases. The performance of PhageWeb was compared to other available tools (PHASTER, Prophinder, and PhiSpy) using Sensitivity (Sn) and Positive Predictive Value (PPV) tests. As a reference for the tests, more than 80 manually annotated genomes were used. In the PhageWeb analysis, the Sn index was 86.1% and the PPV was approximately 87%, while the second best tool presented Sn and PPV values of 83.3 and 86.5%, respectively. These numbers allowed us to observe a greater precision in the regions identified by PhageWeb while compared to other prediction tools submitted to the same tests. Additionally, PhageWeb was much faster than the other computational alternatives, decreasing the processing time to approximately one-ninth of the time required by the second best software. PhageWeb is freely available at http://computationalbiology.ufpa.br/phageweb.
is a Gram-positive bacterium that causes caseous lymphadenitis, a disease that predominantly affects sheep, goat, cattle, buffalo, and horses, but has also been recognized in other animals. This ...bacterium generates a severe economic impact on countries producing meat. Gene expression studies using RNA-Seq are one of the most commonly used techniques to perform transcriptional experiments. Computational analysis of such data through reverse-engineering algorithms leads to a better understanding of the genome-wide complexity of gene interactomes, enabling the identification of genes having the most significant functions inferred by the activated stress response pathways. In this study, we identified the influential or causal genes from four RNA-Seq datasets from different stress conditions (high iron, low iron, acid, osmosis, and PH) in
, using a consensus-based network inference algorithm called miRsigand next identified the causal genes in the network using the miRinfluence tool, which is based on the influence diffusion model. We found that over 50% of the genes identified as influential had some essential cellular functions in the genomes. In the strains analyzed, most of the causal genes had crucial roles or participated in processes associated with the response to extracellular stresses, pathogenicity, membrane components, and essential genes. This research brings new insight into the understanding of virulence and infection by
.
This study compares the feasibility of matrix-assisted laser desorption ionization time-of-flight (MALDI-TOF) mass spectrometry with whole genome sequencing (WGS) for identifying bacteria in river ...surface water samples. We collected samples from four rivers in the Dominican Republic and used both techniques to characterize bacterial profiles. MALDI-TOF demonstrated high precision, with 86.2% similarity to WGS results, except for a few discordant cases due to database limitations. MALDI-TOF provided cost-effective and rapid identification, making it a promising alternative to WGS in resource-constrained regions. In particular, good effectiveness of MALDI-TOF in identifying bacteria with a high probability of being resistant to antibiotics was observed, which allows this technology to be used in the monitoring processes of this type of microorganism for their rapid, accurate, and low-cost identification. We found this technology to be advantageous for environmental bacterial profiling, with potential applications in understanding waterborne pathogenic bacteria. Our findings underline the relevance of MALDI-TOF in microbiology and its potential to expand its capabilities in bacterial identification and protein profiling.