Abstract Breast cancer (BC) is the most common malignancy affecting Western women today. It is estimated that as many as 10% of BC cases can be attributed to germline variants. However, the genetic ...basis of the majority of familial BC cases has yet to be identified. Discovering predisposing genes contributing to familial BC is challenging due to their presumed rarity, low penetrance, and complex biological mechanisms. Here, we focused on an analysis of rare missense variants in a cohort of 12 families of Middle Eastern origins characterized by a high incidence of BC cases. We devised a novel, high-throughput, variant analysis pipeline adapted for family studies, which aims to analyze variants at the protein level by employing state-of-the-art machine learning models and three-dimensional protein structural analysis. Using our pipeline, we analyzed 1218 rare missense variants that are shared between affected family members and classified 80 genes as candidate pathogenic. Among these genes, we found significant functional enrichment in peroxisomal and mitochondrial biological pathways which segregated across seven families in the study and covered diverse ethnic groups. We present multiple evidence that peroxisomal and mitochondrial pathways play an important, yet underappreciated, role in both germline BC predisposition and BC survival.
Abstract Thyroid cancer incidences endure to increase even though a large number of inspection tools have been developed recently. Since there is no standard and certain procedure to follow for the ...thyroid cancer diagnoses, clinicians require conducting various tests. This scrutiny process yields multi-dimensional big data and lack of a common approach leads to randomly distributed missing (sparse) data, which are both formidable challenges for the machine learning algorithms. This paper aims to develop an accurate and computationally efficient deep learning algorithm to diagnose the thyroid cancer. In this respect, randomly distributed missing data stemmed singularity in learning problems is treated and dimensionality reduction with inner and target similarity approaches are developed to select the most informative input datasets. In addition, size reduction with the hierarchical clustering algorithm is performed to eliminate the considerably similar data samples. Four machine learning algorithms are trained and also tested with the unseen data to validate their generalization and robustness abilities. The results yield 100% training and 83% testing preciseness for the unseen data. Computational time efficiencies of the algorithms are also examined under the equal conditions.
Abstract Chimeric antigen receptor (CAR) therapy has emerged as a ground-breaking advancement in cancer treatment, harnessing the power of engineered human immune cells to target and eliminate cancer ...cells. The escalating interest and investment in CAR therapy in recent years emphasize its profound significance in clinical research, positioning it as a rapidly expanding frontier in the field of personalized cancer therapies. A crucial step in CAR therapy design is choosing the right target as it determines the therapy’s effectiveness, safety and specificity against cancer cells, while sparing healthy tissues. Herein, we propose a suite of tools for the identification and analysis of potential CAR targets leveraging expression data from The Cancer Genome Atlas and Genotype-Tissue Expression Project, which are implemented in CARTAR website. These tools focus on pinpointing tumor-associated antigens, ensuring target selectivity and assessing specificity to avoid off-tumor toxicities and can be used to rationally designing dual CARs. In addition, candidate target expression can be explored in cancer cell lines using the expression data for the Cancer Cell Line Encyclopedia. To our best knowledge, CARTAR is the first website dedicated to the systematic search of suitable candidate targets for CAR therapy. CARTAR is publicly accessible at https://gmxenomica.github.io/CARTAR/.
Abstract With the increasing prevalence of age-related chronic diseases burdening healthcare systems, there is a pressing need for innovative management strategies. Our study focuses on the gut ...microbiota, essential for metabolic, nutritional, and immune functions, which undergoes significant changes with aging. These changes can impair intestinal function, leading to altered microbial diversity and composition that potentially influence health outcomes and disease progression. Using advanced metagenomic sequencing, we explore the potential of personalized probiotic supplements in 297 older adults by analyzing their gut microbiota. We identified distinctive Lactobacillus and Bifidobacterium signatures in the gut microbiota of older adults, revealing probiotic patterns associated with various population characteristics, microbial compositions, cognitive functions, and neuroimaging results. These insights suggest that tailored probiotic supplements, designed to match individual probiotic profile, could offer an innovative method for addressing age-related diseases and functional declines. Our findings enhance the existing evidence base for probiotic use among older adults, highlighting the opportunity to create more targeted and effective probiotic strategies. However, additional research is required to validate our results and further assess the impact of precision probiotics on aging populations. Future studies should employ longitudinal designs and larger cohorts to conclusively demonstrate the benefits of tailored probiotic treatments.
Abstract One of the prevalent chronic inflammatory disorders of the nasal mucosa, allergic rhinitis (AR) has become more widespread in recent years. Acupuncture pterygopalatine ganglion (aPPG) is an ...emerging alternative therapy that is used to treat AR, but the molecular mechanisms underlying its anti-inflammatory effects are unclear. This work methodically demonstrated the multi-target mechanisms of aPPG in treating AR based on bioinformatics/topology using techniques including text mining, bioinformatics, and network topology, among others. A total of 16 active biomarkers and 108 protein targets related to aPPG treatment of AR were obtained. A total of 345 Gene Ontology terms related to aPPG of AR were identified, and 135 pathways were screened based on Kyoto Encyclopedia of Genes and Genomes analysis. Our study revealed for the first time the multi-targeted mechanism of action of aPPG in the treatment of AR. In animal experiments, aPPG ameliorated rhinitis symptoms in OVA-induced AR rats; decreased serum immunoglobulin E, OVA-sIgE, and substance P levels; elevated serum neuropeptide Y levels; and modulated serum Th1/Th2/Treg/Th17 cytokine expression by a mechanism that may be related to the inhibition of activation of the TLR4/NF-κB/NLRP3 signaling pathway. In vivo animal experiments once again validated the results of the bioinformatics analysis. This study revealed a possible multi-target mechanism of action between aPPG and AR, provided new insights into the potential pathogenesis of AR, and proved that aPPG was a promising complementary alternative therapy for the treatment of AR.
Abstract In precision medicine, both predicting the disease susceptibility of an individual and forecasting its disease-free survival are areas of key research. Besides the classical epidemiological ...predictor variables, data from multiple (omic) platforms are increasingly available. To integrate this wealth of information, we propose new methodology to combine both cooperative learning, a recent approach to leverage the predictive power of several datasets, and polygenic hazard score models. Polygenic hazard score models provide a practitioner with a more differentiated view of the predicted disease-free survival than the one given by merely a point estimate, for instance computed with a polygenic risk score. Our aim is to leverage the advantages of cooperative learning for the computation of polygenic hazard score models via Cox’s proportional hazard model, thereby improving the prediction of the disease-free survival. In our experimental study, we apply our methodology to forecast the disease-free survival for Alzheimer’s disease (AD) using three layers of data. One layer contains epidemiological variables such as sex, APOE (apolipoprotein E, a genetic risk factor for AD) status and 10 leading principal components. Another layer contains selected genomic loci, and the last layer contains methylation data for selected CpG sites. We demonstrate that the survival curves computed via cooperative learning yield an AUC of around $0.7$, above the state-of-the-art performance of its competitors. Importantly, the proposed methodology returns (1) a linear score that can be easily interpreted (in contrast to machine learning approaches), and (2) a weighting of the predictive power of the involved data layers, allowing for an assessment of the importance of each omic (or other) platform. Similarly to polygenic hazard score models, our methodology also allows one to compute individual survival curves for each patient.
Abstract Spatial transcriptomics data play a crucial role in cancer research, providing a nuanced understanding of the spatial organization of gene expression within tumor tissues. Unraveling the ...spatial dynamics of gene expression can unveil key insights into tumor heterogeneity and aid in identifying potential therapeutic targets. However, in many large-scale cancer studies, spatial transcriptomics data are limited, with bulk RNA-seq and corresponding Whole Slide Image (WSI) data being more common (e.g. TCGA project). To address this gap, there is a critical need to develop methodologies that can estimate gene expression at near-cell (spot) level resolution from existing WSI and bulk RNA-seq data. This approach is essential for reanalyzing expansive cohort studies and uncovering novel biomarkers that have been overlooked in the initial assessments. In this study, we present STGAT (Spatial Transcriptomics Graph Attention Network), a novel approach leveraging Graph Attention Networks (GAT) to discern spatial dependencies among spots. Trained on spatial transcriptomics data, STGAT is designed to estimate gene expression profiles at spot-level resolution and predict whether each spot represents tumor or non-tumor tissue, especially in patient samples where only WSI and bulk RNA-seq data are available. Comprehensive tests on two breast cancer spatial transcriptomics datasets demonstrated that STGAT outperformed existing methods in accurately predicting gene expression. Further analyses using the TCGA breast cancer dataset revealed that gene expression estimated from tumor-only spots (predicted by STGAT) provides more accurate molecular signatures for breast cancer sub-type and tumor stage prediction, and also leading to improved patient survival and disease-free analysis. Availability: Code is available at https://github.com/compbiolabucf/STGAT.
The accelerated urban sprawl of cities around the world presents major challenges for urban planning and land resource management. In this context, it is crucial to have a detailed 3D representation ...of buildings enriched with accurate alphanumeric information. A distinctive aspect of this proposal is its specific focus on the spatial unit corresponding to buildings. In order to propose a domain model for the 3D representation of buildings, the national standard of Ecuador and the international standard (ISO 19152:2012 LADM) were considered. The proposal includes a detailed specification of attributes, both for the general subclass of buildings and for their infrastructure. The application of the domain model proposal was crucial in a study area located in the Riobamba canton, due to the characteristics of the buildings in that area. For this purpose, a geodatabase was created in pgAdmin4 with official information, taking into account the structure of the proposed model and linking it with geospatial data for an adequate management and 3D representation of the buildings in an open-source Geographic Information System. This application improves cadastral management in the study region and has wider implications. This model is intended to serve as a benchmark for other countries facing similar challenges in cadastral management and 3D representation of buildings, promote efficient urban development and contribute to global sustainable development.
Abstract Maternal Parathyroid Hormone-related Protein (PTHrP) is involved in the placental transport of calcium. Autonomous overproduction of PTHrP is a rare cause of hypercalcemia in pregnancy. ...Prior cases of PTHrP-induced hypercalcemia in pregnancy have been managed with either dopamine agonists, fetal delivery, termination of pregnancy, or mastectomy. However, PTHrP level normalization following mastectomy has not previously been documented. Herein, we present a 39-year-old female hospitalized at 19 weeks of gestation for acute encephalopathy due to PTHrP induced hypercalcemic crisis (calcium 15.8 mg/dL, PTHrp 46.5 pmol/L normal 0-3.4). Mammary hyperplasia resulting in gigantomastia significantly impaired her ability to ambulate and perform activities of daily living. She remained hypercalcemic during hospitalization despite aggressive hydration, calcitonin, and 2 weeks of dopamine agonist treatment. Bisphosphonate therapy was not administered due to pregnancy and potential effects on the fetus. Our patient underwent bilateral mastectomy along with excision of a large axillary mass. The pathology of all three specimens revealed mammary stromal hyperplasia. PTHrP was undetectable on post-op day 2 and calcium normalized by post-op day 3. At discharge, she was able to ambulate independently. To our knowledge, this is the first reported case of PTHrP induced hypercalcemia related to gigantomastia, documenting resolution of hypercalcemia, and PTHrP levels following mastectomy. Mastectomy is a potential option in the second trimester for pregnant patients with PTHrP induced severe hypercalcemia due to gigantomastia, refractory to treatment with dopamine agonist therapy.