Sepsis remains a major cause of mortality and morbidity in infants. In recent years, several gene marker strategies for the early identification of sepsis have been proposed but only a few have been ...independently validated for adult cohorts and applicability to infant sepsis remains unclear. Biomarkers to assess disease severity and risks of shock also represent an important unmet need.
To elucidate characteristics driving sepsis in infants, we assembled a multi-transcriptomic dataset from public microarray datasets originating from five independent studies pertaining to bacterial sepsis in infant < 6-months of age (total n=335). We utilized a COmbat co-normalization strategy to enable comparative evaluation across multiple studies while preserving the relationship between cases and controls.
We found good concordance with only two out of seven of the published adult sepsis gene signatures (accuracy > 80%), highlighting the narrow utility of adult-derived signatures for infant diagnosis. Pseudotime analysis of individual subjects' gene expression profiles showed a continuum of molecular changes forming tight clusters concurrent with disease progression between healthy controls and septic shock cases. In depth gene expression analyses between bacteremia, septic shock, and healthy controls characterized lymphocyte activity, hemostatic processes, and heightened innate immunity during the molecular transition toward a state of shock.
Our analysis revealed the presence of multiple significant transcriptomic perturbations that occur during the progression to septic shock in infants that are characterized by late-stage induction of clotting factors, in parallel with a heightened innate immune response and a suppression of adaptive cell functionality.
Feature selection is a critical step for translating advances afforded by systems-scale molecular profiling into actionable clinical insights. While data-driven methods are commonly utilized for ...selecting candidate genes, knowledge-driven methods must contend with the challenge of efficiently sifting through extensive volumes of biomedical information. This work aimed to assess the utility of large language models (LLMs) for knowledge-driven gene prioritization and selection.
In this proof of concept, we focused on 11 blood transcriptional modules associated with an Erythroid cells signature. We evaluated four leading LLMs across multiple tasks. Next, we established a workflow leveraging LLMs. The steps consisted of: (1) Selecting one of the 11 modules; (2) Identifying functional convergences among constituent genes using the LLMs; (3) Scoring candidate genes across six criteria capturing the gene's biological and clinical relevance; (4) Prioritizing candidate genes and summarizing justifications; (5) Fact-checking justifications and identifying supporting references; (6) Selecting a top candidate gene based on validated scoring justifications; and (7) Factoring in transcriptome profiling data to finalize the selection of the top candidate gene.
Of the four LLMs evaluated, OpenAI's GPT-4 and Anthropic's Claude demonstrated the best performance and were chosen for the implementation of the candidate gene prioritization and selection workflow. This workflow was run in parallel for each of the 11 erythroid cell modules by participants in a data mining workshop. Module M9.2 served as an illustrative use case. The 30 candidate genes forming this module were assessed, and the top five scoring genes were identified as BCL2L1, ALAS2, SLC4A1, CA1, and FECH. Researchers carefully fact-checked the summarized scoring justifications, after which the LLMs were prompted to select a top candidate based on this information. GPT-4 initially chose BCL2L1, while Claude selected ALAS2. When transcriptional profiling data from three reference datasets were provided for additional context, GPT-4 revised its initial choice to ALAS2, whereas Claude reaffirmed its original selection for this module.
Taken together, our findings highlight the ability of LLMs to prioritize candidate genes with minimal human intervention. This suggests the potential of this technology to boost productivity, especially for tasks that require leveraging extensive biomedical knowledge.
As the capacity for generating large-scale molecular profiling data continues to grow, the ability to extract meaningful biological knowledge from it remains a limitation. Here, we describe the ...development of a new fixed repertoire of transcriptional modules, BloodGen3, that is designed to serve as a stable reusable framework for the analysis and interpretation of blood transcriptome data. The construction of this repertoire is based on co-clustering patterns observed across sixteen immunological and physiological states encompassing 985 blood transcriptome profiles. Interpretation is supported by customized resources, including module-level analysis workflows, fingerprint grid plot visualizations, interactive web applications and an extensive annotation framework comprising functional profiling reports and reference transcriptional profiles. Taken together, this well-characterized and well-supported transcriptional module repertoire can be employed for the interpretation and benchmarking of blood transcriptome profiles within and across patient cohorts. Blood transcriptome fingerprints for the 16 reference cohorts can be accessed interactively via: https://drinchai.shinyapps.io/BloodGen3Module/ .
Neutrophil extracellular traps (NETs) are a recently identified, web-like, extracellular structure composed of decondensed nuclear DNA and associated antimicrobial granules. NETs are extruded into ...the extracellular environment via the reactive oxygen species (ROS)-dependent cell death pathway participating in inflammation and autoimmune diseases. Transketolase (TKT) is a thiamine pyrophosphate (vitamin B1)-dependent enzyme that links the pentose phosphate pathway with the glycolytic pathway by feeding excess sugar phosphates into the main carbohydrate metabolic pathways to generate biosynthetic reducing capacity in the form of NADPH as a substrate for ROS generation. In this work, TKT was selected as a lead candidate from 24 NET-associated proteins obtained by literature screening and knowledge gap assessment. Consequently, we determined whether TKT influenced NET formation in vitro. We firstly established that the release of ROS-dependent NETs was significantly decreased after purified human PMNs were pretreated with oxythiamine, a TKT inhibitor, and in a concentration dependent manner. As a cofactor for TKT reaction, we evaluated the release of NET formation either in vitamin B1 treatment or in combined use of oxythiamine and vitamin B1, and found that those treatments also exerted a significant suppressive effect on the amount of NET-DNA and ROS production. The regulation of TKT by oxythiamine and/or vitamin B1 may therefore be associated with response to the modulation of NET formation by preventing generation of excessive NETs in inflammatory diseases.
A potential role for the long-chain acyl-CoA synthetase family member 1 (ACSL1) in the immunobiology of sepsis was explored during a hands-on training workshop. Participants first assessed the ...robustness of the potential gap in biomedical knowledge identified via an initial screen of public transcriptome data and of the literature associated with ACSL1. Increase in ACSL1 transcript abundance during sepsis was confirmed in several independent datasets. Querying the ACSL1 literature also confirmed the absence of reports associating ACSL1 with sepsis. Inferences drawn from both the literature (via indirect associations) and public transcriptome data (via correlation) point to the likely participation of ACSL1 and ACSL4, another family member, in inflammasome activation in neutrophils during sepsis. Furthermore, available clinical data indicate that levels of ACSL1 and ACSL4 induction was significantly higher in fatal cases of sepsis. This denotes potential translational relevance and is consistent with involvement in pathways driving potentially deleterious systemic inflammation. Finally, while ACSL1 expression was induced in blood
by a wide range of pathogen-derived factors as well as TNF, induction of ACSL4 appeared restricted to flagellated bacteria and pathogen-derived TLR5 agonists and IFNG. Taken together, this joint review of public literature and omics data records points to two members of the acyl-CoA synthetase family potentially playing a role in inflammasome activation in neutrophils. Translational relevance of these observations in the context of sepsis and other inflammatory conditions remain to be investigated.
Immunomodulatory processes exert steering functions throughout pregnancy. Detecting diversions from this physiologic immune clock may help identify pregnant women at risk for pregnancy-associated ...complications. We present results from a data-driven selection process to develop a targeted panel of mRNAs that may prove effective in detecting pregnancies diverting from the norm.
Based on a
dataset from a resource-constrained setting and a dataset from a resource-rich area readily available in the public domain, whole blood gene expression profiles of uneventful pregnancies were captured at multiple time points during pregnancy. BloodGen3, a fixed blood transcriptional module repertoire, was employed to analyze and visualize gene expression patterns in the two datasets. Differentially expressed genes were identified by comparing their abundance to non-pregnant postpartum controls. The selection process for a targeted gene panel considered (i) transcript abundance in whole blood; (ii) degree of correlation with the BloodGen3 module; and (iii) pregnancy biology.
We identified 176 transcripts that were complemented with eight housekeeping genes. Changes in transcript abundance were seen in the early stages of pregnancy and similar patterns were observed in both datasets. Functional gene annotation suggested significant changes in the lymphoid, prostaglandin and inflammation-associated compartments, when compared to the postpartum controls.
The gene panel presented here holds promise for the development of predictive, targeted, transcriptional profiling assays. Such assays might become useful for monitoring of pregnant women, specifically to detect potential adverse events early. Prospective validation of this targeted assay, in-depth investigation of functional annotations of differentially expressed genes, and assessment of common pregnancy-associated complications with the aim to identify these early in pregnancy to improve pregnancy outcomes are the next steps.
Transcriptome profiling approaches have been widely used to investigate the mechanisms underlying psoriasis pathogenesis. Most researchers have measured changes in transcript abundance in skin ...biopsies; relatively few have examined transcriptome changes in the blood. Although less relevant to the study of psoriasis pathogenesis, blood transcriptome profiles can be readily compared across various diseases. Here, we used a pre-established set of 382 transcriptional modules as a common framework to compare changes in blood transcript abundance in two independent public psoriasis datasets. We then compared the resulting "transcriptional fingerprints" to those obtained for a reference set of 16 pathological or physiological states. The perturbations in blood transcript abundance in psoriasis were relatively subtle compared to the changes we observed in other autoimmune and auto-inflammatory diseases. However, we did observe a consistent pattern of changes for a set of modules associated with neutrophil activation and inflammation; interestingly, this pattern resembled that observed in patients with Kawasaki disease. This similarity between the blood-transcriptome signatures in psoriasis and Kawasaki disease suggests that the immune mechanisms driving their pathogenesis might be partially shared.
In addition to its canonical functions, vitamin D has been proposed to be an important mediator of the immune system. Despite ample sunshine, vitamin D deficiency is prevalent (>80%) in the Middle ...East, resulting in a high rate of supplementation. However, the underlying molecular mechanisms of the specific regimen prescribed and the potential factors affecting an individual’s response to vitamin D supplementation are not well characterized. Our objective is to describe the changes in the blood transcriptome and explore the potential mechanisms associated with vitamin D3 supplementation in one hundred vitamin D-deficient women who were given a weekly oral dose (50,000 IU) of vitamin D3 for three months. A high-throughput targeted PCR, composed of 264 genes representing the important blood transcriptomic fingerprints of health and disease states, was performed on pre and post-supplementation blood samples to profile the molecular response to vitamin D3. We identified 54 differentially expressed genes that were strongly modulated by vitamin D3 supplementation. Network analyses showed significant changes in the immune-related pathways such as TLR4/CD14 and IFN receptors, and catabolic processes related to NF-kB, which were subsequently confirmed by gene ontology enrichment analyses. We proposed a model for vitamin D3 response based on the expression changes of molecules involved in the receptor-mediated intra-cellular signaling pathways and the ensuing predicted effects on cytokine production. Overall, vitamin D3 has a strong effect on the immune system, G-coupled protein receptor signaling, and the ubiquitin system. We highlighted the major molecular changes and biological processes induced by vitamin D3, which will help to further investigate the effectiveness of vitamin D3 supplementation among individuals in the Middle East as well as other regions.
Sepsis is a complex heterogeneous condition, and the current lack of effective risk and outcome predictors hinders the improvement of its management. Using a reductionist approach leveraging publicly ...available transcriptomic data, we describe a knowledge gap for the role of ACVR1B (activin A receptor type 1B) in sepsis. ACVR1B, a member of the transforming growth factor-beta (TGF-beta) superfamily, was selected based on the following: 1) induction upon
exposure of neutrophils from healthy subjects with the serum of septic patients (GSE49755), and 2) absence or minimal overlap between ACVR1B, sepsis, inflammation, or neutrophil in published literature. Moreover,
expression is upregulated in septic melioidosis, a widespread cause of fatal sepsis in the tropics. Key biological concepts extracted from a series of PubMed queries established indirect links between ACVR1B and "cancer", "TGF-beta superfamily", "cell proliferation", "inhibitors of activin", and "apoptosis". We confirmed our observations by measuring ACVR1B transcript abundance in buffy coat samples obtained from healthy individuals (
=3) exposed to septic plasma (n = 26 melioidosis sepsis cases)
. Based on our re-investigation of publicly available transcriptomic data and newly generated
data, we provide perspective on the role of ACVR1B during sepsis. Additional experiments for addressing this knowledge gap are discussed.
Covid-19 morbidity and mortality are associated with a dysregulated immune response. Tools are needed to enhance existing immune profiling capabilities in affected patients. Here we aimed to develop ...an approach to support the design of targeted blood transcriptome panels for profiling the immune response to SARS-CoV-2 infection.
We designed a pool of candidates based on a pre-existing and well-characterized repertoire of blood transcriptional modules. Available Covid-19 blood transcriptome data was also used to guide this process. Further selection steps relied on expert curation. Additionally, we developed several custom web applications to support the evaluation of candidates.
As a proof of principle, we designed three targeted blood transcript panels, each with a different translational connotation: immunological relevance, therapeutic development relevance and SARS biology relevance.
Altogether the work presented here may contribute to the future expansion of immune profiling capabilities via targeted profiling of blood transcript abundance in Covid-19 patients.