The biomolecules inside or near cells form a complex interacting system. Cellular phenotypes and behaviors arise from the totality of interactions among the components of this system. A fruitful way ...of modeling interacting biomolecular systems is by network-based dynamic models that characterize each component by a state variable, and describe the change in the state variables due to the interactions in the system. Dynamic models can capture the stable state patterns of this interacting system and can connect them to different cell fates or behaviors. A Boolean or logic model characterizes each biomolecule by a binary state variable that relates the abundance of that molecule to a threshold abundance necessary for downstream processes. The regulation of this state variable is described in a parameter free manner, making Boolean modeling a practical choice for systems whose kinetic parameters have not been determined. Boolean models integrate the body of knowledge regarding the components and interactions of biomolecular systems, and capture the system's dynamic repertoire, for example the existence of multiple cell fates. These models were used for a variety of systems and led to important insights and predictions. Boolean models serve as an efficient exploratory model, a guide for follow-up experiments, and as a foundation for more quantitative models.
Pathway analysis is widely used to gain mechanistic insights from high-throughput omics data. However, most existing methods do not consider signal integration represented by pathway topology, ...resulting in enrichment of convergent pathways when downstream genes are modulated. Incorporation of signal flow and integration in pathway analysis could rank the pathways based on modulation in key regulatory genes. This implementation can be facilitated for large-scale data by discrete state network modeling due to simplicity in parameterization. Here, we model cellular heterogeneity using discrete state dynamics and measure pathway activities in cross-sectional data. We introduce a new algorithm, Boolean Omics Network Invariant-Time Analysis (BONITA), for signal propagation, signal integration, and pathway analysis. Our signal propagation approach models heterogeneity in transcriptomic data as arising from intercellular heterogeneity rather than intracellular stochasticity, and propagates binary signals repeatedly across networks. Logic rules defining signal integration are inferred by genetic algorithm and are refined by local search. The rules determine the impact of each node in a pathway, which is used to score the probability of the pathway's modulation by chance. We have comprehensively tested BONITA for application to transcriptomics data from translational studies. Comparison with state-of-the-art pathway analysis methods shows that BONITA has higher sensitivity at lower levels of source node modulation and similar sensitivity at higher levels of source node modulation. Application of BONITA pathway analysis to previously validated RNA-sequencing studies identifies additional relevant pathways in in-vitro human cell line experiments and in-vivo infant studies. Additionally, BONITA successfully detected modulation of disease specific pathways when comparing relevant RNA-sequencing data with healthy controls. Most interestingly, the two highest impact score nodes identified by BONITA included known drug targets. Thus, BONITA is a powerful approach to prioritize not only pathways but also specific mechanistic role of genes compared to existing methods. BONITA is available at: https://github.com/thakar-lab/BONITA.
Boolean modeling is a mathematical modeling framework used for defining and studying gene-regulatory networks (GRNs). It serves as a means to develop mechanistic models, offering insights into the ...trajectories and dynamic properties of GRNs. In this review, I delve into seminal papers published in the Journal of Theoretical Biology that have spearheaded this field. Additionally, I explore the application of these modeling methods in the current era of data-intensive science.
People living with HIV are at higher risk of atherosclerosis (AS). The pathogenesis of this risk is not fully understood. To assess the regulatory networks involved in AS we sequenced mRNA of the ...peripheral blood mononuclear cells (PBMCs) and measured cytokine and chemokine levels in the plasma of 13 persons living with HIV and 12 matched HIV-negative persons with and without AS. microRNAs (miRNAs) are known to play a role in HIV infection and may modulate gene regulation to drive AS. Hence, we further assessed miRNA expression in PBMCs of a subset of 12 HIV+ people with and without atherosclerosis. We identified 12 miRNAs differentially expressed between HIV+ AS+ and HIV+ , and validated 5 of those by RT-qPCR. While a few of these miRNAs have been implicated in HIV and atherosclerosis, others are novel. Integrating miRNA measurements with mRNA, we identified 27 target genes including SLC4A7, a critical sodium and bicarbonate transporter, that are potentially dysregulated during atherosclerosis. Additionally, we uncovered that levels of plasma cytokines were associated with transcription factor activity and miRNA expression in PBMCs. For example, BACH2 activity was associated with IL-1β, IL-15, and MIP-1α. IP10 and TNFα levels were associated with miR-124-3p. Finally, integration of all data types into a single network revealed increased importance of miRNAs in network regulation of the HIV+ group in contrast with increased importance of cytokines in the HIV+ AS+ group.
Aging and the age-associated decline of the proteome is determined in part through neuronal control of evolutionarily conserved transcriptional effectors, which safeguard homeostasis under ...fluctuating metabolic and stress conditions by regulating an expansive proteostatic network. We have discovered the
homeodomain-interacting protein kinase (HPK-1) acts as a key transcriptional effector to preserve neuronal integrity, function, and proteostasis during aging. Loss of
results in drastic dysregulation in expression of neuronal genes, including genes associated with neuronal aging. During normal aging
expression increases throughout the nervous system more broadly than any other kinase. Within the aging nervous system,
induction overlaps with key longevity transcription factors, which suggests
expression mitigates natural age-associated physiological decline. Consistently, pan-neuronal overexpression of
extends longevity, preserves proteostasis both within and outside of the nervous system, and improves stress resistance. Neuronal HPK-1 improves proteostasis through kinase activity. HPK-1 functions cell non-autonomously within serotonergic and GABAergic neurons to improve proteostasis in distal tissues by specifically regulating distinct components of the proteostatic network. Increased serotonergic HPK-1 enhances the heat shock response and survival to acute stress. In contrast, GABAergic HPK-1 induces basal autophagy and extends longevity, which requires
(MLX),
(TFEB), and
(FOXO). Our work establishes
as a key neuronal transcriptional regulator critical for preservation of neuronal function during aging. Further, these data provide novel insight as to how the nervous system partitions acute and chronic adaptive response pathways to delay aging by maintaining organismal homeostasis.
Despite advances in the gene-set enrichment analysis methods; inadequate definitions of gene-sets cause a major limitation in the discovery of novel biological processes from the transcriptomic ...datasets. Typically, gene-sets are obtained from publicly available pathway databases, which contain generalized definitions frequently derived by manual curation. Recently unsupervised clustering algorithms have been proposed to identify gene-sets from transcriptomics datasets deposited in public domain. These data-driven definitions of the gene-sets can be context-specific revealing novel biological mechanisms. However, the previously proposed algorithms for identification of data-driven gene-sets are based on hard clustering which do not allow overlap across clusters, a characteristic that is predominantly observed across biological pathways.
We developed a pipeline using fuzzy-C-means (FCM) soft clustering approach to identify gene-sets which recapitulates topological characteristics of biological pathways. Specifically, we apply our pipeline to derive gene-sets from transcriptomic data measuring response of monocyte derived dendritic cells and A549 epithelial cells to influenza infections. Our approach apply Ward's method for the selection of initial conditions, optimize parameters of FCM algorithm for human cell-specific transcriptomic data and identify robust gene-sets along with versatile viral responsive genes.
We validate our gene-sets and demonstrate that by identifying genes associated with multiple gene-sets, FCM clustering algorithm significantly improves interpretation of transcriptomic data facilitating investigation of novel biological processes by leveraging on transcriptomic data available in the public domain. We develop an interactive 'Fuzzy Inference of Gene-sets (FIGS)' package (GitHub: https://github.com/Thakar-Lab/FIGS ) to facilitate use of of pipeline. Future extension of FIGS across different immune cell-types will improve mechanistic investigation followed by high-throughput omics studies.
Neuroinflammation is thought to contribute to the pathogenesis of Alzheimer's disease (AD), yet numerous studies have demonstrated a beneficial role for neuroinflammation in amyloid plaque clearance. ...We have previously shown that sustained expression of IL-1β in the hippocampus of APP/PS1 mice decreases amyloid plaque burden independent of recruited CCR2
myeloid cells, suggesting resident microglia as the main phagocytic effectors of IL-1β-induced plaque clearance. To date, however, the mechanisms of IL-1β-induced plaque clearance remain poorly understood.
To determine whether microglia are involved in IL-1β-induced plaque clearance, APP/PS1 mice induced to express mature human IL-1β in the hippocampus via adenoviral transduction were treated with the Aβ fluorescent probe methoxy-X04 (MX04) and microglial internalization of fibrillar Aβ (fAβ) was analyzed by flow cytometry and immunohistochemistry. To assess microglial proliferation, APP/PS1 mice transduced with IL-1β or control were injected intraperitoneally with BrdU and hippocampal tissue was analyzed by flow cytometry. RNAseq analysis was conducted on microglia FACS sorted from the hippocampus of control or IL-1β-treated APP/PS1 mice. These microglia were also sorted based on MX04 labeling (MX04
and MX04
microglia).
Resident microglia (CD45
CD11b
) constituted > 70% of the MX04
cells in both Phe- and IL-1β-treated conditions, and < 15% of MX04
cells were recruited myeloid cells (CD45
CD11b
). However, IL-1β treatment did not augment the percentage of MX04
microglia nor the quantity of fAβ internalized by individual microglia. Instead, IL-1β increased the total number of MX04
microglia in the hippocampus due to IL-1β-induced proliferation. In addition, transcriptomic analyses revealed that IL-1β treatment was associated with large-scale changes in the expression of genes related to immune responses, proliferation, and cytokine signaling.
These studies show that IL-1β overexpression early in amyloid pathogenesis induces a change in the microglial gene expression profile and an expansion of microglial cells that facilitates Aβ plaque clearance.
Defective viral genomes (DVGs) have been identified in many RNA viruses as a major factor influencing antiviral immune response and viral pathogenesis. However, the generation and function of DVGs in ...SARS-CoV-2 infection are less known. In this study, we elucidated DVG generation in SARS-CoV-2 and its relationship with host antiviral immune response. We observed DVGs ubiquitously from transcriptome sequencing (RNA-seq) data sets of
infections and autopsy lung tissues of COVID-19 patients. Four genomic hot spots were identified for DVG recombination, and RNA secondary structures were suggested to mediate DVG formation. Functionally, bulk and single-cell RNA-seq analysis indicated the interferon (IFN) stimulation of SARS-CoV-2 DVGs. We further applied our criteria to the next-generation sequencing (NGS) data set from a published cohort study and observed a significantly higher amount and frequency of DVG in symptomatic patients than those in asymptomatic patients. Finally, we observed exceptionally diverse DVG populations in one immunosuppressive patient up to 140 days after the first positive test of COVID-19, suggesting for the first time an association between DVGs and persistent viral infections in SARS-CoV-2. Together, our findings strongly suggest a critical role of DVGs in modulating host IFN responses and symptom development, calling for further inquiry into the mechanisms of DVG generation and into how DVGs modulate host responses and infection outcome during SARS-CoV-2 infection.
Defective viral genomes (DVGs) are generated ubiquitously in many RNA viruses, including SARS-CoV-2. Their interference activity to full-length viruses and IFN stimulation provide the potential for them to be used in novel antiviral therapies and vaccine development. SARS-CoV-2 DVGs are generated through the recombination of two discontinuous genomic fragments by viral polymerase complex, and this recombination is also one of the major mechanisms for the emergence of new coronaviruses. Focusing on the generation and function of SARS-CoV-2 DVGs, these studies identify new hot spots for nonhomologous recombination and strongly suggest that the secondary structures within viral genomes mediate the recombination. Furthermore, these studies provide the first evidence for IFN stimulation activity of
DVGs during natural SARS-CoV-2 infection. These findings set up the foundation for further mechanism studies of SARS-CoV-2 recombination and provide evidence to harness the immunostimulatory potential of DVGs in the development of a vaccine and antivirals for SARS-CoV-2.
Coronavirus disease (COVID-19) is an infectious disease caused by the SARS coronavirus 2 (SARS-CoV-2) virus. Direct assessment, detection, and quantitative analysis using high throughput methods like ...single-cell RNA sequencing (scRNAseq) is imperative to understanding the host response to SARS-CoV-2. One barrier to studying SARS-CoV-2 in the laboratory setting is the requirement to process virus-infected cell cultures, and potentially infectious materials derived therefrom, under Biosafety Level 3 (BSL-3) containment. However, there are only 190 BSL3 laboratory facilities registered with the U.S. Federal Select Agent Program, as of 2020, and only a subset of these are outfitted with the equipment needed to perform high-throughput molecular assays. Here, we describe a method for preparing non-hazardous RNA samples from SARS-CoV-2 infected cells, that enables scRNAseq analyses to be conducted safely in a BSL2 facility-thereby making molecular assays of SARS-CoV-2 cells accessible to a much larger community of researchers. Briefly, we infected African green monkey kidney epithelial cells (Vero-E6) with SARS-CoV-2 for 96 hours, trypsin-dissociated the cells, and inactivated them with methanol-acetone in a single-cell suspension. Fixed cells were tested for the presence of infectious SARS-CoV-2 virions using the Tissue Culture Infectious Dose Assay (TCID50), and also tested for viability using flow cytometry. We then tested the dissociation and methanol-acetone inactivation method on primary human lung epithelial cells that had been differentiated on an air-liquid interface. Finally, we performed scRNAseq quality control analysis on the resulting cell populations to evaluate the effects of our virus inactivation and sample preparation protocol on the quality of the cDNA produced. We found that methanol-acetone inactivated SARS-CoV-2, fixed the lung epithelial cells, and could be used to obtain noninfectious, high-quality cDNA libraries. This methodology makes investigating SARS-CoV-2, and related high-containment RNA viruses at a single-cell level more accessible to an expanded community of researchers.
The identification of c‐Myc as a potential PSGL‐1 transcriptional regulator facilitates the determination of the physiological significance of PSGL‐1 induction, and provides novel therapeutic ...targets.
Leukocyte extravasation is a crucial feature of the normal immune response to disease and infection and is implicated in various pathologies during chronic inflammatory disease. P‐Selectin glycoprotein ligand‐1 (PSGL‐1) is critical for leukocyte extravasation; however, despite extensive study, it remains unclear how its expression is regulated, which in turn, impedes a more precise understanding of how its expression level affects transmigration. To investigate the regulation of PSGL‐1, 60 subjects, with or without HIV infection, were recruited and PSGL‐1 expression in monocytes was measured. PSGL‐1 was found to be up‐regulated on leukocytes from HIV‐infected individuals, and the physiologically relevant mediators soluble CD40 ligand (sCD40L) and glutamate were able to induce PSGL‐1 transcription in human monocytes ex vivo. HIV‐1 induced PSGL‐1 induction, and its dependence on CD40L was validated further by use of the mouse‐tropic HIV (EcoHIV) mouse model of HIV infection in C57BL/6 and CD40L knockout (KO) mice. To investigate crosstalk between the signaling cascades induced by CD40L and glutamate that lead to PSGL‐1 induction, a network‐based, discrete dynamic model was developed. The model reveals the MAPK pathway and oxidative stress as critical mediators of crosstalk between CD40L and glutamate‐induced pathways. Importantly, the model predicted induction of the c‐Myc transcription factor upon cotreatment, which was validated using transcriptomic data and pharmacologic inhibition of c‐Myc. This study suggests a novel systems serology approach for translational research and reveals a mechanism for PSGL‐1 transcriptional regulation, which might be leveraged to identify novel targets for therapeutic intervention.