The aryl hydrocarbon receptor (AHR) is a nuclear receptor that modulates the response to environmental stimuli. It was recognized historically for its role in toxicology but, in recent decades, it ...has been increasingly recognized as an important modulator of disease-especially for its role in modulating immune and inflammatory responses. AHR has been implicated in many diseases that are driven by immune/inflammatory processes, including major depressive disorder, multiple sclerosis, rheumatoid arthritis, asthma, and allergic responses, among others. The mechanisms by which AHR has been suggested to impact immune/inflammatory diseases include targeted gene expression and altered immune differentiation. It has been suggested that single nucleotide polymorphisms (SNPs) that are near AHR-regulated genes may contribute to AHR-dependent disease mechanisms/pathways. Further, we have found that SNPs that are outside of nuclear receptor binding sites (i.e., outside of AHR response elements (AHREs)) may contribute to AHR-dependent gene regulation in a SNP- and ligand-dependent manner. This review will discuss the evidence and mechanisms of AHR contributions to immune/inflammatory diseases and will consider the possibility that SNPs that are outside of AHR binding sites might contribute to AHR ligand-dependent inter-individual variation in disease pathophysiology and response to pharmacotherapeutics.
Using latent variables in gene expression data can help correct unobserved confounders and increase statistical power for expression quantitative trait Loci (eQTL) detection. The probabilistic ...estimation of expression residuals (PEER) and principal component analysis (PCA) are widely used methods that can remove unwanted variation and improve eQTL discovery power in bulk RNA-seq analysis. However, their performance has not been evaluated extensively in single-cell eQTL analysis, especially for different cell types. Potential challenges arise due to the structure of single-cell RNA-seq data, including sparsity, skewness, and mean-variance relationship. Here, we show by a series of analyses that PEER and PCA require additional quality control and data transformation steps on the pseudo-bulk matrix to obtain valid latent variables; otherwise, it can result in highly correlated factors (Pearson's correlation r = 0.63 ~ 0.99). Incorporating valid PFs/PCs in the eQTL association model would identify 1.7 ~ 13.3% more eGenes. Sensitivity analysis showed that the pattern of change between the number of eGenes detected and fitted PFs/PCs varied significantly in different cell types. In addition, using highly variable genes to generate latent variables could achieve similar eGenes discovery power as using all genes but save considerable computational resources (~ 6.2-fold faster).
Selective serotonin reuptake inhibitors (SSRIs) are standard of care for major depressive disorder (MDD) pharmacotherapy, but only approximately half of these patients remit on SSRI therapy. Our ...previous genome-wide association study identified a single-nucleotide polymorphism (SNP) signal across the glutamate-rich 3 (ERICH3) gene that was nearly genome-wide significantly associated with plasma serotonin (5-HT) concentrations, which were themselves associated with SSRI response for MDD patients enrolled in the Mayo Clinic PGRN-AMPS SSRI trial. In this study, we performed a meta-analysis which demonstrated that those SNPs were significantly associated with SSRI treatment outcomes in four independent MDD trials. However, the function of ERICH3 and molecular mechanism(s) by which it might be associated with plasma 5-HT concentrations and SSRI clinical response remained unclear. Therefore, we characterized the human ERICH3 gene functionally and identified ERICH3 mRNA transcripts and protein isoforms that are highly expressed in central nervous system cells. Coimmunoprecipitation identified a series of ERICH3 interacting proteins including clathrin heavy chain which are known to play a role in vesicular function. Immunofluorescence showed ERICH3 colocalization with 5-HT in vesicle-like structures, and ERICH3 knock-out dramatically decreased 5-HT staining in SK-N-SH cells as well as 5-HT concentrations in the culture media and cell lysates without changing the expression of 5-HT synthesizing or metabolizing enzymes. Finally, immunofluorescence also showed ERICH3 colocalization with dopamine in human iPSC-derived neurons. These results suggest that ERICH3 may play a significant role in vesicular function in serotonergic and other neuronal cell types, which might help explain its association with antidepressant treatment response.
Human pluripotent stem (hPS) cells can, in theory, be differentiated into any cell type, making them a powerful in vitro model for human biology. Recent technological advances have facilitated ...large-scale hPS cell studies that allow investigation of the genetic regulation of molecular phenotypes and their contribution to high-order phenotypes such as human disease. Integrating hPS cells with single-cell sequencing makes identifying context-dependent genetic effects during cell development or upon experimental manipulation possible. Here we discuss how the intersection of stem cell biology, population genetics and cellular genomics can help resolve the functional consequences of human genetic variation. We examine the critical challenges of integrating these fields and approaches to scaling them cost-effectively and practically. We highlight two areas of human biology that can particularly benefit from population-scale hPS cell studies, elucidating mechanisms underlying complex disease risk loci and evaluating relationships between common genetic variation and pharmacotherapeutic phenotypes.
The mechanisms by which DNA alleles contribute to disease risk, drug response, and other human phenotypes are highly context-specific, varying across cell types and different conditions. Human ...induced pluripotent stem cells are uniquely suited to study these context-dependent effects but cell lines from hundreds or thousands of individuals are required. Village cultures, where multiple induced pluripotent stem lines are cultured and differentiated in a single dish, provide an elegant solution for scaling induced pluripotent stem experiments to the necessary sample sizes required for population-scale studies. Here, we show the utility of village models, demonstrating how cells can be assigned to an induced pluripotent stem line using single-cell sequencing and illustrating that the genetic, epigenetic or induced pluripotent stem line-specific effects explain a large percentage of gene expression variation for many genes. We demonstrate that village methods can effectively detect induced pluripotent stem line-specific effects, including sensitive dynamics of cell states.
Greater than 90% of significant genome-wide association study (GWAS) single-nucleotide polymorphisms (SNPs) are in noncoding regions of the genome, but only 25.6% are known expression quantitative ...trait loci (eQTLs). Therefore, the function of many significant GWAS SNPs remains unclear. We have identified a novel type of eQTL for which SNPs distant from ligand-activated transcription factor (TF) binding sites can alter target gene expression in a SNP genotype-by-ligand-dependent fashion that we refer to as pharmacogenomic eQTLs (PGx-eQTLs)-loci that may have important pharmacotherapeutic implications. In the present study, we integrated chromatin immunoprecipitation-seq with RNA-seq and SNP genotype data for a panel of lymphoblastoid cell lines to identify 10 novel
PGx-eQTLs dependent on the ligand-activated TF aryl hydrocarbon receptor (AHR)-a critical environmental sensor for xenobiotic (drug) and immune response. Those 10
PGx-eQTLs were eQTLs only after AHR ligand treatment, even though the SNPs did not create/destroy an AHR response element-the DNA sequence motif recognized and bound by AHR. Additional functional studies in multiple cell lines demonstrated that some
PGx-eQTLs are functional in multiple cell types, whereas others displayed SNP-by-ligand-dependent effects in just one cell type. Furthermore, four of those
PGx-eQTLs had previously been associated with clinical phenotypes, indicating that those loci might have the potential to inform clinical decisions. Therefore, SNPs across the genome that are distant from TF binding sites for ligand-activated TFs might function as PGx-eQTLs and, as a result, might have important clinical implications for interindividual variation in drug response. SIGNIFICANCE STATEMENT: More than 90% of single-nucleotide polymorphisms (SNPs) that are associated with clinical phenotypes are located in noncoding regions of the genome. However, the mechanisms of action of many of those SNPs have not been elucidated, and drugs may unmask functional expression quantitative trail loci (eQTLs). In the current study, we used drugs that bind to the ligand-activated transcription factor aryl hydrocarbon receptor (AHR) and identified SNPs that were associated with interindividual variation in gene expression following drug exposure-termed pharmacogenomic (PGx)-eQTLs. Possibly of greater significance, those PGx-eQTL SNPs were outside of AHR binding sites, indicating that they do not interrupt AHR DNA recognition. PGx-eQTLs such as those described in this work may have crucial implications for interindividual variation in drug.
The discovery that somatic cells can be reprogrammed to induced pluripotent stem cells (iPSCs) has provided a foundation for in vitro human disease modelling, drug development and population genetics ...studies. Gene expression plays a critical role in complex disease risk and therapeutic response. However, while the genetic background of reprogrammed cell lines has been shown to strongly influence gene expression, the effect has not been evaluated at the level of individual cells which would provide significant resolution. By integrating single cell RNA-sequencing (scRNA-seq) and population genetics, we apply a framework in which to evaluate cell type-specific effects of genetic variation on gene expression.
Here, we perform scRNA-seq on 64,018 fibroblasts from 79 donors and map expression quantitative trait loci (eQTLs) at the level of individual cell types. We demonstrate that the majority of eQTLs detected in fibroblasts are specific to an individual cell subtype. To address if the allelic effects on gene expression are maintained following cell reprogramming, we generate scRNA-seq data in 19,967 iPSCs from 31 reprogramed donor lines. We again identify highly cell type-specific eQTLs in iPSCs and show that the eQTLs in fibroblasts almost entirely disappear during reprogramming.
This work provides an atlas of how genetic variation influences gene expression across cell subtypes and provides evidence for patterns of genetic architecture that lead to cell type-specific eQTL effects.
Recent innovations in single-cell RNA-sequencing (scRNA-seq) provide the technology to investigate biological questions at cellular resolution. Pooling cells from multiple individuals has become a ...common strategy, and droplets can subsequently be assigned to a specific individual by leveraging their inherent genetic differences. An implicit challenge with scRNA-seq is the occurrence of doublets-droplets containing two or more cells. We develop Demuxafy, a framework to enhance donor assignment and doublet removal through the consensus intersection of multiple demultiplexing and doublet detecting methods. Demuxafy significantly improves droplet assignment by separating singlets from doublets and classifying the correct individual.
Heterogeneity in the clinical presentation of major depressive disorder and response to antidepressants limits clinicians' ability to accurately predict a specific patient's eventual response to ...therapy. Validated depressive symptom profiles may be an important tool for identifying poor outcomes early in the course of treatment. To derive these symptom profiles, we first examined data from 947 depressed subjects treated with selective serotonin reuptake inhibitors (SSRIs) to delineate the heterogeneity of antidepressant response using probabilistic graphical models (PGMs). We then used unsupervised machine learning to identify specific depressive symptoms and thresholds of improvement that were predictive of antidepressant response by 4 weeks for a patient to achieve remission, response, or nonresponse by 8 weeks. Four depressive symptoms (depressed mood, guilt feelings and delusion, work and activities and psychic anxiety) and specific thresholds of change in each at 4 weeks predicted eventual outcome at 8 weeks to SSRI therapy with an average accuracy of 77% (p = 5.5E-08). The same four symptoms and prognostic thresholds derived from patients treated with SSRIs correctly predicted outcomes in 72% (p = 1.25E-05) of 1996 patients treated with other antidepressants in both inpatient and outpatient settings in independent publicly-available datasets. These predictive accuracies were higher than the accuracy of 53% for predicting SSRI response achieved using approaches that (i) incorporated only baseline clinical and sociodemographic factors, or (ii) used 4-week nonresponse status to predict likely outcomes at 8 weeks. The present findings suggest that PGMs providing interpretable predictions have the potential to enhance clinical treatment of depression and reduce the time burden associated with trials of ineffective antidepressants. Prospective trials examining this approach are forthcoming.
•This study was to assess whether three symptomatically defined phenotypes of MDD, (core depression, neurovegetative of melancholia and anxiety), could be differentiated based on acylcarnitine ...profiles at baseline, after eight weeks of citalopram/escitalopram treatment.•The current data demonstrated that these phenotypes have distinct patterns of acylcarnitine levels at baseline and after eight weeks of antidepressant treatment.•These findings may help to develop a metabolomic profile of MDD patients with the aim of improving subtype classification of the MDD syndrome.
Acylcarnitines have important functions in mitochondrial energetics and β-oxidation, and have been implicated to play a significant role in metabolic functions of the brain. This retrospective study examined whether plasma acylcarnitine profiles can help biochemically distinguish the three phenotypic subtypes of major depressive disorder (MDD): core depression (CD+), anxious depression (ANX+), and neurovegetative symptoms of melancholia (NVSM+).
Depressed outpatients (n = 240) from the Mayo Clinic Pharmacogenomics Research Network were treated with citalopram or escitalopram for eight weeks. Plasma samples collected at baseline and after eight weeks of treatment with citalopram or escitalopram were profiled for short-, medium- and long-chain acylcarnitine levels using AbsoluteIDQ®p180-Kit and LC-MS. Linear mixed effects models were used to examine whether acylcarnitine levels discriminated the clinical phenotypes at baseline or eight weeks post-treatment, and whether temporal changes in acylcarnitine profiles differed between groups.
Compared to ANX+, CD+ and NVSM+ had significantly lower concentrations of short- and long-chain acylcarnitines at both baseline and week 8. In NVSM+, the medium- and long-chain acylcarnitines were also significantly lower in NVSM+ compared to ANX+. Short-chain acylcarnitine levels increased significantly from baseline to week 8 in CD+ and ANX+, whereas medium- and long-chain acylcarnitines significantly decreased in CD+ and NVSM+.
In depressed patients treated with SSRIs, β-oxidation and mitochondrial energetics as evaluated by levels and changes in acylcarnitines may provide the biochemical basis of the clinical heterogeneity of MDD, especially when combined with clinical characteristics.