Osteoporosis is a devastating disease with an essential genetic component. GWAS have discovered genetic signals robustly associated with bone mineral density (BMD), but not the precise localization ...of effector genes. Here, we carry out physical and direct variant to gene mapping in human mesenchymal progenitor cell-derived osteoblasts employing a massively parallel, high resolution Capture C based method in order to simultaneously characterize the genome-wide interactions of all human promoters. By intersecting our Capture C and ATAC-seq data, we observe consistent contacts between candidate causal variants and putative target gene promoters in open chromatin for ~ 17% of the 273 BMD loci investigated. Knockdown of two novel implicated genes, ING3 at 'CPED1-WNT16' and EPDR1 at 'STARD3NL', inhibits osteoblastogenesis, while promoting adipogenesis. This approach therefore aids target discovery in osteoporosis, here on the example of two relevant genes involved in the fate determination of mesenchymal progenitors, and can be applied to other common genetic diseases.
A chronic proinflammatory state precedes pathological change in arterial endothelial cells located within regions of susceptibility to atherosclerosis. The potential contributions of regulatory ...microRNAs to this disequilibrium were investigated by artery site-specific profiling in normal adult swine. Expression of endothelial microRNA10a (miR-10a) was lower in the athero-susceptible regions of the inner aortic arch and aorto-renal branches than elsewhere. Expression of Homeobox A1 (HOXA1), a known miR-10a target, was up-regulated in the same locations. Endothelial transcriptome microarray analysis of miR-10a knockdown in cultured human aortic endothelial cells (HAEC) identified IκB/NF-κB–mediated inflammation as the top category of up-regulated biological processes. Phosphorylation of IκBα, a prerequisite for IκBα proteolysis and NF-κB activation, was significantly up-regulated in miR-10a knockdown HAEC and was accompanied by increased nuclear expression of NF-κB p65. The inflammatory biomarkers monocyte chemotactic protein 1 (MCP-1), IL-6, IL-8, vascular cell adhesion molecule 1 (VCAM-1), and E-selectin were elevated following miR-10a knockdown. Conversely, knockin of miR-10a (a conservative 25-fold increase) inhibited the basal expression of VCAM-1 and E-selectin in HAEC. Two key regulators of IκBα degradation—mitogen-activated kinase kinase kinase 7 (MAP3K7; TAK1) and β-transducin repeat-containing gene (βTRC)—contain a highly conserved miR-10a binding site in the 3' UTR. Both molecules were up-regulated by miR-10a knockdown and suppressed by miR-10a knockin, and evidence of direct miR-10a binding to the 3' UTR was demonstrated by luciferase assay. Comparative expression studies of endothelium located in athero-susceptible aortic arch and athero-protected descending thoracic aorta identified significantly up-regulated MAP3K7, βTRC, phopho-IκBα, and nuclear p65 expression suggesting that the differential expression of miR-10a contributes to the regulation of proinflammatory endothelial phenotypes in athero-susceptible regions in vivo.
Systemic lupus erythematosus (SLE) is mediated by autoreactive antibodies that damage multiple tissues. Genome-wide association studies (GWAS) link >60 loci with SLE risk, but the causal variants and ...effector genes are largely unknown. We generated high-resolution spatial maps of SLE variant accessibility and gene connectivity in human follicular helper T cells (TFH), a cell type required for anti-nuclear antibodies characteristic of SLE. Of the ~400 potential regulatory variants identified, 90% exhibit spatial proximity to genes distant in the 1D genome sequence, including variants that loop to regulate the canonical TFH genes BCL6 and CXCR5 as confirmed by genome editing. SLE 'variant-to-gene' maps also implicate genes with no known role in TFH/SLE disease biology, including the kinases HIPK1 and MINK1. Targeting these kinases in TFH inhibits production of IL-21, a cytokine crucial for class-switched B cell antibodies. These studies offer mechanistic insight into the SLE-associated regulatory architecture of the human genome.
The FoxA transcription factors are critical for liver development through their pioneering activity, which initiates a highly complex regulatory network thought to become progressively resistant to ...the loss of any individual hepatic transcription factor via mutual redundancy. To investigate the dispensability of FoxA factors for maintaining this regulatory network, we ablated all FoxA genes in the adult mouse liver. Remarkably, loss of FoxA caused rapid and massive reduction in the expression of critical liver genes. Activity of these genes was reduced back to the low levels of the fetal prehepatic endoderm stage, leading to necrosis and lethality within days. Mechanistically, we found FoxA proteins to be required for maintaining enhancer activity, chromatin accessibility, nucleosome positioning, and binding of HNF4α. Thus, the FoxA factors act continuously, guarding hepatic enhancer activity throughout adult life.
BACKGROUNDMultiple islet autoantibodies (AAbs) predict the development of type 1 diabetes (T1D) and hyperglycemia within 10 years. By contrast, T1D develops in only approximately 15% of individuals ...who are positive for single AAbs (generally against glutamic acid decarboxylase GADA); hence, the single GADA+ state may represent an early stage of T1D.METHODSHere, we functionally, histologically, and molecularly phenotyped human islets from nondiabetic GADA+ and T1D donors.RESULTSSimilar to the few remaining β cells in the T1D islets, GADA+ donor islets demonstrated a preserved insulin secretory response. By contrast, α cell glucagon secretion was dysregulated in both GADA+ and T1D islets, with impaired glucose suppression of glucagon secretion. Single-cell RNA-Seq of GADA+ α cells revealed distinct abnormalities in glycolysis and oxidative phosphorylation pathways and a marked downregulation of cAMP-dependent protein kinase inhibitor β (PKIB), providing a molecular basis for the loss of glucose suppression and the increased effect of 3-isobutyl-1-methylxanthine (IBMX) observed in GADA+ donor islets.CONCLUSIONWe found that α cell dysfunction was present during the early stages of islet autoimmunity at a time when β cell mass was still normal, raising important questions about the role of early α cell dysfunction in the progression of T1D.FUNDINGThis work was supported by grants from the NIH (3UC4DK112217-01S1, U01DK123594-02, UC4DK112217, UC4DK112232, U01DK123716, and P30 DK019525) and the Vanderbilt Diabetes Research and Training Center (DK20593).
Neurodegenerative diseases pose an extraordinary threat to the world’s aging population, yet no disease-modifying therapies are available. Although genome-wide association studies (GWASs) have ...identified hundreds of risk loci for neurodegeneration, the mechanisms by which these loci influence disease risk are largely unknown. Here, we investigated the association between common genetic variants at the 7p21 locus and risk of the neurodegenerative disease frontotemporal lobar degeneration. We showed that variants associated with disease risk correlate with increased expression of the 7p21 gene TMEM106B and no other genes; co-localization analyses implicated a common causal variant underlying both association with disease and association with TMEM106B expression in lymphoblastoid cell lines and human brain. Furthermore, increases in the amount of TMEM106B resulted in increases in abnormal lysosomal phenotypes and cell toxicity in both immortalized cell lines and neurons. We then combined fine-mapping, bioinformatics, and bench-based approaches to functionally characterize all candidate causal variants at this locus. This approach identified a noncoding variant, rs1990620, that differentially recruits CTCF in lymphoblastoid cell lines and human brain to influence CTCF-mediated long-range chromatin-looping interactions between multiple cis-regulatory elements, including the TMEM106B promoter. Our findings thus provide an in-depth analysis of the 7p21 locus linked by GWASs to frontotemporal lobar degeneration, nominating a causal variant and causal mechanism for allele-specific expression and disease association at this locus. Finally, we show that genetic variants associated with risk of neurodegenerative diseases beyond frontotemporal lobar degeneration are enriched in CTCF-binding sites found in brain-relevant tissues, implicating CTCF-mediated gene regulation in risk of neurodegeneration more generally.
The transcriptional mechanisms by which temporary exposure to developmental signals instigates adipocyte differentiation are unknown. During early adipogenesis, we find transient enrichment of the ...glucocorticoid receptor (GR), CCAAT/enhancer-binding protein beta (CEBPbeta), p300, mediator subunit 1, and histone H3 acetylation near genes involved in cell proliferation, development, and differentiation, including the gene encoding the master regulator of adipocyte differentiation, peroxisome proliferator-activated receptor gamma2 (PPARgamma2). Occupancy and enhancer function are triggered by adipogenic signals, and diminish upon their removal. GR, which is important for adipogenesis but need not be active in the mature adipocyte, functions transiently with other enhancer proteins to propagate a new program of gene expression that includes induction of PPARgamma2, thereby providing a memory of the earlier adipogenic signal. Thus, the conversion of preadipocyte to adipocyte involves the formation of an epigenomic transition state that is not observed in cells at the beginning or end of the differentiation process.
A typical task in bioinformatics consists of identifying which features are associated with a target outcome of interest and building a predictive model. Automated machine learning (AutoML) systems ...such as the Tree-based Pipeline Optimization Tool (TPOT) constitute an appealing approach to this end. However, in biomedical data, there are often baseline characteristics of the subjects in a study or batch effects that need to be adjusted for in order to better isolate the effects of the features of interest on the target. Thus, the ability to perform covariate adjustments becomes particularly important for applications of AutoML to biomedical big data analysis.
We developed an approach to adjust for covariates affecting features and/or target in TPOT. Our approach is based on regressing out the covariates in a manner that avoids 'leakage' during the cross-validation training procedure. We describe applications of this approach to toxicogenomics and schizophrenia gene expression data sets. The TPOT extensions discussed in this work are available at https://github.com/EpistasisLab/tpot/tree/v0.11.1-resAdj .
In this work, we address an important need in the context of AutoML, which is particularly crucial for applications to bioinformatics and medical informatics, namely covariate adjustments. To this end we present a substantial extension of TPOT, a genetic programming based AutoML approach. We show the utility of this extension by applications to large toxicogenomics and differential gene expression data. The method is generally applicable in many other scenarios from the biomedical field.
Here we describe an application to infectious disease epidemiology leveraging data from ClinEpiDB, a resource aimed at advancing global public health by facilitating the exploration and analysis of ...epidemiological studies 12. In the traditional regression-based analyses on this dataset reported in 14, the independent variables were age bin (< 5, 5–14, 15+), gender, a history of travel in the two weeks preceding the survey visit, a history of malaria in the past year, antimalarial use in the two weeks preceding the visit, reported use of repellent, and whether the visit occurred during the rainy season. For each of these three types, we ran TPOT 50 times with different random splits of the input data into training (75%) and hold-out testing (25%) portions. ...to mitigate the effect of the high imbalance between number of cases and controls, in each run we randomly undersampled the controls to equal the number of cases prior to the random split. Embedding AutoML tools within epidemiology platforms like ClinEpiDB would empower users to directly perform sophisticated analyses, accelerating the benefits derived from these public health resources.
Atherosclerosis is a heterogeneously distributed disease of arteries in which the endothelium plays an important central role. Spatial transcriptome profiling of endothelium in pre-lesional arteries ...has demonstrated differential phenotypes primed for athero-susceptibility at hemodynamic sites associated with disturbed blood flow. DNA methylation is a powerful epigenetic regulator of endothelial transcription recently associated with flow characteristics. We investigated differential DNA methylation in flow region-specific aortic endothelial cells in vivo in adult domestic male and female swine.
Genome-wide DNA methylation was profiled in endothelial cells (EC) isolated from two robust locations of differing patho-susceptibility:--an athero-susceptible site located at the inner curvature of the aortic arch (AA) and an athero-protected region in the descending thoracic (DT) aorta. Complete methylated DNA immunoprecipitation sequencing (MeDIP-seq) identified over 5500 endothelial differentially methylated regions (DMRs). DMR density was significantly enriched in exons and 5'UTR sequences of annotated genes, 60 of which are linked to cardiovascular disease. The set of DMR-associated genes was enriched in transcriptional regulation, pattern specification HOX loci, oxidative stress and the ER stress adaptive pathway, all categories linked to athero-susceptible endothelium. Examination of the relationship between DMR and mRNA in HOXA genes demonstrated a significant inverse relationship between CpG island promoter methylation and gene expression. Methylation-specific PCR (MSP) confirmed differential CpG methylation of HOXA genes, the ER stress gene ATF4, inflammatory regulator microRNA-10a and ARHGAP25 that encodes a negative regulator of Rho GTPases involved in cytoskeleton remodeling. Gender-specific DMRs associated with ciliogenesis that may be linked to defects in cilia development were also identified in AA DMRs.
An endothelial methylome analysis identifies epigenetic DMR characteristics associated with transcriptional regulation in regions of atherosusceptibility in swine aorta in vivo. The data represent the first methylome blueprint for spatio-temporal analyses of lesion susceptibility predisposing to endothelial dysfunction in complex flow environments in vivo.