Existing studies of chromatin conformation have primarily focused on potential enhancers interacting with gene promoters. By contrast, the interactivity of promoters per se, while equally critical to ...understanding transcriptional control, has been largely unexplored, particularly in a cell type-specific manner for blood lineage cell types. In this study, we leverage promoter capture Hi-C data across a compendium of blood lineage cell types to identify and characterize cell type-specific super-interactive promoters (SIPs). Notably, promoter-interacting regions (PIRs) of SIPs are more likely to overlap with cell type-specific ATAC-seq peaks and GWAS variants for relevant blood cell traits than PIRs of non-SIPs. Moreover, PIRs of cell-type-specific SIPs show enriched heritability of relevant blood cell trait (s), and are more enriched with GWAS variants associated with blood cell traits compared to PIRs of non-SIPs. Further, SIP genes tend to express at a higher level in the corresponding cell type. Importantly, SIP subnetworks incorporating cell-type-specific SIPs and ATAC-seq peaks help interpret GWAS variants. Examples include GWAS variants associated with platelet count near the megakaryocyte SIP gene EPHB3 and variants associated lymphocyte count near the native CD4 T-Cell SIP gene ETS1. Interestingly, around 25.7% ~ 39.6% blood cell traits GWAS variants residing in SIP PIR regions disrupt transcription factor binding motifs. Importantly, our analysis shows the potential of using promoter-centric analyses of chromatin spatial organization data to identify biologically important genes and their regulatory regions.
Biomarkers of chronic inflammation (such as C-reactive protein) have long been associated with cardiovascular disease and mortality; however, biomarkers involved in antiviral cytokine induction and ...adaptive immune system activation remain largely unexamined. We hypothesized the cytokine interferon gamma inducible protein 10 (IP-10) would be associated with clinical and subclinical cardiovascular disease and all-cause mortality in African Americans. We assessed these associations in the Jackson Heart Study (JHS) cohort and the REasons for Geographic and Racial Differences in Stroke (REGARDS) study. There was a modest association of IP-10 with higher odds of left ventricular hypertrophy (OR = 1.20 (95% confidence interval (CI) 1.03, 1.41) per standard deviation (SD) higher natural log-transformed IP-10 in JHS). We did not observe associations with ankle brachial index, intima-media thickness, or arterial calcification. Each SD higher increment of ln-transformed IP-10 concentration was associated with incident heart failure (hazard ratio (HR) 1.26; 95% CI 1.11, 1.42, p = 4x10-4) in JHS, and with overall mortality in both JHS (HR 1.12 per SD, 95% CI 1.03, 1.21, p = 7.5x10-3) and REGARDS (HR 1.31 per SD, 95% CI 1.10, 1.55, p = 2.0 x 10-3), adjusting for cardiovascular risk factors and C-reactive protein. However, we found no association between IP-10 and stroke or coronary heart disease. These results suggest a role of IP-10 in heart failure and mortality risk independent of C-reactive protein. Further research is needed to investigate how the body's response to chronic viral infection may mediate heart failure and overall mortality risk in African Americans.
African ancestry individuals with comparable overall anthropometric measures to Europeans have lower abdominal adiposity. To explore the genetic underpinning of different adiposity patterns, we ...investigated whether genetic risk scores for well-studied adiposity phenotypes like body mass index (BMI) and waist circumference (WC) also predict other, less commonly measured adiposity measures in 2420 African American individuals from the Jackson Heart Study. Polygenic risk scores (PRS) were calculated using GWAS-significant variants extracted from published studies mostly representing European ancestry populations for BMI, waist-hip ratio (WHR) adjusted for BMI (WHR.sub.BMIadj ), waist circumference adjusted for BMI (WC.sub.BMIadj ), and body fat percentage (BF%). Associations between each PRS and adiposity measures including BF%, subcutaneous adiposity tissue (SAT), visceral adiposity tissue (VAT) and VAT:SAT ratio (VSR) were examined using multivariable linear regression, with or without BMI adjustment. In non-BMI adjusted models, all phenotype-PRS were found to be positive predictors of BF%, SAT and VAT. WHR-PRS was a positive predictor of VSR, but BF% and BMI-PRS were negative predictors of VSR. After adjusting for BMI, WHR-PRS remained a positive predictor of BF%, VAT and VSR but not SAT. WC-PRS was a positive predictor of SAT and VAT; BF%-PRS was a positive predictor of BF% and SAT only. These analyses suggest that genetically driven increases in BF% strongly associate with subcutaneous rather than visceral adiposity and BF% is strongly associated with BMI but not central adiposity-associated genetic variants. How common genetic variants may contribute to observed differences in adiposity patterns between African and European ancestry individuals requires further study.
Age-related clonal hematopoiesis of indeterminate potential (CHIP), defined as clonally expanded leukemogenic sequence variations (particularly in DNMT3A, TET2, ASXL1, and JAK2) in asymptomatic ...individuals, is associated with cardiovascular events, including recurrent heart failure (HF).
This study sought to evaluate whether CHIP is associated with incident HF.
CHIP status was obtained from whole exome or genome sequencing of blood DNA in participants without prevalent HF or hematological malignancy from 5 cohorts. Cox proportional hazards models were performed within each cohort, adjusting for demographic and clinical risk factors, followed by fixed-effect meta-analyses. Large CHIP clones (defined as variant allele frequency >10%), HF with or without baseline coronary heart disease, and left ventricular ejection fraction were evaluated in secondary analyses.
Of 56,597 individuals (59% women, mean age 58 years at baseline), 3,406 (6%) had CHIP, and 4,694 developed HF (8.3%) over up to 20 years of follow-up. CHIP was prospectively associated with a 25% increased risk of HF in meta-analysis (hazard ratio: 1.25; 95% confidence interval: 1.13-1.38) with consistent associations across cohorts. ASXL1, TET2, and JAK2 sequence variations were each associated with an increased risk of HF, whereas DNMT3A sequence variations were not associated with HF. Secondary analyses suggested large CHIP was associated with a greater risk of HF (hazard ratio: 1.29; 95% confidence interval: 1.15-1.44), and the associations for CHIP on HF with and without prior coronary heart disease were homogenous. ASXL1 sequence variations were associated with reduced left ventricular ejection fraction.
CHIP, particularly sequence variations in ASXL1, TET2, and JAK2, represents a new risk factor for HF.
Polygenic risk scores (PRS) have shown successes in clinics, but most PRS methods focus only on participants with distinct primary continental ancestry without accommodating recently-admixed ...individuals with mosaic continental ancestry backgrounds for different segments of their genomes. Here, we develop GAUDI, a novel penalized-regression-based method specifically designed for admixed individuals. GAUDI explicitly models ancestry-differential effects while borrowing information across segments with shared ancestry in admixed genomes. We demonstrate marked advantages of GAUDI over other methods through comprehensive simulation and real data analyses for traits with associated variants exhibiting ancestral-differential effects. Leveraging data from the Women's Health Initiative study, we show that GAUDI improves PRS prediction of white blood cell count and C-reactive protein in African Americans by > 64% compared to alternative methods, and even outperforms PRS-CSx with large European GWAS for some scenarios. We believe GAUDI will be a valuable tool to mitigate disparities in PRS performance in admixed individuals.
Physical fitness is a well-known correlate of health and the aging process and DNA methylation (DNAm) data can capture aging via epigenetic clocks. However, current epigenetic clocks did not yet use ...measures of mobility, strength, lung, or endurance fitness in their construction. We develop blood-based DNAm biomarkers for fitness parameters gait speed (walking speed), maximum handgrip strength, forced expiratory volume in one second (FEV1), and maximal oxygen uptake (VO2max) which have modest correlation with fitness parameters in five large-scale validation datasets (average r between 0.16-0.48). We then use these DNAm fitness parameter biomarkers with DNAmGrimAge, a DNAm mortality risk estimate, to construct DNAmFitAge, a new biological age indicator that incorporates physical fitness. DNAmFitAge is associated with low-intermediate physical activity levels across validation datasets (
= 6.4E-13), and younger/fitter DNAmFitAge corresponds to stronger DNAm fitness parameters in both males and females. DNAmFitAge is lower (
= 0.046) and DNAmVO2max is higher (
= 0.023) in male body builders compared to controls. Physically fit people have a younger DNAmFitAge and experience better age-related outcomes: lower mortality risk (
= 7.2E-51), coronary heart disease risk (
= 2.6E-8), and increased disease-free status (
= 1.1E-7). These new DNAm biomarkers provide researchers a new method to incorporate physical fitness into epigenetic clocks.
Heart failure (HF) risk and the underlying risk factors vary by race. Traditional models for HF risk prediction treat race as a covariate in risk prediction and do not account for significant ...parameters such as cardiac biomarkers. Machine learning (ML) may offer advantages over traditional modeling techniques to develop race-specific HF risk prediction models and to elucidate important contributors of HF development across races.
We performed a retrospective analysis of 4 large, community cohort studies (ARIC Atherosclerosis Risk in Communities, DHS Dallas Heart Study, JHS Jackson Heart Study, and MESA Multi-Ethnic Study of Atherosclerosis) with adjudicated HF events. The study included participants who were >40 years of age and free of HF at baseline. Race-specific ML models for HF risk prediction were developed in the JHS cohort (for Black race-specific model) and White adults from ARIC (for White race-specific model). The models included 39 candidate variables across demographic, anthropometric, medical history, laboratory, and electrocardiographic domains. The ML models were externally validated and compared with prior established traditional and non-race-specific ML models in race-specific subgroups of the pooled MESA/DHS cohort and Black participants of ARIC. The Harrell C-index and Greenwood-Nam-D'Agostino χ
tests were used to assess discrimination and calibration, respectively.
The ML models had excellent discrimination in the derivation cohorts for Black (n=4141 in JHS, C-index=0.88) and White (n=7858 in ARIC, C-index=0.89) participants. In the external validation cohorts, the race-specific ML model demonstrated adequate calibration and superior discrimination (Black individuals, C-index=0.80-0.83; White individuals, C-index=0.82) compared with established HF risk models or with non-race-specific ML models derived with race included as a covariate. Among the risk factors, natriuretic peptide levels were the most important predictor of HF risk across both races, followed by troponin levels in Black and ECG-based Cornell voltage in White individuals. Other key predictors of HF risk among Black individuals were glycemic parameters and socioeconomic factors. In contrast, prevalent cardiovascular disease and traditional cardiovascular risk factors were stronger predictors of HF risk in White adults.
Race-specific and ML-based HF risk models that integrate clinical, laboratory, and biomarker data demonstrated superior performance compared with traditional HF risk and non-race-specific ML models. This approach identifies distinct race-specific contributors of HF.
Autism spectrum disorder (ASD) is a highly heritable neurodevelopmental disorder. Large genetically informative cohorts of individuals with ASD have led to the identification of a limited number of ...common genome-wide significant (GWS) risk loci to date. However, many more common genetic variants are expected to contribute to ASD risk given the high heritability. Here, we performed a genome-wide association study (GWAS) on 6222 case-pseudocontrol pairs from the Simons Foundation Powering Autism Research for Knowledge (SPARK) dataset to identify additional common genetic risk factors and molecular mechanisms underlying risk for ASD. We identified one novel GWS locus from the SPARK GWAS and four significant loci, including an additional novel locus from meta-analysis with a previous GWAS. We replicated the previous observation of significant enrichment of ASD heritability within regulatory regions of the developing cortex, indicating that disruption of gene regulation during neurodevelopment is critical for ASD risk. We further employed a massively parallel reporter assay (MPRA) and identified a putative causal variant at the novel locus from SPARK GWAS with strong impacts on gene regulation (rs7001340). Expression quantitative trait loci data demonstrated an association between the risk allele and decreased expression of DDHD2 (DDHD domain containing 2) in both adult and prenatal brains. In conclusion, by integrating genetic association data with multi-omic gene regulatory annotations and experimental validation, we fine-mapped a causal risk variant and demonstrated that DDHD2 is a novel gene associated with ASD risk.
Most genome-wide association and fine-mapping studies to date have been conducted in individuals of European descent, and genetic studies of populations of Hispanic/Latino and African ancestry are ...limited. In addition, these populations have more complex linkage disequilibrium structure. In order to better define the genetic architecture of these understudied populations, we leveraged >100,000 phased sequences available from deep-coverage whole genome sequencing through the multi-ethnic NHLBI Trans-Omics for Precision Medicine (TOPMed) program to impute genotypes into admixed African and Hispanic/Latino samples with genome-wide genotyping array data. We demonstrated that using TOPMed sequencing data as the imputation reference panel improves genotype imputation quality in these populations, which subsequently enhanced gene-mapping power for complex traits. For rare variants with minor allele frequency (MAF) < 0.5%, we observed a 2.3- to 6.1-fold increase in the number of well-imputed variants, with 11-34% improvement in average imputation quality, compared to the state-of-the-art 1000 Genomes Project Phase 3 and Haplotype Reference Consortium reference panels. Impressively, even for extremely rare variants with minor allele count <10 (including singletons) in the imputation target samples, average information content rescued was >86%. Subsequent association analyses of TOPMed reference panel-imputed genotype data with hematological traits (hemoglobin (HGB), hematocrit (HCT), and white blood cell count (WBC)) in ~21,600 African-ancestry and ~21,700 Hispanic/Latino individuals identified associations with two rare variants in the HBB gene (rs33930165 with higher WBC p = 8.8x10-15 in African populations, rs11549407 with lower HGB p = 1.5x10-12 and HCT p = 8.8x10-10 in Hispanics/Latinos). By comparison, neither variant would have been genome-wide significant if either 1000 Genomes Project Phase 3 or Haplotype Reference Consortium reference panels had been used for imputation. Our findings highlight the utility of the TOPMed imputation reference panel for identification of novel rare variant associations not previously detected in similarly sized genome-wide studies of under-represented African and Hispanic/Latino populations.
Systemic lupus erythematosus (SLE) is a chronic systemic autoimmune disease that disproportionately affects African American females. The causes of SLE are unknown but postulated to be a combination ...of genetic predisposition and environmental triggers. Vitamin D deficiency is one of the possible environmental triggers. In this study we evaluated relationships between vitamin D status, cellular aging (telomere length) and anti-telomere antibodies among African American Gullah women with SLE. The study population included African American female SLE patients and unaffected controls from the Sea Island region of South Carolina. Serum 25-hydroxyvitamin D levels were measured using a nonchromatographic radioimmunoassay. Telomere length was measured in genomic DNA of peripheral blood mononuclear cells (PBMCs) by monochrome multiplex quantitative PCR. Anti-telomere antibody levels were measured by enzyme-linked immunosorbent assay (ELISA). Patients with SLE had significantly shorter telomeres and higher anti-telomere antibody titers compared to age- and gender-matched unaffected controls. There was a positive correlation between anti-telomere antibody levels and disease activity among patients and a significant correlation of shorter telomeres with lower 25-hydroxyvitamin D levels in both patients and controls. In follow-up examination of a subset of the patients, the patients who remained vitamin D deficient tended to have shorter telomeres than those patients whose 25-hydroxyvitamin D levels were repleted. Increasing 25-hydroxyvitamin D levels in African American patients with SLE may be beneficial in maintaining telomere length and preventing cellular aging. Moreover, anti-telomere antibody levels may be a promising biomarker of SLE status and disease activity.