We report a genome-wide association study (GWAS) of coronary artery disease (CAD) incorporating nearly a quarter of a million cases, in which existing studies are integrated with data from cohorts of ...white, Black and Hispanic individuals from the Million Veteran Program. We document near equivalent heritability of CAD across multiple ancestral groups, identify 95 novel loci, including nine on the X chromosome, detect eight loci of genome-wide significance in Black and Hispanic individuals, and demonstrate that two common haplotypes at the 9p21 locus are responsible for risk stratification in all populations except those of African origin, in which these haplotypes are virtually absent. Moreover, in the largest GWAS for angiographically derived coronary atherosclerosis performed to date, we find 15 loci of genome-wide significance that robustly overlap with established loci for clinical CAD. Phenome-wide association analyses of novel loci and polygenic risk scores (PRSs) augment signals related to insulin resistance, extend pleiotropic associations of these loci to include smoking and family history, and precisely document the markedly reduced transferability of existing PRSs to Black individuals. Downstream integrative analyses reinforce the critical roles of vascular endothelial, fibroblast, and smooth muscle cells in CAD susceptibility, but also point to a shared biology between atherosclerosis and oncogenesis. This study highlights the value of diverse populations in further characterizing the genetic architecture of CAD.
Dental caries and periodontitis account for a vast burden of morbidity and healthcare spending, yet their genetic basis remains largely uncharacterized. Here, we identify self-reported dental disease ...proxies which have similar underlying genetic contributions to clinical disease measures and then combine these in a genome-wide association study meta-analysis, identifying 47 novel and conditionally-independent risk loci for dental caries. We show that the heritability of dental caries is enriched for conserved genomic regions and partially overlapping with a range of complex traits including smoking, education, personality traits and metabolic measures. Using cardio-metabolic traits as an example in Mendelian randomization analysis, we estimate causal relationships and provide evidence suggesting that the processes contributing to dental caries may have undesirable downstream effects on health.
Hispanics living in the USA may have unrecognized potential birthplace and lifestyle influences on the gut microbiome. We report a cross-sectional analysis of 1674 participants from four centers of ...the Hispanic Community Health Study/Study of Latinos (HCHS/SOL), aged 18 to 74 years old at recruitment.
Amplicon sequencing of 16S rRNA gene V4 and fungal ITS1 fragments from self-collected stool samples indicate that the host microbiome is determined by sociodemographic and migration-related variables. Those who relocate from Latin America to the USA at an early age have reductions in Prevotella to Bacteroides ratios that persist across the life course. Shannon index of alpha diversity in fungi and bacteria is low in those who relocate to the USA in early life. In contrast, those who relocate to the USA during adulthood, over 45 years old, have high bacterial and fungal diversity and high Prevotella to Bacteroides ratios, compared to USA-born and childhood arrivals. Low bacterial diversity is associated in turn with obesity. Contrasting with prior studies, our study of the Latino population shows increasing Prevotella to Bacteroides ratio with greater obesity. Taxa within Acidaminococcus, Megasphaera, Ruminococcaceae, Coriobacteriaceae, Clostridiales, Christensenellaceae, YS2 (Cyanobacteria), and Victivallaceae are significantly associated with both obesity and earlier exposure to the USA, while Oscillospira and Anaerotruncus show paradoxical associations with both obesity and late-life introduction to the USA.
Our analysis of the gut microbiome of Latinos demonstrates unique features that might be responsible for health disparities affecting Hispanics living in the USA.
The vast majority of genome-wide association study (GWAS) findings reported to date are from populations with European Ancestry (EA), and it is not yet clear how broadly the genetic associations ...described will generalize to populations of diverse ancestry. The Population Architecture Using Genomics and Epidemiology (PAGE) study is a consortium of multi-ancestry, population-based studies formed with the objective of refining our understanding of the genetic architecture of common traits emerging from GWAS. In the present analysis of five common diseases and traits, including body mass index, type 2 diabetes, and lipid levels, we compare direction and magnitude of effects for GWAS-identified variants in multiple non-EA populations against EA findings. We demonstrate that, in all populations analyzed, a significant majority of GWAS-identified variants have allelic associations in the same direction as in EA, with none showing a statistically significant effect in the opposite direction, after adjustment for multiple testing. However, 25% of tagSNPs identified in EA GWAS have significantly different effect sizes in at least one non-EA population, and these differential effects were most frequent in African Americans where all differential effects were diluted toward the null. We demonstrate that differential LD between tagSNPs and functional variants within populations contributes significantly to dilute effect sizes in this population. Although most variants identified from GWAS in EA populations generalize to all non-EA populations assessed, genetic models derived from GWAS findings in EA may generate spurious results in non-EA populations due to differential effect sizes. Regardless of the origin of the differential effects, caution should be exercised in applying any genetic risk prediction model based on tagSNPs outside of the ancestry group in which it was derived. Models based directly on functional variation may generalize more robustly, but the identification of functional variants remains challenging.
Although the prevalence of obesity has increased in recent years, individuals who are obese early in life have not been studied over time to determine whether they develop severe obesity in ...adulthood, thus limiting effective interventions to reduce severe obesity incidence and its potentially life-threatening associated conditions.
To determine incidence and risk of severe obesity in adulthood by adolescent weight status.
A cohort of 8834 individuals aged 12 to 21 years enrolled in 1996 in wave II of the US National Longitudinal Study of Adolescent Health, followed up into adulthood (ages 18-27 years during wave III 2001-2002 and ages 24-33 years during wave IV 2007-2009). Height and weight were obtained via anthropometry and surveys administered in study participants' homes using standardized procedures.
New cases of adult-onset severe obesity were calculated by sex, race/ethnicity, and adolescent weight status. Sex-stratified, discrete time hazard models estimated the net effect of adolescent obesity (aged <20 years; body mass index BMI ≥95th percentile of the sex-specific BMI-for-age growth chart or BMI ≥30.0) on risk of severe obesity incidence in adulthood (aged ≥20 years; BMI ≥40.0), adjusting for race/ethnicity and age and weighted for national representation.
In 1996, 79 (1.0%; 95% confidence interval CI, 0.7%-1.4%) adolescents were severely obese; 60 (70.5%; 95% CI, 57.2%-83.9%) remained severely obese in adulthood. By 2009, 703 (7.9%; 95% CI, 7.4%-8.5%) non-severely obese adolescents had become severely obese in adulthood, with the highest rates for non-Hispanic black women. Obese adolescents were significantly more likely to develop severe obesity in young adulthood than normal-weight or overweight adolescents (hazard ratio, 16.0; 95% CI, 12.4-20.5).
In this cohort, obesity in adolescence was significantly associated with increased risk of incident severe obesity in adulthood, with variations by sex and race/ethnicity.
Variation in levels of the human metabolome reflect changes in homeostasis, providing a window into health and disease. The genetic impact on circulating metabolites in Hispanics, a population with ...high cardiometabolic disease burden, is largely unknown. We conducted genome-wide association analyses on 640 circulating metabolites in 3,926 Hispanic Community Health Study/Study of Latinos participants. The estimated heritability for 640 metabolites ranged between 0%–54% with a median at 2.5%. We discovered 46 variant-metabolite pairs (p value < 1.2 × 10−10, minor allele frequency ≥ 1%, proportion of variance explained PEV mean = 3.4%, PEVrange = 1%–22%) with generalized effects in two population-based studies and confirmed 301 known locus-metabolite associations. Half of the identified variants with generalized effect were located in genes, including five nonsynonymous variants. We identified co-localization with the expression quantitative trait loci at 105 discovered and 151 known loci-metabolites sets. rs5855544, upstream of SLC51A, was associated with higher levels of three steroid sulfates and co-localized with expression levels of SLC51A in several tissues. Mendelian randomization (MR) analysis identified several metabolites associated with coronary heart disease (CHD) and type 2 diabetes. For example, two variants located in or near CYP4F2 (rs2108622 and rs79400241, respectively), involved in vitamin E metabolism, were associated with the levels of octadecanedioate and vitamin E metabolites (gamma-CEHC and gamma-CEHC glucuronide); MR analysis showed that genetically high levels of these metabolites were associated with lower odds of CHD. Our findings document the genetic architecture of circulating metabolites in an underrepresented Hispanic/Latino community, shedding light on disease etiology.
There is no agnostic GWAS evidence for the genetic control of IL-1β expression in periodontal disease. Here we report a GWAS for "high" gingival crevicular fluid IL-1β expression among 4910 ...European-American adults and identify association signals in the IL37 locus. rs3811046 at this locus (p = 3.3 × 10
) is associated with severe chronic periodontitis (OR = 1.50; 95% CI = 1.12-2.00), 10-year incident tooth loss (≥3 teeth: RR = 1.33; 95% CI = 1.09-1.62) and aggressive periodontitis (OR = 1.12; 95% CI = 1.01-1.26) in an independent sample of 4927 German/Dutch adults. The minor allele at rs3811046 is associated with increased expression of IL-1β in periodontal tissue. In RAW macrophages, PBMCs and transgenic mice, the IL37 variant increases expression of IL-1β and IL-6, inducing more severe periodontal disease, while IL-37 protein production is impaired and shows reduced cleavage by caspase-1. A second variant in the IL37 locus (rs2708943, p = 4.2 × 10
) associates with attenuated IL37 mRNA expression. Overall, we demonstrate that IL37 variants modulate the inflammatory cascade in periodontal disease.
Prior GWAS have identified loci associated with red blood cell (RBC) traits in populations of European, African, and Asian ancestry. These studies have not included individuals with an Amerindian ...ancestral background, such as Hispanics/Latinos, nor evaluated the full spectrum of genomic variation beyond single nucleotide variants. Using a custom genotyping array enriched for Amerindian ancestral content and 1000 Genomes imputation, we performed GWAS in 12,502 participants of Hispanic Community Health Study and Study of Latinos (HCHS/SOL) for hematocrit, hemoglobin, RBC count, RBC distribution width (RDW), and RBC indices. Approximately 60% of previously reported RBC trait loci generalized to HCHS/SOL Hispanics/Latinos, including African ancestral alpha- and beta-globin gene variants. In addition to the known 3.8kb alpha-globin copy number variant, we identified an Amerindian ancestral association in an alpha-globin regulatory region on chromosome 16p13.3 for mean corpuscular volume and mean corpuscular hemoglobin. We also discovered and replicated three genome-wide significant variants in previously unreported loci for RDW (SLC12A2 rs17764730, PSMB5 rs941718), and hematocrit (PROX1 rs3754140). Among the proxy variants at the SLC12A2 locus we identified rs3812049, located in a bi-directional promoter between SLC12A2 (which encodes a red cell membrane ion-transport protein) and an upstream anti-sense long-noncoding RNA, LINC01184, as the likely causal variant. We further demonstrate that disruption of the regulatory element harboring rs3812049 affects transcription of SLC12A2 and LINC01184 in human erythroid progenitor cells. Together, these results reinforce the importance of genetic study of diverse ancestral populations, in particular Hispanics/Latinos.
A key goal of whole-genome sequencing for studies of human genetics is to interrogate all forms of variation, including single-nucleotide variants, small insertion or deletion (indel) variants and ...structural variants. However, tools and resources for the study of structural variants have lagged behind those for smaller variants. Here we used a scalable pipeline
to map and characterize structural variants in 17,795 deeply sequenced human genomes. We publicly release site-frequency data to create the largest, to our knowledge, whole-genome-sequencing-based structural variant resource so far. On average, individuals carry 2.9 rare structural variants that alter coding regions; these variants affect the dosage or structure of 4.2 genes and account for 4.0-11.2% of rare high-impact coding alleles. Using a computational model, we estimate that structural variants account for 17.2% of rare alleles genome-wide, with predicted deleterious effects that are equivalent to loss-of-function coding alleles; approximately 90% of such structural variants are noncoding deletions (mean 19.1 per genome). We report 158,991 ultra-rare structural variants and show that 2% of individuals carry ultra-rare megabase-scale structural variants, nearly half of which are balanced or complex rearrangements. Finally, we infer the dosage sensitivity of genes and noncoding elements, and reveal trends that relate to element class and conservation. This work will help to guide the analysis and interpretation of structural variants in the era of whole-genome sequencing.
Obesity and related comorbidities are major health concerns among many US immigrant populations. Emerging evidence suggests a potential involvement of the gut microbiome. Here, we evaluated gut ...microbiome features and their associations with immigration, dietary intake, and obesity in 2640 individuals from a population-based study of US Hispanics/Latinos.
The fecal shotgun metagenomics data indicate that greater US exposure is associated with reduced ɑ-diversity, reduced functions of fiber degradation, and alterations in individual taxa, potentially related to a westernized diet. However, a majority of gut bacterial genera show paradoxical associations, being reduced with US exposure and increased with fiber intake, but increased with obesity. The observed paradoxical associations are not explained by host characteristics or variation in bacterial species but might be related to potential microbial co-occurrence, as seen by positive correlations among Roseburia, Prevotella, Dorea, and Coprococcus. In the conditional analysis with mutual adjustment, including all genera associated with both obesity and US exposure in the same model, the positive associations of Roseburia and Prevotella with obesity did not persist, suggesting that their positive associations with obesity might be due to their co-occurrence and correlations with obesity-related taxa, such as Dorea and Coprococcus.
Among US Hispanics/Latinos, US exposure is associated with unfavorable gut microbiome profiles for obesity risk, potentially related to westernized diet during acculturation. Microbial co-occurrence could be an important factor to consider in future studies relating individual gut microbiome taxa to environmental factors and host health and disease.