The intestinal microflora, typically equated with bacteria, influences diseases such as obesity and inflammatory bowel disease. Here, we show that the mammalian gut contains a rich fungal community ...that interacts with the immune system through the innate immune receptor Dectin-1. Mice lacking Dectin-1 exhibited increased susceptibility to chemically induced colitis, which was the result of altered responses to indigenous fungi. In humans, we identified a polymorphism in the gene for Dectin-1 (CLEC7A) that is strongly linked to a severe form of ulcerative colitis. Together, our findings reveal a eukaryotic fungal community in the gut (the "mycobiome") that coexists with bacteria and substantially expands the repertoire of organisms interacting with the intestinal immune system to influence health and disease.
Mitochondrial DNA copy number (mtDNA-CN), a measure of the number of mitochondrial genomes per cell, is a minimally invasive proxy measure for mitochondrial function and has been associated with ...several aging-related diseases. Although quantitative real-time PCR (qPCR) is the current gold standard method for measuring mtDNA-CN, mtDNA-CN can also be measured from genotyping microarray probe intensities and DNA sequencing read counts. To conduct a comprehensive examination on the performance of these methods, we use known mtDNA-CN correlates (age, sex, white blood cell count, Duffy locus genotype, incident cardiovascular disease) to evaluate mtDNA-CN calculated from qPCR, two microarray platforms, as well as whole genome (WGS) and whole exome sequence (WES) data across 1,085 participants from the Atherosclerosis Risk in Communities (ARIC) study and 3,489 participants from the Multi-Ethnic Study of Atherosclerosis (MESA). We observe mtDNA-CN derived from WGS data is significantly more associated with known correlates compared to all other methods (p < 0.001). Additionally, mtDNA-CN measured from WGS is on average more significantly associated with traits by 5.6 orders of magnitude and has effect size estimates 5.8 times more extreme than the current gold standard of qPCR. We further investigated the role of DNA extraction method on mtDNA-CN estimate reproducibility and found mtDNA-CN estimated from cell lysate is significantly less variable than traditional phenol-chloroform-isoamyl alcohol (p = 5.44x10-4) and silica-based column selection (p = 2.82x10-7). In conclusion, we recommend the field moves towards more accurate methods for mtDNA-CN, as well as re-analyze trait associations as more WGS data becomes available from larger initiatives such as TOPMed.
Dilated cardiomyopathy (DCM) is an important cause of heart failure and the leading indication for heart transplantation. Many rare genetic variants have been associated with DCM, but common variant ...studies of the disease have yielded few associated loci. As structural changes in the heart are a defining feature of DCM, we report a genome-wide association study of cardiac magnetic resonance imaging (MRI)-derived left ventricular measurements in 36,041 UK Biobank participants, with replication in 2184 participants from the Multi-Ethnic Study of Atherosclerosis. We identify 45 previously unreported loci associated with cardiac structure and function, many near well-established genes for Mendelian cardiomyopathies. A polygenic score of MRI-derived left ventricular end systolic volume strongly associates with incident DCM in the general population. Even among carriers of TTN truncating mutations, this polygenic score influences the size and function of the human heart. These results further implicate common genetic polymorphisms in the pathogenesis of DCM.
We introduce Giraffe, a pangenome short-read mapper that can efficiently map to a collection of haplotypes threaded through a sequence graph. Giraffe maps sequencing reads to thousands of human ...genomes at a speed comparable to that of standard methods mapping to a single reference genome. The increased mapping accuracy enables downstream improvements in genome-wide genotyping pipelines for both small variants and larger structural variants. We used Giraffe to genotype 167,000 structural variants, discovered in long-read studies, in 5202 diverse human genomes that were sequenced using short reads. We conclude that pangenomics facilitates a more comprehensive characterization of variation and, as a result, has the potential to improve many genomic analyses.
Genome wide association studies (GWAS) have revealed 11 independent risk loci for polycystic ovary syndrome (PCOS), a common disorder in young women characterized by androgen excess and ...oligomenorrhea. To put these risk loci and the single nucleotide polymorphisms (SNPs) therein into functional context, we measured DNA methylation and gene expression in subcutaneous adipose tissue biopsies to identify PCOS-specific alterations. Two genes from the LHCGR region, STON1-GTF2A1L and LHCGR, were overexpressed in PCOS. In analysis stratified by obesity, LHCGR was overexpressed only in non-obese PCOS women. Although not differentially expressed in the entire PCOS group, INSR was underexpressed in obese PCOS subjects only. Alterations in gene expression in the LHCGR, RAB5B and INSR regions suggest that SNPs in these loci may be functional and could affect gene expression directly or indirectly via epigenetic alterations. We identified reduced methylation in the LHCGR locus and increased methylation in the INSR locus, changes that are concordant with the altered gene expression profiles. Complex patterns of meQTL and eQTL were identified in these loci, suggesting that local genetic variation plays an important role in gene regulation. We propose that non-obese PCOS women possess significant alterations in LH receptor expression, which drives excess androgen secretion from the ovary. Alternatively, obese women with PCOS possess alterations in insulin receptor expression, with underexpression in metabolic tissues and overexpression in the ovary, resulting in peripheral insulin resistance and excess ovarian androgen production. These studies provide a genetic and molecular basis for the reported clinical heterogeneity of PCOS.
In classifying Crohn's disease (CD) location, proximal (L4) disease includes esophagogastroduodenal (EGD) and jejunal disease. Our aim was to determine the influence of proximal disease on outcomes ...of behavior and need for surgery and to determine if there was significant clinical heterogeneity between EGD and jejunal disease.
We performed a cross-sectional query of the NIDDK (National Institute of Diabetes and Digestive and Kidney Disease) Inflammatory Bowel Disease Genetics Consortium (IBDGC) database of patients with a confirmed diagnosis of CD and phenotyped per the IBDGC manual. Presence of any L4, L4-EGD, L4-jejunal, and non-L4 disease (L1-ileal, L2-colonic, and L3-ileocolonic) was compared with demographic features including age, race, ethnicity, smoking and inflammatory bowel disease (IBD) family history, diagnosis age, disease duration, clinical outcomes of inflammatory, stricturing or penetrating behavior, and CD abdominal surgeries. Univariate and multivariable analyses were performed with R.
Among 2,105 patients with complete disease location data, 346 had L4 disease (175 L4-EGD, 115 L4-jejunal, and 56 EGD and jejunal) with 321 having concurrent L1-L3 disease. In all, 1,759 had only L1-L3 disease. L4 vs. non-L4 patients were more likely (P<0.001) to be younger at diagnosis, non-smokers, have coexisting ileal involvement, and have stricturing disease. L4-jejunal vs. L4-EGD patients were at least twice as likely (P<0.001) to have had ileal disease, stricturing behavior, and any or multiple abdominal surgeries. Remarkably, L4-jejunal patients had more (P<0.001) stricturing behavior and multiple abdominal surgeries than non-L4 ileal disease patients. Logistic regression showed stricturing risks were ileal (without proximal) site (odds ratio (OR) 3.18; 95% confidence interval 2.23-4.64), longer disease duration (OR 1.33/decade; 1.19-1.49), jejunal site (OR 2.90; 1.89-4.45), and older age at diagnosis (OR 1.21/decade; 1.10-1.34). Multiple surgery risks were disease duration (OR 3.74/decade; 3.05-4.64), penetrating disease (OR 2.60; 1.64-4.21), and jejunal site (OR 2.39; 1.36-4.20), with short duration from diagnosis to first surgery protective (OR 0.87/decade to first surgery; 0.84-0.90).
Jejunal disease is a significantly greater risk factor for stricturing disease and multiple abdominal surgeries than either EGD or ileal (without proximal) disease. The Montreal site classification should be revised to include separate designations for jejunal and EGD disease.
Developing countries and Indigenous populations are disproportionately affected by global trends in diabetes (T2DM), but inconsistent data are available to corroborate this pattern in Guatemala and ...indigenous communities in Central America. Historic estimates of T2DM, using a variety of sampling techniques and diagnostic methods, in Guatemala include a T2DM prevalence of: 4·2% (1970) and 8·4% (2003). Objectives of this geographically randomized, cross-sectional analysis of risk include: (1) use HbA1c to determine prevalence of T2DM and prediabetes in rural Indigenous community of Atitlán (2) identify risk factors for T2DM including age, BMI and gender.
A spatially random sampling method was used to identify 400 subjects. Prevalence was compared using the confidence interval method, and logistic regression and linear regression were used to assess association between diabetes and risk factors.
The overall prevalence of T2DM using HbA1c was 13·81% and prediabetes was also 13·81% in Atitlán, representing a tripling in diabetes from historic estimates and a large population with pre-diabetes. The probability of diabetes increased dramatically with increasing age, however no significant overall relationship existed with gender or BMI.
Diabetes is a larger epidemic than previously expected and appears to be related to ageing rather than BMI. Our proposed explanations for these findings include: possible Indigenous unique genetic susceptibility to T2DM, shortcomings in BMI as a metric for adiposity in assessing risk, changes in lifestyle and diet, and an overall aging population. The conclusion of this study suggest that (1) T2DM in rural regions of Guatemala may be of epidemic proportion. With pre-diabetes, more than 25% of the population will be diabetic in the very near future; (2) Age is a significant risk factor in the Indigenous population but BMI is not. This suggests that in some populations diabetes may be a disease of ageing.
BACKGROUND:The relative prevalence and clinical importance of monogenic mutations related to familial hypercholesterolemia and of high polygenic score (cumulative impact of many common variants) ...pathways for early-onset myocardial infarction remain uncertain. Whole-genome sequencing enables simultaneous ascertainment of both monogenic mutations and polygenic score for each individual.
METHODS:We performed deep-coverage whole-genome sequencing of 2081 patients from 4 racial subgroups hospitalized in the United States with early-onset myocardial infarction (age ≤55 years) recruited with a 2:1 female-to-male enrollment design. We compared these genomes with those of 3761 population-based control subjects. We first identified individuals with a rare, monogenic mutation related to familial hypercholesterolemia. Second, we calculated a recently developed polygenic score of 6.6 million common DNA variants to quantify the cumulative susceptibility conferred by common variants. We defined high polygenic score as the top 5% of the control distribution because this cutoff has previously been shown to confer similar risk to that of familial hypercholesterolemia mutations.
RESULTS:The mean age of the 2081 patients presenting with early-onset myocardial infarction was 48 years, and 66% were female. A familial hypercholesterolemia mutation was present in 36 of these patients (1.7%) and was associated with a 3.8-fold (95% CI, 2.1–6.8; P<0.001) increased odds of myocardial infarction. Of the patients with early-onset myocardial infarction, 359 (17.3%) carried a high polygenic score, associated with a 3.7-fold (95% CI, 3.1–4.6; P<0.001) increased odds. Mean estimated untreated low-density lipoprotein cholesterol was 206 mg/dL in those with a familial hypercholesterolemia mutation, 132 mg/dL in those with high polygenic score, and 122 mg/dL in those in the remainder of the population. Although associated with increased risk in all racial groups, high polygenic score demonstrated the strongest association in white participants (P for heterogeneity=0.008).
CONCLUSIONS:Both familial hypercholesterolemia mutations and high polygenic score are associated with a >3-fold increased odds of early-onset myocardial infarction. However, high polygenic score has a 10-fold higher prevalence among patients presents with early-onset myocardial infarction.
CLINICAL TRIAL REGISTRATION:URLhttps://www.clinicaltrials.gov. Unique identifierNCT00597922.
BACKGROUND:The genetic basis of left ventricular (LV) image-derived phenotypes, which play a vital role in the diagnosis, management, and risk stratification of cardiovascular diseases, is unclear at ...present.
METHODS:The LV parameters were measured from the cardiovascular magnetic resonance studies of the UK Biobank. Genotyping was done using Affymetrix arrays, augmented by imputation. We performed genome-wide association studies of 6 LV traits—LV end-diastolic volume, LV end-systolic volume, LV stroke volume, LV ejection fraction, LV mass, and LV mass to end-diastolic volume ratio. The replication analysis was performed in the MESA study (Multi-Ethnic Study of Atherosclerosis). We identified the candidate genes at genome-wide significant loci based on the evidence from extensive bioinformatic analyses. Polygenic risk scores were constructed from the summary statistics of LV genome-wide association studies to predict the heart failure events.
RESULTS:The study comprised 16 923 European UK Biobank participants (mean age 62.5 years; 45.8% men) without prevalent myocardial infarction or heart failure. We discovered 14 genome-wide significant loci (3 loci each for LV end-diastolic volume, LV end-systolic volume, and LV mass to end-diastolic volume ratio; 4 loci for LV ejection fraction, and 1 locus for LV mass) at a stringent P<1×10. Three loci were replicated at Bonferroni significance and 7 loci at nominal significance (P<0.05 with concordant direction of effect) in the MESA study (n=4383). Follow-up bioinformatic analyses identified 28 candidate genes that were enriched in the cardiac developmental pathways and regulation of the LV contractile mechanism. Eight genes (TTN, BAG3, GRK5, HSPB7, MTSS1, ALPK3, NMB, and MMP11) supported by at least 2 independent lines of in silico evidence were implicated in the cardiac morphogenesis and heart failure development. The polygenic risk scores of LV phenotypes were predictive of heart failure in a holdout UK Biobank sample of 3106 cases and 224 134 controls (odds ratio 1.41, 95% CI 1.26 – 1.58, for the top quintile versus the bottom quintile of the LV end-systolic volume risk score).
CONCLUSIONS:We report 14 genetic loci and indicate several candidate genes that not only enhance our understanding of the genetic architecture of prognostically important LV phenotypes but also shed light on potential novel therapeutic targets for LV remodeling.