Exome sequencing studies in complex diseases are challenged by the allelic heterogeneity, large number and modest effect sizes of associated variants on disease risk and the presence of large numbers ...of neutral variants, even in phenotypically relevant genes. Isolated populations with recent bottlenecks offer advantages for studying rare variants in complex diseases as they have deleterious variants that are present at higher frequencies as well as a substantial reduction in rare neutral variation. To explore the potential of the Finnish founder population for studying low-frequency (0.5-5%) variants in complex diseases, we compared exome sequence data on 3,000 Finns to the same number of non-Finnish Europeans and discovered that, despite having fewer variable sites overall, the average Finn has more low-frequency loss-of-function variants and complete gene knockouts. We then used several well-characterized Finnish population cohorts to study the phenotypic effects of 83 enriched loss-of-function variants across 60 phenotypes in 36,262 Finns. Using a deep set of quantitative traits collected on these cohorts, we show 5 associations (p<5×10⁻⁸) including splice variants in LPA that lowered plasma lipoprotein(a) levels (P = 1.5×10⁻¹¹⁷). Through accessing the national medical records of these participants, we evaluate the LPA finding via Mendelian randomization and confirm that these splice variants confer protection from cardiovascular disease (OR = 0.84, P = 3×10⁻⁴), demonstrating for the first time the correlation between very low levels of LPA in humans with potential therapeutic implications for cardiovascular diseases. More generally, this study articulates substantial advantages for studying the role of rare variation in complex phenotypes in founder populations like the Finns and by combining a unique population genetic history with data from large population cohorts and centralized research access to National Health Registers.
Genotype-first approach allows to systematically identify carriers of pathogenic variants in BRCA1/2 genes conferring a high risk of familial breast and ovarian cancer. Participants of the Estonian ...biobank have expressed support for the disclosure of clinically significant findings. With an Estonian biobank cohort, we applied a genotype-first approach, contacted carriers, and offered return of results with genetic counseling. We evaluated participants' responses to and the clinical utility of the reporting of actionable genetic findings. Twenty-two of 40 contacted carriers of 17 pathogenic BRCA1/2 variants responded and chose to receive results. Eight of these 22 participants qualified for high-risk assessment based on National Comprehensive Cancer Network criteria. Twenty of 21 counseled participants appreciated being contacted. Relatives of 10 participants underwent cascade screening. Five of 16 eligible female BRCA1/2 variant carriers chose to undergo risk-reducing surgery, and 10 adhered to surveillance recommendations over the 30-month follow-up period. We recommend the return of results to population-based biobank participants; this approach could be viewed as a model for population-wide genetic testing. The genotype-first approach permits the identification of individuals at high risk who would not be identified by application of an approach based on personal and family histories only.
Clinical nutrition research often lacks robust markers of compliance, complicating the interpretation of clinical trials and observational studies of free-living subjects.
We aimed to examine ...metabolomics profiles in response to 3 diets that differed widely in macronutrient composition during a controlled feeding protocol.
Twenty-one adults with a high body mass index (in kg/m
; mean ± SD: 34.4 ± 4.9) were given hypocaloric diets to promote weight loss corresponding to 10-15% of initial body weight. They were then studied during weight stability while consuming 3 test diets, each for a 4-wk period according to a crossover design: low fat (60% carbohydrate, 20% fat, 20% protein), low glycemic index (40% carbohydrate, 40% fat, 20% protein), or very-low carbohydrate (10% carbohydrate, 60% fat, 30% protein). Plasma samples were obtained at baseline and at the end of each 4-wk period in the fasting state for metabolomics analysis by using liquid chromatography-tandem mass spectrometry. Statistical analyses included adjustment for multiple comparisons.
Of 333 metabolites, we identified 152 whose concentrations differed for ≥1 diet compared with the others, including diacylglycerols and triacylglycerols, branched-chain amino acids, and markers reflecting metabolic status. Analysis of groups of related metabolites, with the use of either principal components or pathways, revealed coordinated metabolic changes affected by dietary composition, including pathways related to amino acid metabolism. We constructed a classifier using the metabolites that differed between diets and were able to correctly identify the test diet from metabolite profiles in 60 of 63 cases (>95% accuracy). Analyses also suggest differential effects by diet on numerous cardiometabolic disease risk factors.
Metabolomic profiling may be used to assess compliance during clinical nutrition trials and the validity of dietary assessment in observational studies. In addition, this methodology may help elucidate mechanistic pathways linking diet to chronic disease risk. This trial was registered at clinicaltrials.gov as NCT00315354.
We conducted genome-wide association studies (GWAS) of relative intake from the macronutrients fat, protein, carbohydrates, and sugar in over 235,000 individuals of European ancestries. We identified ...21 unique, approximately independent lead SNPs. Fourteen lead SNPs are uniquely associated with one macronutrient at genome-wide significance (P < 5 × 10
), while five of the 21 lead SNPs reach suggestive significance (P < 1 × 10
) for at least one other macronutrient. While the phenotypes are genetically correlated, each phenotype carries a partially unique genetic architecture. Relative protein intake exhibits the strongest relationships with poor health, including positive genetic associations with obesity, type 2 diabetes, and heart disease (r
≈ 0.15-0.5). In contrast, relative carbohydrate and sugar intake have negative genetic correlations with waist circumference, waist-hip ratio, and neighborhood deprivation (|r
| ≈ 0.1-0.3) and positive genetic correlations with physical activity (r
≈ 0.1 and 0.2). Relative fat intake has no consistent pattern of genetic correlations with poor health but has a negative genetic correlation with educational attainment (r
≈-0.1). Although our analyses do not allow us to draw causal conclusions, we find no evidence of negative health consequences associated with relative carbohydrate, sugar, or fat intake. However, our results are consistent with the hypothesis that relative protein intake plays a role in the etiology of metabolic dysfunction.
Features of the gut microbiota have been associated with several chronic diseases and longevity in preclinical models as well as in observational studies. Whether these relations underlie causal ...effects in humans remains to be established. We aimed to determine whether the gut microbiota influences cardiometabolic traits as well as the risk of chronic diseases and human longevity using a comprehensive 2-Sample Mendelian randomization approach. We included as exposures 10 gut-associated metabolites and pathways and 57 microbial taxa abundance. We included as outcomes nine cardiometabolic traits (fasting glucose, fasting insulin, systolic blood pressure, diastolic blood pressure, HDL cholesterol, LDL cholesterol, triglycerides, estimated glomerular filtration rate, body mass index BMI), eight chronic diseases previously linked with the gut microbiota in observational studies (Alzheimer's disease, depression, type 2 diabetes, non-alcoholic fatty liver disease, coronary artery disease (CAD), stroke, osteoporosis and chronic kidney disease), as well as parental lifespan and longevity. We found 7 associations with evidence of causality before and after sensitivity analyses, but not after multiple testing correction (1198 tests). Most effect sizes (4/7) were small. The two largest exposure-outcome effects were markedly attenuated towards the null upon inclusion of BMI or alcohol intake frequency in multivariable MR analyses. While finding robust genetic instruments for microbiota features is challenging hence potentially inflating type 2 errors, these results do not support a large causal impact of human gut microbita features on cardiometabolic traits, chronic diseases or longevity. These results also suggest that the previously documented associations between gut microbiota and human health outcomes may not always underly causal relations.
Pernicious anemia is a rare condition characterized by vitamin B12 deficiency anemia due to lack of intrinsic factor, often caused by autoimmune gastritis. Patients with pernicious anemia have a ...higher incidence of other autoimmune disorders, such as type 1 diabetes, vitiligo, and autoimmune thyroid issues. Therefore, the disease has a clear autoimmune basis, although the genetic susceptibility factors have thus far remained poorly studied. We conduct a genome-wide association study meta-analysis in 2166 cases and 659,516 European controls from population-based biobanks and identify genome-wide significant signals in or near the PTPN22 (rs6679677, p = 1.91 × 10
, OR = 1.63), PNPT1 (rs12616502, p = 3.14 × 10
, OR = 1.70), HLA-DQB1 (rs28414666, p = 1.40 × 10
, OR = 1.38), IL2RA (rs2476491, p = 1.90 × 10
, OR = 1.22) and AIRE (rs74203920, p = 2.33 × 10
, OR = 1.83) genes, thus providing robust associations between pernicious anemia and genetic risk factors.
Inappropriate activation or inadequate regulation of CD4+ and CD8+ T cells may contribute to the initiation and progression of multiple autoimmune and inflammatory diseases. Studies on ...disease-associated genetic polymorphisms have highlighted the importance of biological context for many regulatory variants, which is particularly relevant in understanding the genetic regulation of the immune system and its cellular phenotypes. Here we show cell type-specific regulation of transcript levels of genes associated with several autoimmune diseases in CD4+ and CD8+ T cells including a trans-acting regulatory locus at chr12q13.2 containing the rs1131017 SNP in the RPS26 gene. Most remarkably, we identify a common missense variant in IL27, associated with type 1 diabetes that results in decreased functional activity of the protein and reduced expression levels of downstream IRF1 and STAT1 in CD4+ T cells only. Altogether, our results indicate that eQTL mapping in purified T cells provides novel functional insights into polymorphisms and pathways associated with autoimmune diseases.
Abstract
The admixture between modern humans and Neandertals has resulted in ∼2% of the genomes of present-day non-Africans being composed of Neandertal DNA. Introgressed Neandertal DNA has been ...demonstrated to significantly affect the transcriptomic landscape in people today and via this molecular mechanism influence phenotype variation as well. However, little is known about how much of that regulatory impact is mediated through long-range regulatory effects that have been shown to explain ∼20% of expression variation. Here we identified 60 transcription factors (TFs) with their top cis-eQTL SNP in GTEx being of Neandertal ancestry and predicted long-range Neandertal DNA-induced regulatory effects by screening for the predicted target genes of those TFs. We show that the TFs form a significantly connected protein–protein interaction network. Among them are JUN and PRDM5, two brain-expressed TFs that have their predicted target genes enriched in regions devoid of Neandertal DNA. Archaic cis-eQTLs for the 60 TFs include multiple candidates for local adaptation, some of which show significant allele frequency increases over the last ∼10,000 years. A large proportion of the cis-eQTL-associated archaic SNPs have additional associations with various immune traits, schizophrenia, blood cell type composition and anthropometric measures. Finally, we demonstrate that our results are consistent with those of Neandertal DNA-associated empirical trans-eQTLs. Our results suggest that Neandertal DNA significantly influences regulatory networks, that its regulatory reach goes beyond the 40% of genomic sequence it still covers in present-day non-Africans and that via the investigated mechanism Neandertal DNA influences the phenotypic variation in people today.
Yermakovich et al. explore the long-range regulatory effects of Neandertal DNA in modern humans by scanning for transcription factors with eQTL variants of likely Neandertal ancestry and investigating their predicted target genes. Their results suggest that Neandertal DNA significantly influences regulatory networks-reaching beyond the 40% of genomic sequence it still covers in present-day non-Africans-and phenotypic variation in people today.
Type 2 diabetes (T2D) is a very common disease in humans. Here we conduct a meta-analysis of genome-wide association studies (GWAS) with ~16 million genetic variants in 62,892 T2D cases and 596,424 ...controls of European ancestry. We identify 139 common and 4 rare variants associated with T2D, 42 of which (39 common and 3 rare variants) are independent of the known variants. Integration of the gene expression data from blood (n = 14,115 and 2765) with the GWAS results identifies 33 putative functional genes for T2D, 3 of which were targeted by approved drugs. A further integration of DNA methylation (n = 1980) and epigenomic annotation data highlight 3 genes (CAMK1D, TP53INP1, and ATP5G1) with plausible regulatory mechanisms, whereby a genetic variant exerts an effect on T2D through epigenetic regulation of gene expression. Our study uncovers additional loci, proposes putative genetic regulatory mechanisms for T2D, and provides evidence of purifying selection for T2D-associated variants.
We use a genome-wide association of 1 million parental lifespans of genotyped subjects and data on mortality risk factors to validate previously unreplicated findings near
,
,
,
,
, and 13q21.31, and ...identify and replicate novel findings near
,
, and
. We also validate previous findings near 5q33.3/
and
, whilst finding contradictory evidence at other loci. Gene set and cell-specific analyses show that expression in foetal brain cells and adult dorsolateral prefrontal cortex is enriched for lifespan variation, as are gene pathways involving lipid proteins and homeostasis, vesicle-mediated transport, and synaptic function. Individual genetic variants that increase dementia, cardiovascular disease, and lung cancer - but not other cancers - explain the most variance. Resulting polygenic scores show a mean lifespan difference of around five years of life across the deciles.
This article has been through an editorial process in which the authors decide how to respond to the issues raised during peer review. The Reviewing Editor's assessment is that all the issues have been addressed (see decision letter).