The collection of fecal material and developments in sequencing technologies have enabled standardised and non-invasive gut microbiome profiling. Microbiome composition from several large cohorts ...have been cross-sectionally linked to various lifestyle factors and diseases. In spite of these advances, prospective associations between microbiome composition and health have remained uncharacterised due to the lack of sufficiently large and representative population cohorts with comprehensive follow-up data. Here, we analyse the long-term association between gut microbiome variation and mortality in a well-phenotyped and representative population cohort from Finland (n = 7211). We report robust taxonomic and functional microbiome signatures related to the Enterobacteriaceae family that are associated with mortality risk during a 15-year follow-up. Our results extend previous cross-sectional studies, and help to establish the basis for examining long-term associations between human gut microbiome composition, incident outcomes, and general health status.
Blood lipids and metabolites are markers of current health and future disease risk. Here, we describe plasma nuclear magnetic resonance (NMR) biomarker data for 118,461 participants in the UK ...Biobank. The biomarkers cover 249 measures of lipoprotein lipids, fatty acids, and small molecules such as amino acids, ketones, and glycolysis metabolites. We provide an atlas of associations of these biomarkers to prevalence, incidence, and mortality of over 700 common diseases ( nightingalehealth.com/atlas ). The results reveal a plethora of biomarker associations, including susceptibility to infectious diseases and risk of various cancers, joint disorders, and mental health outcomes, indicating that abundant circulating lipids and metabolites are risk markers beyond cardiometabolic diseases. Clustering analyses indicate similar biomarker association patterns across different disease types, suggesting latent systemic connectivity in the susceptibility to a diverse set of diseases. This work highlights the value of NMR based metabolic biomarker profiling in large biobanks for public health research and translation.
Background and Aims
The effects of alcohol use in nonalcoholic fatty liver disease are unclear. We investigated the impact of alcohol use in fatty liver disease on incident liver, cardiovascular, and ...malignant disease, as well as death.
Approach and Results
Our study comprised 8,345 persons with hepatic steatosis (fatty liver index >60) who participated in health‐examination surveys (FINRISK 1992‐2012 or Health 2000), with available data on baseline alcohol intake. Main exclusions were baseline clinical liver disease, viral hepatitis, ethanol intake >50 g/day, and current abstainers. Data were linked with national registers for hospital admissions, malignancies, and death regarding liver, cardiovascular, and malignant disease, as well as all‐cause death. Adjustment were for multiple confounders. Alcohol consumption showed a dose‐dependent risk increase for incident advanced liver disease and malignancies. Consuming 10‐19 g/day of alcohol in general or 0‐9 g/day as nonwine beverages doubled the risk for advanced liver disease compared to lifetime abstainers. In contrast, alcohol intake up to 49 g/day was associated with a 22%‐40% reduction of incident cardiovascular disease (CVD). We observed a J‐shaped association between alcohol intake and all‐cause death with a maximal risk reduction of 21% (95% confidence interval, 5%‐34%) at alcohol intake of 0‐9 g/day compared to lifetime abstainers. However, these benefits on CVD and mortality were only observed in never smokers. Alcohol intake >30 g/day yielded increased risk estimates for mortality compared to lifetime abstainers. In a subpopulation with longitudinal data, alcohol intake remained stable over time in >80% of subjects.
Conclusions
Even low alcohol intake in fatty liver disease is associated with increased risks for advanced liver disease and cancer. Low to moderate alcohol use is associated with reduced mortality and CVD risk but only among never smokers.
The Estonian Biobank cohort is a volunteer-based sample of the Estonian resident adult population (aged ≥18 years). The current number of participants-close to 52000--represents a large proportion, ...5%, of the Estonian adult population, making it ideally suited to population-based studies. General practitioners (GPs) and medical personnel in the special recruitment offices have recruited participants throughout the country. At baseline, the GPs performed a standardized health examination of the participants, who also donated blood samples for DNA, white blood cells and plasma tests and filled out a 16-module questionnaire on health-related topics such as lifestyle, diet and clinical diagnoses described in WHO ICD-10. A significant part of the cohort has whole genome sequencing (100), genome-wide single nucleotide polymorphism (SNP) array data (20 000) and/or NMR metabolome data (11 000) available (http://www.geenivaramu.ee/for-scientists/data-release/). The data are continuously updated through periodical linking to national electronic databases and registries. A part of the cohort has been re-contacted for follow-up purposes and resampling, and targeted invitations are possible for specific purposes, for example people with a specific diagnosis. The Estonian Genome Center of the University of Tartu is actively collaborating with many universities, research institutes and consortia and encourages fellow scientists worldwide to co-initiate new academic or industrial joint projects with us.
Circulating cytokines and growth factors are regulators of inflammation and have been implicated in autoimmune and metabolic diseases. In this genome-wide association study (GWAS) of up to 8,293 ...Finns we identified 27 genome-widely significant loci (p < 1.2 × 10−9) for one or more cytokines. Fifteen of the associated variants had expression quantitative trait loci in whole blood. We provide genetic instruments to clarify the causal roles of cytokine signaling and upstream inflammation in immune-related and other chronic diseases. We further link inflammatory markers with variants previously associated with autoimmune diseases such as Crohn disease, multiple sclerosis, and ulcerative colitis and hereby elucidate the molecular mechanisms underpinning these diseases and suggest potential drug targets.
Evaluation of O2PLS in Omics data integration Bouhaddani, Said El; Houwing-Duistermaat, Jeanine; Salo, Perttu ...
BMC bioinformatics,
2016-Jan-20, 2016-1-20, 20160120, Letnik:
17 Suppl 2, Številka:
29
Journal Article
Recenzirano
Odprti dostop
Rapid computational and technological developments made large amounts of omics data available in different biological levels. It is becoming clear that simultaneous data analysis methods are needed ...for better interpretation and understanding of the underlying systems biology. Different methods have been proposed for this task, among them Partial Least Squares (PLS) related methods. To also deal with orthogonal variation, systematic variation in the data unrelated to one another, we consider the Two-way Orthogonal PLS (O2PLS): an integrative data analysis method which is capable of modeling systematic variation, while providing more parsimonious models aiding interpretation.
A simulation study to assess the performance of O2PLS showed positive results in both low and higher dimensions. More noise (50 % of the data) only affected the systematic part estimates. A data analysis was conducted using data on metabolomics and transcriptomics from a large Finnish cohort (DILGOM). A previous sequential study, using the same data, showed significant correlations between the Lipo-Leukocyte (LL) module and lipoprotein metabolites. The O2PLS results were in agreement with these findings, identifying almost the same set of co-varying variables. Moreover, our integrative approach identified other associative genes and metabolites, while taking into account systematic variation in the data. Including orthogonal components enhanced overall fit, but the orthogonal variation was difficult to interpret.
Simulations showed that the O2PLS estimates were close to the true parameters in both low and higher dimensions. In the presence of more noise (50 %), the orthogonal part estimates could not distinguish well between joint and unique variation. The joint estimates were not systematically affected. Simultaneous analysis with O2PLS on metabolome and transcriptome data showed that the LL module, together with VLDL and HDL metabolites, were important for the metabolomic and transcriptomic relation. This is in agreement with an earlier study. In addition more gene expression and metabolites are identified being important for the joint covariation.
Genetics plays an important role in coronary heart disease (CHD) but the clinical utility of genomic risk scores (GRSs) relative to clinical risk scores, such as the Framingham Risk Score (FRS), is ...unclear. Our aim was to construct and externally validate a CHD GRS, in terms of lifetime CHD risk and relative to traditional clinical risk scores.
We generated a GRS of 49 310 SNPs based on a CARDIoGRAMplusC4D Consortium meta-analysis of CHD, then independently tested it using five prospective population cohorts (three FINRISK cohorts, combined n = 12 676, 757 incident CHD events; two Framingham Heart Study cohorts (FHS), combined n = 3406, 587 incident CHD events). The GRS was associated with incident CHD (FINRISK HR = 1.74, 95% confidence interval (CI) 1.61-1.86 per S.D. of GRS; Framingham HR = 1.28, 95% CI 1.18-1.38), and was largely unchanged by adjustment for known risk factors, including family history. Integration of the GRS with the FRS or ACC/AHA13 scores improved the 10 years risk prediction (meta-analysis C-index: +1.5-1.6%, P < 0.001), particularly for individuals ≥60 years old (meta-analysis C-index: +4.6-5.1%, P < 0.001). Importantly, the GRS captured substantially different trajectories of absolute risk, with men in the top 20% of attaining 10% cumulative CHD risk 12-18 y earlier than those in the bottom 20%. High genomic risk was partially compensated for by low systolic blood pressure, low cholesterol level, and non-smoking.
A GRS based on a large number of SNPs improves CHD risk prediction and encodes different trajectories of lifetime risk not captured by traditional clinical risk scores.
The role of metabolic syndrome (MetS) as a preceding metabolic state for type 2 diabetes and cardiovascular disease is widely recognised. To accumulate knowledge of the pathological mechanisms behind ...the condition at the methylation level, we conducted an epigenome-wide association study (EWAS) of MetS and its components, testing 1187 individuals of European ancestry for approximately 470 000 methylation sites throughout the genome. Methylation site cg19693031 in gene TXNIP -previously associated with type 2 diabetes, glucose and lipid metabolism, associated with fasting glucose level (P = 1.80 × 10
). Cg06500161 in gene ABCG1 associated both with serum triglycerides (P = 5.36 × 10
) and waist circumference (P = 5.21 × 10
). The previously identified type 2 diabetes-associated locus cg08309687 in chromosome 21 associated with waist circumference for the first time (P = 2.24 × 10
). Furthermore, a novel HDL association with cg17901584 in chromosome 1 was identified (P = 7.81 × 10
). Our study supports previous genetic studies of MetS, finding that lipid metabolism plays a key role in pathology of the syndrome. We provide evidence regarding a close interplay with glucose metabolism. Finally, we suggest that in attempts to identify methylation loci linking separate MetS components, cg19693031 appears to represent a strong candidate.
Polygenic scores (PSs) are becoming a useful tool to identify individuals with high genetic risk for complex diseases, and several projects are currently testing their utility for translational ...applications. It is also tempting to use PSs to assess whether genetic variation can explain a part of the geographic distribution of a phenotype. However, it is not well known how the population genetic properties of the training and target samples affect the geographic distribution of PSs. Here, we evaluate geographic differences, and related biases, of PSs in Finland in a geographically well-defined sample of 2,376 individuals from the National FINRISK study. First, we detect geographic differences in PSs for coronary artery disease (CAD), rheumatoid arthritis, schizophrenia, waist-hip ratio (WHR), body-mass index (BMI), and height, but not for Crohn disease or ulcerative colitis. Second, we use height as a model trait to thoroughly assess the possible population genetic biases in PSs and apply similar approaches to the other phenotypes. Most importantly, we detect suspiciously large accumulations of geographic differences for CAD, WHR, BMI, and height, suggesting bias arising from the population’s genetic structure rather than from a direct genotype-phenotype association. This work demonstrates how sensitive the geographic patterns of current PSs are for small biases even within relatively homogeneous populations and provides simple tools to identify such biases. A thorough understanding of the effects of population genetic structure on PSs is essential for translational applications of PSs.
Plasma protein
-glycan profiling integrates information on enzymatic protein glycosylation, which is a highly controlled ubiquitous posttranslational modification. Here we investigate the ability of ...the plasma
-glycome to predict incidence of type 2 diabetes and cardiovascular diseases (CVDs; i.e., myocardial infarction and stroke).
Based on the prospective European Prospective Investigation of Cancer (EPIC)-Potsdam cohort (
= 27,548), we constructed case-cohorts including a random subsample of 2,500 participants and all physician-verified incident cases of type 2 diabetes (
= 820; median follow-up time 6.5 years) and CVD (
= 508; median follow-up time 8.2 years). Information on the relative abundance of 39
-glycan groups in baseline plasma samples was generated by chromatographic profiling. We selected predictive
-glycans for type 2 diabetes and CVD separately, based on cross-validated machine learning, nonlinear model building, and construction of weighted prediction scores. This workflow for CVD was applied separately in men and women.
The
-glycan-based type 2 diabetes score was strongly predictive for diabetes risk in an internal validation cohort (weighted C-index 0.83, 95% CI 0.78-0.88), and this finding was externally validated in the Finland Cardiovascular Risk Study (FINRISK) cohort.
-glycans were moderately predictive for CVD incidence (weighted C-indices 0.66, 95% CI 0.60-0.72, for men; 0.64, 95% CI 0.55-0.73, for women). Information on the selected
-glycans improved the accuracy of established and clinically applied risk prediction scores for type 2 diabetes and CVD.
Selected
-glycans improve type 2 diabetes and CVD prediction beyond established risk markers. Plasma protein
-glycan profiling may thus be useful for risk stratification in the context of precisely targeted primary prevention of cardiometabolic diseases.