Microbiome research is starting to move beyond the exploratory phase towards interventional trials and therefore well-characterized cohorts will be instrumental for generating hypotheses and ...providing new knowledge. As part of the Estonian Biobank, we established the Estonian Microbiome Cohort which includes stool, oral and plasma samples from 2509 participants and is supplemented with multi-omic measurements, questionnaires, and regular linkages to national electronic health records. Here we analyze stool data from deep metagenomic sequencing together with rich phenotyping, including 71 diseases, 136 medications, 21 dietary questions, 5 medical procedures, and 19 other factors. We identify numerous relationships (n = 3262) with different microbiome features. In this study, we extend the understanding of microbiome-host interactions using electronic health data and show that long-term antibiotic usage, independent from recent administration, has a significant impact on the microbiome composition, partly explaining the common associations between diseases.
The prognostic and diagnostic value of microRNA (miRNA) expression aberrations in lung cancer has been studied intensely in recent years. However, due to the application of different technological ...platforms and small sample size, the miRNA expression profiling efforts have led to inconsistent results between the studies. We performed a comprehensive meta‐analysis of 20 published miRNA expression studies in lung cancer, including a total of 598 tumor and 528 non‐cancerous control samples. Using a recently published robust rank aggregation method, we identified a statistically significant miRNA meta‐signature of seven upregulated (miR‐21, miR‐210, miR‐182, miR‐31, miR‐200b, miR‐205 and miR‐183) and eight downregulated (miR‐126‐3p, miR‐30a, miR‐30d, miR‐486‐5p, miR‐451a, miR‐126‐5p, miR‐143 and miR‐145) miRNAs. We conducted a gene set enrichment analysis to identify pathways that are most strongly affected by altered expression of these miRNAs. We found that meta‐signature miRNAs cooperatively target functionally related and biologically relevant genes in signaling and developmental pathways. We have shown that such meta‐analysis approach is suitable and effective solution for identification of statistically significant miRNA meta‐signature by combining several miRNA expression studies. This method allows the analysis of data produced by different technological platforms that cannot be otherwise directly compared or in the case when raw data are unavailable.
What's new?
The prognostic and diagnostic value of microRNA (miRNA) expression aberrations in lung cancer has been studied intensely in recent years. However, due to the application of different technological platforms and small sample size, the miRNA expression profiling efforts have led to inconsistent results. Using a meta‐analysis of more than 1100 lung cancer and non‐cancerous samples from 20 original studies, here the authors have identified a meta‐signature of seven up‐ and eight down‐regulated miRNAs. Their analysis highlights the challenges related with the development of miRNA‐based tests and emphasizes the need for rigorous evaluation of the results before proceeding to clinical trials.
Full text
Available for:
BFBNIB, FZAB, GIS, IJS, KILJ, NLZOH, NUK, OILJ, SAZU, SBCE, SBMB, UL, UM, UPUK
Pernicious anemia is a rare condition characterized by vitamin B12 deficiency anemia due to lack of intrinsic factor, often caused by autoimmune gastritis. Patients with pernicious anemia have a ...higher incidence of other autoimmune disorders, such as type 1 diabetes, vitiligo, and autoimmune thyroid issues. Therefore, the disease has a clear autoimmune basis, although the genetic susceptibility factors have thus far remained poorly studied. We conduct a genome-wide association study meta-analysis in 2166 cases and 659,516 European controls from population-based biobanks and identify genome-wide significant signals in or near the PTPN22 (rs6679677, p = 1.91 × 10
, OR = 1.63), PNPT1 (rs12616502, p = 3.14 × 10
, OR = 1.70), HLA-DQB1 (rs28414666, p = 1.40 × 10
, OR = 1.38), IL2RA (rs2476491, p = 1.90 × 10
, OR = 1.22) and AIRE (rs74203920, p = 2.33 × 10
, OR = 1.83) genes, thus providing robust associations between pernicious anemia and genetic risk factors.
Comparing transcript levels between healthy and diseased individuals allows the identification of differentially expressed genes, which may be causes, consequences or mere correlates of the disease ...under scrutiny. We propose a method to decompose the observational correlation between gene expression and phenotypes driven by confounders, forward- and reverse causal effects. The bi-directional causal effects between gene expression and complex traits are obtained by Mendelian Randomization integrating summary-level data from GWAS and whole-blood eQTLs. Applying this approach to complex traits reveals that forward effects have negligible contribution. For example, BMI- and triglycerides-gene expression correlation coefficients robustly correlate with trait-to-expression causal effects (r
= 0.11, P
= 2.0 × 10
and r
= 0.13, P
= 1.1 × 10
), but not detectably with expression-to-trait effects. Our results demonstrate that studies comparing the transcriptome of diseased and healthy subjects are more prone to reveal disease-induced gene expression changes rather than disease causing ones.
Accurate prediction of an individual's phenotype from their DNA sequence is one of the great promises of genomics and precision medicine. We extend a powerful individual-level data Bayesian multiple ...regression model (BayesR) to one that utilises summary statistics from genome-wide association studies (GWAS), SBayesR. In simulation and cross-validation using 12 real traits and 1.1 million variants on 350,000 individuals from the UK Biobank, SBayesR improves prediction accuracy relative to commonly used state-of-the-art summary statistics methods at a fraction of the computational resources. Furthermore, using summary statistics for variants from the largest GWAS meta-analysis (n ≈ 700, 000) on height and BMI, we show that on average across traits and two independent data sets that SBayesR improves prediction R
by 5.2% relative to LDpred and by 26.5% relative to clumping and p value thresholding.
Using effect estimates from genome-wide association studies (GWAS), we identified a genetic risk score (GRS) that has the strongest association with type 2 diabetes (T2D) status in a population-based ...cohort and investigated its potential for prospective T2D risk assessment.
By varying the number of single-nucleotide polymorphisms (SNPs) and their respective weights, alternative versions of GRS can be computed. They were tested in 1,181 T2D cases and 9,092 controls of the Estonian Biobank cohort. The best-fitting GRS was chosen for the subsequent analysis of incident T2D (386 cases).
The best fit was provided by a novel doubly weighted GRS that captures the effect of 1,000 SNPs. The hazard for incident T2D was 3.45 times (95% CI: 2.31–5.17) higher in the highest GRS quintile compared with the lowest quintile, after adjusting for body mass index and other known predictors. Adding GRS to the prediction model for 5-year T2D risk resulted in continuous net reclassification improvement of 0.324 (95% CI: 0.211–0.444). In addition, a significant effect of the GRS on all-cause and cardiovascular mortality was observed.
The proposed GRS would improve the accuracy of T2D risk prediction when added to the currently used set of predictors.
Genet Med19 3, 322–329.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
We develop a Bayesian mixed linear model that simultaneously estimates single-nucleotide polymorphism (SNP)-based heritability, polygenicity (proportion of SNPs with nonzero effects), and the ...relationship between SNP effect size and minor allele frequency for complex traits in conventionally unrelated individuals using genome-wide SNP data. We apply the method to 28 complex traits in the UK Biobank data (N = 126,752) and show that on average, 6% of SNPs have nonzero effects, which in total explain 22% of phenotypic variance. We detect significant (P < 0.05/28) signatures of natural selection in the genetic architecture of 23 traits, including reproductive, cardiovascular, and anthropometric traits, as well as educational attainment. The significant estimates of the relationship between effect size and minor allele frequency in complex traits are consistent with a model of negative (or purifying) selection, as confirmed by forward simulation. We conclude that negative selection acts pervasively on the genetic variants associated with human complex traits.
The Estonian Biobank cohort is a volunteer-based sample of the Estonian resident adult population (aged ≥18 years). The current number of participants-close to 52000--represents a large proportion, ...5%, of the Estonian adult population, making it ideally suited to population-based studies. General practitioners (GPs) and medical personnel in the special recruitment offices have recruited participants throughout the country. At baseline, the GPs performed a standardized health examination of the participants, who also donated blood samples for DNA, white blood cells and plasma tests and filled out a 16-module questionnaire on health-related topics such as lifestyle, diet and clinical diagnoses described in WHO ICD-10. A significant part of the cohort has whole genome sequencing (100), genome-wide single nucleotide polymorphism (SNP) array data (20 000) and/or NMR metabolome data (11 000) available (http://www.geenivaramu.ee/for-scientists/data-release/). The data are continuously updated through periodical linking to national electronic databases and registries. A part of the cohort has been re-contacted for follow-up purposes and resampling, and targeted invitations are possible for specific purposes, for example people with a specific diagnosis. The Estonian Genome Center of the University of Tartu is actively collaborating with many universities, research institutes and consortia and encourages fellow scientists worldwide to co-initiate new academic or industrial joint projects with us.
We analyzed the mRNA levels for 36,778 transcript expression traits (probes) from 2,765 individuals to comprehensively investigate the genetic architecture and degree of missing heritability for gene ...expression in peripheral blood. We identified 11,204 cis and 3,791 trans independent expression quantitative trait loci (eQTL) by using linear mixed models to perform genome-wide association analyses. Furthermore, using information on both closely and distantly related individuals, heritability was estimated for all expression traits. Of the set of expressed probes (15,966), 10,580 (66%) had an estimated narrow-sense heritability (h2) greater than zero with a mean (median) value of 0.192 (0.142). Across these probes, on average the proportion of genetic variance explained by all eQTL (hCOJO2) was 31% (0.060/0.192), meaning that 69% is missing, with the sentinel SNP of the largest eQTL explaining 87% (0.052/0.060) of the variance attributed to all identified cis- and trans-eQTL. For the same set of probes, the genetic variance attributed to genome-wide common (MAF > 0.01) HapMap 3 SNPs (hg2) accounted for on average 48% (0.093/0.192) of h2. Taken together, the evidence suggests that approximately half the genetic variance for gene expression is not tagged by common SNPs, and of the variance that is tagged by common SNPs, a large proportion can be attributed to identifiable eQTL of large effect, typically in cis. Finally, we present evidence that, compared with a meta-analysis, using individual-level data results in an increase of approximately 50% in power to detect eQTL.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
The role of metabolic syndrome (MetS) as a preceding metabolic state for type 2 diabetes and cardiovascular disease is widely recognised. To accumulate knowledge of the pathological mechanisms behind ...the condition at the methylation level, we conducted an epigenome-wide association study (EWAS) of MetS and its components, testing 1187 individuals of European ancestry for approximately 470 000 methylation sites throughout the genome. Methylation site cg19693031 in gene TXNIP -previously associated with type 2 diabetes, glucose and lipid metabolism, associated with fasting glucose level (P = 1.80 × 10
). Cg06500161 in gene ABCG1 associated both with serum triglycerides (P = 5.36 × 10
) and waist circumference (P = 5.21 × 10
). The previously identified type 2 diabetes-associated locus cg08309687 in chromosome 21 associated with waist circumference for the first time (P = 2.24 × 10
). Furthermore, a novel HDL association with cg17901584 in chromosome 1 was identified (P = 7.81 × 10
). Our study supports previous genetic studies of MetS, finding that lipid metabolism plays a key role in pathology of the syndrome. We provide evidence regarding a close interplay with glucose metabolism. Finally, we suggest that in attempts to identify methylation loci linking separate MetS components, cg19693031 appears to represent a strong candidate.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK