Abstract
Gene co-expression networks can be used to associate genes of unknown function with biological processes, to prioritize candidate disease genes or to discern transcriptional regulatory ...programmes. With recent advances in transcriptomics and next-generation sequencing, co-expression networks constructed from RNA sequencing data also enable the inference of functions and disease associations for non-coding genes and splice variants. Although gene co-expression networks typically do not provide information about causality, emerging methods for differential co-expression analysis are enabling the identification of regulatory genes underlying various phenotypes. Here, we introduce and guide researchers through a (differential) co-expression analysis. We provide an overview of methods and tools used to create and analyse co-expression networks constructed from gene expression data, and we explain how these can be used to identify genes with a regulatory role in disease. Furthermore, we discuss the integration of other data types with co-expression networks and offer future perspectives of co-expression analysis.
Genetic association studies, in particular the genome-wide association study (GWAS) design, have provided a wealth of novel insights into the aetiology of a wide range of human diseases and traits, ...in particular cardiovascular diseases and lipid biomarkers. The next challenge consists of understanding the molecular basis of these associations. The integration of multiple association datasets, including gene expression datasets, can contribute to this goal. We have developed a novel statistical methodology to assess whether two association signals are consistent with a shared causal variant. An application is the integration of disease scans with expression quantitative trait locus (eQTL) studies, but any pair of GWAS datasets can be integrated in this framework. We demonstrate the value of the approach by re-analysing a gene expression dataset in 966 liver samples with a published meta-analysis of lipid traits including >100,000 individuals of European ancestry. Combining all lipid biomarkers, our re-analysis supported 26 out of 38 reported colocalisation results with eQTLs and identified 14 new colocalisation results, hence highlighting the value of a formal statistical test. In three cases of reported eQTL-lipid pairs (SYPL2, IFT172, TBKBP1) for which our analysis suggests that the eQTL pattern is not consistent with the lipid association, we identify alternative colocalisation results with SORT1, GCKR, and KPNB1, indicating that these genes are more likely to be causal in these genomic intervals. A key feature of the method is the ability to derive the output statistics from single SNP summary statistics, hence making it possible to perform systematic meta-analysis type comparisons across multiple GWAS datasets (implemented online at http://coloc.cs.ucl.ac.uk/coloc/). Our methodology provides information about candidate causal genes in associated intervals and has direct implications for the understanding of complex diseases as well as the design of drugs to target disease pathways.
Gut microbiota has been implicated in major diseases affecting the human population and has also been linked to triglycerides and high-density lipoprotein levels in the circulation. Recent ...development in metabolomics allows classifying the lipoprotein particles into more details. Here, we examine the impact of gut microbiota on circulating metabolites measured by Nuclear Magnetic Resonance technology in 2309 individuals from the Rotterdam Study and the LifeLines-DEEP cohort. We assess the relationship between gut microbiota and metabolites by linear regression analysis while adjusting for age, sex, body-mass index, technical covariates, medication use, and multiple testing. We report an association of 32 microbial families and genera with very-low-density and high-density subfractions, serum lipid measures, glycolysis-related metabolites, ketone bodies, amino acids, and acute-phase reaction markers. These observations provide insights into the role of microbiota in host metabolism and support the potential of gut microbiota as a target for therapeutic and preventive interventions.
Microbiome-wide association studies on large population cohorts have highlighted associations between the gut microbiome and complex traits, including type 2 diabetes (T2D) and obesity
. However, the ...causal relationships remain largely unresolved. We leveraged information from 952 normoglycemic individuals for whom genome-wide genotyping, gut metagenomic sequence and fecal short-chain fatty acid (SCFA) levels were available
, then combined this information with genome-wide-association summary statistics for 17 metabolic and anthropometric traits. Using bidirectional Mendelian randomization (MR) analyses to assess causality
, we found that the host-genetic-driven increase in gut production of the SCFA butyrate was associated with improved insulin response after an oral glucose-tolerance test (P = 9.8 × 10
), whereas abnormalities in the production or absorption of another SCFA, propionate, were causally related to an increased risk of T2D (P = 0.004). These data provide evidence of a causal effect of the gut microbiome on metabolic traits and support the use of MR as a means to elucidate causal relationships from microbiome-wide association findings.
Deep sequencing of the gut microbiomes of 1135 participants from a Dutch population-based cohort shows relations between the microbiome and 126 exogenous and intrinsic host factors, including 31 ...intrinsic factors, 12 diseases, 19 drug groups, 4 smoking categories, and 60 dietary factors. These factors collectively explain 18.7% of the variation seen in the interindividual distance of microbial composition. We could associate 110 factors to 125 species and observed that fecal chromogranin A (CgA), a protein secreted by enteroendocrine cells, was exclusively associated with 61 microbial species whose abundance collectively accounted for 53% of microbial composition. Low CgA concentrations were seen in individuals with a more diverse microbiome. These results are an important step toward a better understanding of environment-diet-microbe-host interactions.
Patients with IBD display substantial heterogeneity in clinical characteristics. We hypothesise that individual differences in the complex interaction of the host genome and the gut microbiota can ...explain the onset and the heterogeneous presentation of IBD. Therefore, we performed a case-control analysis of the gut microbiota, the host genome and the clinical phenotypes of IBD.
Stool samples, peripheral blood and extensive phenotype data were collected from 313 patients with IBD and 582 truly healthy controls, selected from a population cohort. The gut microbiota composition was assessed by tag-sequencing the 16S rRNA gene. All participants were genotyped. We composed genetic risk scores from 11 functional genetic variants proven to be associated with IBD in genes that are directly involved in the bacterial handling in the gut:
,
,
,
and
.
Strikingly, we observed significant alterations of the gut microbiota of healthy individuals with a high genetic risk for IBD: the IBD genetic risk score was significantly associated with a decrease in the genus
in healthy controls (false discovery rate 0.017). Moreover, disease location was a major determinant of the gut microbiota: the gut microbiota of patients with colonic Crohn's disease (CD) is different from that of patients with ileal CD, with a decrease in alpha diversity associated to ileal disease (p=3.28×10
).
We show for the first time that genetic risk variants associated with IBD influence the gut microbiota in healthy individuals.
are acetate-to-butyrate converters, and a decrease has already been observed in patients with IBD.
Evidence suggests that the gut microbiome is involved in the development of cardiovascular disease, with the host-microbe interaction regulating immune and metabolic pathways. However, there was no ...firm evidence for associations between microbiota and metabolic risk factors for cardiovascular disease from large-scale studies in humans. In particular, there was no strong evidence for association between cardiovascular disease and aberrant blood lipid levels.
To identify intestinal bacteria taxa, whose proportions correlate with body mass index and lipid levels, and to determine whether lipid variance can be explained by microbiota relative to age, sex, and host genetics.
We studied 893 subjects from the Life-Lines-DEEP population cohort. After correcting for age and sex, we identified 34 bacterial taxa associated with body mass index and blood lipids; most are novel associations. Cross-validation analysis revealed that microbiota explain 4.5% of the variance in body mass index, 6% in triglycerides, and 4% in high-density lipoproteins, independent of age, sex, and genetic risk factors. A novel risk model, including the gut microbiome explained ≤ 25.9% of high-density lipoprotein variance, significantly outperforming the risk model without microbiome. Strikingly, the microbiome had little effect on low-density lipoproteins or total cholesterol.
Our studies suggest that the gut microbiome may play an important role in the variation in body mass index and blood lipid levels, independent of age, sex, and host genetics. Our findings support the potential of therapies altering the gut microbiome to control body mass, triglycerides, and high-density lipoproteins.
The gut microbiome is affected by multiple factors, including genetics. In this study, we assessed the influence of host genetics on microbial species, pathways and gene ontology categories, on the ...basis of metagenomic sequencing in 1,514 subjects. In a genome-wide analysis, we identified associations of 9 loci with microbial taxonomies and 33 loci with microbial pathways and gene ontology terms at P < 5 × 10
. Additionally, in a targeted analysis of regions involved in complex diseases, innate and adaptive immunity, or food preferences, 32 loci were identified at the suggestive level of P < 5 × 10
. Most of our reported associations are new, including genome-wide significance for the C-type lectin molecules CLEC4F-CD207 at 2p13.3 and CLEC4A-FAM90A1 at 12p13. We also identified association of a functional LCT SNP with the Bifidobacterium genus (P = 3.45 × 10
) and provide evidence of a gene-diet interaction in the regulation of Bifidobacterium abundance. Our results demonstrate the importance of understanding host-microbe interactions to gain better insight into human health.
The main challenge for gaining biological insights from genetic associations is identifying which genes and pathways explain the associations. Here we present DEPICT, an integrative tool that employs ...predicted gene functions to systematically prioritize the most likely causal genes at associated loci, highlight enriched pathways and identify tissues/cell types where genes from associated loci are highly expressed. DEPICT is not limited to genes with established functions and prioritizes relevant gene sets for many phenotypes.