Deep sequencing of the gut microbiomes of 1135 participants from a Dutch population-based cohort shows relations between the microbiome and 126 exogenous and intrinsic host factors, including 31 ...intrinsic factors, 12 diseases, 19 drug groups, 4 smoking categories, and 60 dietary factors. These factors collectively explain 18.7% of the variation seen in the interindividual distance of microbial composition. We could associate 110 factors to 125 species and observed that fecal chromogranin A (CgA), a protein secreted by enteroendocrine cells, was exclusively associated with 61 microbial species whose abundance collectively accounted for 53% of microbial composition. Low CgA concentrations were seen in individuals with a more diverse microbiome. These results are an important step toward a better understanding of environment-diet-microbe-host interactions.
The gut microbiome is affected by multiple factors, including genetics. In this study, we assessed the influence of host genetics on microbial species, pathways and gene ontology categories, on the ...basis of metagenomic sequencing in 1,514 subjects. In a genome-wide analysis, we identified associations of 9 loci with microbial taxonomies and 33 loci with microbial pathways and gene ontology terms at P < 5 × 10
. Additionally, in a targeted analysis of regions involved in complex diseases, innate and adaptive immunity, or food preferences, 32 loci were identified at the suggestive level of P < 5 × 10
. Most of our reported associations are new, including genome-wide significance for the C-type lectin molecules CLEC4F-CD207 at 2p13.3 and CLEC4A-FAM90A1 at 12p13. We also identified association of a functional LCT SNP with the Bifidobacterium genus (P = 3.45 × 10
) and provide evidence of a gene-diet interaction in the regulation of Bifidobacterium abundance. Our results demonstrate the importance of understanding host-microbe interactions to gain better insight into human health.
The LifeLines Cohort Study is a large population-based cohort study and biobank that was established as a resource for research on complex interactions between environmental, phenotypic and genomic ...factors in the development of chronic diseases and healthy ageing. Between 2006 and 2013, inhabitants of the northern part of The Netherlands and their families were invited to participate, thereby contributing to a three-generation design. Participants visited one of the LifeLines research sites for a physical examination, including lung function, ECG and cognition tests, and completed extensive questionnaires. Baseline data were collected for 167 729 participants, aged from 6 months to 93 years. Follow-up visits are scheduled every 5 years, and in between participants receive follow-up questionnaires. Linkage is being established with medical registries and environmental data. LifeLines contains information on biochemistry, medical history, psychosocial characteristics, lifestyle and more. Genomic data are available including genome-wide genetic data of 15 638 participants. Fasting blood and 24-h urine samples are processed on the day of collection and stored at -80 °C in a fully automated storage facility. The aim of LifeLines is to be a resource for the national and international scientific community. Requests for data and biomaterials can be submitted to the LifeLines Research Office LLscience@umcg.nl.
Trait-associated genetic variants affect complex phenotypes primarily via regulatory mechanisms on the transcriptome. To investigate the genetics of gene expression, we performed cis- and ...trans-expression quantitative trait locus (eQTL) analyses using blood-derived expression from 31,684 individuals through the eQTLGen Consortium. We detected cis-eQTL for 88% of genes, and these were replicable in numerous tissues. Distal trans-eQTL (detected for 37% of 10,317 trait-associated variants tested) showed lower replication rates, partially due to low replication power and confounding by cell type composition. However, replication analyses in single-cell RNA-seq data prioritized intracellular trans-eQTL. Trans-eQTL exerted their effects via several mechanisms, primarily through regulation by transcription factors. Expression of 13% of the genes correlated with polygenic scores for 1,263 phenotypes, pinpointing potential drivers for those traits. In summary, this work represents a large eQTL resource, and its results serve as a starting point for in-depth interpretation of complex phenotypes.
The recent proliferation of high-resolution mass spectrometers has generated a wealth of new data analysis methods. However, flexible integration of these methods into configurations best suited to ...the research question is hampered by heterogeneous file formats and monolithic software development. The mzXML, mzData, and mzML file formats have enabled uniform access to unprocessed raw data. In this paper we present our efforts to produce an equally simple and powerful format, PeakML, to uniformly exchange processed intermediary and result data. To demonstrate the versatility of PeakML, we have developed an open source Java toolkit for processing, filtering, and annotating mass spectra in a customizable pipeline (mzMatch), as well as a user-friendly data visualization environment (PeakML Viewer). The PeakML format in particular enables the flexible exchange of processed data between software created by different groups or companies, as we illustrate by providing a PeakML-based integration of the widely used XCMS package with mzMatch data processing tools. As an added advantage, downstream analysis can benefit from direct access to the full mass trace information underlying summarized mass spectrometry results, providing the user with the means to rapidly verify results. The PeakML/mzMatch software is freely available at http://mzmatch.sourceforge.net, with documentation, tutorials, and a community forum.
Differences in susceptibility to immune-mediated diseases are determined by variability in immune responses. In three studies within the Human Functional Genomics Project, we assessed the effect of ...environmental and non-genetic host factors of the genetic make-up of the host and of the intestinal microbiome on the cytokine responses in humans. We analyzed the association of these factors with circulating mediators and with six cytokines after stimulation with 19 bacterial, fungal, viral, and non-microbial metabolic stimuli in 534 healthy subjects. In this first study, we show a strong impact of non-genetic host factors (e.g., age and gender) on cytokine production and circulating mediators. Additionally, annual seasonality is found to be an important environmental factor influencing cytokine production. Alpha-1-antitrypsin concentrations partially mediate the seasonality of cytokine responses, whereas the effect of vitamin D levels is limited. The complete dataset has been made publicly available as a comprehensive resource for future studies.
Display omitted
Display omitted
•HFGP cohort: host/environment, genetics, and microbiome affect cytokine production•IFN-γ and IL-22, but not IL-17 and Mo-derived cytokine responses, decrease with age•Gender affects cytokine responses, resulting in disease susceptibility differences•Cytokine responses are season dependent, influenced by AAT concentrations
As part of the Human Functional Genomics Project, mapping of environmental and non-genetic host factors reveals critical associations between age, gender, and annual seasonality in inter-individual variability of immune cell function.
Genome-wide association studies have identified thousands of genetic variants that are associated with disease
. Most of these variants have small effect sizes, but their downstream expression ...effects, so-called expression quantitative trait loci (eQTLs), are often large
and celltype-specific
. To identify these celltype-specific eQTLs using an unbiased approach, we used single-cell RNA sequencing to generate expression profiles of ~25,000 peripheral blood mononuclear cells from 45 donors. We identified previously reported cis-eQTLs, but also identified new celltype-specific cis-eQTLs. Finally, we generated personalized co-expression networks and identified genetic variants that significantly alter co-expression relationships (which we termed 'co-expression QTLs'). Single-cell eQTL analysis thus allows for the identification of genetic variants that impact regulatory networks.
As part of the Human Functional Genomics Project, which aims to understand the factors that determine the variability of immune responses, we investigated genetic variants affecting cytokine ...production in response to ex vivo stimulation in two independent cohorts of 500 and 200 healthy individuals. We demonstrate a strong impact of genetic heritability on cytokine production capacity after challenge with bacterial, fungal, viral, and non-microbial stimuli. In addition to 17 novel genome-wide significant cytokine QTLs (cQTLs), our study provides a comprehensive picture of the genetic variants that influence six different cytokines in whole blood, blood mononuclear cells, and macrophages. Important biological pathways that contain cytokine QTLs map to pattern recognition receptors (TLR1-6-10 cluster), cytokine and complement inhibitors, and the kallikrein system. The cytokine QTLs show enrichment for monocyte-specific enhancers, are more often located in regions under positive selection, and are significantly enriched among SNPs associated with infections and immune-mediated diseases.
Display omitted
Display omitted
•Host genetics play a major role in inter-individual variability in cytokine responses•Identification of 17 novel genome-wide significant cytokine QTLs (cQTLs)•cQTLs generally overlap with genomic regions under positive selection•Many cQTLs overlap with loci associated with infectious and immune-mediated diseases
As part of the Human Functional Genomics Project, examination of millions of human genetic variants demonstrates that host genetics plays a significant role in inter-individual variability of cytokine production in response to different types of microbial stimuli.