Abstract
Gene co-expression networks can be used to associate genes of unknown function with biological processes, to prioritize candidate disease genes or to discern transcriptional regulatory ...programmes. With recent advances in transcriptomics and next-generation sequencing, co-expression networks constructed from RNA sequencing data also enable the inference of functions and disease associations for non-coding genes and splice variants. Although gene co-expression networks typically do not provide information about causality, emerging methods for differential co-expression analysis are enabling the identification of regulatory genes underlying various phenotypes. Here, we introduce and guide researchers through a (differential) co-expression analysis. We provide an overview of methods and tools used to create and analyse co-expression networks constructed from gene expression data, and we explain how these can be used to identify genes with a regulatory role in disease. Furthermore, we discuss the integration of other data types with co-expression networks and offer future perspectives of co-expression analysis.
Circulating proteins are vital in human health and disease and are frequently used as biomarkers for clinical decision-making or as targets for pharmacological intervention. Here, we map and ...replicate protein quantitative trait loci (pQTL) for 90 cardiovascular proteins in over 30,000 individuals, resulting in 451 pQTLs for 85 proteins. For each protein, we further perform pathway mapping to obtain trans-pQTL gene and regulatory designations. We substantiate these regulatory findings with orthogonal evidence for trans-pQTLs using mouse knockdown experiments (ABCA1 and TRIB1) and clinical trial results (chemokine receptors CCR2 and CCR5), with consistent regulation. Finally, we evaluate known drug targets, and suggest new target candidates or repositioning opportunities using Mendelian randomization. This identifies 11 proteins with causal evidence of involvement in human disease that have not previously been targeted, including EGF, IL-16, PAPPA, SPON1, F3, ADM, CASP-8, CHI3L1, CXCL16, GDF15 and MMP-12. Taken together, these findings demonstrate the utility of large-scale mapping of the genetics of the proteome and provide a resource for future precision studies of circulating proteins in human health.
Trait-associated genetic variants affect complex phenotypes primarily via regulatory mechanisms on the transcriptome. To investigate the genetics of gene expression, we performed cis- and ...trans-expression quantitative trait locus (eQTL) analyses using blood-derived expression from 31,684 individuals through the eQTLGen Consortium. We detected cis-eQTL for 88% of genes, and these were replicable in numerous tissues. Distal trans-eQTL (detected for 37% of 10,317 trait-associated variants tested) showed lower replication rates, partially due to low replication power and confounding by cell type composition. However, replication analyses in single-cell RNA-seq data prioritized intracellular trans-eQTL. Trans-eQTL exerted their effects via several mechanisms, primarily through regulation by transcription factors. Expression of 13% of the genes correlated with polygenic scores for 1,263 phenotypes, pinpointing potential drivers for those traits. In summary, this work represents a large eQTL resource, and its results serve as a starting point for in-depth interpretation of complex phenotypes.
Understanding the difference in genetic regulation of gene expression between brain and blood is important for discovering genes for brain-related traits and disorders. Here, we estimate the ...correlation of genetic effects at the top-associated cis-expression or -DNA methylation (DNAm) quantitative trait loci (cis-eQTLs or cis-mQTLs) between brain and blood (r
). Using publicly available data, we find that genetic effects at the top cis-eQTLs or mQTLs are highly correlated between independent brain and blood samples (Formula: see text for cis-eQTLs and Formula: see text for cis-mQTLs). Using meta-analyzed brain cis-eQTL/mQTL data (n = 526 to 1194), we identify 61 genes and 167 DNAm sites associated with four brain-related phenotypes, most of which are a subset of the discoveries (97 genes and 295 DNAm sites) using data from blood with larger sample sizes (n = 1980 to 14,115). Our results demonstrate the gain of power in gene discovery for brain-related phenotypes using blood cis-eQTL/mQTL data with large sample sizes.
Circulating proteins have important functions in inflammation and a broad range of diseases. To identify genetic influences on inflammation-related proteins, we conducted a genome-wide protein ...quantitative trait locus (pQTL) study of 91 plasma proteins measured using the Olink Target platform in 14,824 participants. We identified 180 pQTLs (59 cis, 121 trans). Integration of pQTL data with eQTL and disease genome-wide association studies provided insight into pathogenesis, implicating lymphotoxin-α in multiple sclerosis. Using Mendelian randomization (MR) to assess causality in disease etiology, we identified both shared and distinct effects of specific proteins across immune-mediated diseases, including directionally discordant effects of CD40 on risk of rheumatoid arthritis versus multiple sclerosis and inflammatory bowel disease. MR implicated CXCL5 in the etiology of ulcerative colitis (UC) and we show elevated gut CXCL5 transcript expression in patients with UC. These results identify targets of existing drugs and provide a powerful resource to facilitate future drug target prioritization.
Allele-specific gene expression associated with genetic variation in regulatory regions can play an important role in the development of complex traits. We hypothesized that polymorphisms in microRNA ...(miRNA) response elements (MRE-SNPs) that either disrupt a miRNA binding site or create a new miRNA binding site can affect the allele-specific expression of target genes. By integrating public expression quantitative trait locus (eQTL) data, miRNA binding site predictions, small RNA sequencing, and Argonaute crosslinking immunoprecipitation (AGO-CLIP) datasets, we identified genetic variants that can affect gene expression by modulating miRNA binding efficiency. We also identified MRE-SNPs located in regions associated with complex traits, indicating possible causative mechanisms associated with these loci. The results of this study expand the current understanding of gene expression regulation and help to interpret the mechanisms underlying eQTL effects.
Most existing TWAS tools require individual-level eQTL reference data and thus are not applicable to summary-level reference eQTL datasets. The development of TWAS methods that can harness ...summary-level reference data is valuable to enable TWAS in broader settings and enhance power due to increased reference sample size. Thus, we develop a TWAS framework called OTTERS (Omnibus Transcriptome Test using Expression Reference Summary data) that adapts multiple polygenic risk score (PRS) methods to estimate eQTL weights from summary-level eQTL reference data and conducts an omnibus TWAS. We show that OTTERS is a practical and powerful TWAS tool by both simulations and application studies.
Microbiome-wide association studies on large population cohorts have highlighted associations between the gut microbiome and complex traits, including type 2 diabetes (T2D) and obesity
. However, the ...causal relationships remain largely unresolved. We leveraged information from 952 normoglycemic individuals for whom genome-wide genotyping, gut metagenomic sequence and fecal short-chain fatty acid (SCFA) levels were available
, then combined this information with genome-wide-association summary statistics for 17 metabolic and anthropometric traits. Using bidirectional Mendelian randomization (MR) analyses to assess causality
, we found that the host-genetic-driven increase in gut production of the SCFA butyrate was associated with improved insulin response after an oral glucose-tolerance test (P = 9.8 × 10
), whereas abnormalities in the production or absorption of another SCFA, propionate, were causally related to an increased risk of T2D (P = 0.004). These data provide evidence of a causal effect of the gut microbiome on metabolic traits and support the use of MR as a means to elucidate causal relationships from microbiome-wide association findings.
The prognostic and diagnostic value of microRNA (miRNA) expression aberrations in lung cancer has been studied intensely in recent years. However, due to the application of different technological ...platforms and small sample size, the miRNA expression profiling efforts have led to inconsistent results between the studies. We performed a comprehensive meta‐analysis of 20 published miRNA expression studies in lung cancer, including a total of 598 tumor and 528 non‐cancerous control samples. Using a recently published robust rank aggregation method, we identified a statistically significant miRNA meta‐signature of seven upregulated (miR‐21, miR‐210, miR‐182, miR‐31, miR‐200b, miR‐205 and miR‐183) and eight downregulated (miR‐126‐3p, miR‐30a, miR‐30d, miR‐486‐5p, miR‐451a, miR‐126‐5p, miR‐143 and miR‐145) miRNAs. We conducted a gene set enrichment analysis to identify pathways that are most strongly affected by altered expression of these miRNAs. We found that meta‐signature miRNAs cooperatively target functionally related and biologically relevant genes in signaling and developmental pathways. We have shown that such meta‐analysis approach is suitable and effective solution for identification of statistically significant miRNA meta‐signature by combining several miRNA expression studies. This method allows the analysis of data produced by different technological platforms that cannot be otherwise directly compared or in the case when raw data are unavailable.
What's new?
The prognostic and diagnostic value of microRNA (miRNA) expression aberrations in lung cancer has been studied intensely in recent years. However, due to the application of different technological platforms and small sample size, the miRNA expression profiling efforts have led to inconsistent results. Using a meta‐analysis of more than 1100 lung cancer and non‐cancerous samples from 20 original studies, here the authors have identified a meta‐signature of seven up‐ and eight down‐regulated miRNAs. Their analysis highlights the challenges related with the development of miRNA‐based tests and emphasizes the need for rigorous evaluation of the results before proceeding to clinical trials.