Inverse‐variance weighted two‐sample Mendelian randomization (IVW‐MR) is the most widely used approach that utilizes genome‐wide association studies (GWAS) summary statistics to infer the existence ...and the strength of the causal effect between an exposure and an outcome. Estimates from this approach can be subject to different biases due to the use of weak instruments and winner's curse, which can change as a function of the overlap between the exposure and outcome samples. We developed a method (MRlap) that simultaneously considers weak instrument bias and winner's curse while accounting for potential sample overlap. Assuming spike‐and‐slab genomic architecture and leveraging linkage disequilibrium score regression and other techniques, we could analytically derive, reliably estimate, and hence correct for the bias of IVW‐MR using association summary statistics only. We tested our approach using simulated data for a wide range of realistic settings. In all the explored scenarios, our correction reduced the bias, in some situations by as much as 30‐fold. In addition, our results are consistent with the fact that the strength of the biases will decrease as the sample size increases and we also showed that the overall bias is also dependent on the genetic architecture of the exposure, and traits with low heritability and/or high polygenicity are more strongly affected. Applying MRlap to obesity‐related exposures revealed statistically significant differences between IVW‐based and corrected effects, both for nonoverlapping and fully overlapping samples. Our method not only reduces bias in causal effect estimation but also enables the use of much larger GWAS sample sizes, by allowing for potentially overlapping samples.
Mapping perturbed molecular circuits that underlie complex diseases remains a great challenge. We developed a comprehensive resource of 394 cell type- and tissue-specific gene regulatory networks for ...human, each specifying the genome-wide connectivity among transcription factors, enhancers, promoters and genes. Integration with 37 genome-wide association studies (GWASs) showed that disease-associated genetic variants--including variants that do not reach genome-wide significance--often perturb regulatory modules that are highly specific to disease-relevant cell types or tissues. Our resource opens the door to systematic analysis of regulatory programs across hundreds of human cell types and tissues (http://regulatorycircuits.org).
Abstract
Summary
Increasing sample size is not the only strategy to improve discovery in Genome Wide Association Studies (GWASs) and we propose here an approach that leverages published studies of ...related traits to improve inference. Our Bayesian GWAS method derives informative prior effects by leveraging GWASs of related risk factors and their causal effect estimates on the focal trait using multivariable Mendelian randomization. These prior effects are combined with the observed effects to yield Bayes Factors, posterior and direct effects. The approach not only increases power, but also has the potential to dissect direct and indirect biological mechanisms.
Availability and implementation
bGWAS package is freely available under a GPL-2 License, and can be accessed, alongside with user guides and tutorials, from https://github.com/n-mounier/bGWAS.
Supplementary information
Supplementary data are available at Bioinformatics online.
The growing sample size of genome-wide association studies has facilitated the discovery of gene-environment interactions (GxE). Here we propose a maximum likelihood method to estimate the ...contribution of GxE to continuous traits taking into account all interacting environmental variables, without the need to measure any. Extensive simulations demonstrate that our method provides unbiased interaction estimates and excellent coverage. We also offer strategies to distinguish specific GxE from general scale effects. Applying our method to 32 traits in the UK Biobank reveals that while the genetic risk score (GRS) of 376 variants explains 5.2% of body mass index (BMI) variance, GRSxE explains an additional 1.9%. Nevertheless, this interaction holds for any variable with identical correlation to BMI as the GRS, hence may not be GRS-specific. Still, we observe that the global contribution of specific GRSxE to complex traits is substantial for nine obesity-related measures (including leg impedance and trunk fat-free mass).
As most of the heritability of complex traits is attributed to common and low frequency genetic variants, imputing them by combining genotyping chips and large sequenced reference panels is the most ...cost-effective approach to discover the genetic basis of these traits. Association summary statistics from genome-wide meta-analyses are available for hundreds of traits. Updating these to ever-increasing reference panels is very cumbersome as it requires reimputation of the genetic data, rerunning the association scan, and meta-analysing the results. A much more efficient method is to directly impute the summary statistics, termed as summary statistics imputation, which we improved to accommodate variable sample size across SNVs. Its performance relative to genotype imputation and practical utility has not yet been fully investigated. To this end, we compared the two approaches on real (genotyped and imputed) data from 120K samples from the UK Biobank and show that, genotype imputation boasts a 3- to 5-fold lower root-mean-square error, and better distinguishes true associations from null ones: We observed the largest differences in power for variants with low minor allele frequency and low imputation quality. For fixed false positive rates of 0.001, 0.01, 0.05, using summary statistics imputation yielded a decrease in statistical power by 9, 43 and 35%, respectively. To test its capacity to discover novel associations, we applied summary statistics imputation to the GIANT height meta-analysis summary statistics covering HapMap variants, and identified 34 novel loci, 19 of which replicated using data in the UK Biobank. Additionally, we successfully replicated 55 out of the 111 variants published in an exome chip study. Our study demonstrates that summary statistics imputation is a very efficient and cost-effective way to identify and fine-map trait-associated loci. Moreover, the ability to impute summary statistics is important for follow-up analyses, such as Mendelian randomisation or LD-score regression.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Comparing transcript levels between healthy and diseased individuals allows the identification of differentially expressed genes, which may be causes, consequences or mere correlates of the disease ...under scrutiny. We propose a method to decompose the observational correlation between gene expression and phenotypes driven by confounders, forward- and reverse causal effects. The bi-directional causal effects between gene expression and complex traits are obtained by Mendelian Randomization integrating summary-level data from GWAS and whole-blood eQTLs. Applying this approach to complex traits reveals that forward effects have negligible contribution. For example, BMI- and triglycerides-gene expression correlation coefficients robustly correlate with trait-to-expression causal effects (r
= 0.11, P
= 2.0 × 10
and r
= 0.13, P
= 1.1 × 10
), but not detectably with expression-to-trait effects. Our results demonstrate that studies comparing the transcriptome of diseased and healthy subjects are more prone to reveal disease-induced gene expression changes rather than disease causing ones.
Quantitative changes in leptin concentration lead to alterations in food intake and body weight, but the regulatory mechanisms that control leptin gene expression are poorly understood. Here we ...report that fat-specific and quantitative leptin expression is controlled by redundant cis elements and trans factors interacting with the proximal promoter together with a long noncoding RNA (lncOb). Diet-induced obese mice lacking lncOb show increased fat mass with reduced plasma leptin levels and lose weight after leptin treatment, whereas control mice do not. Consistent with this finding, large-scale genetic studies of humans reveal a significant association of single-nucleotide polymorphisms (SNPs) in the region of human lncOb with lower plasma leptin levels and obesity. These results show that reduced leptin gene expression can lead to a hypoleptinemic, leptin-responsive form of obesity and provide a framework for elucidating the pathogenic mechanism in the subset of obese patients with low endogenous leptin levels.
High-dimensional omics datasets provide valuable resources to determine the causal role of molecular traits in mediating the path from genotype to phenotype. Making use of molecular quantitative ...trait loci (QTL) and genome-wide association study (GWAS) summary statistics, we propose a multivariable Mendelian randomization (MVMR) framework to quantify the proportion of the impact of the DNA methylome (DNAm) on complex traits that is propagated through the assayed transcriptome. Evaluating 50 complex traits, we find that on average at least 28.3% (95% CI: 26.9%-29.8%) of DNAm-to-trait effects are mediated through (typically multiple) transcripts in the cis-region. Several regulatory mechanisms are hypothesized, including methylation of the promoter probe cg10385390 (chr1:8'022'505) increasing the risk for inflammatory bowel disease by reducing PARK7 expression. The proposed integrative framework can be extended to other omics layers to identify causal molecular chains, providing a powerful tool to map and interpret GWAS signals.
Telomeres form repeated DNA sequences at the ends of chromosomes, which shorten with each cell division. Yet, factors modulating telomere attrition and the health consequences thereof are not fully ...understood. To address this, we leveraged data from 326,363 unrelated UK Biobank participants of European ancestry.
Using linear regression and bidirectional univariable and multivariable Mendelian randomization (MR), we elucidate the relationships between leukocyte telomere length (LTL) and 142 complex traits, including diseases, biomarkers, and lifestyle factors. We confirm that telomeres shorten with age and show a stronger decline in males than in females, with these factors contributing to the majority of the 5.4% of LTL variance explained by the phenome. MR reveals 23 traits modulating LTL. Smoking cessation and high educational attainment associate with longer LTL, while weekly alcohol intake, body mass index, urate levels, and female reproductive events, such as childbirth, associate with shorter LTL. We also identify 24 traits affected by LTL, with risk for cardiovascular, pulmonary, and some autoimmune diseases being increased by short LTL, while longer LTL increased risk for other autoimmune conditions and cancers. Through multivariable MR, we show that LTL may partially mediate the impact of educational attainment, body mass index, and female age at childbirth on proxied lifespan.
Our study sheds light on the modulators, consequences, and the mediatory role of telomeres, portraying an intricate relationship between LTL, diseases, lifestyle, and socio-economic factors.