Genome-wide association studies (GWAS) have identified thousands of genetic variants associated with human complex traits. However, the genes or functional DNA elements through which these variants ...exert their effects on the traits are often unknown. We propose a method (called SMR) that integrates summary-level data from GWAS with data from expression quantitative trait locus (eQTL) studies to identify genes whose expression levels are associated with a complex trait because of pleiotropy. We apply the method to five human complex traits using GWAS data on up to 339,224 individuals and eQTL data on 5,311 individuals, and we prioritize 126 genes (for example, TRAF1 and ANKRD55 for rheumatoid arthritis and SNX19 and NMRAL1 for schizophrenia), of which 25 genes are new candidates; 77 genes are not the nearest annotated gene to the top associated GWAS SNP. These genes provide important leads to design future functional studies to understand the mechanism whereby DNA variation leads to complex trait variation.
We develop a Bayesian mixed linear model that simultaneously estimates single-nucleotide polymorphism (SNP)-based heritability, polygenicity (proportion of SNPs with nonzero effects), and the ...relationship between SNP effect size and minor allele frequency for complex traits in conventionally unrelated individuals using genome-wide SNP data. We apply the method to 28 complex traits in the UK Biobank data (N = 126,752) and show that on average, 6% of SNPs have nonzero effects, which in total explain 22% of phenotypic variance. We detect significant (P < 0.05/28) signatures of natural selection in the genetic architecture of 23 traits, including reproductive, cardiovascular, and anthropometric traits, as well as educational attainment. The significant estimates of the relationship between effect size and minor allele frequency in complex traits are consistent with a model of negative (or purifying) selection, as confirmed by forward simulation. We conclude that negative selection acts pervasively on the genetic variants associated with human complex traits.
The identification of genes and regulatory elements underlying the associations discovered by GWAS is essential to understanding the aetiology of complex traits (including diseases). Here, we ...demonstrate an analytical paradigm of prioritizing genes and regulatory elements at GWAS loci for follow-up functional studies. We perform an integrative analysis that uses summary-level SNP data from multi-omics studies to detect DNA methylation (DNAm) sites associated with gene expression and phenotype through shared genetic effects (i.e., pleiotropy). We identify pleiotropic associations between 7858 DNAm sites and 2733 genes. These DNAm sites are enriched in enhancers and promoters, and >40% of them are mapped to distal genes. Further pleiotropic association analyses, which link both the methylome and transcriptome to 12 complex traits, identify 149 DNAm sites and 66 genes, indicating a plausible mechanism whereby the effect of a genetic variant on phenotype is mediated by genetic regulation of transcription through DNAm.
SNPs discovered by genome-wide association studies (GWASs) account for only a small fraction of the genetic variation of complex traits in human populations. Where is the remaining heritability? We ...estimated the proportion of variance for human height explained by 294,831 SNPs genotyped on 3,925 unrelated individuals using a linear model analysis, and validated the estimation method with simulations based on the observed genotype data. We show that 45% of variance can be explained by considering all SNPs simultaneously. Thus, most of the heritability is not missing but has not previously been detected because the individual effects are too small to pass stringent significance tests. We provide evidence that the remaining heritability is due to incomplete linkage disequilibrium between causal variants and genotyped SNPs, exacerbated by causal variants having lower minor allele frequency than the SNPs explored to date.
We present an approximate conditional and joint association analysis that can use summary-level statistics from a meta-analysis of genome-wide association studies (GWAS) and estimated linkage ...disequilibrium (LD) from a reference sample with individual-level genotype data. Using this method, we analyzed meta-analysis summary data from the GIANT Consortium for height and body mass index (BMI), with the LD structure estimated from genotype data in two independent cohorts. We identified 36 loci with multiple associated variants for height (38 leading and 49 additional SNPs, 87 in total) via a genome-wide SNP selection procedure. The 49 new SNPs explain approximately 1.3% of variance, nearly doubling the heritability explained at the 36 loci. We did not find any locus showing multiple associated SNPs for BMI. The method we present is computationally fast and is also applicable to case-control data, which we demonstrate in an example from meta-analysis of type 2 diabetes by the DIAGRAM Consortium.
Abstract
BACKGROUND
Endometriosis remains a poorly understood disease, despite its high prevalence and debilitating symptoms. The overlap in symptoms and the increased risk of multiple other traits ...in women with endometriosis is becoming increasingly apparent through epidemiological data. Genetic studies offer a method of investigating these comorbid relationships through the assessment of causal relationships with Mendelian randomization (MR), as well as identification of shared genetic variants and genes involved across traits. This has the capacity to identify risk factors for endometriosis as well as provide insight into the aetiology of disease.
OBJECTIVE AND RATIONALE
We aim to review the current literature assessing the relationship between endometriosis and other traits using genomic data, primarily through the methods of MR and genetic correlation. We critically examine the limitations of these studies in accordance with the assumptions of the utilized methods.
SEARCH METHODS
The PubMed database was used to search for peer-reviewed original research articles using the terms ‘Mendelian randomization endometriosis’ and ‘“genetic correlation” endometriosis’. Additionally, a Google Scholar search using the terms ‘“endometriosis” “mendelian randomization” “genetic correlation”’ was performed. All relevant publications (n = 21) published up until 7 October 2022 were included in this review. Upon compilation of all traits with published MR and/or genetic correlation with endometriosis, additional epidemiological and genetic information on their comorbidity with endometriosis was sourced by searching for the trait in conjunction with ‘endometriosis’ on Google Scholar.
OUTCOMES
The association between endometriosis and multiple pain, gynaecological, cancer, inflammatory, gastrointestinal, psychological, and anthropometric traits has been assessed using MR analysis and genetic correlation analysis. Genetic correlation analyses provide evidence that genetic factors contributing to endometriosis are shared with multiple traits: migraine, uterine fibroids, subtypes of ovarian cancer, melanoma, asthma, gastro-oesophageal reflux disease, gastritis/duodenitis, and depression, suggesting the involvement of multiple biological mechanisms in endometriosis. The assessment of causality with MR has revealed several potential causes (e.g. depression) and outcomes (e.g. ovarian cancer and uterine fibroids) of a genetic predisposition to endometriosis; however, interpretation of these results requires consideration of potential violations of the MR assumptions.
WIDER IMPLICATIONS
Genomic studies have demonstrated that there is a molecular basis for the co-occurrence of endometriosis with other traits. Dissection of this overlap has identified shared genes and pathways, which provide insight into the biology of endometriosis. Thoughtful MR studies are necessary to ascertain causality of the comorbidities of endometriosis. Given the significant diagnostic delay of endometriosis of 7–11 years, determining risk factors is necessary to aid diagnosis and reduce the disease burden. Identification of traits for which endometriosis is a risk factor is important for holistic treatment and counselling of the patient. The use of genomic data to disentangle the overlap of endometriosis with other traits has provided insights into the aetiology of endometriosis.
GRAPHICAL ABSTRACT
Endometriosis is associated with psychiatric (blue), gastrointestinal (green), cancer (yellow), gynaecological (purple), immune (pink), and pain (red) comorbidities through a causal mechanism and/or shared genetic background
Endometriosis is a heritable common gynaecological condition influenced by multiple genetic and environmental factors. Genome-wide association studies (GWASs) have proved successful in identifying ...common genetic variants of moderate effects for various complex diseases. To date, eight GWAS and replication studies from multiple populations have been published on endometriosis. In this review, we investigate the consistency and heterogeneity of the results across all the studies and their implications for an improved understanding of the aetiology of the condition.
Meta-analyses were conducted on four GWASs and four replication studies including a total of 11 506 cases and 32 678 controls, and on the subset of studies that investigated associations for revised American Fertility Society (rAFS) Stage III/IV including 2859 cases. The datasets included 9039 cases and 27 343 controls of European (Australia, Belgium, Italy, UK, USA) and 2467 cases and 5335 controls of Japanese ancestry. Fixed and Han and Elkin random-effects models, and heterogeneity statistics (Cochran's Q test), were used to investigate the evidence of the nine reported genome-wide significant loci across datasets and populations.
Meta-analysis showed that seven out of nine loci had consistent directions of effect across studies and populations, and six out of nine remained genome-wide significant (P < 5 × 10(-8)), including rs12700667 on 7p15.2 (P = 1.6 × 10(-9)), rs7521902 near WNT4 (P = 1.8 × 10(-15)), rs10859871 near VEZT (P = 4.7 × 10(-15)), rs1537377 near CDKN2B-AS1 (P = 1.5 × 10(-8)), rs7739264 near ID4 (P = 6.2 × 10(-10)) and rs13394619 in GREB1 (P = 4.5 × 10(-8)). In addition to the six loci, two showed borderline genome-wide significant associations with Stage III/IV endometriosis, including rs1250248 in FN1 (P = 8 × 10(-8)) and rs4141819 on 2p14 (P = 9.2 × 10(-8)). Two independent inter-genic loci, rs4141819 and rs6734792 on chromosome 2, showed significant evidence of heterogeneity across datasets (P < 0.005). Eight of the nine loci had stronger effect sizes among Stage III/IV cases, implying that they are likely to be implicated in the development of moderate to severe, or ovarian, disease. While three out of nine loci were inter-genic, the remaining were in or near genes with known functions of biological relevance to endometriosis, varying from roles in developmental pathways to cellular growth/carcinogenesis.
Our meta-analysis shows remarkable consistency in endometriosis GWAS results across studies, with little evidence of population-based heterogeneity. They also show that the phenotypic classifications used in GWAS to date have been limited. Stronger associations with Stage III/IV disease observed for most loci emphasize the importance for future studies to include detailed sub-phenotype information. Functional studies in relevant tissues are needed to understand the effect of the variants on downstream biological pathways.
Endometriosis affects 1 in 9 women, yet it is poorly understood with long diagnostic delays, invasive diagnoses, and poor treatment outcomes. Characterised by the presence of endometrial-like tissue ...outside of the uterus, its main symptoms are pain and infertility. Endometriosis often co-occurs with other conditions, which may provide insights into the origins of endometriosis.
Here a polygenic risk score phenome-wide association study of endometriosis was conducted in the UK Biobank to investigate the pleiotropic effects of a genetic liability to endometriosis. The relationship between the polygenic risk score for endometriosis and health conditions, blood and urine biomarkers and reproductive factors were investigated separately in females, males and females without an endometriosis diagnosis. The relationship between endometriosis and the blood and urine biomarkers was further investigated using genetic correlation and Mendelian randomisation approaches to identify causal relationships.
Multiple health conditions, blood and urine biomarkers and reproductive factors were associated with genetic liability to endometriosis in each group, indicating many endometriosis comorbidities are not dependent on the physical manifestation of endometriosis. Differences in the associated traits between males and females highlighted the importance of sex-specific pathways in the overlap of endometriosis with many other traits. Notably, an association of genetic liability to endometriosis with lower testosterone levels was identified. Follow-up analysis utilising Mendelian randomisation approaches suggested lower testosterone may be causal for both endometriosis and clear cell ovarian cancer.
This study highlights the diversity of the pleiotropic effects of genetic risk to endometriosis irrespective of a diagnosis of endometriosis. A key finding was the identification of a causal effect of the genetic liability to lower testosterone on endometriosis using Mendelian randomisation.
DNA methylation plays an important role in the regulation of transcription. Genetic control of DNA methylation is a potential candidate for explaining the many identified SNP associations with ...disease that are not found in coding regions. We replicated 52,916 cis and 2,025 trans DNA methylation quantitative trait loci (mQTL) using methylation from whole blood measured on Illumina HumanMethylation450 arrays in the Brisbane Systems Genetics Study (n = 614 from 177 families) and the Lothian Birth Cohorts of 1921 and 1936 (combined n = 1366). The trans mQTL SNPs were found to be over-represented in 1 Mbp subtelomeric regions, and on chromosomes 16 and 19. There was a significant increase in trans mQTL DNA methylation sites in upstream and 5' UTR regions. The genetic heritability of a number of complex traits and diseases was partitioned into components due to mQTL and the remainder of the genome. Significant enrichment was observed for height (p = 2.1 × 10
), ulcerative colitis (p = 2 × 10
), Crohn's disease (p = 6 × 10
) and coronary artery disease (p = 5.5 × 10
) when compared to a random sample of SNPs with matched minor allele frequency, although this enrichment is explained by the genomic location of the mQTL SNPs.
The Genetics of Endometriosis Montgomery, Grant W
Twin research and human genetics,
04/2020, Volume:
23, Issue:
2
Journal Article
Peer reviewed
Mapping genetic risk factors for endometriosis continues from early studies on women's health initiated by Nick Martin and Susan Treloar. Their initial recruitment of endometriosis cases and family ...members received a major boost and became a flagship project within the Cooperative Research Centre (CRC) for the Discovery of Common Human Disease. We extended the study through a formal collaboration with Professor Stephen Kennedy and his group in Oxford. Our first joint scientific meeting was held in Brisbane and was sadly memorable as the day the planes were flown into the Twin Towers in New York. Our initial collaboration expanded into the International Endometriosis Genetics Consortium (IEGC). The IEGC now has 15 groups around the world, and the most recent meta-analysis will be published this year.