Trait-associated genetic variants affect complex phenotypes primarily via regulatory mechanisms on the transcriptome. To investigate the genetics of gene expression, we performed cis- and ...trans-expression quantitative trait locus (eQTL) analyses using blood-derived expression from 31,684 individuals through the eQTLGen Consortium. We detected cis-eQTL for 88% of genes, and these were replicable in numerous tissues. Distal trans-eQTL (detected for 37% of 10,317 trait-associated variants tested) showed lower replication rates, partially due to low replication power and confounding by cell type composition. However, replication analyses in single-cell RNA-seq data prioritized intracellular trans-eQTL. Trans-eQTL exerted their effects via several mechanisms, primarily through regulation by transcription factors. Expression of 13% of the genes correlated with polygenic scores for 1,263 phenotypes, pinpointing potential drivers for those traits. In summary, this work represents a large eQTL resource, and its results serve as a starting point for in-depth interpretation of complex phenotypes.
Full text
Available for:
GEOZS, IJS, IMTLJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBMB, UL, UM, UPUK, ZAGLJ
Autism spectrum disorder (ASD) is a highly heritable and heterogeneous group of neurodevelopmental phenotypes diagnosed in more than 1% of children. Common genetic variants contribute substantially ...to ASD susceptibility, but to date no individual variants have been robustly associated with ASD. With a marked sample-size increase from a unique Danish population resource, we report a genome-wide association meta-analysis of 18,381 individuals with ASD and 27,969 controls that identified five genome-wide-significant loci. Leveraging GWAS results from three phenotypes with significantly overlapping genetic architectures (schizophrenia, major depression, and educational attainment), we identified seven additional loci shared with other traits at equally strict significance levels. Dissecting the polygenic architecture, we found both quantitative and qualitative polygenic heterogeneity across ASD subtypes. These results highlight biological insights, particularly relating to neuronal function and corticogenesis, and establish that GWAS performed at scale will be much more productive in the near term in ASD.
Full text
Available for:
EMUNI, FIS, FZAB, GEOZS, GIS, IJS, IMTLJ, KILJ, KISLJ, MFDPS, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, SBMB, SBNM, UKNU, UL, UM, UPUK, VKSCE, ZAGLJ
Common single-nucleotide polymorphisms (SNPs) are predicted to collectively explain 40-50% of phenotypic variation in human height, but identifying the specific variants and associated regions ...requires huge sample sizes
. Here, using data from a genome-wide association study of 5.4 million individuals of diverse ancestries, we show that 12,111 independent SNPs that are significantly associated with height account for nearly all of the common SNP-based heritability. These SNPs are clustered within 7,209 non-overlapping genomic segments with a mean size of around 90 kb, covering about 21% of the genome. The density of independent associations varies across the genome and the regions of increased density are enriched for biologically relevant genes. In out-of-sample estimation and prediction, the 12,111 SNPs (or all SNPs in the HapMap 3 panel
) account for 40% (45%) of phenotypic variance in populations of European ancestry but only around 10-20% (14-24%) in populations of other ancestries. Effect sizes, associated regions and gene prioritization are similar across ancestries, indicating that reduced prediction accuracy is likely to be explained by linkage disequilibrium and differences in allele frequency within associated regions. Finally, we show that the relevant biological pathways are detectable with smaller sample sizes than are needed to implicate causal genes and variants. Overall, this study provides a comprehensive map of specific genomic regions that contain the vast majority of common height-associated variants. Although this map is saturated for populations of European ancestry, further research is needed to achieve equivalent saturation in other ancestries.
Full text
Available for:
IJS, KISLJ, NUK, SBMB, UL, UM, UPUK
We assembled an ancestrally diverse collection of genome-wide association studies (GWAS) of type 2 diabetes (T2D) in 180,834 affected individuals and 1,159,055 controls (48.9% non-European descent) ...through the Diabetes Meta-Analysis of Trans-Ethnic association studies (DIAMANTE) Consortium. Multi-ancestry GWAS meta-analysis identified 237 loci attaining stringent genome-wide significance (P < 5 × 10
), which were delineated to 338 distinct association signals. Fine-mapping of these signals was enhanced by the increased sample size and expanded population diversity of the multi-ancestry meta-analysis, which localized 54.4% of T2D associations to a single variant with >50% posterior probability. This improved fine-mapping enabled systematic assessment of candidate causal genes and molecular mechanisms through which T2D associations are mediated, laying the foundations for functional investigations. Multi-ancestry genetic risk scores enhanced transferability of T2D prediction across diverse populations. Our study provides a step toward more effective clinical translation of T2D GWAS to improve global health for all, irrespective of genetic background.
Full text
Available for:
EMUNI, FIS, FZAB, GEOZS, GIS, IJS, IMTLJ, KILJ, KISLJ, MFDPS, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, SBMB, SBNM, UKNU, UL, UM, UPUK, VKSCE, ZAGLJ
Objective:Tourette’s syndrome is polygenic and highly heritable. Genome-wide association study (GWAS) approaches are useful for interrogating the genetic architecture and determinants of Tourette’s ...syndrome and other tic disorders. The authors conducted a GWAS meta-analysis and probed aggregated Tourette’s syndrome polygenic risk to test whether Tourette’s and related tic disorders have an underlying shared genetic etiology and whether Tourette’s polygenic risk scores correlate with worst-ever tic severity and may represent a potential predictor of disease severity.Methods:GWAS meta-analysis, gene-based association, and genetic enrichment analyses were conducted in 4,819 Tourette’s syndrome case subjects and 9,488 control subjects. Replication of top loci was conducted in an independent population-based sample (706 case subjects, 6,068 control subjects). Relationships between Tourette’s polygenic risk scores (PRSs), other tic disorders, ascertainment, and tic severity were examined.Results:GWAS and gene-based analyses identified one genome-wide significant locus within FLT3 on chromosome 13, rs2504235, although this association was not replicated in the population-based sample. Genetic variants spanning evolutionarily conserved regions significantly explained 92.4% of Tourette’s syndrome heritability. Tourette’s-associated genes were significantly preferentially expressed in dorsolateral prefrontal cortex. Tourette’s PRS significantly predicted both Tourette’s syndrome and tic spectrum disorders status in the population-based sample. Tourette’s PRS also significantly correlated with worst-ever tic severity and was higher in case subjects with a family history of tics than in simplex case subjects.Conclusions:Modulation of gene expression through noncoding variants, particularly within cortico-striatal circuits, is implicated as a fundamental mechanism in Tourette’s syndrome pathogenesis. At a genetic level, tic disorders represent a continuous spectrum of disease, supporting the unification of Tourette’s syndrome and other tic disorders in future diagnostic schemata. Tourette’s PRSs derived from sufficiently large samples may be useful in the future for predicting conversion of transient tics to chronic tic disorders, as well as tic persistence and lifetime tic severity.
Leveraging linkage disequilibrium (LD) patterns as representative of population substructure enables the discovery of additive association signals in genome-wide association studies (GWASs). Standard ...GWASs are well-powered to interrogate additive models; however, new approaches are required for invesigating other modes of inheritance such as dominance and epistasis. Epistasis, or non-additive interaction between genes, exists across the genome but often goes undetected because of a lack of statistical power. Furthermore, the adoption of LD pruning as customary in standard GWASs excludes detection of sites that are in LD but might underlie the genetic architecture of complex traits. We hypothesize that uncovering long-range interactions between loci with strong LD due to epistatic selection can elucidate genetic mechanisms underlying common diseases. To investigate this hypothesis, we tested for associations between 23 common diseases and 5,625,845 epistatic SNP-SNP pairs (determined by Ohta’s D statistics) in long-range LD (>0.25 cM). Across five disease phenotypes, we identified one significant and four near-significant associations that replicated in two large genotype-phenotype datasets (UK Biobank and eMERGE). The genes that were most likely involved in the replicated associations were (1) members of highly conserved gene families with complex roles in multiple pathways, (2) essential genes, and/or (3) genes that were associated in the literature with complex traits that display variable expressivity. These results support the highly pleiotropic and conserved nature of variants in long-range LD under epistatic selection. Our work supports the hypothesis that epistatic interactions regulate diverse clinical mechanisms and might especially be driving factors in conditions with a wide range of phenotypic outcomes.
This study investigates epistasis in the genetic architecture of common diseases by using long-range linkage disequilibrium patterns. One significant and four near-significant associations across five disease phenotypes were identified, highlighting the pleiotropic and conserved nature of variants under epistatic selection. These findings provide insights into the genetic mechanisms underlying complex diseases.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
Underrepresentation of Asian genomes has hindered population and medical genetics research on Asians, leading to population disparities in precision medicine. By whole-genome sequencing of 4,810 ...Singapore Chinese, Malays, and Indians, we found 98.3 million SNPs and small insertions or deletions, over half of which are novel. Population structure analysis demonstrated great representation of Asian genetic diversity by three ethnicities in Singapore and revealed a Malay-related novel ancestry component. Furthermore, demographic inference suggested that Malays split from Chinese ∼24,800 years ago and experienced significant admixture with East Asians ∼1,700 years ago, coinciding with the Austronesian expansion. Additionally, we identified 20 candidate loci for natural selection, 14 of which harbored robust associations with complex traits and diseases. Finally, we show that our data can substantially improve genotype imputation in diverse Asian and Oceanian populations. These results highlight the value of our data as a resource to empower human genetics discovery across broad geographic regions.
Display omitted
•Discovery of 52 million novel variants by 13.7× WGS of 4,810 Singaporeans•Insights into population structure and evolutionary history of Asians•Identification of 20 loci under selection that are enriched for GWAS signals•Substantial improvement of imputation in diverse Asian and Oceanian populations
Because of Singapore’s unique history of immigration, whole-genome sequence analysis of 4,810 Singaporeans provides a snapshot of the genetic diversity across East, Southeast, and South Asia.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
Idiopathic pulmonary fibrosis (IPF) is a devastating lung disease of unknown etiology. The genes TOLLIP and MUC5B play important roles in lung host defense, which is an immune process influenced by ...oxidative signaling. Whether polymorphisms in TOLLIP and MUC5B modify the effect of immunosuppressive and antioxidant therapy in individuals with IPF is unknown.
To determine whether single-nucleotide polymorphisms (SNPs) within TOLLIP and MUC5B modify the effect of interventions in subjects participating in the Evaluating the Effectiveness of Prednisone, Azathioprine, and N-Acetylcysteine in Patients with Idiopathic Pulmonary Fibrosis (PANTHER-IPF) clinical trial.
SNPs within TOLLIP (rs5743890/rs5743894/rs5743854/rs3750920) and MUC5B (rs35705950) were genotyped. Interaction modeling was conducted with multivariable Cox regression followed by genotype-stratified survival analysis using a composite endpoint of death, transplantation, hospitalization, or a decline of ≥ 10% in FVC.
Significant interaction was observed between N-acetylcysteine (NAC) therapy and rs3750920 within TOLLIP (P interaction = 0.001). After stratifying by rs3750920 genotype, NAC therapy was associated with a significant reduction in composite endpoint risk (hazard ratio, 0.14; 95% confidence interval, 0.02-0.83; P = 0.03) in those with a TT genotype, but a nonsignificant increase in composite endpoint risk (hazard ratio, 3.23; 95% confidence interval, 0.79-13.16; P = 0.10) was seen in those with a CC genotype. These findings were then replicated in an independent IPF cohort.
NAC may be an efficacious therapy for individuals with IPF with an rs3750920 (TOLLIP) TT genotype, but it was associated with a trend toward harm in those with a CC genotype. A genotype-stratified prospective clinical trial should be conducted before any recommendation regarding the use of off-label NAC to treat IPF.
RATIONALE:Congenital heart disease (CHD) is among the most common birth defects. Most cases are of unknown pathogenesis.
OBJECTIVE:To determine the contribution of de novo copy number variants (CNVs) ...in the pathogenesis of sporadic CHD.
METHODS AND RESULTS:We studied 538 CHD trios using genome-wide dense single nucleotide polymorphism arrays and whole exome sequencing. Results were experimentally validated using digital droplet polymerase chain reaction. We compared validated CNVs in CHD cases with CNVs in 1301 healthy control trios. The 2 complementary high-resolution technologies identified 63 validated de novo CNVs in 51 CHD cases. A significant increase in CNV burden was observed when comparing CHD trios with healthy trios, using either single nucleotide polymorphism array (P=7×10; odds ratio, 4.6) or whole exome sequencing data (P=6×10; odds ratio, 3.5) and remained after removing 16% of de novo CNV loci previously reported as pathogenic (P=0.02; odds ratio, 2.7). We observed recurrent de novo CNVs on 15q11.2 encompassing CYFIP1, NIPA1, and NIPA2 and single de novo CNVs encompassing DUSP1, JUN, JUP, MED15, MED9, PTPRE SREBF1, TOP2A, and ZEB2, genes that interact with established CHD proteins NKX2-5 and GATA4. Integrating de novo variants in whole exome sequencing and CNV data suggests that ETS1 is the pathogenic gene altered by 11q24.2-q25 deletions in Jacobsen syndrome and that CTBP2 is the pathogenic gene in 10q subtelomeric deletions.
CONCLUSIONS:We demonstrate a significantly increased frequency of rare de novo CNVs in CHD patients compared with healthy controls and suggest several novel genetic loci for CHD.
Body fat distribution is a heritable trait and a well-established predictor of adverse metabolic outcomes, independent of overall adiposity. To increase our understanding of the genetic basis of body ...fat distribution and its molecular links to cardiometabolic traits, here we conduct genome-wide association meta-analyses of traits related to waist and hip circumferences in up to 224,459 individuals. We identify 49 loci (33 new) associated with waist-to-hip ratio adjusted for body mass index (BMI), and an additional 19 loci newly associated with related waist and hip circumference measures (P < 5 × 10(-8)). In total, 20 of the 49 waist-to-hip ratio adjusted for BMI loci show significant sexual dimorphism, 19 of which display a stronger effect in women. The identified loci were enriched for genes expressed in adipose tissue and for putative regulatory elements in adipocytes. Pathway analyses implicated adipogenesis, angiogenesis, transcriptional regulation and insulin resistance as processes affecting fat distribution, providing insight into potential pathophysiological mechanisms.
Full text
Available for:
DOBA, IJS, IZUM, KILJ, KISLJ, NUK, PILJ, PNG, SAZU, SBMB, SIK, UILJ, UKNU, UL, UM, UPUK