Pleiotropy, a phenomenon in which a single gene affects multiple phenotypes, is becoming very common among different cancer types and cancer-related phenotypes, such as those in hormonal, ...cardiometabolic and inflammatory/immune conditions. The discovery of pleiotropic associations can improve our understanding of cancer and help to target investigation of genes with greater clinical relevance.
Genome-wide association studies (GWAS) have successfully identified tens of thousands of genetic variants associated with various phenotypes, but together they explain only a fraction of ...heritability, suggesting many variants have yet to be discovered. Recently it has been recognized that incorporating functional information of genetic variants can improve power for identifying novel loci. For example, S-PrediXcan and TWAS tested the association of predicted gene expression with phenotypes based on GWAS summary statistics by leveraging the information on genetic regulation of gene expression and found many novel loci. However, as genetic variants may have effects on more than one gene and through different mechanisms, these methods likely only capture part of the total effects of these variants. In this paper, we propose a summary statistics-based mixed effects score test (sMiST) that tests for the total effect of both the effect of the mediator by imputing genetically predicted gene expression, like S-PrediXcan and TWAS, and the direct effects of individual variants. It allows for multiple functional annotations and multiple genetically predicted mediators. It can also perform conditional association analysis while adjusting for other genetic variants (e.g., known loci for the phenotype). Extensive simulation and real data analyses demonstrate that sMiST yields p-values that agree well with those obtained from individual level data but with substantively improved computational speed. Importantly, a broad application of sMiST to GWAS is possible, as only summary statistics of genetic variant associations are required. We apply sMiST to a large-scale GWAS of colorectal cancer using summary statistics from ∼120, 000 study participants and gene expression data from the Genotype-Tissue Expression (GTEx) project. We identify several novel and secondary independent genetic loci.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Investigating genetic architecture of complex traits in ancestrally diverse populations is imperative to understand the etiology of disease. However, the current paucity of genetic research in people ...of African and Latin American ancestry, Hispanic and indigenous peoples in the United States is likely to exacerbate existing health disparities for many common diseases. The Population Architecture using Genomics and Epidemiology, Phase II (PAGE II), Study was initiated in 2013 by the National Human Genome Research Institute to expand our understanding of complex trait loci in ethnically diverse and well characterized study populations. To meet this goal, the Multi-Ethnic Genotyping Array (MEGA) was designed to substantially improve fine-mapping and functional discovery by increasing variant coverage across multiple ethnicities at known loci for metabolic, cardiovascular, renal, inflammatory, anthropometric, and a variety of lifestyle traits. Studying the frequency distribution of clinically relevant mutations, putative risk alleles, and known functional variants across multiple populations will provide important insight into the genetic architecture of complex diseases and facilitate the discovery of novel, sometimes population-specific, disease associations. DNA samples from 51,650 self-identified African ancestry (17,328), Hispanic/Latino (22,379), Asian/Pacific Islander (8,640), and American Indian (653) and an additional 2,650 participants of either South Asian or European ancestry, and other reference panels have been genotyped on MEGA by PAGE II. MEGA was designed as a new resource for studying ancestrally diverse populations. Here, we describe the methodology for selecting trait-specific content for use in multi-ethnic populations and how enriching MEGA for this content may contribute to deeper biological understanding of the genetic etiology of complex disease.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Circulating white blood cell and platelet traits are clinically linked to various disease outcomes and differ across individuals and ancestry groups. Genetic factors play an important role in ...determining these traits and many loci have been identified. However, most of these findings were identified in populations of European ancestry (EA), with African Americans (AA), Hispanics/Latinos (HL), and other races/ethnicities being severely underrepresented.
We performed ancestry-combined and ancestry-specific genome-wide association studies (GWAS) for white blood cell and platelet traits in the ancestrally diverse Population Architecture using Genomics and Epidemiology (PAGE) Study, including 16,201 AA, 21,347 HL, and 27,236 EA participants. We identified six novel findings at suggestive significance (P < 5E-8), which need confirmation, and independent signals at six previously established regions at genome-wide significance (P < 2E-9). We confirmed multiple previously reported genome-wide significant variants in the single variant association analysis and multiple genes using PrediXcan. Evaluation of loci reported from a Euro-centric GWAS indicated attenuation of effect estimates in AA and HL compared to EA populations.
Our results highlighted the potential to identify ancestry-specific and ancestry-agnostic variants in participants with diverse backgrounds and advocate for continued efforts in improving inclusion of racially/ethnically diverse populations in genetic association studies for complex traits.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Quantitative red blood cell (RBC) traits are highly polygenic clinically relevant traits, with approximately 500 reported GWAS loci. The majority of RBC trait GWAS have been performed in European- or ...East Asian-ancestry populations, despite evidence that rare or ancestry-specific variation contributes substantially to RBC trait heritability. Recently developed combined-phenotype methods which leverage genetic trait correlation to improve statistical power have not yet been applied to these traits. Here we leveraged correlation of seven quantitative RBC traits in performing a combined-phenotype analysis in a multi-ethnic study population.
We used the adaptive sum of powered scores (aSPU) test to assess combined-phenotype associations between ~ 21 million SNPs and seven RBC traits in a multi-ethnic population (maximum n = 67,885 participants; 24% African American, 30% Hispanic/Latino, and 43% European American; 76% female). Thirty-nine loci in our multi-ethnic population contained at least one significant association signal (p < 5E-9), with lead SNPs at nine loci significantly associated with three or more RBC traits. A majority of the lead SNPs were common (MAF > 5%) across all ancestral populations. Nineteen additional independent association signals were identified at seven known loci (HFE, KIT, HBS1L/MYB, CITED2/FILNC1, ABO, HBA1/2, and PLIN4/5). For example, the HBA1/2 locus contained 14 conditionally independent association signals, 11 of which were previously unreported and are specific to African and Amerindian ancestries. One variant in this region was common in all ancestries, but exhibited a narrower LD block in African Americans than European Americans or Hispanics/Latinos. GTEx eQTL analysis of all independent lead SNPs yielded 31 significant associations in relevant tissues, over half of which were not at the gene immediately proximal to the lead SNP.
This work identified seven loci containing multiple independent association signals for RBC traits using a combined-phenotype approach, which may improve discovery in genetically correlated traits. Highly complex genetic architecture at the HBA1/2 locus was only revealed by the inclusion of African Americans and Hispanics/Latinos, underscoring the continued importance of expanding large GWAS to include ancestrally diverse populations.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
The T cell receptor (TCR) is a central regulator of the adaptive immune system. Efficient structural organization and signaling of TCR/CD3 underlies the development and function of T cells. T cell ...maturation defects are associated with autoimmunity, suggesting an underlying defect in TCR selection. To explore quantitative defects as a cause for altered TCR selection at the level of the TCR/CD3 protein complex, we compared TCR expression densities on CD4 T cells from subjects with type 1 diabetes (T1D) (n=80) and nondiabetes controls (n=37) . We also investigated a quantitative defect in genes in the TCR/CD3 genome for possible underlying overmethylation in T1D. All 13 gene regions of the TCR segments and all 4 versions of CD3 proteins were studied for methylation and protein differences in T1D vs. controls. Significant quantitative defects in TCR and CD3 proteins were observed in CD4+ T cells of patients with T1D vs. controls. As a cause for these defects, the TCR complex genes and associated CD3 genes were overmethylated, resulting in downregulated cell surface expression. Evaluation of TCRαβ expression in CD4+ T cells at the protein level confirmed methylation patterns, showing that the population of TCRαβ+ cells was significantly reduced in T1D vs. controls (p=0.005) . The MFI density of TCRαβ antibody in T1D was also significantly decreased vs. controls (p=0.01) . All CD3 genes except CD3ε had hypermethylated patterns in T1D, confirmed by RNAseq analysis. The percentage CD3+ T cells in the CD4+ T cell population was reduced in T1D vs. controls (p=0.04) , as was the MFI of CD3+ T cells (p=0.02) . Patients with T1D have quantitative defects in the TCR/CD3 protein complex, resulting in decreased expression. It is recognized that TCR triggering is affected by intermolecular distance between TCR proteins and that activation diminishes when proximity is increased between TCR surface proteins. In T1D, this might be a novel mechanism for failed T cell selection leading to autoimmunity.
Disclosure
W.Kuhtreiber: None. A.H.Lee: None. A.Aristarkhova: None. H.F.Dias: None. N.Ng: None. S.A.Bien: Employee; Adaptive Biotechnologies. D.L.Faustman: None.
The past decade has seen a technological revolution in human genetics that has empowered population-level investigations into genetic associations with phenotypes. Although these discoveries rely on ...genetic variation across individuals, association studies have overwhelmingly been performed in populations of European descent. In this review, we describe limitations faced by single-population studies and provide an overview of strategies to improve global representation in existing data sets and future human genomics research via diversity-focused, multiethnic studies. We highlight the successes of individual studies and meta-analysis consortia that have provided unique knowledge. Additionally, we outline the approach taken by the Population Architecture Using Genomics and Epidemiology (PAGE) study to develop best practices for performing genetic epidemiology in multiethnic contexts. Finally, we discuss how limiting investigations to single populations impairs findings in the clinical domain for both rare-variant identification and genetic risk prediction.
A growing knowledge base of genetic and environmental information has greatly enabled the study of disease risk factors. However, the computational complexity and statistical burden of testing all ...variants by all environments has required novel study designs and hypothesis-driven approaches. We discuss how incorporating biological knowledge from model organisms, functional genomics, and integrative approaches can empower the discovery of novel gene-environment interactions and discuss specific methodological considerations with each approach. We consider specific examples where the application of these approaches has uncovered effects of gene-environment interactions relevant to drug response and immunity, and we highlight how such improvements enable a greater understanding of the pathogenesis of disease and the realization of precision medicine.
Abstract
Background
Increasing evidence suggests that conventional adenomas (CAs) and serrated polyps (SPs) represent two distinct groups of precursor lesions for colorectal cancer (CRC). The ...influence of common genetic variants on risk of CAs and SPs remain largely unknown.
Methods
Among 27 426 participants within three prospective cohort studies, we created a weighted genetic risk score (GRS) based on 40 CRC-related single nucleotide polymorphisms (SNPs) identified in previous genome-wide association studies; and we examined the association of GRS (per one standard deviation increment) with risk of CAs, SPs and synchronous CAs and SPs, by multivariable logistic regression. We also analysed individual variants in the secondary analysis.
Results
During 18–20 years of follow-up, we documented 2952 CAs, 1585 SPs and 794 synchronous CAs and SPs. Higher GRS was associated with increased risk of CAs odds ratio (OR) = 1.17, 95% confidence interval (CI): 1.12-1.21 and SPs (OR = 1.09, 95% CI: 1.03-1.14), with a stronger association for CAs than SPs (Pheterogeneity=0.01). An even stronger association was found for patients with synchronous CAs and SPs (OR = 1.32), advanced CAs (OR = 1.22) and multiple CAs (OR = 1.25). Different sets of variants were associated with CAs and SPs, with a Spearman correlation coefficient of 0.02 between the ORs associating the 40 SNPs with the two lesions. After correcting for multiple testing, three variants were associated with CAs (rs3802842, rs6983267 and rs7136702) and two with SPs (rs16892766 and rs4779584).
Conclusions
Common genetic variants play a potential role in the conventional and serrated pathways of CRC. Different sets of variants are identified for the two pathways, further supporting the aetiological heterogeneity of CRC.
Although genome-wide association studies (GWAS) have identified hundreds of risk loci for breast and prostate cancer, only a few studies have characterized the GWAS association signals across ...functional genomic annotations with a particular focus on single nucleotide polymorphisms (SNPs) located in DNA regulatory elements. In this study, we investigated the enrichment pattern of GWAS signals for breast and prostate cancer in genomic functional regions located in normal tissue and cancer cell lines. We quantified the overall enrichment of SNPs with breast and prostate cancer association
p
values < 1 × 10
−8
across regulatory categories. We then obtained annotations for DNaseI hypersensitive sites (DHS), typical enhancers, and super enhancers across multiple tissue types, to assess if significant GWAS signals were selectively enriched in annotations found in disease-related tissue. Finally, we quantified the enrichment of breast and prostate cancer SNP heritability in regulatory regions, and compared the enrichment pattern of SNP heritability with GWAS signals. DHS, typical enhancers, and super enhancers identified in the breast cancer cell line MCF-7 were observed with the highest enrichment of genome-wide significant variants for breast cancer. For prostate cancer, GWAS signals were mostly enriched in DHS and typical enhancers identified in the prostate cancer cell line LNCaP. With progressively stringent GWAS
p
value thresholds, an increasing trend of enrichment was observed for both diseases in DHS, typical enhancers, and super enhancers located in disease-related tissue. Results from heritability enrichment analysis supported the selective enrichment pattern of functional genomic regions in disease-related cell lines for both breast and prostate cancer. Our results suggest the importance of studying functional annotations identified in disease-related tissues when characterizing GWAS results, and further demonstrate the role of germline DNA regulatory elements from disease-related tissue in breast and prostate carcinogenesis.