Identifying transcription factors (TFs) whose DNA bindings are altered by genetic variants that regulate susceptibility genes is imperative to understand transcriptional dysregulation in disease ...etiology. Here, we develop a statistical framework to analyze extensive ChIP-seq and GWAS data and identify 22 breast cancer risk-associated TFs. We find that, by analyzing genetic variations of TF-DNA bindings, the interaction of FOXA1 with co-factors such as ESR1 and E2F1, and the interaction of TFs with chromatin features (i.e., enhancers) play a key role in breast cancer susceptibility. Using genetic variants occupied by the 22 TFs, transcriptome-wide association analyses identify 52 previously unreported breast cancer susceptibility genes, including seven with evidence of essentiality from functional screens in breast relevant cell lines. We show that FOXA1 and co-factors form a core TF-transcriptional network regulating the susceptibility genes. Our findings provide additional insights into genetic variations of TF-DNA bindings (particularly for FOXA1) underlying breast cancer susceptibility.
Background: Ginseng, an herbal remedy, has been commonly used in Asian countries to promote longevity and health for over 2,000 years. However, the association of ginseng consumption with all-cause ...and cause-specific mortality is still unclear. We analyzed the association of total and major cause-specific mortality (cardiovascular disease CVD, cancer, and other death) with consumption of ginseng (primarily American and white ginseng).Methods: This study included 56,183 female participants with an average follow-up of 14.7 years in the Shanghai Women’s Health Study, an ongoing prospective cohort study. Data were assessed via an in-person interview conducted at baseline recruitment. Cox proportional hazards models were used to estimate hazard ratios (HRs) and 95% confidence intervals (CIs) for ginseng-mortality associations after adjusting for confounders.Results: Compared with those who never used ginseng, regular ginseng use was associated with significantly reduced all-cause mortality (HR 0.92; 95% CI, 0.87–0.98). This inverse association was seen primarily among those who consumed ginseng for perceived general health benefit (HR 0.90; 95% CI, 0.85–0.96). A significant dose-response association was observed between duration of ginseng use and total mortality (HR 0.85, for using ≥6 years vs never use; P for trend <0.001), CVD mortality (HR 0.83; P for trend = 0.019), and other-cause mortality (HR 0.76; P for trend = 0.001). However, no dose-response association was observed between amount of ginseng consumption and mortality outcomes.Conclusion: Regular ginseng consumption, particularly over a long duration, was associated with decreased risk of all causes of death, death due to CVD, and death due to certain other diseases.
Transcriptome-wide association studies (TWAS) have successfully discovered many putative disease susceptibility genes. However, TWAS may suffer from inaccuracy of gene expression predictions due to ...inclusion of non-regulatory variants. By integrating prior knowledge of susceptible transcription factor occupied elements, we develop sTF-TWAS and demonstrate that it outperforms existing TWAS approaches in both simulation and real data analyses. Under the sTF-TWAS framework, we build genetic models to predict alternative splicing and gene expression in normal breast, prostate and lung tissues from the Genotype-Tissue Expression project and apply these models to data from large genome-wide association studies (GWAS) conducted among European-ancestry populations. At Bonferroni-corrected P < 0.05, we identify 354 putative susceptibility genes for these cancers, including 189 previously unreported in GWAS loci and 45 in loci unreported by GWAS. These findings provide additional insight into the genetic susceptibility of human cancers. Additionally, we show the generalizability of the sTF-TWAS on non-cancer diseases.
Obesity is a disorder with a complex genetic etiology, and its epidemic is a worldwide problem. Although multiple genetic loci associated with body mass index, the most common measure of obesity, ...have been identified in European populations, few studies have focused on Asian populations. Here we report a genome-wide association study and replication studies with 62,245 east Asian subjects, which identified two new body mass index-associated loci in the CDKAL1 locus at 6p22 (rs2206734, P = 1.4 × 10(-11)) and the KLF9 locus at 9q21 (rs11142387, P = 1.3 × 10(-9)), as well as several previously reported loci (the SEC16B, BDNF, FTO, MC4R and GIPR loci, P < 5.0 × 10(-8)). We subsequently performed gene-gene interaction analyses and identified an interaction (P = 2.0 × 10(-8)) between a SNP in the KLF9 locus (rs11142387) and one in the MSTN (also known as GDF8) locus at 2q32 (rs13034723). These findings should provide useful insights into the etiology of obesity.
Prostaglandins play a critical role in inflammatory response. To investigate the association of urinary PGE-M, a stable end-product of prostaglandin E2 (PGE
) with overall and cause-specific ...mortality and examine potential effect modifiers, we obtained urinary PGE-M levels of 2927 non-cancerous adults from our previous case-control studies nested in the Shanghai Women's Health Study and Shanghai Men's Health Study, two cohort studies conducted in Shanghai, China. Mortality data and modifiable factors associated with urinary PGE-M were obtained from the parent cohort studies. Using linear regression models, we found that high urinary PGE-M levels were significantly associated with low education, heaving smoking, old age at urine collection, and abdominal obesity. Using Cox proportional hazards models, we found that increase (per standard deviation) of urinary PGE-M levels were significantly associated with overall mortality (adjusted hazard ratio = 1.19, 95% confidence interval: 1.07, 1.33) and particularly deaths from cardiometabolic diseases (adjusted hazard ratio = 1.27, 95% confidence interval: 1.11, 1.44). The increased death risks persisted across different time intervals during the follow-up and were stronger among participants who were younger than 60 (P = 0.0014 for all- cause mortality and P = 0.007 for deaths from cardiometabolic diseases) at urine collection or perhaps among those who had higher education.
Purpose
Limited studies have been conducted to evaluate pathogenetic mutations in breast cancer predisposition genes among Chinese women. To fully characterize germline mutations of these genes in ...this population, we used the whole-exome sequencing data in a population-based case–control study conducted in Shanghai, China.
Methods
We evaluated exonic, splicing, and copy number variants in 11 established and 14 candidate breast cancer predisposition genes in 831 invasive breast cancer cases and 839 controls. We identified 55 pathogenic variants, including 15 newly identified in this study.
Results
Approximately 8% of the cases and 0.6% of the cancer-free controls carried these pathogenetic variants (
P
= 3.05 × 10
−15
). Among cases, 3.7% had a
BRCA
2 pathogenic variant and 1.6% had a
BRCA1
pathogenic variant, while 2.5% had a pathogenic variant in other genes including
ATM, CHEK2, NBN, NF1, CDH1, PALB2, PTEN, TP53
as well as
BARD1, BRIP,
and
RAD51D
. Patients with
BRCA
1/2 pathogenic variants were more likely to have a family history of breast cancer and hormone receptor negative tumors compared with patients without pathogenic variants.
Conclusions
This study highlighted the importance of hereditary breast cancer genes in the breast cancer etiology in this understudied population. Together with previous studies in East Asian women, this study suggested a relatively more prominent role of
BRCA2
compared to
BRCA1
. This study also provides additional evidence to design cost-efficient genetic testing among Chinese women for risk assessment and early detection of breast cancer.
We carried out a genome-wide association study among Chinese women to identify risk variants for breast cancer. After analyzing 607,728 SNPs in 1,505 cases and 1,522 controls, we selected 29 SNPs for ...a fast-track replication in an independent set of 1,554 cases and 1,576 controls. We further investigated four replicated loci in a third set of samples comprising 3,472 cases and 900 controls. SNP rs2046210 at 6q25.1, located upstream of the gene encoding estrogen receptor α (ESR1), showed strong and consistent association with breast cancer across all three stages. Adjusted odds ratio (95% CI) were 1.36 (1.24-1.49) and 1.59 (1.40-1.82), respectively, for genotypes A/G and A/A versus G/G (P for trend 2.0 × 10−15) in the pooled analysis of samples from all three stages. We also found a similar, albeit weaker, association in an independent study comprising 1,591 cases and 1,466 controls of European ancestry (Ptrend = 0.01). These results strongly implicate 6q25.1 as a susceptibility locus for breast cancer.
Multiple genetic loci associated with obesity or body mass index (BMI) have been identified through genome-wide association studies conducted predominantly in populations of European ancestry. We ...performed a meta-analysis of associations between BMI and approximately 2.4 million SNPs in 27,715 east Asians, which was followed by in silico and de novo replication studies in 37,691 and 17,642 additional east Asians, respectively. We identified ten BMI-associated loci at genome-wide significance (P < 5.0 × 10(-8)), including seven previously identified loci (FTO, SEC16B, MC4R, GIPR-QPCTL, ADCY3-DNAJC27, BDNF and MAP2K5) and three novel loci in or near the CDKAL1, PCSK1 and GP2 genes. Three additional loci nearly reached the genome-wide significance threshold, including two previously identified loci in the GNPDA2 and TFAP2B genes and a newly identified signal near PAX6, all of which were associated with BMI with P < 5.0 × 10(-7). Findings from this study may shed light on new pathways involved in obesity and demonstrate the value of conducting genetic studies in non-European populations.
In a three-stage genome-wide association study among East Asian women including 22,780 cases and 24,181 controls, we identified 3 genetic loci newly associated with breast cancer risk, including ...rs4951011 at 1q32.1 (in intron 2 of the ZC3H11A gene; P=8.82×10(-9)), rs10474352 at 5q14.3 (near the ARRDC3 gene; P=1.67×10(-9)) and rs2290203 at 15q26.1 (in intron 14 of the PRC1 gene; P=4.25×10(-8)). We replicated these associations in 16,003 cases and 41,335 controls of European ancestry (P=0.030, 0.004 and 0.010, respectively). Data from the ENCODE Project suggest that variants rs4951011 and rs10474352 might be located in an enhancer region and transcription factor binding sites, respectively. This study provides additional insights into the genetics and biology of breast cancer.
Background & Aims Known genetic factors explain only a small fraction of genetic variation in colorectal cancer (CRC). We conducted a genome-wide association study to identify risk loci for CRC. ...Methods This discovery stage included 8027 cases and 22,577 controls of East-Asian ancestry. Promising variants were evaluated in studies including as many as 11,044 cases and 12,047 controls. Tumor-adjacent normal tissues from 188 patients were analyzed to evaluate correlations of risk variants with expression levels of nearby genes. Potential functionality of risk variants were evaluated using public genomic and epigenomic databases. Results We identified 4 loci associated with CRC risk; P values for the most significant variant in each locus ranged from 3.92 × 10−8 to 1.24 × 10−12 : 6p21.1 (rs4711689), 8q23.3 (rs2450115, rs6469656), 10q24.3 (rs4919687), and 12p13.3 (rs11064437). We also identified 2 risk variants at loci previously associated with CRC: 10q25.2 (rs10506868) and 20q13.3 (rs6061231). These risk variants, conferring an approximate 10%–18% increase in risk per allele, are located either inside or near protein-coding genes that include TFEB (lysosome biogenesis and autophagy), eukaryotic translation initiation factor 3, subunit H (initiation of translation), cytochrome P450, family 17, subfamily A, polypeptide 1 (steroidogenesis), splA/ryanodine receptor domain and SOCS box containing 2 (proteasome degradation), and RPS21 (ribosome biogenesis). Gene expression analyses showed a significant association ( P < .05) for rs4711689 with TFEB , rs6469656 with eukaryotic translation initiation factor 3, subunit H, rs11064437 with splA/ryanodine receptor domain and SOCS box containing 2, and rs6061231 with RPS21. Conclusions We identified susceptibility loci and genes associated with CRC risk, linking CRC predisposition to steroid hormone, protein synthesis and degradation, and autophagy pathways and providing added insight into the mechanism of CRC pathogenesis.