Genome-wide association studies have identified more than 70 common variants that are associated with breast cancer risk. Most of these variants map to non-protein-coding regions and several map to ...gene deserts, regions of several hundred kilobases lacking protein-coding genes. We hypothesized that gene deserts harbor long-range regulatory elements that can physically interact with target genes to influence their expression. To test this, we developed Capture Hi-C (CHi-C), which, by incorporating a sequence capture step into a Hi-C protocol, allows high-resolution analysis of targeted regions of the genome. We used CHi-C to investigate long-range interactions at three breast cancer gene deserts mapping to 2q35, 8q24.21, and 9q31.2. We identified interaction peaks between putative regulatory elements ("bait fragments") within the captured regions and "targets" that included both protein-coding genes and long noncoding (lnc) RNAs over distances of 6.6 kb to 2.6 Mb. Target protein-coding genes were IGFBP5, KLF4, NSMCE2, and MYC; and target lncRNAs included DIRC3, PVT1, and CCDC26. For one gene desert, we were able to define two SNPs (rs12613955 and rs4442975) that were highly correlated with the published risk variant and that mapped within the bait end of an interaction peak. In vivo ChIP-qPCR data show that one of these, rs4442975, affects the binding of FOXA1 and implicate this SNP as a putative functional variant.
Genome-wide association studies (GWAS) have transformed understanding of susceptibility to testicular germ cell tumors (TGCTs), but much of the heritability remains unexplained. Here we report a new ...GWAS, a meta-analysis with previous GWAS and a replication series, totaling 7,319 TGCT cases and 23,082 controls. We identify 19 new TGCT risk loci, roughly doubling the number of known TGCT risk loci to 44. By performing in situ Hi-C in TGCT cells, we provide evidence for a network of physical interactions among all 44 TGCT risk SNPs and candidate causal genes. Our findings implicate widespread disruption of developmental transcriptional regulators as a basis of TGCT susceptibility, consistent with failed primordial germ cell differentiation as an initiating step in oncogenesis. Defective microtubule assembly and dysregulation of KIT-MAPK signaling also feature as recurrently disrupted pathways. Our findings support a polygenic model of risk and provide insight into the biological basis of TGCT.
Full text
Available for:
IJS, NUK, SBMB, UL, UM, UPUK
Patients with dyskeratosis congenita (DC), a heterogeneous inherited bone marrow failure syndrome, have abnormalities in telomere biology, including very short telomeres and germline mutations in
...DKC1,
TERC,
TERT, or
NOP10, but ∼60% of DC patients lack an identifiable mutation. With the very short telomere phenotype and a highly penetrant, rare disease model, a linkage scan was performed on a family with autosomal-dominant DC and no mutations in
DKCI,
TERC, or
TERT. Evidence favoring linkage was found at 2p24 and 14q11.2, and this led to the identification of
TINF2 (14q11.2) mutations, K280E, in the proband and her five affected relatives and
TINF2 R282H in three additional unrelated DC probands, including one with Revesz syndrome; a fifth DC proband had a R282S mutation.
TINF2 mutations were not present in unaffected relatives, DC probands with mutations in
DKC1,
TERC, or
TERT or 298 control subjects. We demonstrate that a fifth gene,
TINF2, is mutated in classical DC and, for the first time, in Revesz syndrome. This represents the first shelterin complex mutation linked to human disease and confirms the role of very short telomeres as a diagnostic test for DC.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
Genome-wide association studies (GWAS) have identified approximately 100 breast cancer risk loci. Translating these findings into a greater understanding of the mechanisms that influence disease risk ...requires identification of the genes or non-coding RNAs that mediate these associations. Here, we use Capture Hi-C (CHi-C) to annotate 63 loci; we identify 110 putative target genes at 33 loci. To assess the support for these target genes in other data sources we test for associations between levels of expression and SNP genotype (eQTLs), disease-specific survival (DSS), and compare them with somatically mutated cancer genes. 22 putative target genes are eQTLs, 32 are associated with DSS and 14 are somatically mutated in breast, or other, cancers. Identifying the target genes at GWAS risk loci will lead to a greater understanding of the mechanisms that influence breast cancer risk and prognosis.
Genome-wide association studies have identified several common genetic variants associated with breast cancer risk. It is likely, however, that a substantial proportion of such loci have not yet been ...discovered.
We compared 296,114 tagging single-nucleotide polymorphisms in 1694 breast cancer case subjects (92% with two primary cancers or at least two affected first-degree relatives) and 2365 control subjects, with validation in three independent series totaling 11,880 case subjects and 12,487 control subjects. Odds ratios (ORs) and associated 95% confidence intervals (CIs) in each stage and all stages combined were calculated using unconditional logistic regression. Heterogeneity was evaluated with Cochran Q and I(2) statistics. All statistical tests were two-sided.
We identified a novel risk locus for breast cancer at 9q31.2 (rs865686: OR = 0.89, 95% CI = 0.85 to 0.92, P = 1.75 × 10(-10)). This single-nucleotide polymorphism maps to a gene desert, the nearest genes being Kruppel-like factor 4 (KLF4, 636 kb centromeric), RAD23 homolog B (RAD23B, 794 kb centromeric), and actin-like 7A (ACTL7A, 736 kb telomeric). We also identified two variants (rs3734805 and rs9383938) mapping to 6q25.1 estrogen receptor 1 (ESR1), which were associated with breast cancer in subjects of northern European ancestry (rs3734805: OR = 1.19, 95% CI = 1.11 to 1.27, P = 1.35 × 10(-7); rs9383938: OR = 1.18, 95% CI = 1.11 to 1.26, P = 1.41 × 10(-7)). A variant mapping to 10q26.13, approximately 300 kb telomeric to the established risk locus within the second intron of FGFR2, was also associated with breast cancer risk, although not at genome-wide statistical significance (rs10510102: OR = 1.12, 95% CI = 1.07 to 1.17, P = 1.58 × 10(-6)).
These findings provide further evidence on the role of genetic variation in the etiology of breast cancer. Fine mapping will be needed to identify causal variants and to determine their functional effects.
Increased mitochondrial DNA (mtDNA) copy number in peripheral blood cells (PBC) has been associated with the risk of developing several tumor types. Here we evaluate sources of variation of this ...biomarker and its association with breast cancer risk in a prospective cohort study. mtDNA copy number was measured using quantitative real-time PCR on PBC DNA samples from participants in the UK-based Breakthrough Generations Study. Temporal and assay variation was evaluated in a serial study of 91 women, with two blood samples collected approximately 6-years apart. Then, associations with breast cancer risk factors and risk were evaluated in 1,108 cases and 1,099 controls using a nested case-control design. In the serial study, mtDNA copy number showed low assay variation but large temporal variation assay intraclass correlation coefficient (ICC), 79.3%-87.9%; temporal ICC, 38.3%). Higher mtDNA copy number was significantly associated with younger age at blood collection, being premenopausal, having an older age at menopause, and never taking HRT, both in cases and controls. Based on measurements in a single blood sample taken on average 6 years before diagnosis, higher mtDNA copy number was associated with increased breast cancer risk OR (95% CI) for highest versus lowest quartile, 1.37 (1.02-1.83); P trend = 0.007. In conclusion, mtDNA copy number is associated with breast cancer risk and represents a promising biomarker for risk assessment. The relatively large temporal variation should be taken into account in future analyses.
Variants of the MSTN gene encoding myostatin are associated with muscle hypertrophy phenotypes in a range of mammalian species, most notably cattle, dogs, mice, and humans. Using a sample of ...registered Thoroughbred horses (n = 148), we have identified a novel MSTN sequence polymorphism that is strongly associated (g.66493737C>T, P = 4.85×10⁻⁸) with best race distance among elite racehorses (n = 79). This observation was independently validated (P = 1.91×10⁻⁶) in a resampled group of Thoroughbreds (n = 62) and in a cohort of Thoroughbreds (n = 37, P = 0.0047) produced by the same trainer. We observed that C/C horses are suited to fast, short-distance races; C/T horses compete favorably in middle-distance races; and T/T horses have greater stamina. Evaluation of retrospective racecourse performance (n = 142) and stallion progeny performance predict that C/C and C/T horses are more likely to be successful two-year-old racehorses than T/T animals. Here we describe for the first time the identification of a gene variant in Thoroughbred racehorses that is predictive of genetic potential for an athletic phenotype.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
We report a genome-wide association study in 10,286 cases and 9,135 controls of European ancestry in the Cancer Genetic Markers of Susceptibility (CGEMS) initiative. We identify a new association ...with prostate cancer risk on chromosome 8q24 (rs620861, P = 1.3 × 10−10, heterozygote OR = 1.17, 95% CI 1.10-1.24; homozygote OR = 1.33, 95% CI 1.21-1.45). This defines a new locus associated with prostate cancer susceptibility on 8q24.
Full text
Available for:
DOBA, IJS, IZUM, KILJ, NUK, PILJ, PNG, SAZU, UILJ, UKNU, UL, UM, UPUK
To interrogate the complex mechanisms involved in the later stages of cancer metastasis, we designed a functional in vivo RNA interference (RNAi) screen combined with next-generation sequencing. ...Using this approach, we identified the sialyltransferase ST6GalNAc2 as a novel breast cancer metastasis suppressor. Mechanistically, ST6GalNAc2 silencing alters the profile of O-glycans on the tumor cell surface, facilitating binding of the soluble lectin galectin-3. This then enhances tumor cell retention and emboli formation at metastatic sites leading to increased metastatic burden, events that can be completely blocked by galectin-3 inhibition. Critically, elevated ST6GALNAC2, but not galectin-3, expression in estrogen receptor-negative breast cancers significantly correlates with reduced frequency of metastatic events and improved survival. These data demonstrate that the prometastatic role of galectin-3 is regulated by its ability to bind to the tumor cell surface and highlight the potential of monitoring ST6GalNAc2 expression to stratify patients with breast cancer for treatment with galectin-3 inhibitors.
Thoroughbred horses have been selected for exceptional racing performance resulting in system-wide structural and functional adaptations contributing to elite athletic phenotypes. Because selection ...has been recent and intense in a closed population that stems from a small number of founder animals Thoroughbreds represent a unique population within which to identify genomic contributions to exercise-related traits. Employing a population genetics-based hitchhiking mapping approach we performed a genome scan using 394 autosomal and X chromosome microsatellite loci and identified positively selected loci in the extreme tail-ends of the empirical distributions for (1) deviations from expected heterozygosity (Ewens-Watterson test) in Thoroughbred (n = 112) and (2) global differentiation among four geographically diverse horse populations (F(ST)). We found positively selected genomic regions in Thoroughbred enriched for phosphoinositide-mediated signalling (3.2-fold enrichment; P<0.01), insulin receptor signalling (5.0-fold enrichment; P<0.01) and lipid transport (2.2-fold enrichment; P<0.05) genes. We found a significant overrepresentation of sarcoglycan complex (11.1-fold enrichment; P<0.05) and focal adhesion pathway (1.9-fold enrichment; P<0.01) genes highlighting the role for muscle strength and integrity in the Thoroughbred athletic phenotype. We report for the first time candidate athletic-performance genes within regions targeted by selection in Thoroughbred horses that are principally responsible for fatty acid oxidation, increased insulin sensitivity and muscle strength: ACSS1 (acyl-CoA synthetase short-chain family member 1), ACTA1 (actin, alpha 1, skeletal muscle), ACTN2 (actinin, alpha 2), ADHFE1 (alcohol dehydrogenase, iron containing, 1), MTFR1 (mitochondrial fission regulator 1), PDK4 (pyruvate dehydrogenase kinase, isozyme 4) and TNC (tenascin C). Understanding the genetic basis for exercise adaptation will be crucial for the identification of genes within the complex molecular networks underlying obesity and its consequential pathologies, such as type 2 diabetes. Therefore, we propose Thoroughbred as a novel in vivo large animal model for understanding molecular protection against metabolic disease.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK