Papillary renal cell carcinoma (pRCC) is an important subtype of kidney cancer with a problematic pathological classification and highly variable clinical behaviour. Here we sequence the genomes or ...exomes of 31 pRCCs, and in four tumours, multi-region sequencing is undertaken. We identify BAP1, SETD2, ARID2 and Nrf2 pathway genes (KEAP1, NHE2L2 and CUL3) as probable drivers, together with at least eight other possible drivers. However, only ~10% of tumours harbour detectable pathogenic changes in any one driver gene, and where present, the mutations are often predicted to be present within cancer sub-clones. We specifically detect parallel evolution of multiple SETD2 mutations within different sub-regions of the same tumour. By contrast, large copy number gains of chromosomes 7, 12, 16 and 17 are usually early, monoclonal changes in pRCC evolution. The predominance of large copy number variants as the major drivers for pRCC highlights an unusual mode of tumorigenesis that may challenge precision medicine approaches.
Genome-wide association (GWA) studies have identified multiple loci at which common variants modestly influence the risk of developing colorectal cancer (CRC). To enhance power to identify additional ...loci with similar effect sizes, we conducted a meta-analysis of two GWA studies, comprising 13,315 individuals genotyped for 38,710 common tagging SNPs. We undertook replication testing in up to eight independent case-control series comprising 27,418 subjects. We identified four previously unreported CRC risk loci at 14q22.2 (rs4444235, BMP4; P = 8.1 x 10(-10)), 16q22.1 (rs9929218, CDH1; P = 1.2 x 10(-8)), 19q13.1 (rs10411210, RHPN2; P = 4.6 x 10(-9)) and 20p12.3 (rs961253; P = 2.0 x 10(-10)). These findings underscore the value of large sample series for discovery and follow-up of genetic variants contributing to the etiology of CRC.
Genome-wide association studies (GWAS) of colorectal cancer (CRC) have identified 23 susceptibility loci thus far. Analyses of previously conducted GWAS indicate additional risk loci are yet to be ...discovered. To identify novel CRC susceptibility loci, we conducted a new GWAS and performed a meta-analysis with five published GWAS (totalling 7,577 cases and 9,979 controls of European ancestry), imputing genotypes utilising the 1000 Genomes Project. The combined analysis identified new, significant associations with CRC at 1p36.2 marked by rs72647484 (minor allele frequency MAF = 0.09) near CDC42 and WNT4 (P = 1.21 × 10(-8), odds ratio OR = 1.21 ) and at 16q24.1 marked by rs16941835 (MAF = 0.21, P = 5.06 × 10(-8); OR = 1.15) within the long non-coding RNA (lncRNA) RP11-58A18.1 and ~500 kb from the nearest coding gene FOXL1. Additionally we identified a promising association at 10p13 with rs10904849 intronic to CUBN (MAF = 0.32, P = 7.01 × 10(-8); OR = 1.14). These findings provide further insights into the genetic and biological basis of inherited genetic susceptibility to CRC. Additionally, our analysis further demonstrates that imputation can be used to exploit GWAS data to identify novel disease-causing variants.
Objective Colorectal cancer (CRC) has a substantial heritable component. Common genetic variation has been shown to contribute to CRC risk. A study was conducted in a large multi-population study to ...assess the feasibility of CRC risk prediction using common genetic variant data combined with other risk factors. A risk prediction model was built and applied to the Scottish population using available data. Design Nine populations of European descent were studied to develop and validate CRC risk prediction models. Binary logistic regression was used to assess the combined effect of age, gender, family history (FH) and genotypes at 10 susceptibility loci that individually only modestly influence CRC risk. Risk models were generated from case-control data incorporating genotypes alone (n=39 266) and in combination with gender, age and FH (n=11 324). Model discriminatory performance was assessed using 10-fold internal cross-validation and externally using 4187 independent samples. The 10-year absolute risk was estimated by modelling genotype and FH with age- and gender-specific population risks. Results The median number of risk alleles was greater in cases than controls (10 vs 9, p<2.2×10−16), confirmed in external validation sets (Sweden p=1.2×10−6, Finland p=2×10−5). The mean per-allele increase in risk was 9% (OR 1.09; 95% CI 1.05 to 1.13). Discriminative performance was poor across the risk spectrum (area under curve for genotypes alone 0.57; area under curve for genotype/age/gender/FH 0.59). However, modelling genotype data, FH, age and gender with Scottish population data shows the practicalities of identifying a subgroup with >5% predicted 10-year absolute risk. Conclusion Genotype data provide additional information that complements age, gender and FH as risk factors, but individualised genetic risk prediction is not currently feasible. Nonetheless, the modelling exercise suggests public health potential since it is possible to stratify the population into CRC risk categories, thereby informing targeted prevention and surveillance.
To identify colorectal cancer (CRC) susceptibility alleles, we conducted a genome-wide association study. In phase 1, we genotyped 550,163 tagSNPs in 940 familial colorectal tumor cases (627 CRC, 313 ...high-risk adenoma) and 965 controls. In phase 2, we genotyped 42,708 selected SNPs in 2,873 CRC cases and 2,871 controls. In phase 3, we evaluated 11 SNPs showing association at P < 10−4 in a joint analysis of phases 1 and 2 in 4,287 CRC cases and 3,743 controls. Two SNPs were taken forward to phase 4 genotyping (10,731 CRC cases and 10,961 controls from eight centers). In addition to the previously reported 8q24, 15q13 and 18q21 CRC risk loci, we identified two previously unreported associations: rs10795668, located at 10p14 (P = 2.5 × 10−13 overall; P = 6.9 × 10−12 replication), and rs16892766, at 8q23.3 (P = 3.3 × 10−18 overall; P = 9.6 × 10−17 replication), which tags a plausible causative gene, EIF3H. These data provide further evidence for the 'common-disease common-variant' model of CRC predisposition.
Genome-wide association studies (GWAS) have identified 14 tagging single nucleotide polymorphisms (tagSNPs) that are associated with the risk of colorectal cancer (CRC), and several of these tagSNPs ...are near bone morphogenetic protein (BMP) pathway loci. The penalty of multiple testing implicit in GWAS increases the attraction of complementary approaches for disease gene discovery, including candidate gene- or pathway-based analyses. The strongest candidate loci for additional predisposition SNPs are arguably those already known both to have functional relevance and to be involved in disease risk. To investigate this proposition, we searched for novel CRC susceptibility variants close to the BMP pathway genes GREM1 (15q13.3), BMP4 (14q22.2), and BMP2 (20p12.3) using sample sets totalling 24,910 CRC cases and 26,275 controls. We identified new, independent CRC predisposition SNPs close to BMP4 (rs1957636, P = 3.93×10(-10)) and BMP2 (rs4813802, P = 4.65×10(-11)). Near GREM1, we found using fine-mapping that the previously-identified association between tagSNP rs4779584 and CRC actually resulted from two independent signals represented by rs16969681 (P = 5.33×10(-8)) and rs11632715 (P = 2.30×10(-10)). As low-penetrance predisposition variants become harder to identify-owing to small effect sizes and/or low risk allele frequencies-approaches based on informed candidate gene selection may become increasingly attractive. Our data emphasise that genetic fine-mapping studies can deconvolute associations that have arisen owing to independent correlation of a tagSNP with more than one functional SNP, thus explaining some of the apparently missing heritability of common diseases.
Candidate gene studies have reported CYP19A1 variants to be associated with endometrial cancer and with estradiol (E2) concentrations. We analyzed 2937 single nucleotide polymorphisms (SNPs) in 6608 ...endometrial cancer cases and 37 925 controls and report the first genome wide-significant association between endometrial cancer and a CYP19A1 SNP (rs727479 in intron 2, P=4.8×10−11). SNP rs727479 was also among those most strongly associated with circulating E2 concentrations in 2767 post-menopausal controls (P=7.4×10−8). The observed endometrial cancer odds ratio per rs727479 A-allele (1.15, CI=1.11–1.21) is compatible with that predicted by the observed effect on E2 concentrations (1.09, CI=1.03–1.21), consistent with the hypothesis that endometrial cancer risk is driven by E2. From 28 candidate-causal SNPs, 12 co-located with three putative gene-regulatory elements and their risk alleles associated with higher CYP19A1 expression in bioinformatical analyses. For both phenotypes, the associations with rs727479 were stronger among women with a higher BMI (Pinteraction=0.034 and 0.066 respectively), suggesting a biologically plausible gene-environment interaction.
Epidemiological, biological, and molecular data suggest links between endometriosis and endometrial cancer, with recent epidemiological studies providing evidence for an association between a ...previous diagnosis of endometriosis and risk of endometrial cancer. We used genetic data as an alternative approach to investigate shared biological etiology of these two diseases. Genetic correlation analysis of summary level statistics from genomewide association studies (GWAS) using LD Score regression revealed moderate but significant genetic correlation (rg = 0.23, P = 9.3 × 10−3), and SNP effect concordance analysis provided evidence for significant SNP pleiotropy (P = 6.0 × 10−3) and concordance in effect direction (P = 2.0 × 10−3) between the two diseases. Cross‐disease GWAS meta‐analysis highlighted 13 distinct loci associated at P ≤ 10−5 with both endometriosis and endometrial cancer, with one locus (SNP rs2475335) located within PTPRD associated at a genomewide significant level (P = 4.9 × 10−8, OR = 1.11, 95% CI = 1.07–1.15). PTPRD acts in the STAT3 pathway, which has been implicated in both endometriosis and endometrial cancer. This study demonstrates the value of cross‐disease genetic analysis to support epidemiological observations and to identify biological pathways of relevance to multiple diseases.
Investigating shared biological etiology between endometriosis and endometrial cancer using genomewide genetic data has revealed moderate, significant genetic correlation between them and highlighted 13 genetic loci associated at P ≤ 10−5 with both diseases. One of these SNPs, associated at a genomewide significant level (P = 4.9 × 10−8, OR = 1.11, 95% CI = 1.07–1.15), is located within the PTPRD gene, which acts in the STAT3 pathway implicated in both endometriosis and endometrial cancer. Our study demonstrates the value of cross‐disease genetic analysis to support epidemiological observations and to identify biological pathways of relevance to multiple diseases.
Endometrial cancer is the most common gynecological malignancy in the developed world. Although there is evidence of genetic predisposition to the disease, most of the genetic risk remains ...unexplained. We present the meta-analysis results of four genome-wide association studies (4907 cases and 11 945 controls total) in women of European ancestry. We describe one new locus reaching genome-wide significance (P < 5 × 10
) at 6p22.3 (rs1740828; P = 2.29 × 10
, OR = 1.20), providing evidence of an additional region of interest for genetic susceptibility to endometrial cancer.
Whilst common genetic variation in many non-coding genomic regulatory regions are known to impart risk of colorectal cancer (CRC), much of the heritability of CRC remains unexplained. To examine the ...role of recurrent coding sequence variation in CRC aetiology, we genotyped 12,638 CRCs cases and 29,045 controls from six European populations. Single-variant analysis identified a coding variant (rs3184504) in SH2B3 (12q24) associated with CRC risk (OR = 1.08, P = 3.9 × 10(-7)), and novel damaging coding variants in 3 genes previously tagged by GWAS efforts; rs16888728 (8q24) in UTP23 (OR = 1.15, P = 1.4 × 10(-7)); rs6580742 and rs12303082 (12q13) in FAM186A (OR = 1.11, P = 1.2 × 10(-7) and OR = 1.09, P = 7.4 × 10(-8)); rs1129406 (12q13) in ATF1 (OR = 1.11, P = 8.3 × 10(-9)), all reaching exome-wide significance levels. Gene based tests identified associations between CRC and PCDHGA genes (P < 2.90 × 10(-6)). We found an excess of rare, damaging variants in base-excision (P = 2.4 × 10(-4)) and DNA mismatch repair genes (P = 6.1 × 10(-4)) consistent with a recessive mode of inheritance. This study comprehensively explores the contribution of coding sequence variation to CRC risk, identifying associations with coding variation in 4 genes and PCDHG gene cluster and several candidate recessive alleles. However, these findings suggest that recurrent, low-frequency coding variants account for a minority of the unexplained heritability of CRC.