Long non-coding RNAs (lncRNAs) are emerging as crucial regulators of cellular processes and diseases such as cancer; however, their functions remain poorly characterised. Several studies have ...demonstrated that lncRNAs are typically disease and tumour subtype specific, particularly in breast cancer where lncRNA expression alone is sufficient to discriminate samples based on hormone status and molecular intrinsic subtype. However, little attempt has been made to assess the reproducibility of lncRNA signatures across more than one dataset. In this work, we derive consensus lncRNA signatures indicative of breast cancer subtype based on two clinical RNA-Seq datasets: the Utah Breast Cancer Study and The Cancer Genome Atlas, through integration of differential expression and hypothesis-free clustering analyses. The most consistent signature is associated with breast cancers of the basal-like subtype, leading us to generate a putative set of six lncRNA basal-like breast cancer markers, at least two of which may have a role in cis-regulation of known poor prognosis markers. Through in silico functional characterization of individual signatures and integration of expression data from pre-clinical cancer models, we discover that discordance between signatures derived from different clinical cohorts can arise from the strong influence of non-cancerous cells in tumour samples. As a consequence, we identify nine lncRNAs putatively associated with breast cancer associated fibroblasts, or the immune response. Overall, our study establishes the confounding effects of tumour purity on lncRNA signature derivation, and generates several novel hypotheses on the role of lncRNAs in basal-like breast cancers and the tumour microenvironment.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Uptake of risk-reducing surgery has increased among women at high risk of epithelial ovarian cancer. We sought to characterise familial risk of epithelial ovarian cancer histotypes in a ...population-based study after accounting for gynaecological surgeries, including bilateral oophorectomy.
We compared risk of epithelial ovarian cancer in relatives of 3536 epithelial ovarian cancer cases diagnosed in 1966-2016 and relatives of 35 326 matched controls. We used Cox competing risk models, incorporating bilateral oophorectomy as a competing risk, to estimate the relative risk of ovarian cancer in first-degree (FDR), second-degree (SDR) and third-degree (TDR) relatives from 1966 to 2016. We also estimated relative risks in time periods before (1966-1994, 1995-2004) and after (2005-2016) formal recommendations were made for prophylactic oophorectomy among women with pathogenic variants in
.
The relative risks of epithelial ovarian cancer in FDRs, SDRs and TDRs of cases versus controls were 1.68 (95% CI 1.39 to 2.04), 1.51 (95% CI 1.30 to 1.75) and 1.34 (95% CI 1.20 to 1.48), respectively. Relative risks were greatest for high-grade serous, mucinous and 'other epithelial' histotypes. Relative risks were attenuated for case FDRs, but not for SDRs or TDRs, from 2005 onwards, consistent with the timing of recommendations for prophylactic surgery.
Familial risk of epithelial ovarian cancer extends to TDRs, especially for high-grade serous and mucinous histotypes. Distant relatives share genes but minimal environment, highlighting the importance of germline inherited genetics in ovarian cancer aetiology. Increased ovarian cancer risk in distant relatives has implications for counselling and recommendations for prophylactic surgeries that, from our data, appear only to reach FDRs.
The high-risk pedigree (HRP) design is an established strategy to discover rare, highly-penetrant, Mendelian-like causal variants. Its success, however, in complex traits has been modest, largely due ...to challenges of genetic heterogeneity and complex inheritance models. We describe a HRP strategy that addresses intra-familial heterogeneity, and identifies inherited segments important for mapping regulatory risk. We apply this new Shared Genomic Segment (SGS) method in 11 extended, Utah, multiple myeloma (MM) HRPs, and subsequent exome sequencing in SGS regions of interest in 1063 MM / MGUS (monoclonal gammopathy of undetermined significance-a precursor to MM) cases and 964 controls from a jointly-called collaborative resource, including cases from the initial 11 HRPs. One genome-wide significant 1.8 Mb shared segment was found at 6q16. Exome sequencing in this region revealed predicted deleterious variants in USP45 (p.Gln691* and p.Gln621Glu), a gene known to influence DNA repair through endonuclease regulation. Additionally, a 1.2 Mb segment at 1p36.11 is inherited in two Utah HRPs, with coding variants identified in ARID1A (p.Ser90Gly and p.Met890Val), a key gene in the SWI/SNF chromatin remodeling complex. Our results provide compelling statistical and genetic evidence for segregating risk variants for MM. In addition, we demonstrate a novel strategy to use large HRPs for risk-variant discovery more generally in complex traits.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Transcriptome studies are gaining momentum in genomic epidemiology, and the need to incorporate these data in multivariable models alongside other risk factors brings demands for new approaches.
Here ...we describe SPECTRA, an approach to derive quantitative variables that capture the intrinsic variation in gene expression of a tissue type. We applied the SPECTRA approach to bulk RNA sequencing from malignant cells (CD138+) in patients from the Multiple Myeloma Research Foundation CoMMpass study.
A set of 39 spectra variables were derived to represent multiple myeloma cells. We used these variables in predictive modeling to determine spectra-based risk scores for overall survival, progression-free survival, and time to treatment failure. Risk scores added predictive value beyond known clinical and expression risk factors and replicated in an external dataset. Spectrum variable S5, a significant predictor for all three outcomes, showed pre-ranked gene set enrichment for the unfolded protein response, a mechanism targeted by proteasome inhibitors which are a common first line agent in multiple myeloma treatment. We further used the 39 spectra variables in descriptive modeling, with significant associations found with tumor cytogenetics, race, gender, and age at diagnosis; factors known to influence multiple myeloma incidence or progression.
Quantitative variables from the SPECTRA approach can predict clinical outcomes in multiple myeloma and provide a new avenue for insight into tumor differences by demographic groups.
The SPECTRA approach provides a set of quantitative phenotypes that deeply profile a tissue and allows for more comprehensive modeling of gene expression with other risk factors.
Several chronic lymphocytic leukaemia (CLL) susceptibility loci have been reported; however, much of the heritable risk remains unidentified. Here we perform a meta-analysis of six genome-wide ...association studies, imputed using a merged reference panel of 1,000 Genomes and UK10K data, totalling 6,200 cases and 17,598 controls after replication. We identify nine risk loci at 1p36.11 (rs34676223, P=5.04 × 10
), 1q42.13 (rs41271473, P=1.06 × 10
), 4q24 (rs71597109, P=1.37 × 10
), 4q35.1 (rs57214277, P=3.69 × 10
), 6p21.31 (rs3800461, P=1.97 × 10
), 11q23.2 (rs61904987, P=2.64 × 10
), 18q21.1 (rs1036935, P=3.27 × 10
), 19p13.3 (rs7254272, P=4.67 × 10
) and 22q13.33 (rs140522, P=2.70 × 10
). These new and established risk loci map to areas of active chromatin and show an over-representation of transcription factor binding for the key determinants of B-cell development and immune response.
Can we simultaneously assess risk for multiple cancers to identify familial multicancer patterns in families of azoospermic and severely oligozoospermic men?
Distinct familial cancer patterns were ...observed in the azoospermia and severe oligozoospermia cohorts, suggesting heterogeneity in familial cancer risk by both type of subfertility and within subfertility type.
Subfertile men and their relatives show increased risk for certain cancers including testicular, thyroid, and pediatric.
A retrospective cohort of subfertile men (N = 786) was identified and matched to fertile population controls (N = 5674). Family members out to third-degree relatives were identified for both subfertile men and fertile population controls (N = 337 754). The study period was 1966-2017. Individuals were censored at death or loss to follow-up, loss to follow-up occurred if they left Utah during the study period.
Azoospermic (0 × 106/mL) and severely oligozoospermic (<1.5 × 106/mL) men were identified in the Subfertility Health and Assisted Reproduction and the Environment cohort (SHARE). Subfertile men were age- and sex-matched 5:1 to fertile population controls and family members out to third-degree relatives were identified using the Utah Population Database (UPDB). Cancer diagnoses were identified through the Utah Cancer Registry. Families containing ≥10 members with ≥1 year of follow-up 1966-2017 were included (azoospermic: N = 426 families, 21 361 individuals; oligozoospermic: N = 360 families, 18 818 individuals). Unsupervised clustering based on standardized incidence ratios for 34 cancer phenotypes in the families was used to identify familial multicancer patterns; azoospermia and severe oligospermia families were assessed separately.
Compared to control families, significant increases in cancer risks were observed in the azoospermia cohort for five cancer types: bone and joint cancers hazard ratio (HR) = 2.56 (95% CI = 1.48-4.42), soft tissue cancers HR = 1.56 (95% CI = 1.01-2.39), uterine cancers HR = 1.27 (95% CI = 1.03-1.56), Hodgkin lymphomas HR = 1.60 (95% CI = 1.07-2.39), and thyroid cancer HR = 1.54 (95% CI = 1.21-1.97). Among severe oligozoospermia families, increased risk was seen for three cancer types: colon cancer HR = 1.16 (95% CI = 1.01-1.32), bone and joint cancers HR = 2.43 (95% CI = 1.30-4.54), and testis cancer HR = 2.34 (95% CI = 1.60-3.42) along with a significant decrease in esophageal cancer risk HR = 0.39 (95% CI = 0.16-0.97). Thirteen clusters of familial multicancer patterns were identified in families of azoospermic men, 66% of families in the azoospermia cohort showed population-level cancer risks, however, the remaining 12 clusters showed elevated risk for 2-7 cancer types. Several of the clusters with elevated cancer risks also showed increased odds of cancer diagnoses at young ages with six clusters showing increased odds of adolescent and young adult (AYA) diagnosis odds ratio (OR) = 1.96-2.88 and two clusters showing increased odds of pediatric cancer diagnosis (OR = 3.64-12.63). Within the severe oligozoospermia cohort, 12 distinct familial multicancer clusters were identified. All 12 clusters showed elevated risk for 1-3 cancer types. An increase in odds of cancer diagnoses at young ages was also seen in five of the severe oligozoospermia familial multicancer clusters, three clusters showed increased odds of AYA diagnosis (OR = 2.19-2.78) with an additional two clusters showing increased odds of a pediatric diagnosis (OR = 3.84-9.32).
Although this study has many strengths, including population data for family structure, cancer diagnoses and subfertility, there are limitations. First, semen measures are not available for the sample of fertile men. Second, there is no information on medical comorbidities or lifestyle risk factors such as smoking status, BMI, or environmental exposures. Third, all of the subfertile men included in this study were seen at a fertility clinic for evaluation. These men were therefore a subset of the overall population experiencing fertility problems and likely represent those with the socioeconomic means for evaluation by a physician.
This analysis leveraged unique population-level data resources, SHARE and the UPDB, to describe novel multicancer clusters among the families of azoospermic and severely oligozoospermic men. Distinct overall multicancer risk and familial multicancer patterns were observed in the azoospermia and severe oligozoospermia cohorts, suggesting heterogeneity in cancer risk by type of subfertility and within subfertility type. Describing families with similar cancer risk patterns provides a new avenue to increase homogeneity for focused gene discovery and environmental risk factor studies. Such discoveries will lead to more accurate risk predictions and improved counseling for patients and their families.
This work was funded by GEMS: Genomic approach to connecting Elevated germline Mutation rates with male infertility and Somatic health (Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD): R01 HD106112). The authors have no conflicts of interest relevant to this work.
N/A.
Abstract
Inherited genetic risk factors play a role in multiple myeloma (MM), yet considerable missing heritability exists. Rare risk variants at genome-wide association study (GWAS) loci are a new ...avenue to explore. Pleiotropy between lymphoid neoplasms (LNs) has been suggested in family history and genetic studies, but no studies have interrogated sequencing for pleiotropic genes or rare risk variants. Sequencing genetically enriched cases can help discover rarer variants. We analyzed exome sequencing in familial or early-onset MM cases to identify rare, functionally relevant variants near GWAS loci for a range of LNs. A total of 149 distinct and significant LN GWAS loci have been published. We identified six recurrent, rare, potentially deleterious variants within 5 kb of significant GWAS single nucleotide polymorphisms in 75 MM cases. Mutations were observed in BTNL2, EOMES, TNFRSF13B, IRF8, ACOXL and TSPAN32. All six genes replicated in an independent set of 255 early-onset MM or familial MM or precursor cases. Expansion of our analyses to the full length of these six genes resulted in a list of 39 rare and deleterious variants, seven of which segregated in MM families. Three genes also had significant rare variant burden in 733 sporadic MM cases compared with 935 control individuals: IRF8 (P = 1.0 × 10−6), EOMES (P = 6.0 × 10−6) and BTNL2 (P = 2.1 × 10−3). Together, our results implicate six genes in MM risk, provide support for genetic pleiotropy between LN subtypes and demonstrate the utility of sequencing genetically enriched cases to identify functionally relevant variants near GWAS loci.
Suicide is the 10th leading cause of death in the United States. Although environment has undeniable impact, evidence suggests that genetic factors play a significant role in completed suicide. We ...linked a resource of ~ 4500 DNA samples from completed suicides obtained from the Utah Medical Examiner to genealogical records and medical records data available on over eight million individuals. This linking has resulted in the identification of high-risk extended families (7-9 generations) with significant familial risk of completed suicide. Familial aggregation across distant relatives minimizes effects of shared environment, provides more genetically homogeneous risk groups, and magnifies genetic risks through familial repetition. We analyzed Illumina PsychArray genotypes from suicide cases in 43 high-risk families, identifying 30 distinct shared genomic segments with genome-wide evidence (p = 2.02E-07-1.30E-18) of segregation with completed suicide. The 207 genes implicated by the shared regions provide a focused set of genes for further study; 18 have been previously associated with suicide risk. Although PsychArray variants do not represent exhaustive variation within the 207 genes, we investigated these for specific segregation within the high-risk families, and for association of variants with predicted functional impact in ~ 1300 additional Utah suicides unrelated to the discovery families. None of the limited PsychArray variants explained the high-risk family segregation; sequencing of these regions will be needed to discover segregating risk variants, which may be rarer or regulatory. However, additional association tests yielded four significant PsychArray variants (SP110, rs181058279; AGBL2, rs76215382; SUCLA2, rs121908538; APH1B, rs745918508), raising the likelihood that these genes confer risk of completed suicide.
Full text
Available for:
EMUNI, FIS, FZAB, GEOZS, GIS, IJS, IMTLJ, KILJ, KISLJ, MFDPS, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, SBMB, SBNM, UKNU, UL, UM, UPUK, VKSCE, ZAGLJ
Age-related macular degeneration (AMD) is the most common cause of irreversible vision loss in the developed world and has a strong genetic predisposition. A locus at human chromosome 10q26 affects ...the risk of AMD, but the precise gene(s) have not been identified. We genotyped 581 AMD cases and 309 normal controls in a Caucasian cohort in Utah. We demonstrate that a single-nucleotide polymorphism, rs11200638, in the promoter region of HTRA1 is the most likely causal variant for AMD at 10q26 and is estimated to confer a population attributable risk of 49.3%. The HTRA1 gene encodes a secreted serine protease. Preliminary analysis of lymphocytes and retinal pigment epithelium from four AMD patients revealed that the risk allele was associated with elevated expression levels of HTRA1 mRNA and protein. We also found that drusen in the eyes of AMD patients were strongly immunolabeled with HTRA1 antibody. Together, these findings support a key role for HTRA1 in AMD susceptibility and identify a potential new pathway for AMD pathogenesis.
Full text
Available for:
BFBNIB, NMLJ, NUK, PNG, SAZU, UL, UM, UPUK