Extensive genomic characterization of human cancers presents the problem of inference from genomic abnormalities to cancer phenotypes. To address this problem, we analysed proteomes of colon and ...rectal tumours characterized previously by The Cancer Genome Atlas (TCGA) and perform integrated proteogenomic analyses. Somatic variants displayed reduced protein abundance compared to germline variants. Messenger RNA transcript abundance did not reliably predict protein abundance differences between tumours. Proteomics identified five proteomic subtypes in the TCGA cohort, two of which overlapped with the TCGA 'microsatellite instability/CpG island methylation phenotype' transcriptomic subtype, but had distinct mutation, methylation and protein expression patterns associated with different clinical outcomes. Although copy number alterations showed strong cis- and trans-effects on mRNA abundance, relatively few of these extend to the protein level. Thus, proteomics data enabled prioritization of candidate driver genes. The chromosome 20q amplicon was associated with the largest global changes at both mRNA and protein levels; proteomics data highlighted potential 20q candidates, including HNF4A (hepatocyte nuclear factor 4, alpha), TOMM34 (translocase of outer mitochondrial membrane 34) and SRC (SRC proto-oncogene, non-receptor tyrosine kinase). Integrated proteogenomic analysis provides functional context to interpret genomic abnormalities and affords a new paradigm for understanding cancer biology.
Here we present an optimized workflow for global proteome and phosphoproteome analysis of tissues or cell lines that uses isobaric tags (TMT (tandem mass tags)-10) for multiplexed analysis and ...relative quantification, and provides 3× higher throughput than iTRAQ (isobaric tags for absolute and relative quantification)-4-based methods with high intra- and inter-laboratory reproducibility. The workflow was systematically characterized and benchmarked across three independent laboratories using two distinct breast cancer subtypes from patient-derived xenograft models to enable assessment of proteome and phosphoproteome depth and quantitative reproducibility. Each plex consisted of ten samples, each being 300 μg of peptide derived from <50 mg of wet-weight tissue. Of the 10,000 proteins quantified per sample, we could distinguish 7,700 human proteins derived from tumor cells and 3100 mouse proteins derived from the surrounding stroma and blood. The maximum deviation across replicates and laboratories was <7%, and the inter-laboratory correlation for TMT ratio-based comparison of the two breast cancer subtypes was r > 0.88. The maximum deviation for the phosphoproteome coverage was <24% across laboratories, with an average of >37,000 quantified phosphosites per sample and differential quantification correlations of r > 0.72. The full procedure, including sample processing and data generation, can be completed within 10 d for ten tissue samples, and 100 samples can be analyzed in ~4 months using a single LC-MS/MS instrument. The high quality, depth, and reproducibility of the data obtained both within and across laboratories should enable new biological insights to be obtained from mass spectrometry-based proteomics analyses of cells and tissues together with proteogenomic data integration.
The four intrinsic subtypes of breast cancer, defined by differential expression of 50 genes (PAM50), have been shown to be predictive of risk of recurrence and benefit of hormonal therapy and ...chemotherapy. Here we describe the development of Prosigna™, a PAM50-based subtype classifier and risk model on the NanoString nCounter Dx Analysis System intended for decentralized testing in clinical laboratories.
514 formalin-fixed, paraffin-embedded (FFPE) breast cancer patient samples were used to train prototypical centroids for each of the intrinsic subtypes of breast cancer on the NanoString platform. Hierarchical cluster analysis of gene expression data was used to identify the prototypical centroids defined in previous PAM50 algorithm training exercises. 304 FFPE patient samples from a well annotated clinical cohort in the absence of adjuvant systemic therapy were then used to train a subtype-based risk model (i.e. Prosigna ROR score). 232 samples from a tamoxifen-treated patient cohort were used to verify the prognostic accuracy of the algorithm prior to initiating clinical validation studies.
The gene expression profiles of each of the four Prosigna subtype centroids were consistent with those previously published using the PCR-based PAM50 method. Similar to previously published classifiers, tumor samples classified as Luminal A by Prosigna had the best prognosis compared to samples classified as one of the three higher-risk tumor subtypes. The Prosigna Risk of Recurrence (ROR) score model was verified to be significantly associated with prognosis as a continuous variable and to add significant information over both commonly available IHC markers and Adjuvant! Online.
The results from the training and verification data sets show that the FDA-cleared and CE marked Prosigna test provides an accurate estimate of the risk of distant recurrence in hormone receptor positive breast cancer and is also capable of identifying a tumor's intrinsic subtype that is consistent with the previously published PCR-based PAM50 assay. Subsequent analytical and clinical validation studies confirm the clinical accuracy and technical precision of the Prosigna PAM50 assay in a decentralized setting.
To characterize patient-derived xenografts (PDXs) for functional studies, we made whole-genome comparisons with originating breast cancers representative of the major intrinsic subtypes. Structural ...and copy number aberrations were found to be retained with high fidelity. However, at the single-nucleotide level, variable numbers of PDX-specific somatic events were documented, although they were only rarely functionally significant. Variant allele frequencies were often preserved in the PDXs, demonstrating that clonal representation can be transplantable. Estrogen-receptor-positive PDXs were associated with ESR1 ligand-binding-domain mutations, gene amplification, or an ESR1/YAP1 translocation. These events produced different endocrine-therapy-response phenotypes in human, cell line, and PDX endocrine-response studies. Hence, deeply sequenced PDX models are an important resource for the search for genome-forward treatment options and capture endocrine-drug-resistance etiologies that are not observed in standard cell lines. The originating tumor genome provides a benchmark for assessing genetic drift and clonal representation after transplantation.
Display omitted
•Genome-wide structural variants are highly preserved in breast cancer PDX models•PDXs harbor rarely functionally significant single-nucleotide variants•PDXs harbor ESR1 mutations and translocations associated with endocrine therapy resistance•ESR1 gene amplification in PDX correlated with paradoxical estradiol-induced regression
In this study, Ellis and colleagues compare whole-tumor genomes from drug-resistant breast cancers with paired xenografts. Genomic fidelity upon transplantation was high for structural variants but variable at the single-nucleotide level. Therefore, tumor and xenograft whole-genome comparisons critically assess genetic drift and clonal representation. Additional analysis revealed ESR1 mutations, amplification, and translocations associated with endocrine resistance in lumenal xenografts. Sequenced patient-derived xenografts are an important resource for functional genomics and capture treatment-resistance etiologies that are not observed in standard cell lines.
Recent advances in mass spectrometry (MS) have enabled extensive analysis of cancer proteomes. Here, we employed quantitative proteomics to profile protein expression across 24 breast cancer ...patient-derived xenograft (PDX) models. Integrated proteogenomic analysis shows positive correlation between expression measurements from transcriptomic and proteomic analyses; further, gene expression-based intrinsic subtypes are largely re-capitulated using non-stromal protein markers. Proteogenomic analysis also validates a number of predicted genomic targets in multiple receptor tyrosine kinases. However, several protein/phosphoprotein events such as overexpression of AKT proteins and ARAF, BRAF, HSP90AB1 phosphosites are not readily explainable by genomic analysis, suggesting that druggable translational and/or post-translational regulatory events may be uniquely diagnosed by MS. Drug treatment experiments targeting HER2 and components of the PI3K pathway supported proteogenomic response predictions in seven xenograft models. Our study demonstrates that MS-based proteomics can identify therapeutic targets and highlights the potential of PDX drug response evaluation to annotate MS-based pathway activities.
Protein abundance and phosphorylation convey important information about pathway activity and molecular pathophysiology in diseases including cancer, providing biological insight, informing drug and ...diagnostic development, and guiding therapeutic intervention. Analyzed tissues are usually collected without tight regulation or documentation of ischemic time. To evaluate the impact of ischemia, we collected human ovarian tumor and breast cancer xenograft tissue without vascular interruption and performed quantitative proteomics and phosphoproteomics after defined ischemic intervals. Although the global expressed proteome and most of the >25,000 quantified phosphosites were unchanged after 60 min, rapid phosphorylation changes were observed in up to 24% of the phosphoproteome, representing activation of critical cancer pathways related to stress response, transcriptional regulation, and cell death. Both pan-tumor and tissue-specific changes were observed. The demonstrated impact of pre-analytical tissue ischemia on tumor biology mandates caution in interpreting stress-pathway activation in such samples and motivates reexamination of collection protocols for phosphoprotein analysis.
For many years, basic and clinical researchers have taken advantage of the analytical sensitivity and specificity afforded by mass spectrometry in the measurement of proteins. Clinical laboratories ...are now beginning to deploy these work flows as well. For assays that use proteolysis to generate peptides for protein quantification and characterization, synthetic stable isotope-labeled internal standard peptides are of central importance. No general recommendations are currently available surrounding the use of peptides in protein mass spectrometric assays.
The Clinical Proteomic Tumor Analysis Consortium of the National Cancer Institute has collaborated with clinical laboratorians, peptide manufacturers, metrologists, representatives of the pharmaceutical industry, and other professionals to develop a consensus set of recommendations for peptide procurement, characterization, storage, and handling, as well as approaches to the interpretation of the data generated by mass spectrometric protein assays. Additionally, the importance of carefully characterized reference materials-in particular, peptide standards for the improved concordance of amino acid analysis methods across the industry-is highlighted. The alignment of practices around the use of peptides and the transparency of sample preparation protocols should allow for the harmonization of peptide and protein quantification in research and clinical care.
Purpose
Multi-gene signatures provide biological insight and risk stratification in breast cancer. Intrinsic molecular subtypes defined by mRNA expression of 50 genes (PAM50) are prognostic in ...hormone-receptor positive postmenopausal breast cancer. Yet, for 25–40% in the PAM50 intermediate risk group, long-term risk remains uncertain. Our study aimed to (i) test the long-term prognostic value of the PAM50 signature in pre- and post-menopausal breast cancer; (ii) investigate if the PAM50 model could be improved by addition of other mRNAs implicated in oncogenesis.
Methods
We used archived FFPE samples from 1723 breast cancer survivors; high quality reads were obtained on 1253 samples. Transcript expression was quantified using a custom codeset with probes for > 100 targets. Cox models assessed gene signatures for breast cancer relapse and survival.
Results
Over 15 + years of follow-up, PAM50 subtypes were (
P
< 0.01) associated with breast cancer outcomes after accounting for tumor stage, grade and age at diagnosis. Results did not differ by menopausal status at diagnosis. Women with Luminal B (versus Luminal A) subtype had a > 60% higher hazard. Addition of a 13-gene hypoxia signature improved prognostication with > 40% higher hazard in the highest vs lowest hypoxia tertiles.
Conclusions
PAM50 intrinsic subtypes were independently prognostic for long-term breast cancer survival, irrespective of menopausal status. Addition of hypoxia signatures improved risk prediction. If replicated, incorporating the 13-gene hypoxia signature into the existing PAM50 risk assessment tool, may refine risk stratification and further clarify treatment for breast cancer.
To compare clinical, immunohistochemical (IHC), and gene expression models of prognosis applicable to formalin-fixed, paraffin-embedded blocks in a large series of estrogen receptor (ER)-positive ...breast cancers from patients uniformly treated with adjuvant tamoxifen.
Quantitative real-time reverse transcription-PCR (qRT-PCR) assays for 50 genes identifying intrinsic breast cancer subtypes were completed on 786 specimens linked to clinical (median follow-up, 11.7 years) and IHC ER, progesterone receptor (PR), HER2, and Ki67 data. Performance of predefined intrinsic subtype and risk-of-relapse scores was assessed using multivariable Cox models and Kaplan-Meier analysis. Harrell's C-index was used to compare fixed models trained in independent data sets, including proliferation signatures.
Despite clinical ER positivity, 10% of cases were assigned to nonluminal subtypes. qRT-PCR signatures for proliferation genes gave more prognostic information than clinical assays for hormone receptors or Ki67. In Cox models incorporating standard prognostic variables, hazard ratios for breast cancer disease-specific survival over the first 5 years of follow-up, relative to the most common luminal A subtype, are 1.99 95% confidence interval (CI), 1.09-3.64 for luminal B, 3.65 (95% CI, 1.64-8.16) for HER2-enriched subtype, and 17.71 (95% CI, 1.71-183.33) for the basal-like subtype. For node-negative disease, PAM50 qRT-PCR-based risk assignment weighted for tumor size and proliferation identifies a group with >95% 10-year survival without chemotherapy. In node-positive disease, PAM50-based prognostic models were also superior.
The PAM50 gene expression test for intrinsic biological subtype can be applied to large series of formalin-fixed, paraffin-embedded breast cancers, and gives more prognostic information than clinical factors and IHC using standard cut points.
Development of candidate cancer treatments is a resource-intensive process, with the research community continuing to investigate options beyond static genomic characterization. Toward this goal, we ...have established the genomic landscapes of 536 patient-derived xenograft (PDX) models across 25 cancer types, together with mutation, copy number, fusion, transcriptomic profiles, and NCI-MATCH arms. Compared with human tumors, PDXs typically have higher purity and fit to investigate dynamic driver events and molecular properties via multiple time points from same case PDXs. Here, we report on dynamic genomic landscapes and pharmacogenomic associations, including associations between activating oncogenic events and drugs, correlations between whole-genome duplications and subclone events, and the potential PDX models for NCI-MATCH trials. Lastly, we provide a web portal having comprehensive pan-cancer PDX genomic profiles and source code to facilitate identification of more druggable events and further insights into PDXs' recapitulation of human tumors.