Various genetic mutations associated with cancer are known to alter cell signaling, but it is not clear whether they dysregulate signaling pathways by altering the abundance of pathway proteins. ...Using a combination of RNA sequencing and ultrasensitive targeted proteomics, we defined the primary components-16 core proteins and 10 feedback regulators-of the epidermal growth factor receptor (EGFR)-mitogen-activated protein kinase (MAPK) pathway in normal human mammary epithelial cells and then quantified their absolute abundance across a panel of normal and breast cancer cell lines as well as fibroblasts. We found that core pathway proteins were present at very similar concentrations across all cell types, with a variance similar to that of proteins previously shown to display conserved abundances across species. In contrast, EGFR and transcriptionally controlled feedback regulators were present at highly variable concentrations. The absolute abundance of most core proteins was between 50,000 and 70,000 copies per cell, but the adaptors SOS1, SOS2, and GAB1 were found at far lower amounts (2000 to 5000 copies per cell). MAPK signaling showed saturation in all cells between 3000 and 10,000 occupied EGFRs, consistent with the idea that adaptors limit signaling. Our results suggest that the relative stoichiometry of core MAPK pathway proteins is very similar across different cell types, with cell-specific differences mostly restricted to variable amounts of feedback regulators and receptors. The low abundance of adaptors relative to EGFR could be responsible for previous observations that only a fraction of total cell surface EGFR is capable of rapid endocytosis, high-affinity binding, and mitogenic signaling.
In this review, we apply selected imputation strategies to label-free liquid chromatography–mass spectrometry (LC–MS) proteomics datasets to evaluate the accuracy with respect to metrics of variance ...and classification. We evaluate several commonly used imputation approaches for individual merits and discuss the caveats of each approach with respect to the example LC–MS proteomics data. In general, local similarity-based approaches, such as the regularized expectation maximization and least-squares adaptive algorithms, yield the best overall performances with respect to metrics of accuracy and robustness. However, no single algorithm consistently outperforms the remaining approaches, and in some cases, performing classification without imputation sometimes yielded the most accurate classification. Thus, because of the complex mechanisms of missing data in proteomics, which also vary from peptide to protein, no individual method is a single solution for imputation. On the basis of the observations in this review, the goal for imputation in the field of computational proteomics should be to develop new approaches that work generically for this data type and new strategies to guide users in the selection of the best imputation for their dataset and analysis objectives.
The throughput efficiency and increased depth of coverage provided by isobaric-labeled proteomics measurements have led to increased usage of these techniques. However, the structure of missing data ...is different than unlabeled studies, which prompts the need for this review to compare the efficacy of nine imputation methods on large isobaric-labeled proteomics data sets to guide researchers on the appropriateness of various imputation methods. Imputation methods were evaluated by accuracy, statistical hypothesis test inference, and run time. In general, expectation maximization and random forest imputation methods yielded the best performance, and constant-based methods consistently performed poorly across all data set sizes and percentages of missing values. For data sets with small sample sizes and higher percentages of missing data, results indicate that statistical inference with no imputation may be preferable. On the basis of the findings in this review, there are core imputation methods that perform better for isobaric-labeled proteomics data, but great care and consideration as to whether imputation is the optimal strategy should be given for data sets comprised of a small number of samples.
Cancer is driven by the acquisition of somatic DNA lesions. Distinguishing the early driver mutations from subsequent passenger mutations is key to molecular subtyping of cancers, understanding ...cancer progression, and the discovery of novel biomarkers. The advances of genomics technologies (whole‐genome exome, and transcript sequencing, collectively referred to as NGS (next‐generation sequencing)) have fueled recent studies on somatic mutation discovery. However, the vision is challenged by the complexity, redundancy, and errors in genomic data, and the difficulty of investigating the proteome translated portion of aberrant genes using only genomic approaches. Combination of proteomic and genomic technologies are increasingly being employed. Various strategies have been employed to allow the usage of large‐scale NGS data for conventional MS/MS searches. This paper provides a discussion of applying different strategies relating to large database search, and FDR (false discovery rate) ‐based error control, and their implication to cancer proteogenomics. Moreover, it extends and develops the idea of a unified genomic variant database that can be searched by any MS sample. A total of 879 BAM files downloaded from TCGA repository were used to create a 4.34 GB unified FASTA database that contained 2787062 novel splice junctions, 38 464 deletions, 1 105 insertions, and 182 302 substitutions. Proteomic data from a single ovarian carcinoma sample (439 858 spectra) was searched against the database. By applying the most conservative FDR measure, we have identified 524 novel peptides and 65 578 known peptides at 1% FDR threshold. The novel peptides include interesting examples of doubly mutated peptides, frame‐shifts, and nonsample‐recruited mutations, which emphasize the strength of our approach.
Our study details the stepwise evolution of gilteritinib resistance in FLT3-mutated acute myeloid leukemia (AML). Early resistance is mediated by the bone marrow microenvironment, which protects ...residual leukemia cells. Over time, leukemia cells evolve intrinsic mechanisms of resistance, or late resistance. We mechanistically define both early and late resistance by integrating whole-exome sequencing, CRISPR-Cas9, metabolomics, proteomics, and pharmacologic approaches. Early resistant cells undergo metabolic reprogramming, grow more slowly, and are dependent upon Aurora kinase B (AURKB). Late resistant cells are characterized by expansion of pre-existing NRAS mutant subclones and continued metabolic reprogramming. Our model closely mirrors the timing and mutations of AML patients treated with gilteritinib. Pharmacological inhibition of AURKB resensitizes both early resistant cell cultures and primary leukemia cells from gilteritinib-treated AML patients. These findings support a combinatorial strategy to target early resistant AML cells with AURKB inhibitors and gilteritinib before the expansion of pre-existing resistance mutations occurs.
Display omitted
•Stepwise model of early to late gilteritinib resistance recapitulates human disease•Early resistant cells in marrow microenvironment rely on AURKB to resume growth•Pre-existing NRAS mutations expand in late resistance and drive relapse•Metabolic reprogramming occurs during evolution of gilteritinib resistance
Gilteritinib is an effective FLT3 inhibitor for AML, but residual cells survive in the marrow microenvironment. Over time, these early resistant cells evolve intrinsic mechanisms of resistance leading to relapse. Joshi et al. use a comprehensive approach to interrogate the evolution of resistance; identifying AURKB as critical for early resistance.
Despite wide use of anti-vascular endothelial growth factor (VEGF) therapy for many solid cancers, most individuals become resistant to this therapy, leading to disease progression. Therefore, new ...biomarkers and strategies for blocking adaptive resistance of cancer to anti-VEGF therapy are needed. As described here, we demonstrate that cancer-derived small extracellular vesicles package increasing quantities of VEGF and other factors in response to anti-VEGF therapy. The packaging process of VEGF into small extracellular vesicles (EVs) is mediated by the tetraspanin CD63. Furthermore, small EV-VEGF (eVEGF) is not accessible to anti-VEGF antibodies and can trigger intracrine VEGF signaling in endothelial cells. eVEGF promotes angiogenesis and enhances tumor growth despite bevacizumab treatment. These data demonstrate a mechanism where VEGF is partitioned into small EVs and promotes tumor angiogenesis and progression. These findings have clinical implications for biomarkers and therapeutic strategies for ovarian cancer.
Display omitted
•Cancer cells package increasing amounts of VEGF in small EVs with anti-VEGF therapy•VEGF packaging into small EVs is mediated by the tetraspanin CD63•Anti-VEGF antibodies failed to recognize small EV-VEGF (eVEGF)•eVEGF triggers intracrine VEGF signaling and promotes angiogenesis
Ma et.al report that cancer-cell-derived small EVs contain increasing amounts of VEGF (eVEGF) and contribute to resistance to anti-VEGF therapy (AVT). CD63 is a potential mediator that regulates packaging of VEGF into small EVs. eVEGF can trigger intracrine VEGF signaling in endothelial cells and promote angiogenesis despite AVT.
Protein abundance and phosphorylation convey important information about pathway activity and molecular pathophysiology in diseases including cancer, providing biological insight, informing drug and ...diagnostic development, and guiding therapeutic intervention. Analyzed tissues are usually collected without tight regulation or documentation of ischemic time. To evaluate the impact of ischemia, we collected human ovarian tumor and breast cancer xenograft tissue without vascular interruption and performed quantitative proteomics and phosphoproteomics after defined ischemic intervals. Although the global expressed proteome and most of the >25,000 quantified phosphosites were unchanged after 60 min, rapid phosphorylation changes were observed in up to 24% of the phosphoproteome, representing activation of critical cancer pathways related to stress response, transcriptional regulation, and cell death. Both pan-tumor and tissue-specific changes were observed. The demonstrated impact of pre-analytical tissue ischemia on tumor biology mandates caution in interpreting stress-pathway activation in such samples and motivates reexamination of collection protocols for phosphoprotein analysis.
Coexpression of mRNAs under multiple conditions is commonly used to infer cofunctionality of their gene products despite well-known limitations of this “guilt-by-association” (GBA) approach. Recent ...advancements in mass spectrometry-based proteomic technologies have enabled global expression profiling at the protein level; however, whether proteome profiling data can outperform transcriptome profiling data for coexpression based gene function prediction has not been systematically investigated. Here, we address this question by constructing and analyzing mRNA and protein coexpression networks for three cancer types with matched mRNA and protein profiling data from The Cancer Genome Atlas (TCGA) and the Clinical Proteomic Tumor Analysis Consortium (CPTAC). Our analyses revealed a marked difference in wiring between the mRNA and protein coexpression networks. Whereas protein coexpression was driven primarily by functional similarity between coexpressed genes, mRNA coexpression was driven by both cofunction and chromosomal colocalization of the genes. Functionally coherent mRNA modules were more likely to have their edges preserved in corresponding protein networks than functionally incoherent mRNA modules. Proteomic data strengthened the link between gene expression and function for at least 75% of Gene Ontology (GO) biological processes and 90% of KEGG pathways. A web application Gene2Net (http://cptac.gene2net.org) developed based on the three protein coexpression networks revealed novel gene-function relationships, such as linking ERBB2 (HER2) to lipid biosynthetic process in breast cancer, identifying PLG as a new gene involved in complement activation, and identifying AEBP1 as a new epithelial-mesenchymal transition (EMT) marker. Our results demonstrate that proteome profiling outperforms transcriptome profiling for coexpression based gene function prediction. Proteomics should be integrated if not preferred in gene function and human disease studies.
Many of the filters are statistical in nature, as- sessing the relative performance of candidate markers on relatively small, "convenience" sample sets that are themselves subject to experimental ...bias and con- founding factors. ...an imperative next step is the reproduction of the original result on a com- pletely independent sample set, which is presumed to have an independent set of potential confounders (8 ). Recent technical advances in both mass spectrometry platforms and in work flows for targeted mass spectrometry analyses, enabling detection and quantification at less than 1 ng/mL (13, 14), suggest that investigators might soon be able to proceed through the initial verification and preclinical validation phases without the requirement for antibody development, thus focusing antibody development only on those targets worth pursuing and additionally increasing the return on investment for highly rigorous antibody development efforts.
Mass spectrometry (MS) -based proteomics has become an indispensable tool with broad applications in systems biology and biomedical research. With recent advances in liquid chromatography (LC) and MS ...instrumentation, LC-MS is making increasingly significant contributions to clinical applications, especially in the area of cancer biomarker discovery and verification. To overcome challenges associated with analyses of clinical samples (for example, a wide dynamic range of protein concentrations in bodily fluids and the need to perform high throughput and accurate quantification of candidate biomarker proteins), significant efforts have been devoted to improve the overall performance of LC-MS-based clinical proteomics platforms. Reviewed here are the recent advances in LC-MS and its applications in cancer biomarker discovery and quantification, along with the potentials, limitations and future perspectives.