Traditional shotgun proteomics used to detect a mixture of hundreds to thousands of proteins through mass spectrometric analysis, has been the standard approach in research to profile protein content ...in a biological sample which could lead to the discovery of new (and all) protein candidates with diagnostic, prognostic, and therapeutic values. In practice, this approach requires significant resources and time, and does not necessarily represent the goal of the researcher who would rather study a subset of such discovered proteins (including their variations or posttranslational modifications) under different biological conditions. In this context, targeted proteomics is playing an increasingly important role in the accurate measurement of protein targets in biological samples in the hope of elucidating the molecular mechanism of cellular function via the understanding of intricate protein networks and pathways. One such (targeted) approach, selected reaction monitoring (or multiple reaction monitoring) mass spectrometry (MRM‐MS), offers the capability of measuring multiple proteins with higher sensitivity and throughput than shotgun proteomics. Developing and validating MRM‐MS‐based assays, however, is an extensive and iterative process, requiring a coordinated and collaborative effort by the scientific community through the sharing of publicly accessible data and datasets, bioinformatic tools, standard operating procedures, and well characterized reagents.
We performed the first proteogenomic characterization of hepatitis B virus (HBV)-related hepatocellular carcinoma (HCC) using paired tumor and adjacent liver tissues from 159 patients. Integrated ...proteogenomic analyses revealed consistency and discordance among multi-omics, activation status of key signaling pathways, and liver-specific metabolic reprogramming in HBV-related HCC. Proteomic profiling identified three subgroups associated with clinical and molecular attributes including patient survival, tumor thrombus, genetic profile, and the liver-specific proteome. These proteomic subgroups have distinct features in metabolic reprogramming, microenvironment dysregulation, cell proliferation, and potential therapeutics. Two prognostic biomarkers, PYCR2 and ADH1A, related to proteomic subgrouping and involved in HCC metabolic reprogramming, were identified. CTNNB1 and TP53 mutation-associated signaling and metabolic profiles were revealed, among which mutated CTNNB1-associated ALDOA phosphorylation was validated to promote glycolysis and cell proliferation. Our study provides a valuable resource that significantly expands the knowledge of HBV-related HCC and may eventually benefit clinical practice.
Display omitted
•Proteomic subgroups stratify patient survival and allocate specific treatments•Alterations of the liver-specific proteome and metabolism in HCC are identified•Multi-omics profile of key signaling and metabolic pathways in HCC is depicted•CTNNB1 mutation-associated ALDOA phosphorylation promotes HCC cell proliferation
Proteogenomic characterization of HBV-related hepatocellular carcinoma (HCC) using paired tumor and adjacent liver tissues identifies three subgroups with distinct features in metabolic reprogramming, microenvironment dysregulation, cell proliferation, and potential therapeutics.
Mutations of the KRAS gene are found in human cancers with high frequency and result in the constitutive activation of its protein products. This leads to aberrant regulation of downstream pathways, ...promoting cell survival, proliferation, and tumorigenesis that drive cancer progression and negatively affect treatment outcomes. Here, we describe a workflow that can detect and quantify mutation-specific consequences of KRAS biochemistry, namely linked changes in posttranslational modifications (PTMs). We combined immunoaffinity enrichment with detection by top-down mass spectrometry to discover and quantify proteoforms with or without the Gly13Asp mutation (G13D) specifically in the KRAS4b isoform. The workflow was applied first to isogenic KRAS colorectal cancer (CRC) cell lines and then to patient CRC tumors with matching KRAS genotypes. In two cellular models, a direct link between the knockout of the mutant G13D allele and the complete nitrosylation of cysteine 118 of the remaining WT KRAS4b was observed. Analysis of tumor samples quantified the percentage of mutant KRAS4b actually present in cancer tissue and identified major differences in the levels of C-terminal carboxymethylation, a modification critical for membrane association. These data from CRC cells and human tumors suggest mechanisms of posttranslational regulation that are highly context-dependent and which lead to preferential production of specific KRAS4b proteoforms.
Vasopressin's action in renal cells to regulate water transport depends on protein phosphorylation. Here we used mass spectrometry-based quantitative phosphoproteomics to identify signaling pathways ...involved in the short-term V2-receptor-mediated response in cultured collecting duct cells (mpkCCD) from mouse. Using Stable Isotope Labeling by Amino acids in Cell culture (SILAC) with two treatment groups (0.1 nM dDAVP or vehicle for 30 min), we carried out quantification of 2884 phosphopeptides. The majority (82%) of quantified phosphopeptides did not change in abundance in response to dDAVP. Analysis of the 273 phosphopeptides increased by dDAVP showed a predominance of so-called "basophilic" motifs consistent with activation of kinases of the AGC family. Increases in phosphorylation of several known protein kinase A targets were found. In addition, increased phosphorylation of targets of the calmodulin-dependent kinase family was seen, including autophosphorylation of calmodulin-dependent kinase 2 at T286. Analysis of the 254 phosphopeptides decreased in abundance by dDAVP showed a predominance of so-called "proline-directed" motifs, consistent with down-regulation of mitogen-activated or cyclin-dependent kinases. dDAVP decreased phosphorylation of both JNK1/2 (T183/Y185) and ERK1/2 (T183/Y185; T203/Y205), consistent with a decrease in activation of these proline-directed kinases in response to dDAVP. Both ERK and JNK were able to phosphorylate residue S261of aquaporin-2 in vitro, a site showing a decrease in phosphorylation in response to dDAVP in vivo. The data support roles for multiple vasopressin V2-receptor-dependent signaling pathways in the vasopressin signaling network of collecting duct cells, involving several kinases not generally accepted to regulate collecting duct function.
Coexpression of mRNAs under multiple conditions is commonly used to infer cofunctionality of their gene products despite well-known limitations of this “guilt-by-association” (GBA) approach. Recent ...advancements in mass spectrometry-based proteomic technologies have enabled global expression profiling at the protein level; however, whether proteome profiling data can outperform transcriptome profiling data for coexpression based gene function prediction has not been systematically investigated. Here, we address this question by constructing and analyzing mRNA and protein coexpression networks for three cancer types with matched mRNA and protein profiling data from The Cancer Genome Atlas (TCGA) and the Clinical Proteomic Tumor Analysis Consortium (CPTAC). Our analyses revealed a marked difference in wiring between the mRNA and protein coexpression networks. Whereas protein coexpression was driven primarily by functional similarity between coexpressed genes, mRNA coexpression was driven by both cofunction and chromosomal colocalization of the genes. Functionally coherent mRNA modules were more likely to have their edges preserved in corresponding protein networks than functionally incoherent mRNA modules. Proteomic data strengthened the link between gene expression and function for at least 75% of Gene Ontology (GO) biological processes and 90% of KEGG pathways. A web application Gene2Net (http://cptac.gene2net.org) developed based on the three protein coexpression networks revealed novel gene-function relationships, such as linking ERBB2 (HER2) to lipid biosynthetic process in breast cancer, identifying PLG as a new gene involved in complement activation, and identifying AEBP1 as a new epithelial-mesenchymal transition (EMT) marker. Our results demonstrate that proteome profiling outperforms transcriptome profiling for coexpression based gene function prediction. Proteomics should be integrated if not preferred in gene function and human disease studies.
To explore the biology of lung adenocarcinoma (LUAD) and identify new therapeutic opportunities, we performed comprehensive proteogenomic characterization of 110 tumors and 101 matched normal ...adjacent tissues (NATs) incorporating genomics, epigenomics, deep-scale proteomics, phosphoproteomics, and acetylproteomics. Multi-omics clustering revealed four subgroups defined by key driver mutations, country, and gender. Proteomic and phosphoproteomic data illuminated biology downstream of copy number aberrations, somatic mutations, and fusions and identified therapeutic vulnerabilities associated with driver events involving KRAS, EGFR, and ALK. Immune subtyping revealed a complex landscape, reinforced the association of STK11 with immune-cold behavior, and underscored a potential immunosuppressive role of neutrophil degranulation. Smoking-associated LUADs showed correlation with other environmental exposure signatures and a field effect in NATs. Matched NATs allowed identification of differentially expressed proteins with potential diagnostic and therapeutic utility. This proteogenomics dataset represents a unique public resource for researchers and clinicians seeking to better understand and treat lung adenocarcinomas.
Display omitted
•Comprehensive LUAD proteogenomics exposes multi-omic clusters and immune subtypes•Phosphoproteomics identifies candidate ALK-fusion diagnostic markers and targets•Candidate drug targets: PTPN11 (EGFR), SOS1 (KRAS), neutrophil degranulation (STK11)•Phospho and acetyl modifications denote tumor-specific markers and druggable proteins
Comprehensive proteogenomic characterization of lung adenocarcinomas and paired normal adjacent tissues from patients of diverse smoking status and country of origin yields insights into cancer taxonomy, oncogenesis, and immune response; offers novel candidate biomarkers and therapeutic targets; and provides a community resource for further discovery.
The KRAS gene is one of the most frequently mutated oncogenes in human cancer and gives rise to two isoforms, KRAS4A and KRAS4B. KRAS post-translational modifications (PTMs) have the potential to ...influence downstream signaling. However, the relationship between KRAS PTMs and oncogenic mutations remains unclear, and the extent of isoform-specific modification is unknown. Here, we present the first top–down proteomics study evaluating both KRAS4A and KRAS4B, resulting in 39 completely characterized proteoforms across colorectal cancer cell lines and primary tumor samples. We determined which KRAS PTMs are present, along with their relative abundance, and that proteoforms of KRAS4A versus KRAS4B are differentially modified. Moreover, we identified a subset of KRAS4B proteoforms lacking the C185 residue and associated C-terminal PTMs. By confocal microscopy, we confirmed that this truncated GFP-KRAS4BC185∗ proteoform is unable to associate with the plasma membrane, resulting in a decrease in mitogen-activated protein kinase signaling pathway activation. Collectively, our study provides a reference set of functionally distinct KRAS proteoforms and the colorectal cancer contexts in which they are present.
Methodologies that facilitate high-throughput proteomic analysis are a key step toward moving proteome investigations into clinical translation. Data independent acquisition (DIA) has potential as a ...high-throughput analytical method due to the reduced time needed for sample analysis, as well as its highly quantitative accuracy. However, a limiting feature of DIA methods is the sensitivity of detection of low abundant proteins and depth of coverage, which other mass spectrometry approaches address by two-dimensional fractionation (2D) to reduce sample complexity during data acquisition. In this study, we developed a 2D-DIA method intended for rapid- and deeper-proteome analysis compared to conventional 1D-DIA analysis. First, we characterized 96 individual fractions obtained from the protein standard, NCI-7, using a data-dependent approach (DDA), identifying a total of 151,366 unique peptides from 11,273 protein groups. We observed that the majority of the proteins can be identified from just a few selected fractions. By performing an optimization analysis, we identified six fractions with high peptide number and uniqueness that can account for 80% of the proteins identified in the entire experiment. These selected fractions were combined into a single sample which was then subjected to DIA (referred to as 2D-DIA) quantitative analysis. Furthermore, improved DIA quantification was achieved using a hybrid spectral library, obtained by combining peptides identified from DDA data with peptides identified directly from the DIA runs with the help of DIA-Umpire. The optimized 2D-DIA method allowed for improved identification and quantification of low abundant proteins compared to conventional unfractionated DIA analysis (1D-DIA). We then applied the 2D-DIA method to profile the proteomes of two breast cancer patient-derived xenograft (PDX) models, quantifying 6,217 and 6,167 unique proteins in basal- and luminal- tumors, respectively. Overall, this study demonstrates the potential of high-throughput quantitative proteomics using a novel 2D-DIA method.
The integration of mass spectrometry-based proteomics with next-generation DNA and RNA sequencing profiles tumors more comprehensively. Here this “proteogenomics” approach was applied to 122 ...treatment-naive primary breast cancers accrued to preserve post-translational modifications, including protein phosphorylation and acetylation. Proteogenomics challenged standard breast cancer diagnoses, provided detailed analysis of the ERBB2 amplicon, defined tumor subsets that could benefit from immune checkpoint therapy, and allowed more accurate assessment of Rb status for prediction of CDK4/6 inhibitor responsiveness. Phosphoproteomics profiles uncovered novel associations between tumor suppressor loss and targetable kinases. Acetylproteome analysis highlighted acetylation on key nuclear proteins involved in the DNA damage response and revealed cross-talk between cytoplasmic and mitochondrial acetylation and metabolism. Our results underscore the potential of proteogenomics for clinical investigation of breast cancer through more accurate annotation of targetable pathways and biological features of this remarkably heterogeneous malignancy.
Display omitted
•Comprehensive proteogenomics resource from prospectively collected breast tumors•Proteogenomics defines ERBB2 and Rb status with clinical implications•Acetylproteome profiling yields insights into subtype-specific cancer metabolism•Immune profiling nominates subsets of luminal tumors for immune therapy
Breast cancer is a highly heterogeneous disease with variable outcomes and subtype-driven treatment approaches, making precision medicine a considerable challenge. Proteogenomic analyses of 122 primary breast cancers provide insights into clinically relevant biology, including cell cycle dysregulation, tumor immunogenicity, aberrant metabolism, and heterogeneity in therapeutic target expression.
Protein biomarker discovery and validation in current omics era are vital for healthcare professionals to improve diagnosis, detect cancers at an early stage, identify the likelihood of cancer ...recurrence, stratify stages with differential survival outcomes, and monitor therapeutic responses. The success of such biomarkers would have a huge impact on how we improve the diagnosis and treatment of patients and alleviate the financial burden of healthcare systems. In the past, the genomics community (mostly through large-scale, deep genomic sequencing technologies) has been steadily improving our understanding of the molecular basis of disease, with a number of biomarker panels already authorized by the U.S. Food and Drug Administration (FDA) for clinical use (e.g., MammaPrint, two recently cleared devices using next-generation sequencing platforms to detect DNA changes in the cystic fibrosis transmembrane conductance regulator (CFTR) gene). Clinical proteomics, on the other hand, albeit its ability to delineate the functional units of a cell, more likely driving the phenotypic differences of a disease (i.e., proteins and protein–protein interaction networks and signaling pathways underlying the disease), “staggers” to make a significant impact with only an average ∼1.5 protein biomarkers per year approved by the FDA over the past 15–20 years. This statistic itself raises the concern that major roadblocks have been impeding an efficient transition of protein marker candidates in biomarker development despite major technological advances in proteomics in recent years.