Mitochondria are complex organelles that house essential pathways involved in energy metabolism, ion homeostasis, signalling and apoptosis. To understand mitochondrial pathways in health and disease, ...it is crucial to have an accurate inventory of the organelle's protein components. In 2008, we made substantial progress toward this goal by performing in-depth mass spectrometry of mitochondria from 14 organs, epitope tagging/microscopy and Bayesian integration to assemble MitoCarta (www.broadinstitute.org/pubs/MitoCarta): an inventory of genes encoding mitochondrial-localized proteins and their expression across 14 mouse tissues. Using the same strategy we have now reconstructed this inventory separately for human and for mouse based on (i) improved gene transcript models, (ii) updated literature curation, including results from proteomic analyses of mitochondrial sub-compartments, (iii) improved homology mapping and (iv) updated versions of all seven original data sets. The updated human MitoCarta2.0 consists of 1158 human genes, including 918 genes in the original inventory as well as 240 additional genes. The updated mouse MitoCarta2.0 consists of 1158 genes, including 967 genes in the original inventory plus 191 additional genes. The improved MitoCarta 2.0 inventory provides a molecular framework for system-level analysis of mammalian mitochondria.
Abstract
The extracellular matrix (ECM) is a complex and dynamic meshwork of cross-linked proteins that supports cell polarization and functions and tissue organization and homeostasis. Over the past ...few decades, mass-spectrometry-based proteomics has emerged as the method of choice to characterize the composition of the ECM of normal and diseased tissues. Here, we present a new release of MatrisomeDB, a searchable collection of curated proteomic data from 17 studies on the ECM of 15 different normal tissue types, six cancer types (different grades of breast cancers, colorectal cancer, melanoma, and insulinoma) and other diseases including vascular defects and lung and liver fibroses. MatrisomeDB (http://www.pepchem.org/matrisomedb) was built by retrieving raw mass spectrometry data files and reprocessing them using the same search parameters and criteria to allow for a more direct comparison between the different studies. The present release of MatrisomeDB includes 847 human and 791 mouse ECM proteoforms and over 350 000 human and 600 000 mouse ECM-derived peptide-to-spectrum matches. For each query, a hierarchically-clustered tissue distribution map, a peptide coverage map, and a list of post-translational modifications identified, are generated. MatrisomeDB is the most complete collection of ECM proteomic data to date and allows the building of a comprehensive ECM atlas.
Isobaric labeling using tandem mass tags (TMTs) is increasingly applied for deep-scale proteomic studies in a multitude of organisms and addressing diverse research questions. The cost of labeling ...reagents represents a substantial proportion of the total expenses for conducting such experiments. Here, Zecha et al. present an economically optimized TMT labeling approach that reduces the quantity of required labeling reagent by a factor of eight and reproducibly achieves complete labeling.
Display omitted
Highlights
•TMT labeling protocol with excellent intra- and interlaboratory reproducibility.•Complete in-solution labeling of peptides using 1/8 of recommended TMT quantities.•Demonstration of utility for deep-scale (phospho)proteome analysis.
Isobaric stable isotope labeling using, for example, tandem mass tags (TMTs) is increasingly being applied for large-scale proteomic studies. Experiments focusing on proteoform analysis in drug time course or perturbation studies or in large patient cohorts greatly benefit from the reproducible quantification of single peptides across samples. However, such studies often require labeling of hundreds of micrograms of peptides such that the cost for labeling reagents represents a major contribution to the overall cost of an experiment. Here, we describe and evaluate a robust and cost-effective protocol for TMT labeling that reduces the quantity of required labeling reagent by a factor of eight and achieves complete labeling. Under- and overlabeling of peptides derived from complex digests of tissues and cell lines were systematically evaluated using peptide quantities of between 12.5 and 800 μg and TMT-to-peptide ratios (wt/wt) ranging from 8:1 to 1:2 at different TMT and peptide concentrations. When reaction volumes were reduced to maintain TMT and peptide concentrations of at least 10 mm and 2 g/l, respectively, TMT-to-peptide ratios as low as 1:1 (wt/wt) resulted in labeling efficiencies of > 99% and excellent intra- and interlaboratory reproducibility. The utility of the optimized protocol was further demonstrated in a deep-scale proteome and phosphoproteome analysis of patient-derived xenograft tumor tissue benchmarked against the labeling procedure recommended by the TMT vendor. Finally, we discuss the impact of labeling reaction parameters for N-hydroxysuccinimide ester-based chemistry and provide guidance on adopting efficient labeling protocols for different peptide quantities.
The extracellular matrix (ECM) is a complex meshwork of cross-linked proteins providing both biophysical and biochemical cues that are important regulators of cell proliferation, survival, ...differentiation, and migration. We present here a proteomic strategy developed to characterize the in vivo ECM composition of normal tissues and tumors using enrichment of protein extracts for ECM components and subsequent analysis by mass spectrometry. In parallel, we have developed a bioinformatic approach to predict the in silico “matrisome” defined as the ensemble of ECM proteins and associated factors. We report the characterization of the extracellular matrices of murine lung and colon, each comprising more than 100 ECM proteins and each presenting a characteristic signature. Moreover, using human tumor xenografts in mice, we show that both tumor cells and stromal cells contribute to the production of the tumor matrix and that tumors of differing metastatic potential differ in both the tumor- and the stroma-derived ECM components. The strategy we describe and illustrate here can be broadly applied and, to facilitate application of these methods by others, we provide resources including laboratory protocols, inventories of ECM domains and proteins, and instructions for bioinformatically deriving the human and mouse matrisome.
The extracellular matrix (ECM) is a fundamental component of multicellular organisms that provides mechanical and chemical cues that orchestrate cellular and tissue organization and functions. ...Degradation, hyperproduction or alteration of the composition of the ECM cause or accompany numerous pathologies. Thus, a better characterization of ECM composition, metabolism, and biology can lead to the identification of novel prognostic and diagnostic markers and therapeutic opportunities. The development over the last few years of high-throughput (“omics”) approaches has considerably accelerated the pace of discovery in life sciences. In this review, we describe new bioinformatic tools and experimental strategies for ECM research, and illustrate how these tools and approaches can be exploited to provide novel insights in our understanding of ECM biology. We also introduce a web platform “the matrisome project” and the database MatrisomeDB that compiles in silico and in vivo data on the matrisome, defined as the ensemble of genes encoding ECM and ECM-associated proteins. Finally, we present a first draft of an ECM atlas built by compiling proteomics data on the ECM composition of 14 different tissues and tumor types.
•The matrisome is defined as the ensemble of 1000+ genes encoding ECM and ECM-associated proteins.•Bioinformatic and experimental approaches to study the ECM/matrisome are discussed.•We introduce a novel website and database MatrisomeDB to centralize resources on the matrisome.•We present a draft of an ECM atlas compiling proteomics data on the ECM of 14 different tissues and tumors.•“Omics” data provide novel insights into ECM functions in development, homeostasis and disease.
Genomic analyses in cancer have been enormously impactful, leading to the identification of driver mutations and development of targeted therapies. But the functions of the vast majority of somatic ...mutations and copy number variants in tumours remain unknown, and the causes of resistance to targeted therapies and methods to overcome them are poorly defined. Recent improvements in mass spectrometry-based proteomics now enable direct examination of the consequences of genomic aberrations, providing deep and quantitative characterization of tumour tissues. Integration of proteins and their post-translational modifications with genomic, epigenomic and transcriptomic data constitutes the new field of proteogenomics, and is already leading to new biological and diagnostic knowledge with the potential to improve our understanding of malignant transformation and therapeutic outcomes. In this Review we describe recent developments in proteogenomics and key findings from the proteogenomic analysis of a wide range of cancers. Considerations relevant to the selection and use of samples for proteogenomics and the current technologies used to generate, analyse and integrate proteomic with genomic data are described. Applications of proteogenomics in translational studies and immuno-oncology are rapidly emerging, and the prospect for their full integration into therapeutic trials and clinical care seems bright.
Identification of human leukocyte antigen (HLA)-bound peptides by liquid chromatography-tandem mass spectrometry (LC-MS/MS) is poised to provide a deep understanding of rules underlying antigen ...presentation. However, a key obstacle is the ambiguity that arises from the co-expression of multiple HLA alleles. Here, we have implemented a scalable mono-allelic strategy for profiling the HLA peptidome. By using cell lines expressing a single HLA allele, optimizing immunopurifications, and developing an application-specific spectral search algorithm, we identified thousands of peptides bound to 16 different HLA class I alleles. These data enabled the discovery of subdominant binding motifs and an integrative analysis quantifying the contribution of factors critical to epitope presentation, such as protein cleavage and gene expression. We trained neural-network prediction algorithms with our large dataset (>24,000 peptides) and outperformed algorithms trained on datasets of peptides with measured affinities. We thus demonstrate a strategy for systematically learning the rules of endogenous antigen presentation.
Display omitted
•24,000 HLA class I peptides were identified through a scalable MS-based pipeline.•Mono-allelic data revealed binding motifs that were validated biochemically.•Comprehensive analyses provide an updated portrait of antigen processing rules.•Neural networks were trained for 16 alleles and outperform standard by 2-fold.
HLA class I binding prediction has traditionally been based on biochemical binding experiments. Abelin and colleagues present an LC-MS/MS-based workflow and analytical framework that greatly accelerates gains in prediction performance. Key advances include the discovery of sequence motifs and improved quantification of the roles of gene expression and proteasomal processing.
Here we present an optimized workflow for global proteome and phosphoproteome analysis of tissues or cell lines that uses isobaric tags (TMT (tandem mass tags)-10) for multiplexed analysis and ...relative quantification, and provides 3× higher throughput than iTRAQ (isobaric tags for absolute and relative quantification)-4-based methods with high intra- and inter-laboratory reproducibility. The workflow was systematically characterized and benchmarked across three independent laboratories using two distinct breast cancer subtypes from patient-derived xenograft models to enable assessment of proteome and phosphoproteome depth and quantitative reproducibility. Each plex consisted of ten samples, each being 300 μg of peptide derived from <50 mg of wet-weight tissue. Of the 10,000 proteins quantified per sample, we could distinguish 7,700 human proteins derived from tumor cells and 3100 mouse proteins derived from the surrounding stroma and blood. The maximum deviation across replicates and laboratories was <7%, and the inter-laboratory correlation for TMT ratio-based comparison of the two breast cancer subtypes was r > 0.88. The maximum deviation for the phosphoproteome coverage was <24% across laboratories, with an average of >37,000 quantified phosphosites per sample and differential quantification correlations of r > 0.72. The full procedure, including sample processing and data generation, can be completed within 10 d for ten tissue samples, and 100 samples can be analyzed in ~4 months using a single LC-MS/MS instrument. The high quality, depth, and reproducibility of the data obtained both within and across laboratories should enable new biological insights to be obtained from mass spectrometry-based proteomics analyses of cells and tissues together with proteogenomic data integration.
We report a mass spectrometry-based method for the integrated analysis of protein expression, phosphorylation, ubiquitination and acetylation by serial enrichments of different post-translational ...modifications (SEPTM) from the same biological sample. This technology enabled quantitative analysis of nearly 8,000 proteins and more than 20,000 phosphorylation, 15,000 ubiquitination and 3,000 acetylation sites per experiment, generating a holistic view of cellular signal transduction pathways as exemplified by analysis of bortezomib-treated human leukemia cells.
Interactions between T cell receptors (TCRs) and their cognate tumour antigens are central to antitumour immune responses
; however, the relationship between phenotypic characteristics and TCR ...properties is not well elucidated. Here we show, by linking the antigenic specificity of TCRs and the cellular phenotype of melanoma-infiltrating lymphocytes at single-cell resolution, that tumour specificity shapes the expression state of intratumoural CD8
T cells. Non-tumour-reactive T cells were enriched for viral specificities and exhibited a non-exhausted memory phenotype, whereas melanoma-reactive lymphocytes predominantly displayed an exhausted state that encompassed diverse levels of differentiation but rarely acquired memory properties. These exhausted phenotypes were observed both among clonotypes specific for public overexpressed melanoma antigens (shared across different tumours) or personal neoantigens (specific for each tumour). The recognition of such tumour antigens was provided by TCRs with avidities inversely related to the abundance of cognate targets in melanoma cells and proportional to the binding affinity of peptide-human leukocyte antigen (HLA) complexes. The persistence of TCR clonotypes in peripheral blood was negatively affected by the level of intratumoural exhaustion, and increased in patients with a poor response to immune checkpoint blockade, consistent with chronic stimulation mediated by residual tumour antigens. By revealing how the quality and quantity of tumour antigens drive the features of T cell responses within the tumour microenvironment, we gain insights into the properties of the anti-melanoma TCR repertoire.