Many research questions in fields such as personalized medicine, drug screens or systems biology depend on obtaining consistent and quantitatively accurate proteomics data from many samples. SWATH‐MS ...is a specific variant of data‐independent acquisition (DIA) methods and is emerging as a technology that combines deep proteome coverage capabilities with quantitative consistency and accuracy. In a SWATH‐MS measurement, all ionized peptides of a given sample that fall within a specified mass range are fragmented in a systematic and unbiased fashion using rather large precursor isolation windows. To analyse SWATH‐MS data, a strategy based on peptide‐centric scoring has been established, which typically requires prior knowledge about the chromatographic and mass spectrometric behaviour of peptides of interest in the form of spectral libraries and peptide query parameters. This tutorial provides guidelines on how to set up and plan a SWATH‐MS experiment, how to perform the mass spectrometric measurement and how to analyse SWATH‐MS data using peptide‐centric scoring. Furthermore, concepts on how to improve SWATH‐MS data acquisition, potential trade‐offs of parameter settings and alternative data analysis strategies are discussed.
SWATH‐MS combines deep proteome coverage with quantitative consistency and accuracy and is often the method of choice for personalized medicine, drug screens or systems biology. This tutorial provides guidelines on how to set up SWATH‐MS experiments, perform the mass spectrometric measurements and analyse the data.
Full text
Available for:
FZAB, GIS, IJS, IZUM, KILJ, NLZOH, NUK, OILJ, PILJ, PNG, SAZU, SBCE, SBMB, UL, UM, UPUK
•Bacterial pathogens rewire host proteomes to enable their replication and spread.•Identifying host-pathogen protein interactions elucidates pathogenic rewiring.•Protein interaction studies are ...shifting towards quantitative and unbiased approaches.•Bacterial-host interaction studies require endogenous experimental systems.•Linking interspecies interaction studies to functional assays can measure their phenotypic impacts.
Infectious diseases are the result of molecular cross-talks between hosts and their pathogens. These cross-talks are in part mediated by host–pathogen protein–protein interactions (HP-PPI). HP-PPI play crucial roles in infections, as they may tilt the balance either in favor of the pathogens’ spread or their clearance. The identification of host proteins targeted by viral or bacterial pathogenic proteins necessary for the infection can provide insights into their underlying molecular mechanisms of pathogenicity, and potentially even single out pharmacological intervention targets. Here, we review the available methods to study HP-PPI, with a focus on recent mass spectrometry based methods to decipher bacterial–human infectious diseases and examine their relevance in uncovering host cell rewiring by pathogens.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UL, UM, UPCLJ, UPUK, ZRSKP
Targeted proteomics by selected/multiple reaction monitoring (S/MRM) or, on a larger scale, by SWATH (sequential window acquisition of all theoretical spectra) MS (mass spectrometry) typically relies ...on spectral reference libraries for peptide identification. Quality and coverage of these libraries are therefore of crucial importance for the performance of the methods. Here we present a detailed protocol that has been successfully used to build high-quality, extensive reference libraries supporting targeted proteomics by SWATH MS. We describe each step of the process, including data acquisition by discovery proteomics, assertion of peptide-spectrum matches (PSMs), generation of consensus spectra and compilation of MS coordinates that uniquely define each targeted peptide. Crucial steps such as false discovery rate (FDR) control, retention time normalization and handling of post-translationally modified peptides are detailed. Finally, we show how to use the library to extract SWATH data with the open-source software Skyline. The protocol takes 2-3 d to complete, depending on the extent of the library and the computational resources available.
In this Perspective, we discuss developments in mass-spectrometry-based proteomic technology over the past decade from the viewpoint of our laboratory. We also reflect on existing challenges and ...limitations, and explore the current and future roles of quantitative proteomics in molecular systems biology, clinical research and personalized medicine.
Protein complexes and protein interaction networks are essential mediators of most biological functions. Complexes supporting transient functions such as signal transduction processes are frequently ...subject to dynamic remodeling. Currently, the majority of studies on the composition of protein complexes are carried out by affinity purification and mass spectrometry (AP-MS) and present a static view of the system. For a better understanding of inherently dynamic biological processes, methods to reliably quantify temporal changes of protein interaction networks are essential. Here we used affinity purification combined with sequential window acquisition of all theoretical spectra (AP-SWATH) mass spectrometry to study the dynamics of the 14-3-3β scaffold protein interactome after stimulation of the insulin-PI3K-AKT pathway. The consistent and reproducible quantification of 1,967 proteins across all stimulation time points provided insights into the 14-3-3β interactome and its dynamic changes following IGF1 stimulation. We therefore establish AP-SWATH as a tool to quantify dynamic changes in protein-complex interaction networks.
Quantitative proteomics employing mass spectrometry is an indispensable tool in life science research. Targeted proteomics has emerged as a powerful approach for reproducible quantification but is ...limited in the number of proteins quantified. SWATH-mass spectrometry consists of data-independent acquisition and a targeted data analysis strategy that aims to maintain the favorable quantitative characteristics (accuracy, sensitivity, and selectivity) of targeted proteomics at large scale. While previous SWATH-mass spectrometry studies have shown high intra-lab reproducibility, this has not been evaluated between labs. In this multi-laboratory evaluation study including 11 sites worldwide, we demonstrate that using SWATH-mass spectrometry data acquisition we can consistently detect and reproducibly quantify >4000 proteins from HEK293 cells. Using synthetic peptide dilution series, we show that the sensitivity, dynamic range and reproducibility established with SWATH-mass spectrometry are uniformly achieved. This study demonstrates that the acquisition of reproducible quantitative proteomics data by multiple labs is achievable, and broadly serves to increase confidence in SWATH-mass spectrometry data acquisition as a reproducible method for large-scale protein quantification.SWATH-mass spectrometry consists of a data-independent acquisition and a targeted data analysis strategy that aims to maintain the favorable quantitative characteristics on the scale of thousands of proteins. Here, using data generated by eleven groups worldwide, the authors show that SWATH-MS is capable of generating highly reproducible data across different laboratories.
Data-independent acquisition modes isolate and concurrently fragment populations of different precursors by cycling through segments of a predefined precursor m/z range. Although these selection ...windows collectively cover the entire m/z range, overall, only a few per cent of all incoming ions are isolated for mass analysis. Here, we make use of the correlation of molecular weight and ion mobility in a trapped ion mobility device (timsTOF Pro) to devise a scan mode that samples up to 100% of the peptide precursor ion current in m/z and mobility windows. We extend an established targeted data extraction workflow by inclusion of the ion mobility dimension for both signal extraction and scoring and thereby increase the specificity for precursor identification. Data acquired from whole proteome digests and mixed organism samples demonstrate deep proteome coverage and a high degree of reproducibility as well as quantitative accuracy, even from 10 ng sample amounts.
Mycobacterium tuberculosis remains a health concern due to its ability to enter a non-replicative dormant state linked to drug resistance. Understanding transitions into and out of dormancy will ...inform therapeutic strategies. We implemented a universally applicable, label-free approach to estimate absolute cellular protein concentrations on a proteome-wide scale based on SWATH mass spectrometry. We applied this approach to examine proteomic reorganization of M. tuberculosis during exponential growth, hypoxia-induced dormancy, and resuscitation. The resulting data set covering >2,000 proteins reveals how protein biomass is distributed among cellular functions during these states. The stress-induced DosR regulon contributes 20% to cellular protein content during dormancy, whereas ribosomal proteins remain largely unchanged at 5%–7%. Absolute protein concentrations furthermore allow protein alterations to be translated into changes in maximal enzymatic reaction velocities, enhancing understanding of metabolic adaptations. Thus, global absolute protein measurements provide a quantitative description of microbial states, which can support the development of therapeutic interventions.
Display omitted
•A SWATH MS-based strategy can estimate genome-wide absolute protein concentrations•This technique revealed absolute protein abundances in Mtb during growth and dormancy•The DosR regulon makes up 20% of the total protein biomass during dormancy•Absolute protein abundances reveal maximal enzymatic reaction velocities in metabolism
Schubert et al. develop a strategy to estimate absolute cellular protein concentrations on a global scale based on the proteomic technique SWATH MS. They use this strategy to generate a resource of absolute protein concentrations in Mycobacterium tuberculosis during exponential growth, induction of dormancy, and resuscitation.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
The degree and the origins of quantitative variability of most human plasma proteins are largely unknown. Because the twin study design provides a natural opportunity to estimate the relative ...contribution of heritability and environment to different traits in human population, we applied here the highly accurate and reproducible SWATH mass spectrometry technique to quantify 1,904 peptides defining 342 unique plasma proteins in 232 plasma samples collected longitudinally from pairs of monozygotic and dizygotic twins at intervals of 2–7 years, and proportioned the observed total quantitative variability to its root causes, genes, and environmental and longitudinal factors. The data indicate that different proteins show vastly different patterns of abundance variability among humans and that genetic control and longitudinal variation affect protein levels and biological processes to different degrees. The data further strongly suggest that the plasma concentrations of clinical biomarkers need to be calibrated against genetic and temporal factors. Moreover, we identified 13 cis‐SNPs significantly influencing the level of specific plasma proteins. These results therefore have immediate implications for the effective design of blood‐based biomarker studies.
Synopsis
The degree and origins of the abundance variability of 342 human plasma proteins are analyzed by a longitudinal twin design and SWATH mass spectrometry. The results suggest genetic control and longitudinal variation affect protein levels and biological processes to different degrees.
We used the highly accurate and reproducible SWATH mass spectrometry technique to quantify 342 unique plasma proteins in 232 plasma samples collected longitudinally from pairs of monozygotic and dizygotic twins at intervals of 2–7 years.
The observed total quantitative variability of human plasma proteome is dissected to its root causes, genes, environment and longitudinal factors.
The roles of the heritable, environmental and longitudinal determinants in controlling plasma protein levels are different for different proteins and functional clusters, strongly suggesting that the plasma concentrations of clinical biomarkers need to be calibrated against genetic and temporal factors.
We further identified 13 cis‐SNPs significantly influencing the level of specific plasma proteins as protein quantitative trait loci (pQTLs), and five of them are associated with gene expression QTLs (eQTLs) in human tissues.
The degree and origins of the abundance variability of 342 human plasma proteins are analyzed by a longitudinal twin design and SWATH mass spectrometry. The results suggest genetic control and longitudinal variation affect protein levels and biological processes to different degrees.
Full text
Available for:
FZAB, GIS, IJS, IZUM, KILJ, NLZOH, NUK, OILJ, PILJ, PNG, SAZU, SBCE, SBMB, UL, UM, UPUK
Accurate measurements of cellular protein concentrations are invaluable to quantitative studies of gene expression and physiology in living cells. Here, we developed a versatile mass spectrometric ...workflow based on data‐independent acquisition proteomics (DIA/SWATH) together with a novel protein inference algorithm (xTop). We used this workflow to accurately quantify absolute protein abundances in Escherichia coli for > 2,000 proteins over > 60 growth conditions, including nutrient limitations, non‐metabolic stresses, and non‐planktonic states. The resulting high‐quality dataset of protein mass fractions allowed us to characterize proteome responses from a coarse (groups of related proteins) to a fine (individual) protein level. Hereby, a plethora of novel biological findings could be elucidated, including the generic upregulation of low‐abundant proteins under various metabolic limitations, the non‐specificity of catabolic enzymes upregulated under carbon limitation, the lack of large‐scale proteome reallocation under stress compared to nutrient limitations, as well as surprising strain‐dependent effects important for biofilm formation. These results present valuable resources for the systems biology community and can be used for future multi‐omics studies of gene regulation and metabolic control in E. coli.
Synopsis
Accurate proteomic measurements of absolute protein mass fractions in Escherichia coli allowed the characterization of proteome responses under > 60 diverse growth conditions from a coarse (groups of related proteins) to a fine (individual) protein level.
The study presents a mass spectrometric workflow based on data‐independent acquisition proteomics and a novel protein inference algorithm (xTop) optimized for absolute protein quantification.
The mass spectrometric data was benchmarked and calibrated with absolute protein mass fractions obtained by ribosome profiling.
A plethora of novel biological findings are presented, including lack of large‐scale proteome reallocation under stress compared to nutrient limitations, regulation of outer membrane proteins, and effects important for motility and biofilm formation.
Accurate proteomic measurements of absolute protein mass fractions in Escherichia coli allowed the characterization of proteome responses under > 60 diverse growth conditions from a coarse (groups of related proteins) to a fine (individual) protein level.
Full text
Available for:
FZAB, GIS, IJS, IZUM, KILJ, NLZOH, NUK, OILJ, PILJ, PNG, SAZU, SBCE, SBMB, UL, UM, UPUK