A main bottleneck in proteomics is the downstream biological analysis of highly multivariate quantitative protein abundance data generated using mass-spectrometry-based analysis. We developed the ...Perseus software platform (http://www.perseus-framework.org) to support biological and biomedical researchers in interpreting protein quantification, interaction and post-translational modification data. Perseus contains a comprehensive portfolio of statistical tools for high-dimensional omics data analysis covering normalization, pattern recognition, time-series analysis, cross-omics comparisons and multiple-hypothesis testing. A machine learning module supports the classification and validation of patient groups for diagnosis and prognosis, and it also detects predictive protein signatures. Central to Perseus is a user-friendly, interactive workflow environment that provides complete documentation of computational methods used in a publication. All activities in Perseus are realized as plugins, and users can extend the software by programming their own, which can be shared through a plugin store. We anticipate that Perseus's arsenal of algorithms and its intuitive usability will empower interdisciplinary analysis of complex large data sets.
Protein quantification without isotopic labels has been a long-standing interest in the proteomics field. However, accurate and robust proteome-wide quantification with label-free approaches remains ...a challenge. We developed a new intensity determination and normalization procedure called MaxLFQ that is fully compatible with any peptide or protein separation prior to LC-MS analysis. Protein abundance profiles are assembled using the maximum possible information from MS signals, given that the presence of quantifiable peptides varies from sample to sample. For a benchmark dataset with two proteomes mixed at known ratios, we accurately detected the mixing ratio over the entire protein expression range, with greater precision for abundant proteins. The significance of individual label-free quantifications was obtained via a t test approach. For a second benchmark dataset, we accurately quantify fold changes over several orders of magnitude, a task that is challenging with label-based methods. MaxLFQ is a generic label-free quantification technology that is readily applicable to many biological questions; it is compatible with standard statistical analysis workflows, and it has been validated in many and diverse biological projects. Our algorithms can handle very large experiments of 500+ samples in a manageable computing time. It is implemented in the freely available MaxQuant computational proteomics platform and works completely seamlessly at the click of a button.
The organization of a cell emerges from the interactions in protein networks. The interactome is critically dependent on the strengths of interactions and the cellular abundances of the connected ...proteins, both of which span orders of magnitude. However, these aspects have not yet been analyzed globally. Here, we have generated a library of HeLa cell lines expressing 1,125 GFP-tagged proteins under near-endogenous control, which we used as input for a next-generation interaction survey. Using quantitative proteomics, we detect specific interactions, estimate interaction stoichiometries, and measure cellular abundances of interacting proteins. These three quantitative dimensions reveal that the protein network is dominated by weak, substoichiometric interactions that play a pivotal role in defining network topology. The minority of stable complexes can be identified by their unique stoichiometry signature. This study provides a rich interaction dataset connecting thousands of proteins and introduces a framework for quantitative network analysis.
Display omitted
•Human interactome dataset connecting 5,400 proteins with 28,500 interactions•Three quantitative dimensions measure specificities, stoichiometries, and abundances•Stable complexes are rare but stand out by a signature of balanced stoichiometries•Weak interactions dominate the network and have critical topological properties
Weak interactions shape the cellular protein interaction network as determined from proteomic measures of cellular interaction specificities, the strength of those interactions, and the cellular copy numbers of the proteins involved.
Many proteins contain disordered regions of low-sequence complexity, which cause aging-associated diseases because they are prone to aggregate. Here, we study FUS, a prion-like protein containing ...intrinsically disordered domains associated with the neurodegenerative disease ALS. We show that, in cells, FUS forms liquid compartments at sites of DNA damage and in the cytoplasm upon stress. We confirm this by reconstituting liquid FUS compartments in vitro. Using an in vitro “aging” experiment, we demonstrate that liquid droplets of FUS protein convert with time from a liquid to an aggregated state, and this conversion is accelerated by patient-derived mutations. We conclude that the physiological role of FUS requires forming dynamic liquid-like compartments. We propose that liquid-like compartments carry the trade-off between functionality and risk of aggregation and that aberrant phase transitions within liquid-like compartments lie at the heart of ALS and, presumably, other age-related diseases.
Display omitted
Display omitted
•The ALS-associated protein FUS forms liquid compartments in vivo and in vitro•Liquid compartment formation is dependent on the prion-like low-complexity domain•Liquid compartments of FUS convert with time into an aberrant aggregated state•ALS patient mutations accelerate aberrant phase transitions of FUS
The ALS-associated protein FUS assembles into a liquid-like compartment to operate in vivo, but a risk of the functionality conferred by the liquid phase is aggregation to the disease-linked solid phase. Aging diseases caused by aggregation-prone proteins may arise from a failure to maintain liquid-phase homeostasis.
Absolute protein quantification using mass spectrometry (MS)-based proteomics delivers protein concentrations or copy numbers per cell. Existing methodologies typically require a combination of ...isotope-labeled spike-in references, cell counting, and protein concentration measurements. Here we present a novel method that delivers similar quantitative results directly from deep eukaryotic proteome datasets without any additional experimental steps. We show that the MS signal of histones can be used as a “proteomic ruler” because it is proportional to the amount of DNA in the sample, which in turn depends on the number of cells. As a result, our proteomic ruler approach adds an absolute scale to the MS readout and allows estimation of the copy numbers of individual proteins per cell. We compare our protein quantifications with values derived via the use of stable isotope labeling by amino acids in cell culture and protein epitope signature tags in a method that combines spike-in protein fragment standards with precise isotope label quantification. The proteomic ruler approach yields quantitative readouts that are in remarkably good agreement with results from the precision method. We attribute this surprising result to the fact that the proteomic ruler approach omits error-prone steps such as cell counting or protein concentration measurements. The proteomic ruler approach is readily applicable to any deep eukaryotic proteome dataset—even in retrospective analysis—and we demonstrate its usefulness with a series of mouse organ proteomes.
Genetic interaction (GI) maps, comprising pairwise measures of how strongly the function of one gene depends on the presence of a second, have enabled the systematic exploration of gene function in ...microorganisms. Here, we present a two-stage strategy to construct high-density GI maps in mammalian cells. First, we use ultracomplex pooled shRNA libraries (25 shRNAs/gene) to identify high-confidence hit genes for a given phenotype and effective shRNAs. We then construct double-shRNA libraries from these to systematically measure GIs between hits. A GI map focused on ricin susceptibility broadly recapitulates known pathways and provides many unexpected insights. These include a noncanonical role for COPI, a previously uncharacterized protein complex affecting toxin clearance, a specialized role for the ribosomal protein RPS25, and functionally distinct mammalian TRAPP complexes. The ability to rapidly generate mammalian GI maps provides a potentially transformative tool for defining gene function and designing combination therapies based on synergistic pairs.
Display omitted
► Ultracomplex shRNA library minimizes false positives/negatives in genome-wide screens ► Pooled double-shRNA strategy systematically maps genetic interactions between hits ► Application of two-step strategy identifies pathways controlling ricin susceptibility ► The resulting map uncovers functionally distinct mammalian TRAPP complexes
A high-throughput method that relies on the use of ultracomplex shRNA libraries makes it possible to create genetic interaction maps in mammalian cells. This approach will be applicable to many cellular processes and conditions, as illustrated by the discovery of distinct TRAPP complexes involved in endocytosis.
Protein–protein interactions are fundamental to the understanding of biological processes. Affinity purification coupled to mass spectrometry (AP-MS) is one of the most promising methods for their ...investigation. Previously, complexes were purified as much as possible, frequently followed by identification of individual gel bands. However, todays mass spectrometers are highly sensitive, and powerful quantitative proteomics strategies are available to distinguish true interactors from background binders. Here we describe a high performance affinity enrichment-mass spectrometry method for investigating protein–protein interactions, in which no attempt at purifying complexes to homogeneity is made. Instead, we developed analysis methods that take advantage of specific enrichment of interactors in the context of a large amount of unspecific background binders. We perform single-step affinity enrichment of endogenously expressed GFP-tagged proteins and their interactors in budding yeast, followed by single-run, intensity-based label-free quantitative LC-MS/MS analysis. Each pull-down contains around 2000 background binders, which are reinterpreted from troubling contaminants to crucial elements in a novel data analysis strategy. First the background serves for accurate normalization. Second, interacting proteins are not identified by comparison to a single untagged control strain, but instead to the other tagged strains. Third, potential interactors are further validated by their intensity profiles across all samples. We demonstrate the power of our AE-MS method using several well-known and challenging yeast complexes of various abundances. AE-MS is not only highly efficient and robust, but also cost effective, broadly applicable, and can be performed in any laboratory with access to high-resolution mass spectrometers.
Regulatory T (Treg) cells maintain immune homeostasis and prevent inflammatory and autoimmune responses. During development, thymocytes bearing a moderately self-reactive T cell receptor (TCR) can be ...selected to become Treg cells. Several observations suggest that also in the periphery mature Treg cells continuously receive self-reactive TCR signals. However, the importance of this inherent autoreactivity for Treg cell biology remains poorly defined. To address this open question, we genetically ablated the TCR of mature Treg cells in vivo. These experiments revealed that TCR-induced Treg lineage-defining Foxp3 expression and gene hypomethylation were uncoupled from TCR input in mature Treg cells. However, Treg cell homeostasis, cell-type-specific gene expression and suppressive function critically depend on continuous triggering of their TCR.
•Foxp3 expression and gene-specific hypomethylation are unaffected by TCR ablation•TCR expression is critical for effector Treg cell differentiation and maintenance•The homeostatic proliferation of Treg cells depends on TCR signals•TCR ablation impairs the functional abilities of Treg cells
The importance of T cell receptor (TCR) for mature regulatory T cell pool size and lineage identity is not well understood. Vahl and colleagues show that TCR signals are dispensable for Foxp3 expression, yet effector differentiation and function of regulatory T cells critically depend on continuous signaling through the TCR.
Functional genomics efforts face tradeoffs between number of perturbations examined and complexity of phenotypes measured. We bridge this gap with Perturb-seq, which combines droplet-based ...single-cell RNA-seq with a strategy for barcoding CRISPR-mediated perturbations, allowing many perturbations to be profiled in pooled format. We applied Perturb-seq to dissect the mammalian unfolded protein response (UPR) using single and combinatorial CRISPR perturbations. Two genome-scale CRISPR interference (CRISPRi) screens identified genes whose repression perturbs ER homeostasis. Subjecting ∼100 hits to Perturb-seq enabled high-precision functional clustering of genes. Single-cell analyses decoupled the three UPR branches, revealed bifurcated UPR branch activation among cells subject to the same perturbation, and uncovered differential activation of the branches across hits, including an isolated feedback loop between the translocon and IRE1α. These studies provide insight into how the three sensors of ER homeostasis monitor distinct types of stress and highlight the ability of Perturb-seq to dissect complex cellular responses.
Display omitted
•Perturb-seq allows parallel screening with rich phenotypic output from single cells•Simultaneous delivery and identification of up to three CRISPR perturbations•Genome-scale screens dissect the mammalian unfolded protein response•Analytical methods separate perturbation responses from confounding effects
A strategy for barcoding CRISPR-mediated perturbations allows pooled expression profiling via single-cell RNA sequencing. Application to the mammalian unfolded protein response then enabled systematic delineation of the transcriptional arms of the response and functional clustering of genes affecting ER homeostasis.
P granules are non-membrane-bound RNA-protein compartments that are involved in germline development in C. elegans. They are liquids that condense at one end of the embryo by localized phase ...separation, driven by gradients of polarity proteins such as the mRNA-binding protein MEX-5. To probe how polarity proteins regulate phase separation, we combined biochemistry and theoretical modeling. We reconstitute P granule-like droplets in vitro using a single protein PGL-3. By combining in vitro reconstitution with measurements of intracellular concentrations, we show that competition between PGL-3 and MEX-5 for mRNA can regulate the formation of PGL-3 droplets. Using theory, we show that, in a MEX-5 gradient, this mRNA competition mechanism can drive a gradient of P granule assembly with similar spatial and temporal characteristics to P granule assembly in vivo. We conclude that gradients of polarity proteins can position RNP granules during development by using RNA competition to regulate local phase separation.
Display omitted
•The protein PGL-3 can phase separate into P-granule-like droplets in vitro•Assembly of PGL-3 droplets at physiological concentration requires mRNA binding to PGL-3•MEX-5 inhibits mRNA-dependent droplet assembly by competing with PGL-3 for binding mRNA•Competition among MEX-5 and PGL-3 for mRNA can account for P granule segregation in vivo
Asymmetric positioning of cellular compartments formed by phase separation can be driven by competition between two mRNA binding proteins, one of which forms a spatial concentration gradient.