Phosphoproteomics integrating data-independent acquisition (DIA) enables deep phosphoproteome profiling with improved quantification reproducibility and accuracy compared to data-dependent ...acquisition (DDA)-based phosphoproteomics. DIA data mining heavily relies on a spectral library that in most cases is built on DDA analysis of the same sample. Construction of this project-specific DDA library impairs the analytical throughput, limits the proteome coverage, and increases the sample size for DIA phosphoproteomics. Herein we introduce a deep neural network, DeepPhospho, which conceptually differs from previous deep learning models to achieve accurate predictions of LC-MS/MS data for phosphopeptides. By leveraging in silico libraries generated by DeepPhospho, we establish a DIA workflow for phosphoproteome profiling which involves DIA data acquisition and data mining with DeepPhospho predicted libraries, thus circumventing the need of DDA library construction. Our DeepPhospho-empowered workflow substantially expands the phosphoproteome coverage while maintaining high quantification performance, which leads to the discovery of more signaling pathways and regulated kinases in an EGF signaling study than the DDA library-based approach. DeepPhospho is provided as a web server as well as an offline app to facilitate user access to model training, predictions and library generation.
As one of the most successful therapeutic target families, G protein-coupled receptors (GPCRs) have experienced a transformation from random ligand screening to knowledge-driven drug design. We are ...eye-witnessing tremendous progresses made recently in the understanding of their structure-function relationships that facilitated drug development at an unprecedented pace. This article intends to provide a comprehensive overview of this important field to a broader readership that shares some common interests in drug discovery.
Data analysis represents a key challenge for untargeted metabolomics studies and it commonly requires extensive processing of more than thousands of metabolite peaks included in raw high-resolution ...MS data. Although a number of software packages have been developed to facilitate untargeted data processing, they have not been comprehensively scrutinized in the capability of feature detection, quantification and marker selection using a well-defined benchmark sample set. In this study, we acquired a benchmark dataset from standard mixtures consisting of 1100 compounds with specified concentration ratios including 130 compounds with significant variation of concentrations. Five software evaluated here (MS-Dial, MZmine 2, XCMS, MarkerView, and Compound Discoverer) showed similar performance in detection of true features derived from compounds in the mixtures. However, significant differences between untargeted metabolomics software were observed in relative quantification of true features in the benchmark dataset. MZmine 2 outperformed the other software in terms of quantification accuracy and it reported the most true discriminating markers together with the fewest false markers. Furthermore, we assessed selection of discriminating markers by different software using both the benchmark dataset and a real-case metabolomics dataset to propose combined usage of two software for increasing confidence of biomarker identification. Our findings from comprehensive evaluation of untargeted metabolomics software would help guide future improvements of these widely used bioinformatics tools and enable users to properly interpret their metabolomics results.
Display omitted
•Five widely used data analysis software packages for untargeted metabolomics were compared.•Open source and commercial programs gave similar performance in detecting true features from benchmark samples.•Significant difference of software in quantification accuracy for true features.•Increased confidence of biomarker identification by combined use of XCMS and MZmine 2.
A plethora of software suites and multiple classes of spectral libraries have been developed to enhance the depth and robustness of data-independent acquisition (DIA) data processing. However, how ...the combination of a DIA software tool and a spectral library impacts the outcome of DIA proteomics and phosphoproteomics data analysis has been rarely investigated using benchmark data that mimics biological complexity. In this study, we create DIA benchmark data sets simulating the regulation of thousands of proteins in a complex background, which are collected on both an Orbitrap and a timsTOF instruments. We evaluate four commonly used software suites (DIA-NN, Spectronaut, MaxDIA and Skyline) combined with seven different spectral libraries in global proteome analysis. Moreover, we assess their performances in analyzing phosphopeptide standards and TNF-α-induced phosphoproteome regulation. Our study provides a practical guidance on how to construct a robust data analysis pipeline for different proteomics studies implementing the DIA technique.
Adhesion G protein-coupled receptors (aGPCRs) are essential for a variety of physiological processes such as immune responses, organ development, cellular communication, proliferation and homeostasis
.... An intrinsic manner of activation that involves a tethered agonist in the N-terminal region of the receptor has been proposed for the aGPCRs
, but its molecular mechanism remains elusive. Here we report the G protein-bound structures of ADGRD1 and ADGRF1, which exhibit many unique features with regard to the tethered agonism. The stalk region that proceeds the first transmembrane helix acts as the tethered agonist by forming extensive interactions with the transmembrane domain; these interactions are mostly conserved in ADGRD1 and ADGRF1, suggesting that a common stalk-transmembrane domain interaction pattern is shared by members of the aGPCR family. A similar stalk binding mode is observed in the structure of autoproteolysis-deficient ADGRF1, supporting a cleavage-independent manner of receptor activation. The stalk-induced activation is facilitated by a cascade of inter-helix interaction cores that are conserved in positions but show sequence variability in these two aGPCRs. Furthermore, the intracellular region of ADGRF1 contains a specific lipid-binding site, which proves to be functionally important and may serve as the recognition site for the previously discovered endogenous ADGRF1 ligand synaptamide. These findings highlight the diversity and complexity of the signal transduction mechanisms of the aGPCRs.
The tubular network is a critical part of the endoplasmic reticulum (ER). The network is shaped by the reticulons and REEPs/Yop1p that generate tubules by inducing high membrane curvature, and the ...dynamin-like GTPases atlastin and Sey1p/RHD3 that connect tubules via membrane fusion. However, the specific functions of this ER domain are not clear. Here, we isolated tubule-based microsomes from
via classical cell fractionation and detergent-free immunoprecipitation of Flag-tagged Yop1p, which specifically localizes to ER tubules. In quantitative comparisons of tubule-derived and total microsomes, we identified a total of 79 proteins that were enriched in the ER tubules, including known proteins that organize the tubular ER network. Functional categorization of the list of proteins revealed that the tubular ER network may be involved in membrane trafficking, lipid metabolism, organelle contact, and stress sensing. We propose that affinity isolation coupled with quantitative proteomics is a useful tool for investigating ER functions.
The properties of therapeutic proteins can be enhanced by chemical modification. Methods for site-specific protein conjugation are critical to such efforts. Here, we demonstrate that recombinant ...proteins expressed in mammalian cells can be site-specifically modified by using a genetically encoded aldehyde tag. We introduced the peptide sequence recognized by the endoplasmic reticulum (ER)-resident formylglycine generating enzyme (FGE), which can be as short as 6 residues, into heterologous proteins expressed in mammalian cells. Cotranslational modification of the proteins by FGE produced products bearing a unique aldehyde group. Proteins bearing this "aldehyde tag" were chemically modified by selective reaction with hydrazide- or aminooxy-functionalized reagents. We applied the technique to site-specific modification of monoclonal antibodies, the fastest growing class of biopharmaceuticals, as well as membrane-associated and cytosolic proteins expressed in mammalian cells.
Dear Editor,
Receptor-like kinases (RLKs) constitute the major family of cell surface-associated receptors in plants and play essential roles in perceiving extracellular signals 1. Over two hundred ...members of the largest subfam- ily of RLKs that contain leucine-rich repeat extracellular domains (LRR-RLK) are found in Arabidopsis, among which BRIl-associated kinase 1 (BAK1) is one of the best studied. BAK1 was initially identified based on its association with the LRR-RLK BRI1, which perceives brassinosteroid, an important hormone that regulates a wide range of developmental and physiological processes in plants 2. BAK1 also serves as a co-receptor for sev- eral other LRR-RLKs that perceive pathogen-associated molecular patterns (PAMPs), including flagellin-sensing 2 (FLS2) and elongation factor EF-Tu receptor 3, 4, and is therefore required for the innate immunity of plants. The reciprocal phosphorylation on the cytoplas- mic domains of BAK1 and the ligand-binding RLKs in the complex is a prerequisite for the full activation of the receptor kinase 5. The crystal structure of BAK1 com- plexed with AvrPtoB, which is an effector secreted by Pseudomonas syringae pv. Tomato to suppress PAMP- triggered immunity, recently revealed the mechanism by which BAK1 activity is inhibited 6. However, the mechanism for BAK1 activation, particularly the impact of phosphorylation of key residues on BAK1 activation, remains unclear.
Saccharomyces cerevisiae has been intensively studied in responses to different environmental stresses such as heat shock through global omic analysis. However, the S. cerevisiae industrial strains ...with superior thermotolerance have not been explored in any proteomic studies for elucidating the tolerance mechanism. Recently a new diploid strain was obtained through evolutionary engineering of a parental industrial strain, and it exhibited even higher resistance to prolonged thermal stress. Herein, we performed iTRAQ-based quantitative proteomic analysis on both the parental and evolved industrial strains to further understand the mechanism of thermotolerant adaptation. Out of ∼2600 quantifiable proteins from biological quadruplicates, 193 and 204 proteins were differentially regulated in the parental and evolved strains respectively during heat-stressed growth. The proteomic response of the industrial strains cultivated under prolonged thermal stress turned out to be substantially different from that of the laboratory strain exposed to sudden heat shock. Further analysis of transcription factors underlying the proteomic perturbation also indicated the distinct regulatory mechanism of thermotolerance. Finally, a cochaperone Mdj1 and a metabolic enzyme Adh1 were selected to investigate their roles in mediating heat-stressed growth and ethanol production of yeasts. Our proteomic characterization of the industrial strain led to comprehensive understanding of the molecular basis of thermotolerance, which would facilitate future improvement in the industrially important trait of S. cerevisiae by rational engineering.
G protein-coupled receptors (GPCRs) are a protein superfamily comprising >800 members that regulate numerous cellular and physiologic responses. GPCRs represent the largest class of therapeutic ...targets with implications in various diseases. Although advances in GPCR structural and pharmacological research have significantly improved our knowledge of GPCR signaling mechanisms, mapping diverse post-translational modifications (PTMs) of GPCR proteins and understanding their regulatory roles have received much less attention. Mass spectrometry-based proteomics has become the most popular technology for profiling protein PTMs in a systematic manner. Herein we provide an overview of PTM types, locations, crosstalk and dynamic regulation for different GPCRs that are characterized using proteomic and/or biochemical approaches. Our main focus is on glycosylation, phosphorylation, ubiquitination and palmitoylation that are known to modulate receptor folding, biosynthesis, trafficking, dimerization and signaling. Furthermore, we discuss the locations of specific PTM sites in the structure of a given GPCR and its signaling complex to highlight the importance of PTM regulation in the molecular basis of GPCRs, which may shed new light on structure-based drug discovery.