The signal peptidase complex (SPC) is an essential membrane complex in the endoplasmic reticulum (ER), where it removes signal peptides (SPs) from a large variety of secretory pre-proteins with ...exquisite specificity. Although the determinants of this process have been established empirically, the molecular details of SP recognition and removal remain elusive. Here, we show that the human SPC exists in two functional paralogs with distinct proteolytic subunits. We determined the atomic structures of both paralogs using electron cryo-microscopy and structural proteomics. The active site is formed by a catalytic triad and abuts the ER membrane, where a transmembrane window collectively formed by all subunits locally thins the bilayer. Molecular dynamics simulations indicate that this unique architecture generates specificity for SPs based on the length of their hydrophobic segments.
Display omitted
•The human SPC has two paralogs, both of which are structurally characterized here•The SPC is a serine protease with a catalytic Ser-His-Asp triad•The c-region binding pocket is conserved to bacterial signal peptidases•Membrane thinning by the SPC measures signal peptides for length before cleavage
The ER-resident signal peptidase complex (SPC) cleaves signal peptides of nascent secretory proteins and leaves transmembrane helices intact. Combining electron cryo-microscopy, mass spectrometry, and molecular dynamics simulations, Liaci et al. discover that the human SPC uses local membrane thinning and shape complementarity near the active site to generate substrate selectivity.
The Cmr complex is an RNA-guided endonuclease that cleaves foreign RNA targets as part of the CRISPR prokaryotic defense system. We investigated the molecular architecture of the P. furiosus Cmr ...complex using an integrative structural biology approach. We determined crystal structures of P. furiosus Cmr1, Cmr2, Cmr4, and Cmr6 and combined them with known structural information to interpret the cryo-EM map of the complex. To support structure determination, we obtained residue-specific interaction data using protein crosslinking and mass spectrometry. The resulting pseudoatomic model reveals how the superhelical backbone of the complex is defined by the polymerizing principles of Cmr4 and Cmr5 and how it is capped at the extremities by proteins of similar folds. The inner surface of the superhelix exposes conserved residues of Cmr4 that we show are required for target-cleavage activity. The structural and biochemical data thus identify Cmr4 as the conserved endoribonuclease of the Cmr complex.
Display omitted
•The core of the Cmr complex is formed by head-to-tail oligomers of Cmr4 and Cmr5•Cmr2-Cmr3 and Cmr6-Cmr1 mimic and terminate the head-to-tail oligomerization•In Cmr2-Cmr3, the HD nuclease domain of Cmr2 lies diametrically opposite to Cmr3•Cmr4 provides the RNA-target cleavage site on the inner surface of the superhelix
The Cmr complex is the only known CRISPR system that targets foreign RNA rather than DNA. Benda et al. have used a hybrid structural and mass spectrometry approach to obtain a pseudoatomic model of the P. furiosus Cmr complex that allows identifying the endoribonuclease cleavage site.
Protein secretion allows communication of distant cells in an organism and controls a broad range of physiological functions. We describe a quantitative, high-resolution mass spectrometric workflow ...to detect and quantify proteins that are released from immune cells upon receptor ligation. We quantified the time-resolved release of 775 proteins, including 52 annotated cytokines from only 150,000 primary Toll-like receptor 4-activated macrophages per condition. Achieving low picogram sensitivity, we detected secreted proteins whose abundance increased by a factor of more than 10,000 upon stimulation. Secretome to transcriptome comparisons revealed the transcriptionally decoupled release of lysosomal proteins. From genetic models, we defined secretory profiles that depended on distinct intracellular signaling adaptors and showed that secretion of many proinflammatory proteins is safeguarded by redundant mechanisms, whereas signaling adaptor synergy promoted the release of anti-inflammatory proteins.
The recent proliferation of high-resolution mass spectrometers has generated a wealth of new data analysis methods. However, flexible integration of these methods into configurations best suited to ...the research question is hampered by heterogeneous file formats and monolithic software development. The mzXML, mzData, and mzML file formats have enabled uniform access to unprocessed raw data. In this paper we present our efforts to produce an equally simple and powerful format, PeakML, to uniformly exchange processed intermediary and result data. To demonstrate the versatility of PeakML, we have developed an open source Java toolkit for processing, filtering, and annotating mass spectra in a customizable pipeline (mzMatch), as well as a user-friendly data visualization environment (PeakML Viewer). The PeakML format in particular enables the flexible exchange of processed data between software created by different groups or companies, as we illustrate by providing a PeakML-based integration of the widely used XCMS package with mzMatch data processing tools. As an added advantage, downstream analysis can benefit from direct access to the full mass trace information underlying summarized mass spectrometry results, providing the user with the means to rapidly verify results. The PeakML/mzMatch software is freely available at http://mzmatch.sourceforge.net, with documentation, tutorials, and a community forum.
Grana are a characteristic feature of higher plants' thylakoid membranes, consisting of stacks of appressed membranes enriched in Photosystem II (PSII) and associated light-harvesting complex II ...(LHCII) proteins, together forming the PSII-LHCII supercomplex. Grana stacks undergo light-dependent structural changes, mainly by reorganizing the supramolecular structure of PSII-LHCII supercomplexes. LHCII is vital for grana formation, in which also PSII-LHCII supercomplexes are involved. By combining top-down and crosslinking mass spectrometry we uncover the spatial organization of paired PSII-LHCII supercomplexes within thylakoid membranes. The resulting model highlights a basic molecular mechanism whereby plants maintain grana stacking at changing light conditions. This mechanism relies on interactions between stroma-exposed N-terminal loops of LHCII trimers and Lhcb4 subunits facing each other in adjacent membranes. The combination of light-dependent LHCII N-terminal trimming and extensive N-terminal α-acetylation likely affects interactions between pairs of PSII-LHCII supercomplexes across the stromal gap, ultimately mediating membrane folding in grana stacks.
A key step in mass spectrometry (MS)-based proteomics is the identification of peptides in sequence databases by their fragmentation spectra. Here we describe Andromeda, a novel peptide search engine ...using a probabilistic scoring model. On proteome data, Andromeda performs as well as Mascot, a widely used commercial search engine, as judged by sensitivity and specificity analysis based on target decoy searches. Furthermore, it can handle data with arbitrarily high fragment mass accuracy, is able to assign and score complex patterns of post-translational modifications, such as highly phosphorylated peptides, and accommodates extremely large databases. The algorithms of Andromeda are provided. Andromeda can function independently or as an integrated search engine of the widely used MaxQuant computational proteomics platform and both are freely available at www.maxquant.org. The combination enables analysis of large data sets in a simple analysis workflow on a desktop computer. For searching individual spectra Andromeda is also accessible via a web server. We demonstrate the flexibility of the system by implementing the capability to identify cofragmented peptides, significantly improving the total number of identified peptides.
Cells organize their actions partly through tightly controlled protein-protein interactions—collectively termed the interactome. Here we use crosslinking mass spectrometry (XL-MS) to chart the ...protein-protein interactions in intact human nuclei. Overall, we identified ∼8,700 crosslinks, of which 2/3 represent links connecting distinct proteins. From these data, we gain insights on interactions involving histone proteins. We observed that core histones on the nucleosomes expose well-defined interaction hot spots. For several nucleosome-interacting proteins, such as USF3 and Ran GTPase, the data allowed us to build low-resolution models of their binding mode to the nucleosome. For HMGN2, the data guided the construction of a refined model of the interaction with the nucleosome, based on complementary NMR, XL-MS, and modeling. Excitingly, the analysis of crosslinks carrying posttranslational modifications allowed us to extract how specific modifications influence nucleosome interactions. Overall, our data depository will support future structural and functional analysis of cell nuclei, including the nucleoprotein assemblies they harbor.
Eukaryotic cells employ three SMC (structural maintenance of chromosomes) complexes to control DNA folding and topology. The Smc5/6 complex plays roles in DNA repair and in preventing the ...accumulation of deleterious DNA junctions. To elucidate how specific features of Smc5/6 govern these functions, we reconstituted the yeast holo‐complex. We found that the Nse5/6 sub‐complex strongly inhibited the Smc5/6 ATPase by preventing productive ATP binding. This inhibition was relieved by plasmid DNA binding but not by short linear DNA, while opposing effects were observed without Nse5/6. We uncovered two binding sites for Nse5/6 on Smc5/6, based on an Nse5/6 crystal structure and cross‐linking mass spectrometry data. One binding site is located at the Smc5/6 arms and one at the heads, the latter likely exerting inhibitory effects on ATP hydrolysis. Cysteine cross‐linking demonstrated that the interaction with Nse5/6 anchored the ATPase domains in a non‐productive state, which was destabilized by ATP and DNA. Under similar conditions, the Nse4/3/1 module detached from the ATPase. Altogether, we show how DNA substrate selection is modulated by direct inhibition of the Smc5/6 ATPase by Nse5/6.
SYNOPSIS
The Smc5/6 complex prevents accumulation of toxic DNA junctions during DNA replication and repair, in order to enable faithful chromosome segregation in mitosis and meiosis. Biochemical reconstitution of the yeast holo‐complex reveals a central role of the Nse5/6 sub‐complex in DNA substrate selection.
Nse5/6 promotes ATP‐dependent, salt‐stable DNA association of Smc5/6.
Nse5/6 inhibits the Smc5/6 ATPase by preventing productive ATP binding in the absence of DNA.
Nse5/6 contacts the Smc5/6 hexamer via multiple interfaces including one at the SMC joint and one at the heads.
Lysine cross‐linking (XL‐MS) and cysteine cross‐linking uncover major conformational changes upon Nse5/6 association.
A crystal structure of the Nse5/6 core shows a HEAT‐repeat‐like organization.
Biochemical reconstitution and structural analyses provide insight into the architecture and substrate‐associated conformational changes of the yeast holo‐SMC5/6 complex.
Cellular functions are governed by molecular machines that assemble through protein-protein interactions. Their atomic details are critical to studying their molecular mechanisms. However, fewer than ...5% of hundreds of thousands of human protein interactions have been structurally characterized. Here we test the potential and limitations of recent progress in deep-learning methods using AlphaFold2 to predict structures for 65,484 human protein interactions. We show that experiments can orthogonally confirm higher-confidence models. We identify 3,137 high-confidence models, of which 1,371 have no homology to a known structure. We identify interface residues harboring disease mutations, suggesting potential mechanisms for pathogenic variants. Groups of interface phosphorylation sites show patterns of co-regulation across conditions, suggestive of coordinated tuning of multiple protein interactions as signaling responses. Finally, we provide examples of how the predicted binary complexes can be used to build larger assemblies helping to expand our understanding of human cell biology.