SARS-CoV-2 variants of interest and concern will continue to emerge for the duration of the COVID-19 pandemic. To map mutations in the receptor-binding domain (RBD) of the spike protein that affect ...binding to angiotensin-converting enzyme 2 (ACE2), the receptor for SARS-CoV-2, we applied in vitro evolution to affinity-mature the RBD. Multiple rounds of random mutagenic libraries of the RBD were sorted against decreasing concentrations of ACE2, resulting in the selection of higher affinity RBD binders. We found that mutations present in more transmissible viruses (S477N, E484K and N501Y) were preferentially selected in our high-throughput screen. Evolved RBD mutants include prominently the amino acid substitutions found in the RBDs of B.1.620, B.1.1.7 (Alpha), B1.351 (Beta) and P.1 (Gamma) variants. Moreover, the incidence of RBD mutations in the population as presented in the GISAID database (April 2021) is positively correlated with increased binding affinity to ACE2. Further in vitro evolution increased binding by 1,000-fold and identified mutations that may be more infectious if they evolve in the circulating viral population, for example, Q498R is epistatic to N501Y. We show that our high-affinity variant RBD-62 can be used as a drug to inhibit infection with SARS-CoV-2 and variants Alpha, Beta and Gamma in vitro. In a model of SARS-CoV-2 challenge in hamster, RBD-62 significantly reduced clinical disease when administered before or after infection. A 2.9 Å cryo-electron microscopy structure of the high-affinity complex of RBD-62 and ACE2, including all rapidly spreading mutations, provides a structural basis for future drug and vaccine development and for in silico evaluation of known antibodies.
Substantial improvements in enzyme activity demand multiple mutations at spatially proximal positions in the active site. Such mutations, however, often exhibit unpredictable epistatic (non-additive) ...effects on activity. Here we describe FuncLib, an automated method for designing multipoint mutations at enzyme active sites using phylogenetic analysis and Rosetta design calculations. We applied FuncLib to two unrelated enzymes, a phosphotriesterase and an acetyl-CoA synthetase. All designs were active, and most showed activity profiles that significantly differed from the wild-type and from one another. Several dozen designs with only 3–6 active-site mutations exhibited 10- to 4,000-fold higher efficiencies with a range of alternative substrates, including hydrolysis of the toxic organophosphate nerve agents soman and cyclosarin and synthesis of butyryl-CoA. FuncLib is implemented as a web server (http://FuncLib.weizmann.ac.il); it circumvents iterative, high-throughput experimental screens and opens the way to designing highly efficient and diverse catalytic repertoires.
Display omitted
•FuncLib is a new method that designs diverse multipoint mutants in enzyme active sites•Designs are efficient and functionally diverse, bypassing high-throughput screening•Designs exhibit up to 4 orders of magnitude improvement in several activities•FuncLib is implemented as a web-server (http://funclib.weizmann.ac.il)
Khersonsky et al. present FuncLib, an automated method for designing catalytic repertoires using phylogenetic analysis and Rosetta design calculations. FuncLib resulted in efficient enzymes, including new hydrolases with the potential to treat nerve agent poisoning.
Upon heterologous overexpression, many proteins misfold or aggregate, thus resulting in low functional yields. Human acetylcholinesterase (hAChE), an enzyme mediating synaptic transmission, is a ...typical case of a human protein that necessitates mammalian systems to obtain functional expression. We developed a computational strategy and designed an AChE variant bearing 51 mutations that improved core packing, surface polarity, and backbone rigidity. This variant expressed at ∼2,000-fold higher levels in E. coli compared to wild-type hAChE and exhibited 20°C higher thermostability with no change in enzymatic properties or in the active-site configuration as determined by crystallography. To demonstrate broad utility, we similarly designed four other human and bacterial proteins. Testing at most three designs per protein, we obtained enhanced stability and/or higher yields of soluble and active protein in E. coli. Our algorithm requires only a 3D structure and several dozen sequences of naturally occurring homologs, and is available at http://pross.weizmann.ac.il.
Display omitted
•A new computational method is used to stabilize five recalcitrant proteins•Designed variants show higher expression and stability with unmodified function•A designed human acetylcholinesterase variant expresses solubly in bacteria•The method is fully automated and implemented on a webserver
Heterologous expression of proteins and their mutants often results in misfolding and aggregation. Goldenzweig et al. (2016) developed an automated algorithm for protein stabilization requiring minimal experimental testing; for instance, the five tested variants of human acetylcholinesterase showed ≥100-fold higher soluble bacterial expression and higher melting temperatures than wild-type.
Molecular evolution has focused on the divergence of molecular functions, yet we know little about how structurally distinct protein folds emerge de novo. We characterized the evolutionary ...trajectories and selection forces underlying emergence of β-propeller proteins, a globular and symmetric fold group with diverse functions. The identification of short propeller-like motifs (<50 amino acids) in natural genomes indicated that they expanded via tandem duplications to form extant propellers. We phylogenetically reconstructed 47-residue ancestral motifs that form five-bladed lectin propellers via oligomeric assembly. We demonstrate a functional trajectory of tandem duplications of these motifs leading to monomeric lectins. Foldability, i.e., higher efficiency of folding, was the main parameter leading to improved functionality along the entire evolutionary trajectory. However, folding constraints changed along the trajectory: initially, conflicts between monomer folding and oligomer assembly dominated, whereas subsequently, upon tandem duplication, tradeoffs between monomer stability and foldability took precedence.
Display omitted
•Inferred 47-aminoacid ancestral motifs fold into functional β-propeller assemblies•Motif duplication, fusion, and diversification yield functional monomeric propellers•Folding efficiency was the key parameter optimized throughout propeller emergence•Single-motif precursors in extant genomes support the reconstructed emergence pathway
Experimental reconstruction of the emergence of a de novo protein indicates that “foldability” is the primary factor required for improved functionality along the entire evolutionary trajectory, although the parameters dictating optimal folding shifted as protein complexity increased.
Protein networks in all organisms comprise homologous interacting pairs. In these networks, some proteins are specific, interacting with one or a few binding partners, whereas others are ...multispecific and bind a range of targets. We describe an algorithm that starts from an interacting pair and designs dozens of new pairs with diverse backbone conformations at the binding site as well as new binding orientations and sequences. Applied to a high-affinity bacterial pair, the algorithm results in 18 new ones, with cognate affinities from pico- to micromolar. Three pairs exhibit 3-5 orders of magnitude switch in specificity relative to the wild type, whereas others are multispecific, collectively forming a protein-interaction network. Crystallographic analysis confirms design accuracy, including in new backbones and polar interactions. Preorganized polar interaction networks are responsible for high specificity, thus defining design principles that can be applied to program synthetic cellular interaction networks of desired affinity and specificity.
Antibodies developed for research and clinical applications may exhibit suboptimal stability, expressibility, or affinity. Existing optimization strategies focus on surface mutations, whereas natural ...affinity maturation also introduces mutations in the antibody core, simultaneously improving stability and affinity. To systematically map the mutational tolerance of an antibody variable fragment (Fv), we performed yeast display and applied deep mutational scanning to an anti-lysozyme antibody and found that many of the affinity-enhancing mutations clustered at the variable light-heavy chain interface, within the antibody core. Rosetta design combined enhancing mutations, yielding a variant with tenfold higher affinity and substantially improved stability. To make this approach broadly accessible, we developed AbLIFT, an automated web server that designs multipoint core mutations to improve contacts between specific Fv light and heavy chains (http://AbLIFT.weizmann.ac.il). We applied AbLIFT to two unrelated antibodies targeting the human antigens VEGF and QSOX1. Strikingly, the designs improved stability, affinity, and expression yields. The results provide proof-of-principle for bypassing laborious cycles of antibody engineering through automated computational affinity and stability design.
Automated design of enzymes with wild-type-like catalytic properties has been a long-standing but elusive goal. Here, we present a general, automated method for enzyme design through combinatorial ...backbone assembly. Starting from a set of homologous yet structurally diverse enzyme structures, the method assembles new backbone combinations and uses Rosetta to optimize the amino acid sequence, while conserving key catalytic residues. We apply this method to two unrelated enzyme families with TIM-barrel folds, glycoside hydrolase 10 (GH10) xylanases and phosphotriesterase-like lactonases (PLLs), designing 43 and 34 proteins, respectively. Twenty-one GH10 and seven PLL designs are active, including designs derived from templates with <25% sequence identity. Moreover, four designs are as active as natural enzymes in these families. Atomic accuracy in a high-activity GH10 design is further confirmed by crystallographic analysis. Thus, combinatorial-backbone assembly and design may be used to generate stable, active, and structurally diverse enzymes with altered selectivity or activity.
Computational design is a test of our understanding of enzyme catalysis and a means of engineering novel, tailor-made enzymes. While the de novo computational design of catalytically efficient ...enzymes remains a challenge, designed enzymes may comprise unique starting points for further optimization by directed evolution. Directed evolution of two computationally designed Kemp eliminases, KE07 and KE70, led to low to moderately efficient enzymes (k cₐₜ/ K ₘ values of ≤ 5 10 ⁴ M ⁻¹s ⁻¹). Here we describe the optimization of a third design, KE59. Although KE59 was the most catalytically efficient Kemp eliminase from this design series (by k cₐₜ/ K ₘ, and by catalyzing the elimination of nonactivated benzisoxazoles), its impaired stability prevented its evolutionary optimization. To boost KE59’s evolvability, stabilizing consensus mutations were included in the libraries throughout the directed evolution process. The libraries were also screened with less activated substrates. Sixteen rounds of mutation and selection led to > 2,000-fold increase in catalytic efficiency, mainly via higher k cₐₜ values. The best KE59 variants exhibited k cₐₜ/ K ₘ values up to 0.6 10 ⁶ M ⁻¹s ⁻¹, and k cₐₜ/ k ᵤₙcₐₜ values of ≤ 10 ⁷ almost regardless of substrate reactivity. Biochemical, structural, and molecular dynamics (MD) simulation studies provided insights regarding the optimization of KE59. Overall, the directed evolution of three different designed Kemp eliminases, KE07, KE70, and KE59, demonstrates that computational designs are highly evolvable and can be optimized to high catalytic efficiencies.
The ability to tune the light‐absorption properties of chlorophylls by their protein environment is the key to the robustness and high efficiency of photosynthetic light‐harvesting proteins. ...Unfortunately, the intricacy of the natural complexes makes it very difficult to identify and isolate specific protein–pigment interactions that underlie the spectral‐tuning mechanisms. Herein we identify and demonstrate the tuning mechanism of chlorophyll spectra in type II water‐soluble chlorophyll binding proteins from Brassicaceae (WSCPs). By comparing the molecular structures of two natural WSCPs we correlate a shift in the chlorophyll red absorption band with deformation of its tetrapyrrole macrocycle that is induced by changing the position of a nearby tryptophan residue. We show by a set of reciprocal point mutations that this change accounts for up to 2/3 of the observed spectral shift between the two natural variants.
A change for the redder: A new crystal structure of type IIa water‐soluble chlorophyll binding protein (WSCP) from cauliflower is compared to the type IIb WSCP structure. It reveals how ring deformation is exploited for red shifting the red‐most chlorophyll absorption peak. A single mutation of asparagine to alanine changes the hydrogen‐bonding pattern of a tryptophan residue repositioning it to perturb the planarity of the chlorophyll macrocycle.
Cellular lineage tracking provides a means to observe population makeup at the clonal level, allowing exploration of heterogeneity, evolutionary and developmental processes and individual clones' ...relative fitness. It has thus contributed significantly to understanding microbial evolution, organ differentiation and cancer heterogeneity, among others. Its use, however, is limited because existing methods are highly specific, expensive, labour-intensive, and, critically, do not allow the repetition of experiments. To address these issues, we developed gUMI-BEAR (genomic Unique Molecular Identifier Barcoded Enriched Associated Regions), a modular, cost-effective method for tracking populations at high resolution. We first demonstrate the system's application and resolution by applying it to track tens of thousands of Saccharomyces cerevisiae lineages growing together under varying environmental conditions applied across multiple generations, revealing fitness differences and lineage-specific adaptations. Then, we demonstrate how gUMI-BEAR can be used to perform parallel screening of a huge number of randomly generated variants of the Hsp82 gene. We further show how our method allows isolation of variants, even if their frequency in the population is low, thus enabling unsupervised identification of modifications that lead to a behaviour of interest.