The ability of naturally occurring proteins to change conformation in response to environmental changes is critical to biological function. Although there have been advances in the de novo design of ...stable proteins with a single, deep free-energy minimum, the design of conformational switches remains challenging. We present a general strategy to design pH-responsive protein conformational changes by precisely preorganizing histidine residues in buried hydrogen-bond networks. We design homotrimers and heterodimers that are stable above pH 6.5 but undergo cooperative, large-scale conformational changes when the pH is lowered and electrostatic and steric repulsion builds up as the network histidine residues become protonated. The transition pH and cooperativity can be controlled through the number of histidine-containing networks and the strength of the surrounding hydrophobic interactions. Upon disassembly, the designed proteins disrupt lipid membranes both in vitro and after being endocytosed in mammalian cells. Our results demonstrate that environmentally triggered conformational changes can now be programmed by de novo protein design.
Transmembrane channels and pores have key roles in fundamental biological processes
and in biotechnological applications such as DNA nanopore sequencing
, resulting in considerable interest in the ...design of pore-containing proteins. Synthetic amphiphilic peptides have been found to form ion channels
, and there have been recent advances in de novo membrane protein design
and in redesigning naturally occurring channel-containing proteins
. However, the de novo design of stable, well-defined transmembrane protein pores that are capable of conducting ions selectively or are large enough to enable the passage of small-molecule fluorophores remains an outstanding challenge
. Here we report the computational design of protein pores formed by two concentric rings of α-helices that are stable and monodisperse in both their water-soluble and their transmembrane forms. Crystal structures of the water-soluble forms of a 12-helical pore and a 16-helical pore closely match the computational design models. Patch-clamp electrophysiology experiments show that, when expressed in insect cells, the transmembrane form of the 12-helix pore enables the passage of ions across the membrane with high selectivity for potassium over sodium; ion passage is blocked by specific chemical modification at the pore entrance. When incorporated into liposomes using in vitro protein synthesis, the transmembrane form of the 16-helix pore-but not the 12-helix pore-enables the passage of biotinylated Alexa Fluor 488. A cryo-electron microscopy structure of the 16-helix transmembrane pore closely matches the design model. The ability to produce structurally and functionally well-defined transmembrane pores opens the door to the creation of designer channels and pores for a wide variety of applications.
The regular arrangements of β-strands around a central axis in β-barrels and of α-helices in coiled coils contrast with the irregular tertiary structures of most globular proteins, and have ...fascinated structural biologists since they were first discovered. Simple parametric models have been used to design a wide range of α-helical coiled-coil structures, but to date there has been no success with β-barrels. Here we show that accurate de novo design of β-barrels requires considerable symmetry-breaking to achieve continuous hydrogen-bond connectivity and eliminate backbone strain. We then build ensembles of β-barrel backbone models with cavity shapes that match the fluorogenic compound DFHBI, and use a hierarchical grid-based search method to simultaneously optimize the rigid-body placement of DFHBI in these cavities and the identities of the surrounding amino acids to achieve high shape and chemical complementarity. The designs have high structural accuracy and bind and fluorescently activate DFHBI in vitro and in Escherichia coli, yeast and mammalian cells. This de novo design of small-molecule binding activity, using backbones custom-built to bind the ligand, should enable the design of increasingly sophisticated ligand-binding proteins, sensors and catalysts that are not limited by the backbone geometries available in known protein structures.
Precise cell targeting is challenging because most mammalian cell types lack a single surface marker that distinguishes them from other cells. A solution would be to target cells using specific ...combinations of proteins present on their surfaces. In this study, we design colocalization-dependent protein switches (Co-LOCKR) that perform AND, OR, and NOT Boolean logic operations. These switches activate through a conformational change only when all conditions are met, generating rapid, transcription-independent responses at single-cell resolution within complex cell populations. We implement AND gates to redirect T cell specificity against tumor cells expressing two surface antigens while avoiding off-target recognition of single-antigen cells, and three-input switches that add NOT or OR logic to avoid or include cells expressing a third antigen. Thus, de novo designed proteins can perform computations on the surface of cells, integrating multiple distinct binding interactions into a single output.
To create new enzymes and biosensors from scratch, precise control over the structure of small-molecule binding sites is of paramount importance, but systematically designing arbitrary protein pocket ...shapes and sizes remains an outstanding challenge. Using the NTF2-like structural superfamily as a model system, we developed an enumerative algorithm for creating a virtually unlimited number of de novo proteins supporting diverse pocket structures. The enumerative algorithm was tested and refined through feedback from two rounds of large-scale experimental testing, involving in total the assembly of synthetic genes encoding 7,896 designs and assessment of their stability on yeast cell surface, detailed biophysical characterization of 64 designs, and crystal structures of 5 designs. The refined algorithm generates proteins that remain folded at high temperatures and exhibit more pocket diversity than naturally occurring NTF2-like proteins. We expect this approach to transform the design of smallmolecule sensors and enzymes by enabling the creation of binding and active site geometries much more optimal for specific design challenges than is accessible by repurposing the limited number of naturally occurring NTF2-like proteins.
A systematic and robust approach to generating complex protein nanomaterials would have broad utility. We develop a hierarchical approach to designing multi-component protein assemblies from two ...classes of modular building blocks: designed helical repeat proteins (DHRs) and helical bundle oligomers (HBs). We first rigidly fuse DHRs to HBs to generate a large library of oligomeric building blocks. We then generate assemblies with cyclic, dihedral, and point group symmetries from these building blocks using architecture guided rigid helical fusion with new software named WORMS. X-ray crystallography and cryo-electron microscopy characterization show that the hierarchical design approach can accurately generate a wide range of assemblies, including a 43 nm diameter icosahedral nanocage. The computational methods and building block sets described here provide a very general route to de novo designed protein nanomaterials.
Specificity of interactions between two DNA strands, or between protein and DNA, is often achieved by varying bases or side chains coming off the DNA or protein backbone-for example, the bases ...participating in Watson-Crick pairing in the double helix, or the side chains contacting DNA in TALEN-DNA complexes. By contrast, specificity of protein-protein interactions usually involves backbone shape complementarity
, which is less modular and hence harder to generalize. Coiled-coil heterodimers are an exception, but the restricted geometry of interactions across the heterodimer interface (primarily at the heptad a and d positions
) limits the number of orthogonal pairs that can be created simply by varying side-chain interactions
. Here we show that protein-protein interaction specificity can be achieved using extensive and modular side-chain hydrogen-bond networks. We used the Crick generating equations
to produce millions of four-helix backbones with varying degrees of supercoiling around a central axis, identified those accommodating extensive hydrogen-bond networks, and used Rosetta to connect pairs of helices with short loops and to optimize the remainder of the sequence. Of 97 such designs expressed in Escherichia coli, 65 formed constitutive heterodimers, and the crystal structures of four designs were in close agreement with the computational models and confirmed the designed hydrogen-bond networks. In cells, six heterodimers were fully orthogonal, and in vitro-following mixing of 32 chains from 16 heterodimer designs, denaturation in 5 M guanidine hydrochloride and reannealing-almost all of the interactions observed by native mass spectrometry were between the designed cognate pairs. The ability to design orthogonal protein heterodimers should enable sophisticated protein-based control logic for synthetic biology, and illustrates that nature has not fully explored the possibilities for programmable biomolecular interaction modalities.
Online citizen science projects such as GalaxyZoo
, Eyewire
and Phylo
have proven very successful for data collection, annotation and processing, but for the most part have harnessed human ...pattern-recognition skills rather than human creativity. An exception is the game EteRNA
, in which game players learn to build new RNA structures by exploring the discrete two-dimensional space of Watson-Crick base pairing possibilities. Building new proteins, however, is a more challenging task to present in a game, as both the representation and evaluation of a protein structure are intrinsically three-dimensional. We posed the challenge of de novo protein design in the online protein-folding game Foldit
. Players were presented with a fully extended peptide chain and challenged to craft a folded protein structure and an amino acid sequence encoding that structure. After many iterations of player design, analysis of the top-scoring solutions and subsequent game improvement, Foldit players can now-starting from an extended polypeptide chain-generate a diversity of protein structures and sequences that encode them in silico. One hundred forty-six Foldit player designs with sequences unrelated to naturally occurring proteins were encoded in synthetic genes; 56 were found to be expressed and soluble in Escherichia coli, and to adopt stable monomeric folded structures in solution. The diversity of these structures is unprecedented in de novo protein design, representing 20 different folds-including a new fold not observed in natural proteins. High-resolution structures were determined for four of the designs, and are nearly identical to the player models. This work makes explicit the considerable implicit knowledge that contributes to success in de novo protein design, and shows that citizen scientists can discover creative new solutions to outstanding scientific challenges such as the protein design problem.
We describe the computational design of proteins that bind the potent analgesic fentanyl. Our approach employs a fast docking algorithm to find shape complementary ligand placement in protein ...scaffolds, followed by design of the surrounding residues to optimize binding affinity. Co-crystal structures of the highest affinity binder reveal a highly preorganized binding site, and an overall architecture and ligand placement in close agreement with the design model. We use the designs to generate plant sensors for fentanyl by coupling ligand binding to design stability. The method should be generally useful for detecting toxic hydrophobic compounds in the environment.
Computational design of a synthetic PD-1 agonist Bryan, Cassie M; Rocklin, Gabriel J; Bick, Matthew J ...
Proceedings of the National Academy of Sciences - PNAS,
07/2021, Volume:
118, Issue:
29
Journal Article
Peer reviewed
Open access
Programmed cell death protein-1 (PD-1) expressed on activated T cells inhibits T cell function and proliferation to prevent an excessive immune response, and disease can result if this delicate ...balance is shifted in either direction. Tumor cells often take advantage of this pathway by overexpressing the PD-1 ligand PD-L1 to evade destruction by the immune system. Alternatively, if there is a decrease in function of the PD-1 pathway, unchecked activation of the immune system and autoimmunity can result. Using a combination of computation and experiment, we designed a hyperstable 40-residue miniprotein, PD-MP1, that specifically binds murine and human PD-1 at the PD-L1 interface with a K
of ∼100 nM. The apo crystal structure shows that the binder folds as designed with a backbone RMSD of 1.3 Å to the design model. Trimerization of PD-MP1 resulted in a PD-1 agonist that strongly inhibits murine T cell activation. This small, hyperstable PD-1 binding protein was computationally designed with an all-beta interface, and the trimeric agonist could contribute to treatments for autoimmune and inflammatory diseases.