The coronavirus disease 2019 (COVID-19) pandemic has led to accelerated efforts to develop therapeutics and vaccines. A key target of these efforts is the spike (S) protein, which is metastable and ...difficult to produce recombinantly. We characterized 100 structure-guided spike designs and identified 26 individual substitutions that increased protein yields and stability. Testing combinations of beneficial substitutions resulted in the identification of HexaPro, a variant with six beneficial proline substitutions exhibiting higher expression than its parental construct (by a factor of 10) as well as the ability to withstand heat stress, storage at room temperature, and three freeze-thaw cycles. A cryo-electron microscopy structure of HexaPro at a resolution of 3.2 angstroms confirmed that it retains the prefusion spike conformation. High-yield production of a stabilized prefusion spike protein will accelerate the development of vaccines and serological diagnostics for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2).
The molecular composition and binding epitopes of the immunoglobulin G (IgG) antibodies that circulate in blood plasma following SARS-CoV-2 infection are unknown. Proteomic deconvolution of the IgG ...repertoire to the spike glycoprotein in convalescent subjects revealed that the response is directed predominantly (>80%) against epitopes residing outside the receptor-binding domain (RBD). In one subject, just four IgG lineages accounted for 93.5% of the response, including an N-terminal domain (NTD)-directed antibody that was protective against lethal viral challenge. Genetic, structural, and functional characterization of a multi-donor class of "public" antibodies revealed an NTD epitope that is recurrently mutated among emerging SARS-CoV-2 variants of concern. These data show that "public" NTD-directed and other non-RBD plasma antibodies are prevalent and have implications for SARS-CoV-2 protection and antibody escape.
Rabbits have been used extensively as a model system for the elucidation of the mechanism of immunoglobulin diversification and for the production of antibodies. We employed Next Generation ...Sequencing to analyze Ig germline V and J gene usage, CDR3 length and amino acid composition, and gene conversion frequencies within the functional (transcribed) IgG repertoire of the New Zealand white rabbit (Oryctolagus cuniculus). Several previously unannotated rabbit heavy chain variable (VH) and light chain variable (VL) germline elements were deduced bioinformatically using multidimensional scaling and k-means clustering methods. We estimated the gene conversion frequency in the rabbit at 23% of IgG sequences with a mean gene conversion tract length of 59±36 bp. Sequencing and gene conversion analysis of the chicken, human, and mouse repertoires revealed that gene conversion occurs much more extensively in the chicken (frequency 70%, tract length 79±57 bp), was observed to a small, yet statistically significant extent in humans, but was virtually absent in mice.
Each B-cell receptor consists of a pair of heavy and light chains. High-throughput sequencing can identify large numbers of heavy- and light-chain variable regions (V(H) and V(L)) in a given B-cell ...repertoire, but information about endogenous pairing of heavy and light chains is lost after bulk lysis of B-cell populations. Here we describe a way to retain this pairing information. In our approach, single B cells (>5 × 10(4) capacity per experiment) are deposited in a high-density microwell plate (125 pl/well) and lysed in situ. mRNA is then captured on magnetic beads, reverse transcribed and amplified by emulsion V(H):V(L) linkage PCR. The linked transcripts are analyzed by Illumina high-throughput sequencing. We validated the fidelity of V(H):V(L) pairs identified by this approach and used the method to sequence the repertoire of three human cell subsets-peripheral blood IgG(+) B cells, peripheral plasmablasts isolated after tetanus toxoid immunization and memory B cells isolated after seasonal influenza vaccination.
Most vaccines confer protection via the elicitation of serum antibodies, yet more than 100 y after the discovery of antibodies, the molecular composition of the human serum antibody repertoire to an ...antigen remains unknown. Using high-resolution liquid chromatography tandem MS proteomic analyses of serum antibodies coupled with next-generation sequencing of the V gene repertoire in peripheral B cells, we have delineated the human serum IgG and B-cell receptor repertoires following tetanus toxoid (TT) booster vaccination. We show that the TT ⁺ serum IgG repertoire comprises ∼100 antibody clonotypes, with three clonotypes accounting for >40% of the response. All 13 recombinant IgGs examined bound to vaccine antigen with K d ∼ 10 ⁻⁸–10 ⁻¹⁰ M. Five of 13 IgGs recognized the same linear epitope on TT, occluding the binding site used by the toxin for cell entry, suggesting a possible explanation for the mechanism of protection conferred by the vaccine. Importantly, only a small fraction (<5%) of peripheral blood plasmablast clonotypes (CD3 ⁻CD14 ⁻CD19 ⁺CD27 ⁺⁺CD38 ⁺⁺CD20 ⁻TT ⁺) at the peak of the response (day 7), and an even smaller fraction of memory B cells, were found to encode antibodies that could be detected in the serological memory response 9 mo postvaccination. This suggests that only a small fraction of responding peripheral B cells give rise to the bone marrow long-lived plasma cells responsible for the production of biologically relevant amounts of vaccine-specific antibodies (near or above the K d). Collectively, our results reveal the nature and dynamics of the serological response to vaccination with direct implications for vaccine design and evaluation.
The low stability of natural proteins often limits their use in therapeutic, industrial, and research applications. The scale and throughput of methods such as circular dichroism, fluorescence ...spectroscopy, and calorimetry severely limit the number of variants that can be examined. Here we demonstrate a high-throughput thermal scanning (HTTS) method for determining the approximate stabilities of protein variants at high throughput and low cost. The method is based on binding to a hydrophobic dye akin to ANS, which fluoresces upon binding to molten globules and thermal denaturation intermediates. No inherent properties of the protein, such as enzymatic activity or presence of an intrinsic fluorophore, are required. Very small sample sizes are analyzed using a real-time PCR machine, enabling the use of high-throughput purification. We show that the apparent T M values obtained from HTTS are approximately linearly related to those from CD thermal denaturation for a series of four-helix bundle hydrophobic core variants. We demonstrate similar results for a small set of TIM barrel variants. This inexpensive, general, and scaleable approach enables the search for conservative, stable mutants of biotechnologically important proteins and provides a method for statistical correlation of sequence−stability relationships.
•Next-generation sequencing of peripheral B cells (BCR-seq) in health and disease states.•Serum antibody proteomics (Ig-seq) complements BCR-seq in analyzing endpoint functionality.•Advances in the ...pairing and NGS of antibody heavy chain and light chain (VH:VL) sequences.•Combined, BCR-seq and Ig-seq provide a clear advance in antibody discovery.
Recent developments of high-throughput technologies are enabling the molecular-level analysis and bioinformatic mining of antibody-mediated (humoral) immunity in humans at an unprecedented level. These approaches explore either the sequence space of B-cell receptor repertoires using next-generation deep sequencing (BCR-seq), or the amino acid identities of antibody in blood using protein mass spectrometry (Ig-seq), or both. Generalizable principles about the molecular composition of the protective humoral immune response are being defined, and as such, the field could supersede traditional methods for the development of diagnostics, vaccines, and antibody therapeutics. Three key challenges remain and have driven recent advances: (1) incorporation of innovative techniques for paired BCR-seq to ascertain the complete antibody variable-domain VH:VL clonotype, (2) integration of proteomic Ig-seq with BCR-seq to reveal how the serum antibody repertoire compares with the antibody repertoire encoded by circulating B cells, and (3) a demand to link antibody sequence data to functional meaning (binding and protection).
Monoclonal antibodies have revolutionized the treatment of human diseases, which has made them the fastest-growing class of therapeutics, with global sales expected to reach $346.6 billion USD by ...2028. Advances in antibody engineering and development have led to the creation of increasingly sophisticated antibody-based therapeutics (e.g. bispecific antibodies and chimeric antigen receptor T cells). However, approaches for antibody discovery have remained comparatively grounded in conventional yet reliable in vitro assays. Breakthrough developments in high-throughput single B-cell sequencing and immunoglobulin proteomic serology, however, have enabled the identification of high-affinity antibodies directly from endogenous B cells or circulating immunoglobulin produced in vivo. Moreover, advances in artificial intelligence offer vast potential for antibody discovery and design with large-scale repertoire datasets positioned as the optimal source of training data for such applications. We highlight advances and recent trends in how these technologies are being applied to antibody repertoire analysis.
•Functional screens integrate with single cell BCR-sequencing and transcriptomics.•Bottom-up and top-down MS enable comprehensive profiling of circulating antibodies.•Antibody repertoire studies drive artificial intelligence aimed at therapeutic design.
We have developed and validated a methodology for determining the antibody composition of the polyclonal serum response after immunization. Pepsin-digested serum IgGs were subjected to standard ...antigen-affinity chromatography, and resulting elution, wash, and flow-through fractions were analyzed by bottom-up, liquid chromatography–high-resolution tandem mass spectrometry. Identification of individual monoclonal antibodies required the generation of a database of IgG variable gene (V-gene) sequences constructed by NextGen sequencing of mature B cells. Antibody V-gene sequences are characterized by short complementarity determining regions (CDRs) of high diversity adjacent to framework regions shared across thousands of IgGs, greatly complicating the identification of antigen-specific IgGs from proteomically observed peptides. By mapping peptides marking unique V H CDRH3 sequences, we identified a set of V-genes heavily enriched in the affinity chromatography elution, constituting the serum polyclonal response. After booster immunization in a rabbit, we find that the antigen-specific serum immune response is oligoclonal, comprising antibodies encoding 34 different CDRH3s that group into 30 distinct antibody V H clonotypes. Of these 34 CDRH3s, 12 account for ∼60% of the antigen-specific CDRH3 peptide mass spectral counts. For comparison, antibodies with 18 different CDRH3s (12 clonotypes) were represented in the antigen-specific IgG fraction from an unimmunized rabbit that fortuitously displayed a moderate titer for BSA. Proteomically identified antibodies were synthesized and shown to display subnanomolar affinities. The ability to deconvolute the polyclonal serum response is likely to be of key importance for analyzing antibody responses after vaccination and for more completely understanding adaptive immune responses in health and disease.