HIV-1 envelope glycoprotein (Env) glycosylation is important because individual glycans are components of multiple broadly neutralizing antibody epitopes, while shielding other sites that might ...otherwise be immunogenic. The glycosylation on Env is influenced by a variety of factors, including the genotype of the protein, the cell line used for its expression, and the details of the construct design. Here, we used a mass spectrometry (MS)-based approach to map the complete glycosylation profile at every site in multiple HIV-1 Env trimers, accomplishing two goals. (i) We determined which glycosylation sites contain conserved glycan profiles across many trimeric Envs. (ii) We identified the variables that impact Env's glycosylation profile at sites with divergent glycosylation. Over half of the gp120 glycosylation sites on 11 different trimeric Envs have a conserved glycan profile, indicating that a native consensus glycosylation profile does indeed exist among trimers. We showed that some soluble gp120s and gp140s exhibit highly divergent glycosylation profiles compared to trimeric Env. We also assessed the impact of several variables on Env glycosylation: truncating the full-length Env; producing Env, instead of the more virologically relevant T lymphocytes, in CHO cells; and purifying Env with different chromatographic platforms, including nickel-nitrilotriacetic acid (Ni-NTA), 2G12, and PGT151 affinity. This report provides the first consensus glycosylation profile of Env trimers, which should serve as a useful benchmark for HIV-1 vaccine developers. This report also defines the sites where glycosylation may be impacted when Env trimers are truncated or produced in CHO cells.
A protective HIV-1 vaccine will likely include a recombinant version of the viral envelope glycoprotein (Env). Env is highly glycosylated, and yet vaccine developers have lacked guidance on how to assess whether their immunogens have optimal glycosylation. The following important questions are still unanswered. (i) What is the "target" glycosylation profile, when the goal is to generate a natively glycosylated protein? (ii) What variables exert the greatest influence on Env glycosylation? We identified numerous sites on Env where the glycosylation profile does not deviate in 11 different Env trimers, and we investigated the impact on the divergent glycosylation profiles of changing the genotype of the Env sequence, the construct design, the purification method, and the producer cell type. The data presented here give vaccine developers a "glycosylation target" for their immunogens, and they show how protein production variables can impact Env glycosylation.
Glycoproteins are biologically significant large molecules that participate in numerous cellular activities. In order to obtain site-specific protein glycosylation information, intact glycopeptides, ...with the glycan attached to the peptide sequence, are characterized by tandem mass spectrometry (MS/MS) methods such as collision-induced dissociation (CID) and electron transfer dissociation (ETD). While several emerging automated tools are developed, no consensus is present in the field about the best way to determine the reliability of the tools and/or provide the false discovery rate (FDR). A common approach to calculate FDRs for glycopeptide analysis, adopted from the target-decoy strategy in proteomics, employs a decoy database that is created based on the target protein sequence database. Nonetheless, this approach is not optimal in measuring the confidence of N-linked glycopeptide matches, because the glycopeptide data set is considerably smaller compared to that of peptides, and the requirement of a consensus sequence for N-glycosylation further limits the number of possible decoy glycopeptides tested in a database search. To address the need to accurately determine FDRs for automated glycopeptide assignments, we developed GlycoPep Evaluator (GPE), a tool that helps to measure FDRs in identifying glycopeptides without using a decoy database. GPE generates decoy glycopeptides de novo for every target glycopeptide, in a 1:20 target-to-decoy ratio. The decoys, along with target glycopeptides, are scored against the ETD data, from which FDRs can be calculated accurately based on the number of decoy matches and the ratio of the number of targets to decoys, for small data sets. GPE is freely accessible for download and can work with any search engine that interprets ETD data of N-linked glycopeptides. The software is provided at https://desairegroup.ku.edu/research.
Lysyl oxidase-like 2 (LOXL2) catalyzes the oxidative deamination of peptidyl lysines and hydroxylysines to promote extracellular matrix remodeling. Aberrant activity of LOXL2 has been associated with ...organ fibrosis and tumor metastasis. The lysine tyrosylquinone (LTQ) cofactor is derived from Lys653 and Tyr689 in the amine oxidase domain via post-translational modification. Based on the similarity in hydrodynamic radius and radius of gyration, we recently proposed that the overall structures of the mature LOXL2 (containing LTQ) and the precursor LOXL2 (no LTQ) are very similar. In this study, we conducted a mass spectrometry-based disulfide mapping analysis of recombinant LOXL2 in three forms: a full-length LOXL2 (fl-LOXL2) containing a nearly stoichiometric amount of LTQ, Δ1-2SRCR-LOXL2 (SRCR1 and SRCR2 are truncated) in the precursor form, and Δ1-3SRCR-LOXL2 (SRCR1, SRCR2, SRCR3 are truncated) in a mixture of the precursor and the mature forms. We detected a set of five disulfide bonds that is conserved in both the precursor and the mature recombinant LOXL2s. In addition, we detected a set of four alternative disulfide bonds in low abundance that is not associated with the mature LOXL2. These results suggest that the major set of five disulfide bonds is retained post-LTQ formation.
Interactions between innate antiviral factors at mucosal surfaces and HIV-1 virions contribute to the natural inefficiency of HIV-1 transmission and are a platform to inform the development of ...vaccine and nonvaccine strategies to block mucosal HIV-1 transmission. Tenascin-C (TNC) is a large, hexameric extracellular matrix glycoprotein identified in breast milk and genital fluids that broadly neutralizes HIV-1 via interaction with the HIV-1 Envelope (Env) variable 3 (V3) loop. In this report, we characterize the specific determinants of the interaction between TNC and the HIV-1 Env. We observed that TNC binding and neutralization of HIV-1 is dependent on the TNC fibrinogen-like globe (fbg) and fibronectin-type III (fn) domains, oligomerization, and its newly-mapped glycan structure. Moreover, we observed that TNC-mediated neutralization is also dependent on Env V3 residues 321/322 and 326/327, which surround the IGDIR motif of the V3 loop, as well the N332 glycan, which is critical to the broadly neutralizing activity of glycan-dependent V3-specific antibodies such as PGT128. Our results demonstrate a striking parallel between innate and adaptive immune mechanisms of broad HIV neutralization and provide further insight into the host protein-virus interactions responsible for the natural inefficiency of mucosal HIV-1 transmission.
Glycosylation plays an essential role in regulating protein function by modulating biological, structural, and therapeutic properties. However, due to its inherent heterogeneity and diversity, the ...comprehensive analysis of protein glycosylation remains a challenge. As part of our continuing effort in the analysis of glycosylation profiles of recombinant HIV-1 envelope-based immunogens, we evaluated and compared the host-cell specific glycosylation pattern of recombinant HIV-1 surface glycoprotein, gp120, derived from clade C transmitted/founder virus 1086.C expressed in Chinese hamster ovary (CHO) and human embryonic kidney containing T antigen (293T) cell lines. We used an integrated glycopeptide-based mass mapping workflow that includes a partial deglycosylation step described in our previous study with the inclusion of a fragmentation technique, electron transfer dissociation (ETD), to complement collision-induced dissociation. The inclusion of ETD facilitated the analysis by providing additional validation for glycopeptide identification and expanding the identified glycopeptides to include coverage of O-linked glycosylation. The site-specific glycosylation analysis shows that the transmitted/founder 1086.C gp120 expressed in CHO and 293T displayed distinct similarities and differences. For N-linked glycosylation, two sites (N386 and N392) in the V4 region were populated with high mannose glycans in the CHO cell-derived 1086.C gp120, while these sites had a mixture of high mannose and processed glycans in the 293T cell-derived 1086.C gp120. Compositional analysis of O-linked glycans revealed that 293T cell-derived 1086.C gp120 consisted of core 1, 2, and 4 type O-linked glycans, while CHO cell-derived 1086.C exclusively consisted of core 1 type O-linked glycans. Overall, glycosylation site occupancy of the CHO and 293T cell-derived 1086.C gp120 showed a high degree of similarity except for one site at N88 in the C1 region. This site was partially occupied in 293T-gp120 but fully occupied in CHO-gp120. Site-specific glycopeptide analysis of transmitted/founder 1086.C gp120 expressed in CHO cells revealed the presence of phosphorylated glycans, while 293T cell-produced 1086.C gp120 glycans were not phosphorylated. While the influence of phosphorylated glycans on immunogenicity is unclear, distinguishing host-cell specific variations in glycosylation profiles provide insights into the similarity (or difference) in recombinant vaccine products. While these differences had minimal effect on envelope antigenicity, they may be important in considering immunogenicity and functional capacities of recombinant envelope proteins produced in different expression systems.
We have investigated factors that influence the production of native-like soluble, recombinant trimers based on the env genes of two isolates of human immunodeficiency virus type 1 (HIV-1), ...specifically 92UG037.8 (clade A) and CZA97.012 (clade C). When the recombinant trimers based on the env genes of isolates 92UG037.8 and CZA97.012 were made according to the SOSIP.664 design and purified by affinity chromatography using broadly neutralizing antibodies (bNAbs) against quaternary epitopes (PGT145 and PGT151, respectively), the resulting trimers are highly stable and they are fully native-like when visualized by negative-stain electron microscopy. They also have a native-like (i.e., abundant) oligomannose glycan composition and display multiple bNAb epitopes while occluding those for nonneutralizing antibodies. In contrast, uncleaved, histidine-tagged Foldon (Fd) domain-containing gp140 proteins (gp140UNC-Fd-His), based on the same env genes, very rarely form native-like trimers, a finding that is consistent with their antigenic and biophysical properties and glycan composition. The addition of a 20-residue flexible linker (FL20) between the gp120 and gp41 ectodomain (gp41ECTO) subunits to make the uncleaved 92UG037.8 gp140-FL20 construct is not sufficient to create a native-like trimer, but a small percentage of native-like trimers were produced when an I559P substitution in gp41ECTO was also present. The further addition of a disulfide bond (SOS) to link the gp120 and gp41 subunits in the uncleaved gp140-FL20-SOSIP protein increases native-like trimer formation to ∼20 to 30%. Analysis of the disulfide bond content shows that misfolded gp120 subunits are abundant in uncleaved CZA97.012 gp140UNC-Fd-His proteins but very rare in native-like trimer populations. The design and stabilization method and the purification strategy are, therefore, all important influences on the quality of trimeric Env proteins and hence their suitability as vaccine components.
Soluble, recombinant multimeric proteins based on the HIV-1 env gene are current candidate immunogens for vaccine trials in humans. These proteins are generally designed to mimic the native trimeric envelope glycoprotein (Env) that is the target of virus-neutralizing antibodies on the surfaces of virions. The underlying hypothesis is that an Env-mimetic protein may be able to induce antibodies that can neutralize the virus broadly and potently enough for a vaccine to be protective. Multiple different designs for Env-mimetic trimers have been put forth. Here, we used the CZA97.012 and 92UG037.8 env genes to compare some of these designs and determine which ones best mimic virus-associated Env trimers. We conclude that the most widely used versions of CZA97.012 and 92UG037.8 oligomeric Env proteins do not resemble the trimeric Env glycoprotein on HIV-1 viruses, which has implications for the design and interpretation of ongoing or proposed clinical trials of these proteins.
Studying protein O-glycosylation remains an analytical challenge. Different from N-linked glycans, the O-glycosylation site is not within a known consensus sequence. Additionally, O-glycans are ...heterogeneous with numerous potential modification sites. Electron transfer dissociation (ETD) is the method of choice in analyzing these glycopeptides since the glycan side chain remains intact in ETD, and the glycosylation site can be localized on the basis of the c and z fragment ions. Nonetheless, new software is necessary for interpreting O-glycopeptide ETD spectra in order to expedite the analysis workflow. To address the urgent need, we studied the fragmentation of O-glycopeptides in ETD and found useful rules that facilitate their identification. By implementing the rules into an algorithm to score potential assignments against ETD-MS/MS data, we applied the method to glycopeptides generated from various O-glycosylated proteins including mucin, erythropoietin, fetuin, and an HIV envelope protein, 1086.C gp120. The site-specific O-glycopeptide composition was correctly assigned in every case, proving the merits of our method in analyzing glycopeptide ETD data. The algorithm described herein can be easily incorporated into other automated glycomics tools.
A successful HIV-1 vaccine will likely need to elicit broadly neutralizing antibodies (bNAbs) that target the envelope glycoprotein (Env) spike on the virus. Native-like recombinant Env trimers of ...the SOSIP design now serve as a platform for achieving this challenging goal. However, SOSIP trimers usually do not bind efficiently to the inferred germline precursors of bNAbs (gl-bNAbs). We hypothesized that the inherent flexibilities of the V1 and V2 variable loops in the Env trimer contribute to the poor recognition of gl-bNAb epitopes at the trimer apex that extensively involve V2 residues. To reduce local V2 flexibility and improve the binding of V2-dependent bNAbs and gl-bNAbs, we designed BG505 SOSIP.664 trimer variants containing newly created disulfide bonds intended to stabilize the V2 loop in an optimally antigenic configuration. The first variant, I184C/E190C, contained a new disulfide bond within the V2 loop, whereas the second variant, E153C/R178C, had a new disulfide bond that cross-linked V2 and V1. The resulting engineered native-like trimer variants were both more reactive with and were neutralized by V2 bNAbs and gl-bNAbs, a finding that may be valuable in the design of germline targeting and boosting trimer immunogens to create an antigenic conformation optimal for HIV vaccine development.
MALDI-TOF MS has shown great utility for rapidly identifying microbial species. It can be used to successfully type bacteria and fungi from a variety of sources more rapidly and cost-effectively than ...traditional methods. One area where improvements are necessary is in the typing of highly similar samples, such as those samples from the same genus but different species or samples from within a single species but from different strains. One promising way to address this current limitation is by using advanced machine learning techniques. In this work, we adapt a newly developed machine learning tool, the Aristotle Classifier, to bacterial classification of MALDI-TOF MS data. This tool was originally developed for classifying glycomics and glycoproteomics data, so we modified it to be well-suited for assigning mass spectral data from bacterial proteins. The classifier exceeds existing benchmarks in classifying bacteria, and it shows particularly strong performance when the samples to be identified are highly similar. The combination of mass spectrometry data and tools like the Aristotle Classifier could ameliorate the ambiguities associated with challenging bacterial classification problems.