Abstract
Motivation
Protein solubility plays a vital role in pharmaceutical research and production yield. For a given protein, the extent of its solubility can represent the quality of its function, ...and is ultimately defined by its sequence. Thus, it is imperative to develop novel, highly accurate in silico sequence-based protein solubility predictors. In this work we propose, DeepSol, a novel Deep Learning-based protein solubility predictor. The backbone of our framework is a convolutional neural network that exploits k-mer structure and additional sequence and structural features extracted from the protein sequence.
Results
DeepSol outperformed all known sequence-based state-of-the-art solubility prediction methods and attained an accuracy of 0.77 and Matthew's correlation coefficient of 0.55. The superior prediction accuracy of DeepSol allows to screen for sequences with enhanced production capacity and can more reliably predict solubility of novel proteins.
Availability and implementation
DeepSol's best performing models and results are publicly deposited at https://doi.org/10.5281/zenodo.1162886 (Khurana and Mall, 2018).
Supplementary information
Supplementary data are available at Bioinformatics online.
The SARS-CoV-2 spike employs mobile receptor-binding domains (RBDs) to engage the human ACE2 receptor and to facilitate virus entry, which can occur through low-pH-endosomal pathways. To understand ...how ACE2 binding and low pH affect spike conformation, we determined cryo-electron microscopy structures—at serological and endosomal pH—delineating spike recognition of up to three ACE2 molecules. RBDs freely adopted “up” conformations required for ACE2 interaction, primarily through RBD movement combined with smaller alterations in neighboring domains. In the absence of ACE2, single-RBD-up conformations dominated at pH 5.5, resolving into a solitary all-down conformation at lower pH. Notably, a pH-dependent refolding region (residues 824–858) at the spike-interdomain interface displayed dramatic structural rearrangements and mediated RBD positioning through coordinated movements of the entire trimer apex. These structures provide a foundation for understanding prefusion-spike mechanics governing endosomal entry; we suggest that the low pH all-down conformation potentially facilitates immune evasion from RBD-up binding antibody.
Display omitted
•Determine cryo-EM structures of SARS-CoV-2 spike along its endosomal entry pathway•Reveal structural basis by which a pH-dependent switch mediates RBD positioning•Show spike to exclusively adopt an all-RBD-down conformation at low pH•Suggest low-pH all-RBD-down conformation to provide a basis for immune evasion
Zhou et al. determine 12 structures of the SARS-CoV-2 spike, bound by ACE2 receptor and ligand free, that reveal a pH-dependent switch to mediate positioning of spike receptor-binding domains (RBDs). At low pH, the spike adopts an all-RBD-down conformation, which provides a potential means of immune evasion from RBD-up-recognizing antibody.
Numerous antibodies that neutralize SARS-CoV-2 have been identified, and these generally target either the receptor-binding domain (RBD) or the N-terminal domain (NTD) of the viral spike. While ...RBD-directed antibodies have been extensively studied, far less is known about NTD-directed antibodies. Here, we report cryo-EM and crystal structures for seven potent NTD-directed neutralizing antibodies in complex with spike or isolated NTD. These structures defined several antibody classes, with at least one observed in multiple convalescent donors. The structures revealed that all seven antibodies target a common surface, bordered by glycans N17, N74, N122, and N149. This site—formed primarily by a mobile β-hairpin and several flexible loops—was highly electropositive, located at the periphery of the spike, and the largest glycan-free surface of NTD facing away from the viral membrane. Thus, in contrast to neutralizing RBD-directed antibodies that recognize multiple non-overlapping epitopes, potent NTD-directed neutralizing antibodies appear to target a single supersite.
Display omitted
•Structures of seven NTD-directed neutralizing antibody complexes with spike or NTD•Structures define distinct recognition classes, one observed in multiple donors•Supersite is glycan free, electropositive, with mobile β-hairpin and flexible loops•Most potent NTD-directed neutralizing antibodies may target this supersite
Cerutti et al. report structural analysis of seven potent neutralizing antibodies targeting the N-terminal domain of SARS-CoV-2 spike. All antibodies recognize a common glycan-free, electropositive surface comprised of a mobile β-hairpin and flexible loops. While RBD-directed antibodies recognize non-overlapping epitopes, these findings indicate that NTD-directed antibodies predominantly target a single supersite.
Since the start of the COVID-19 pandemic, SARS-CoV-2 has caused millions of deaths worldwide. Although a number of vaccines have been deployed, the continual evolution of the receptor-binding domain ...(RBD) of the virus has challenged their efficacy. In particular, the emerging variants B.1.1.7, B.1.351 and P.1 (first detected in the UK, South Africa and Brazil, respectively) have compromised the efficacy of sera from patients who have recovered from COVID-19 and immunotherapies that have received emergency use authorization
. One potential alternative to avert viral escape is the use of camelid VHHs (variable heavy chain domains of heavy chain antibody (also known as nanobodies)), which can recognize epitopes that are often inaccessible to conventional antibodies
. Here, we isolate anti-RBD nanobodies from llamas and from mice that we engineered to produce VHHs cloned from alpacas, dromedaries and Bactrian camels. We identified two groups of highly neutralizing nanobodies. Group 1 circumvents antigenic drift by recognizing an RBD region that is highly conserved in coronaviruses but rarely targeted by human antibodies. Group 2 is almost exclusively focused to the RBD-ACE2 interface and does not neutralize SARS-CoV-2 variants that carry E484K or N501Y substitutions. However, nanobodies in group 2 retain full neutralization activity against these variants when expressed as homotrimers, and-to our knowledge-rival the most potent antibodies against SARS-CoV-2 that have been produced to date. These findings suggest that multivalent nanobodies overcome SARS-CoV-2 mutations through two separate mechanisms: enhanced avidity for the ACE2-binding domain and recognition of conserved epitopes that are largely inaccessible to human antibodies. Therefore, although new SARS-CoV-2 mutants will continue to emerge, nanobodies represent promising tools to prevent COVID-19 mortality when vaccines are compromised.
Respiratory syncytial virus (RSV) is the leading cause of hospitalisation for children under 5 years of age. We sought to engineer a viral antigen that provides greater protection than currently ...available vaccines and focused on antigenic site φ, a metastable site specific to the prefusion state of the RSV fusion (F) glycoprotein, as this site is targeted by extremely potent RSV-neutralizing antibodies. Structure-based design yielded stabilized versions of RSV F that maintained antigenic site φ when exposed to extremes of pH, osmolality, and temperature. Six RSV F crystal structures provided atomic-level data on how introduced cysteine residues and filled hydrophobic cavities improved stability. Immunization with site φ—stabilized variants of RSV F in mice and macaques elicited levels of RSV-specific neutralizing activity many times the protective threshold.
Respiratory syncytial virus (RSV) is estimated to claim more lives among infants <1 year old than any other single pathogen, except malaria, and poses a substantial global health burden. Viral entry ...is mediated by a type I fusion glycoprotein (F) that transitions from a metastable prefusion (pre-F) to a stable postfusion (post-F) trimer. A highly neutralization-sensitive epitope, antigenic site Ø, is found only on pre-F. We determined what fraction of neutralizing (NT) activity in human sera is dependent on antibodies specific for antigenic site Ø or other antigenic sites on F in healthy subjects from ages 7 to 93 years. Adsorption of individual sera with stabilized pre-F protein removed >90% of NT activity and depleted binding antibodies to both F conformations. In contrast, adsorption with post-F removed ~30% of NT activity, and binding antibodies to pre-F were retained. These findings were consistent across all age groups. Protein competition neutralization assays with pre-F mutants in which sites Ø or II were altered to knock out binding of antibodies to the corresponding sites showed that these sites accounted for ~35 and <10% of NT activity, respectively. Binding competition assays with monoclonal antibodies (mAbs) indicated that the amount of site Ø-specific antibodies correlated with NT activity, whereas the magnitude of binding competed by site II mAbs did not correlate with neutralization. Our results indicate that RSV NT activity in human sera is primarily derived from pre-F-specific antibodies, and therefore, inducing or boosting NT activity by vaccination will be facilitated by using pre-F antigens that preserve site Ø.
Abstract
Motivation
Protein solubility can be a decisive factor in both research and production efficiency, and in silico sequence-based predictors that can accurately estimate solubility outcomes ...are highly sought.
Results
In this study, we present a novel approach termed PRotein SolubIlity Predictor (PaRSnIP), which uses a gradient boosting machine algorithm as well as an approximation of sequence and structural features of the protein of interest. Based on an independent test set, PaRSnIP outperformed other state-of-the-art sequence-based methods by more than 9% in accuracy and 0.17 in Matthew's correlation coefficient, with an overall accuracy of 74% and Matthew's correlation coefficient of 0.48. Additionally, PaRSnIP provides importance scores for all features used in training. We observed higher fractions of exposed residues to associate positively with protein solubility and tripeptide stretches with multiple histidines to associate negatively with solubility. The improved prediction accuracy of PaRSnIP should enable it to predict protein solubility with greater reliability and to screen for sequence variants with enhanced manufacturability.
Availability and implementation
PaRSnIP software is available for download under GitHub (https://github.com/RedaRawi/PaRSnIP).
Supplementary information
Supplementary data are available at Bioinformatics online.
The emergence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) variants in the Omicron lineage has resulted in diminished Coronavirus Disease 2019 (COVID-19) vaccine efficacy and ...persistent transmission. In this study, we evaluated the immunogenicity and protective efficacy of two, recently authorized, bivalent COVID-19 vaccines that contain two mRNAs encoding Wuhan-1 and either BA.1 (mRNA-1273.214) or BA.4/5 (mRNA-1273.222) spike proteins. As a primary two-dose immunization series in mice, both bivalent vaccines induced greater neutralizing antibody responses against Omicron variants than the parental, monovalent mRNA-1273 vaccine. When administered to mice as a booster at 7 months after the primary vaccination series with mRNA-1273, the bivalent vaccines induced broadly neutralizing antibody responses. Whereas most anti-Omicron receptor binding domain antibodies in serum induced by mRNA-1273, mRNA-1273.214 and mRNA-1273.222 boosters cross-reacted with the antecedent Wuhan-1 spike antigen, the mRNA-1273.214 and mRNA-1273.222 bivalent vaccine boosters also induced unique BA.1-specific and BA.4/5-specific responses, respectively. Although boosting with parental or bivalent mRNA vaccines substantially improved protection against BA.5 compared to mice receiving two vaccine doses, the levels of infection, inflammation and pathology in the lung were lowest in animals administered the bivalent mRNA vaccines. Thus, boosting with bivalent Omicron-based mRNA-1273.214 or mRNA-1273.222 vaccines enhances immunogenicity and confers protection in mice against a currently circulating SARS-CoV-2 strain.
Broadly neutralizing antibodies (bNAbs) represent a promising alternative to antiretroviral drugs for HIV-1 prevention and treatment. Selected antibodies to the CD4-binding site bolster envelope ...trimer binding via quaternary contacts. Here, we rationally engraft a new paratope, i.e., the extended heavy-chain framework region 3 (FR3) loop of VRC03, which mediates quaternary interaction, onto several potent bNAbs, enabling them to reach an adjacent gp120 protomer. The interactive quaternary surface is delineated by solving the crystal structure of two FR3 loop-chimeric antibodies. Chimerization enhances the neutralizing activity of several potent bNAbs against a majority of global HIV-1 strains. Compared to unmodified antibodies, chimeric antibodies display lower autoreactivity and prolonged in vivo half-life in huFcRn mice and rhesus macaques. Thus, paratope engraftment may be used to expand the epitope repertory of natural antibodies, improving their functionality for disease prevention and treatment.
Motivation: The binding sites of proteins generally contain smaller regions that provide major contributions to the binding free energy and hence are the prime targets in drug design. Screening ...libraries of fragment-sized compounds by NMR or X-ray crystallography demonstrates that such ‘hot spot’ regions bind a large variety of small organic molecules, and that a relatively high ‘hit rate’ is predictive of target sites that are likely to bind drug-like ligands with high affinity. Our goal is to determine the ‘hot spots’ computationally rather than experimentally. Results: We have developed the FTMAP algorithm that performs global search of the entire protein surface for regions that bind a number of small organic probe molecules. The search is based on the extremely efficient fast Fourier transform (FFT) correlation approach which can sample billions of probe positions on dense translational and rotational grids, but can use only sums of correlation functions for scoring and hence is generally restricted to very simple energy expressions. The novelty of FTMAP is that we were able to incorporate and represent on grids a detailed energy expression, resulting in a very accurate identification of low-energy probe clusters. Overlapping clusters of different probes are defined as consensus sites (CSs). We show that the largest CS is generally located at the most important subsite of the protein binding site, and the nearby smaller CSs identify other important subsites. Mapping results are presented for elastase whose structure has been solved in aqueous solutions of eight organic solvents, and we show that FTMAP provides very similar information. The second application is to renin, a long-standing pharmaceutical target for the treatment of hypertension, and we show that the major CSs trace out the shape of the first approved renin inhibitor, aliskiren. Availability: FTMAP is available as a server at http://ftmap.bu.edu/. Contact: vajda@bu.edu Supplementary information: Supplementary Material is available at Bioinformatics online.