The solubility of proteins correlates with a variety of their properties, including function, production yield, pharmacokinetics, and formulation at high concentrations. High solubility is therefore ...a key requirement for the development of protein-based reagents for applications in life sciences, biotechnology, diagnostics, and therapeutics. Accurate solubility measurements, however, remain challenging and resource intensive, which limits their throughput and hence their applicability at the early stages of development pipelines, when long-lists of candidates are typically available in minute amounts. Here, we present an automated method based on the titration of a crowding agent (polyethylene glycol, PEG) to quantitatively assess relative solubility of proteins using about 200 µg of purified material. Our results demonstrate that this method is accurate and economical in material requirement and costs of reagents, which makes it suitable for high-throughput screening. This approach is freely-shared and based on a low cost, open-source liquid-handling robot. We anticipate that this method will facilitate the assessment of the developability of proteins and make it substantially more accessible.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
Abstract
Motivation
Antibodies play essential roles in the immune system of vertebrates and are powerful tools in research and diagnostics. While hypervariable regions of antibodies, which are ...responsible for binding, can be readily identified from their amino acid sequence, it remains challenging to accurately pinpoint which amino acids will be in contact with the antigen (the paratope).
Results
In this work, we present a sequence-based probabilistic machine learning algorithm for paratope prediction, named Parapred. Parapred uses a deep-learning architecture to leverage features from both local residue neighbourhoods and across the entire sequence. The method significantly improves on the current state-of-the-art methodology, and only requires a stretch of amino acid sequence corresponding to a hypervariable region as an input, without any information about the antigen. We further show that our predictions can be used to improve both speed and accuracy of a rigid docking algorithm.
Availability and implementation
The Parapred method is freely available as a webserver at http://www-mvsoftware.ch.cam.ac.uk/and for download at https://github.com/eliberis/parapred.
Supplementary information
Supplementary information is available at Bioinformatics online.
Protein aggregation is a complex process resulting in the formation of heterogeneous mixtures of aggregate populations that are closely linked to neurodegenerative conditions, such as Alzheimer's ...disease. Here, we find that soluble aggregates formed at different stages of the aggregation process of amyloid beta (Aβ42) induce the disruption of lipid bilayers and an inflammatory response to different extents. Further, by using gradient ultracentrifugation assay, we show that the smaller aggregates are those most potent at inducing membrane permeability and most effectively inhibited by antibodies binding to the C-terminal region of Aβ42. By contrast, we find that the larger soluble aggregates are those most effective at causing an inflammatory response in microglia cells and more effectively inhibited by antibodies targeting the N-terminal region of Aβ42. These findings suggest that different toxic mechanisms driven by different soluble aggregated species of Aβ42 may contribute to the onset and progression of Alzheimer's disease.
Antibodies are powerful tools in life sciences research, as well as in diagnostic and therapeutic applications, because of their ability to bind given molecules with high affinity and specificity. ...Using current methods, however, it is laborious and sometimes difficult to generate antibodies to target specific epitopes within a protein, in particular if these epitopes are not effective antigens. Here we present a method to rationally design antibodies to enable them to bind virtually any chosen disordered epitope in a protein. The procedure consists in the sequence-based design of one or more complementary peptides targeting a selected disordered epitope and the subsequent grafting of such peptides on an antibody scaffold. We illustrate the method by designing six single-domain antibodies to bind different epitopes within three disease-related intrinsically disordered proteins and peptides (α-synuclein, Aβ42, and IAPP). Our results show that all these designed antibodies bind their targets with good affinity and specificity. As an example of an application, we show that one of these antibodies inhibits the aggregation of α-synuclein at substoichiometric concentrations and that binding occurs at the selected epitope. Taken together, these results indicate that the design strategy that we propose makes it possible to obtain antibodies targeting given epitopes in disordered proteins or protein regions.
Full text
Available for:
BFBNIB, NMLJ, NUK, PNG, SAZU, UL, UM, UPUK
Antibodies represent essential tools in research and diagnostics and are rapidly growing in importance as therapeutics. Commonly used methods to obtain novel antibodies typically yield several ...candidates capable of engaging a given target. The development steps that follow, however, are usually performed with only one or few candidates since they can be resource demanding, thereby increasing the risk of failure of the overall antibody discovery program. In particular, insufficient solubility, which may lead to aggregation under typical storage conditions, often hinders the ability of a candidate antibody to be developed and manufactured. Here we show that the selection of soluble lead antibodies from an initial library screening can be greatly facilitated by a fast computational prediction of solubility that requires only the amino acid sequence as input. We quantitatively validate this approach on a panel of nine distinct monoclonal antibodies targeting nerve growth factor (NGF), for which we compare the predicted and measured solubilities finding a very close match, and we further benchmark our predictions with published experimental data on aggregation hotspots and solubility of mutational variants of one of these antibodies.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
Extensive amounts of information about protein sequences are becoming available, as demonstrated by the over 79 million entries in the UniProt database. Yet, it is still challenging to obtain ...proteome-wide experimental information on the structural properties associated with these sequences. Fast computational predictors of secondary structure and of intrinsic disorder of proteins have been developed in order to bridge this gap. These two types of predictions, however, have remained largely separated, often preventing a clear characterization of the structure and dynamics of proteins. Here, we introduce a computational method to predict secondary-structure populations from amino acid sequences, which simultaneously characterizes structure and disorder in a unified statistical mechanics framework. To develop this method, called s2D, we exploited recent advances made in the analysis of NMR chemical shifts that provide quantitative information about the probability distributions of secondary-structure elements in disordered states. The results that we discuss show that the s2D method predicts secondary-structure populations with an average error of about 14%. A validation on three datasets of mostly disordered, mostly structured and partly structured proteins, respectively, shows that its performance is comparable to or better than that of existing predictors of intrinsic disorder and of secondary structure. These results indicate that it is possible to perform rapid and quantitative sequence-based characterizations of the structure and dynamics of proteins through the predictions of the statistical distributions of their ordered and disordered regions.
Display omitted
•A sequence-based predictor is derived from NMR measurements of disordered proteins.•Order and disorder of proteins are predicted simultaneously.•Statistical weights are assigned to the ordered and disordered regions of proteins.•Structure and dynamics of proteins are characterized from their sequences.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UL, UM, UPCLJ, UPUK
To function effectively proteins must avoid aberrant aggregation, and hence they are expected to be expressed at concentrations safely below their solubility limits. By analyzing proteome-wide mass ...spectrometry data of Caenorhabditis elegans, however, we show that the levels of about three-quarters of the nearly 4,000 proteins analyzed in adult animals are close to their intrinsic solubility limits, indeed exceeding them by about 10% on average. We next asked how aging and functional self-assembly influence these solubility limits. We found that despite the fact that the total quantity of proteins within the cellular environment remains approximately constant during aging, protein aggregation sharply increases between days 6 and 12 of adulthood, after the worms have reproduced, as individual proteins lose their stoichiometric balances and the cellular machinery that maintains solubility undergoes functional decline. These findings reveal that these proteins are highly prone to undergoing concentration-dependent phase separation, which on aging is rationalized in a decrease of their effective solubilities, in particular for proteins associated with translation, growth, reproduction, and the chaperone system.
Full text
Available for:
BFBNIB, NMLJ, NUK, PNG, SAZU, UL, UM, UPUK
Abstract
The MobiDB (URL: mobidb.bio.unipd.it) database of protein disorder and mobility annotations has been significantly updated and upgraded since its last major renewal in 2014. Several curated ...datasets for intrinsic disorder and folding upon binding have been integrated from specialized databases. The indirect evidence has also been expanded to better capture information available in the PDB, such as high temperature residues in X-ray structures and overall conformational diversity. Novel nuclear magnetic resonance chemical shift data provides an additional experimental information layer on conformational dynamics. Predictions have been expanded to provide new types of annotation on backbone rigidity, secondary structure preference and disordered binding regions. MobiDB 3.0 contains information for the complete UniProt protein set and synchronization has been improved by covering all UniParc sequences. An advanced search function allows the creation of a wide array of custom-made datasets for download and further analysis. A large amount of information and cross-links to more specialized databases are intended to make MobiDB the central resource for the scientific community working on protein intrinsic disorder and mobility.
Antibody drugs should exhibit not only high-binding affinity for their target antigens but also favorable physicochemical drug-like properties. Such drug-like biophysical properties are essential for ...the successful development of antibody drug products. The traditional approaches used in antibody drug development require significant experimentation to produce, optimize, and characterize many candidates. Therefore, it is attractive to integrate new methods that can optimize the process of selecting antibodies with both desired target-binding and drug-like biophysical properties. Here, we summarize a selection of techniques that can complement the conventional toolbox used to de-risk antibody drug development. These techniques can be integrated at different stages of the antibody development process to reduce the frequency of physicochemical liabilities in antibody libraries during initial discovery and to co-optimize multiple antibody features during early-stage antibody engineering and affinity maturation. Moreover, we highlight biophysical and computational approaches that can be used to predict physical degradation pathways relevant for long-term storage and in-use stability to reduce the need for extensive experimentation.
Biologics, such as antibodies and enzymes, are crucial in research, biotechnology, diagnostics, and therapeutics. Often, biologics with suitable functionality are discovered, but their development is ...impeded by developability issues. Stability and solubility are key biophysical traits underpinning developability potential, as they determine aggregation, correlate with production yield and poly-specificity, and are essential to access parenteral and oral delivery. While advances for the optimisation of individual traits have been made, the co-optimization of multiple traits remains highly problematic and time-consuming, as mutations that improve one property often negatively impact others. In this work, we introduce a fully automated computational strategy for the simultaneous optimisation of conformational stability and solubility, which we experimentally validate on six antibodies, including two approved therapeutics. Our results on 42 designs demonstrate that the computational procedure is highly effective at improving developability potential, while not affecting antigen-binding. We make the method available as a webserver at www-cohsoftware.ch.cam.ac.uk.