Abs are immune system proteins that recognize noxious molecules for elimination. Their sequence diversity and binding versatility have made Abs the primary class of biopharmaceuticals. Recently, it ...has become possible to query their immense natural diversity using next-generation sequencing of Ig gene repertoires (Ig-seq). However, Ig-seq outputs are currently fragmented across repositories and tend to be presented as raw nucleotide reads, which means nontrivial effort is required to reuse the data for analysis. To address this issue, we have collected Ig-seq outputs from 55 studies, covering more than half a billion Ab sequences across diverse immune states, organisms (primarily human and mouse), and individuals. We have sorted, cleaned, annotated, translated, and numbered these sequences and make the data available via our Observed Antibody Space (OAS) resource at http://antibodymap.org The data within OAS will be regularly updated with newly released Ig-seq datasets. We believe OAS will facilitate data mining of immune repertoires for improved understanding of the immune system and development of better biotherapeutics.
SAbPred is a server that makes predictions of the properties of antibodies focusing on their structures. Antibody informatics tools can help improve our understanding of immune responses to disease ...and aid in the design and engineering of therapeutic molecules. SAbPred is a single platform containing multiple applications which can: number and align sequences; automatically generate antibody variable fragment homology models; annotate such models with estimated accuracy alongside sequence and structural properties including potential developability issues; predict paratope residues; and predict epitope patches on protein antigens. The server is available at http://opig.stats.ox.ac.uk/webapps/sabpred.
The adaptive immune system uses two main types of antigen receptors: T-cell receptors (TCRs) and antibodies. While both proteins share a globally similar β-sandwich architecture, TCRs are specialized ...to recognize peptide antigens in the binding groove of the major histocompatibility complex, while antibodies can bind an almost infinite range of molecules. For both proteins, the main determinants of target recognition are the complementarity-determining region (CDR) loops. Five of the six CDRs adopt a limited number of backbone conformations, known as the "canonical classes"; the remaining CDR (β3in TCRs and H3 in antibodies) is more structurally diverse. In this paper, we first update the definition of canonical forms in TCRs, build an auto-updating sequence-based prediction tool (available at http://opig.stats.ox.ac.uk/resources) and demonstrate its application on large scale sequencing studies. Given the global similarity of TCRs and antibodies, we then examine the structural similarity of their CDRs. We find that TCR and antibody CDRs tend to have different length distributions, and where they have similar lengths, they mostly occupy distinct structural spaces. In the rare cases where we found structural similarity, the underlying sequence patterns for the TCR and antibody version are different. Finally, where multiple structures have been solved for the same CDR sequence, the structural variability in TCR loops is higher than that in antibodies, suggesting TCR CDRs are more flexible. These structural differences between TCR and antibody CDRs may be important to their different biological functions.
Abstract
Motivation
Canonical forms of the antibody complementarity-determining regions (CDRs) were first described in 1987 and have been redefined on multiple occasions since. The canonical forms ...are often used to approximate the antibody binding site shape as they can be predicted from sequence. A rapid predictor would facilitate the annotation of CDR structures in the large amounts of repertoire data now becoming available from next generation sequencing experiments.
Results
SCALOP annotates CDR canonical forms for antibody sequences, supported by an auto-updating database to capture the latest cluster information. Its accuracy is comparable to that of a standard structural predictor but it is 800 times faster. The auto-updating nature of SCALOP ensures that it always attains the best possible coverage.
Availability and implementation
SCALOP is available as a web application and for download under a GPLv3 license at opig.stats.ox.ac.uk/webapps/scalop.
Supplementary information
Supplementary data are available at Bioinformatics online.
Every human possesses millions of distinct antibodies. It is now possible to analyze this diversity
next-generation sequencing of immunoglobulin genes (Ig-seq). This technique produces large volume ...sequence snapshots of B-cell receptors that are indicative of the antibody repertoire. In this paper, we enrich these large-scale sequence datasets with structural information. Enriching a sequence with its structural data allows better approximation of many vital features, such as its binding site and specificity. Here, we describe the structural annotation of antibodies pipeline that maps the outputs of large Ig-seq experiments to known antibody structures. We demonstrate the viability of our protocol on five separate Ig-seq datasets covering ca. 35 m unique amino acid sequences from ca. 600 individuals. Despite the great theoretical diversity of antibodies, we find that the majority of sequences coming from such studies can be reliably mapped to an existing structure.
AlphaFold2 has hallmarked a generational improvement in protein structure prediction. In particular, advances in antibody structure prediction have provided a highly translatable impact on drug ...discovery. Though AlphaFold2 laid the groundwork for all proteins, antibody-specific applications require adjustments tailored to these molecules, which has resulted in a handful of deep learning antibody structure predictors. Herein, we review the recent advances in antibody structure prediction and relate them to their role in advancing biologics discovery.
Samuel Demharter, Nicholas Pearce Affiliation: Doctoral Training Centre for Systems Approaches to Biomedical Science, University of Oxford, Oxford, United Kingdom ORCID ...http://orcid.org/0000-0002-6693-8603 Kylie Beattie Affiliation: Doctoral Training Centre for Systems Approaches to Biomedical Science, University of Oxford, Oxford, United Kingdom Isabel Frost Affiliation: Doctoral Training Centre for Systems Biology, University of Oxford, Oxford, United Kingdom Jinwoo Leem Affiliation: Doctoral Training Centre for Systems Approaches to Biomedical Science, University of Oxford, Oxford, United Kingdom ORCID http://orcid.org/0000-0002-7817-3644 Alistair Martin Affiliation: Doctoral Training Centre for Life Sciences Interface, University of Oxford, Oxford, United Kingdom Robert Oppenheimer Affiliation: Doctoral Training Centre for Synthetic Biology, University of Oxford, Oxford, United Kingdom Cristian Regep Affiliation: Doctoral Training Centre for Life Sciences Interface, University of Oxford, Oxford, United Kingdom Tammo Rukat Affiliation: Doctoral Training Centre for Systems Approaches to Biomedical Science, University of Oxford, Oxford, United Kingdom ORCID http://orcid.org/0000-0002-6186-0077 Alexander Skates Affiliation: Doctoral Training Centre for Systems Approaches to Biomedical Science, University of Oxford, Oxford, United Kingdom Nicola Trendel Affiliation: Doctoral Training Centre for Systems Approaches to Biomedical Science, University of Oxford, Oxford, United Kingdom David J. Gavaghan Affiliation: Doctoral Training Centre for Life Sciences Interface, University of Oxford, Oxford, United Kingdom Charlotte M. Deane Affiliation: Doctoral Training Centre for Systems Approaches to Biomedical Science, University of Oxford, Oxford, United Kingdom ORCID http://orcid.org/0000-0003-1388-2252 Bernhard Knapp * E-mail: samuel.demharter@dtc.ox.ac.uk (SD); bknapp@uic.es (BK) Current address: Systems Biology (EP/G03706X/1), Systems Approaches to Biomedical Sciences (EP/L016044/1, EP/G037280/1), Life Sciences Interface (EP/F500394/1, EP/I017909/1), Biomedical Imaging (EP/L016052/1), Synthetic Biology (EP/L016494/1), Synthesis for Biology...
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Structural antibody database (SAbDab; http://opig.stats.ox.ac.uk/webapps/sabdab) is an online resource containing all the publicly available antibody structures annotated and presented in a ...consistent fashion. The data are annotated with several properties including experimental information, gene details, correct heavy and light chain pairings, antigen details and, where available, antibody-antigen binding affinity. The user can select structures, according to these attributes as well as structural properties such as complementarity determining region loop conformation and variable domain orientation. Individual structures, datasets and the complete database can be downloaded.
Computational modeling of antibody structures plays a critical role in therapeutic antibody design. Several antibody modeling pipelines exist, but no freely available methods currently model ...nanobodies, provide estimates of expected model accuracy, or highlight potential issues with the antibody's experimental development. Here, we describe our automated antibody modeling pipeline, ABodyBuilder, designed to overcome these issues. The algorithm itself follows the standard 4 steps of template selection, orientation prediction, complementarity-determining region (CDR) loop modeling, and side chain prediction. ABodyBuilder then annotates the 'confidence' of the model as a probability that a component of the antibody (e.g., CDRL3 loop) will be modeled within a root-mean square deviation threshold. It also flags structural motifs on the model that are known to cause issues during in vitro development. ABodyBuilder was tested on 4 separate datasets, including the 11 antibodies from the Antibody Modeling Assessment-II competition. ABodyBuilder builds models that are of similar quality to other methodologies, with sub-Angstrom predictions for the 'canonical' CDR loops. Its ability to model nanobodies, and rapidly generate models (∼30 seconds per model) widens its potential usage. ABodyBuilder can also help users in decision-making for the development of novel antibodies because it provides model confidence and potential sequence liabilities. ABodyBuilder is freely available at http://opig.stats.ox.ac.uk/webapps/abodybuilder .
Antibodies are versatile proteins with both the capacity to bind a broad range of targets and a proven track record as some of the most successful therapeutics. However, the development of novel ...antibody therapeutics is a lengthy and costly process. It is challenging to predict the functional and biophysical properties of antibodies from their amino acid sequence alone, requiring numerous experiments for full characterization. Machine learning, specifically deep representation learning, has emerged as a family of methods that can complement wet lab approaches and accelerate the overall discovery and engineering process. Here, we review advances in antibody sequence representation learning, and how this has improved antibody structure prediction and facilitated antibody optimization. We discuss challenges in the development and implementation of such models, such as the lack of publicly available, well-curated antibody function data and highlight opportunities for improvement. These and future advances in machine learning for antibody sequences have the potential to increase the success rate in developing new therapeutics, resulting in broader access to transformative medicines and improved patient outcomes.