Effective countermeasures against the recent emergence and rapid expansion of the 2019 novel coronavirus (SARS-CoV-2) require the development of data and tools to understand and monitor its spread ...and immune responses to it. However, little information is available about the targets of immune responses to SARS-CoV-2. We used the Immune Epitope Database and Analysis Resource (IEDB) to catalog available data related to other coronaviruses. This includes SARS-CoV, which has high sequence similarity to SARS-CoV-2 and is the best-characterized coronavirus in terms of epitope responses. We identified multiple specific regions in SARS-CoV-2 that have high homology to the SARS-CoV virus. Parallel bioinformatic predictions identified a priori potential B and T cell epitopes for SARS-CoV-2. The independent identification of the same regions using two approaches reflects the high probability that these regions are promising targets for immune recognition of SARS-CoV-2. These predictions can facilitate effective vaccine design against this virus of high priority.
Display omitted
•Ten experimentally defined regions within SARS-CoV have high homology with SARS-CoV-2•Parallel bioinformatics predicted potential B and T cell epitopes for SARS-CoV-2•Independent approaches identified the same immunodominant regions•The conserved immune regions have implications for vaccine design against multiple CoVs
Grifoni et al. identify potential targets for immune responses to the 2019 novel coronavirus (SARS-CoV-2) by sequence homology with closely related SARS-CoV and by a priori epitope prediction using bioinformatics approaches. This analysis provides essential information for understanding human immune responses to this virus and for evaluating diagnostic and vaccine candidates.
MHC class II binding predictions are widely used to identify epitope candidates in infectious agents, allergens, cancer and autoantigens. The vast majority of prediction algorithms for human MHC ...class II to date have targeted HLA molecules encoded in the DR locus. This reflects a significant gap in knowledge as HLA DP and DQ molecules are presumably equally important, and have only been studied less because they are more difficult to handle experimentally.
In this study, we aimed to narrow this gap by providing a large scale dataset of over 17,000 HLA-peptide binding affinities for a set of 11 HLA DP and DQ alleles. We also expanded our dataset for HLA DR alleles resulting in a total of 40,000 MHC class II binding affinities covering 26 allelic variants. Utilizing this dataset, we generated prediction tools utilizing several machine learning algorithms and evaluated their performance.
We found that 1) prediction methodologies developed for HLA DR molecules perform equally well for DP or DQ molecules. 2) Prediction performances were significantly increased compared to previous reports due to the larger amounts of training data available. 3) The presence of homologous peptides between training and testing datasets should be avoided to give real-world estimates of prediction performance metrics, but the relative ranking of different predictors is largely unaffected by the presence of homologous peptides, and predictors intended for end-user applications should include all training data for maximum performance. 4) The recently developed NN-align prediction method significantly outperformed all other algorithms, including a naïve consensus based on all prediction methods. A new consensus method dropping the comparably weak ARB prediction method could outperform the NN-align method, but further research into how to best combine MHC class II binding predictions is required.
T cells in atherosclerosis Tse, Kevin; Tse, Harley; Sidney, John ...
International immunology,
11/2013, Volume:
25, Issue:
11
Journal Article
Peer reviewed
Open access
Atherosclerosis is a chronic inflammatory disease of the artery wall. Atherosclerotic lesions contain monocytes, macrophages, smooth muscle cells and T lymphocytes. Here, we review the role of ...T-lymphocyte subsets in atherosclerosis. Among CD4⁺T cells, T(h)1 cells are pro-atherogenic, T(reg) cells are athero-protective and the role of T(h)2 and T(h)17 cells remains unclear. The role of follicular helper T cells in atherosclerosis remains unknown, as is the role of CD8⁺T cells. NKT cells bind glycolipid antigens and exert a pro-atherogenic role. The antigen specificity of T-cell responses in atherosclerosis is poorly understood. In order to enable antigen-specific prevention or therapy, a better understanding of these mechanisms is needed.
Identification of human leukocyte antigen (HLA)-bound peptides by liquid chromatography-tandem mass spectrometry (LC-MS/MS) is poised to provide a deep understanding of rules underlying antigen ...presentation. However, a key obstacle is the ambiguity that arises from the co-expression of multiple HLA alleles. Here, we have implemented a scalable mono-allelic strategy for profiling the HLA peptidome. By using cell lines expressing a single HLA allele, optimizing immunopurifications, and developing an application-specific spectral search algorithm, we identified thousands of peptides bound to 16 different HLA class I alleles. These data enabled the discovery of subdominant binding motifs and an integrative analysis quantifying the contribution of factors critical to epitope presentation, such as protein cleavage and gene expression. We trained neural-network prediction algorithms with our large dataset (>24,000 peptides) and outperformed algorithms trained on datasets of peptides with measured affinities. We thus demonstrate a strategy for systematically learning the rules of endogenous antigen presentation.
Display omitted
•24,000 HLA class I peptides were identified through a scalable MS-based pipeline.•Mono-allelic data revealed binding motifs that were validated biochemically.•Comprehensive analyses provide an updated portrait of antigen processing rules.•Neural networks were trained for 16 alleles and outperform standard by 2-fold.
HLA class I binding prediction has traditionally been based on biochemical binding experiments. Abelin and colleagues present an LC-MS/MS-based workflow and analytical framework that greatly accelerates gains in prediction performance. Key advances include the discovery of sequence motifs and improved quantification of the roles of gene expression and proteasomal processing.
Dengue virus (DENV) is a rapidly spreading pathogen with unusual pathogenesis, and correlates of protection from severe dengue disease and vaccine efficacy have not yet been established. Although ...DENV-specific CD8⁺ T-cell responses have been extensively studied, the breadth and specificity of CD4⁺ T-cell responses remains to be defined. Here we define HLA-restricted CD4⁺ T-cell epitopes resulting from natural infection with dengue virus in a hyperepidemic setting. Ex vivo flow-cytometric analysis of DENV-specific CD4⁺ T cells revealed that the virus-specific cells were highly polarized, with a strong bias toward a CX3CR1⁺ Eomesodermin⁺ perforin⁺ granzyme B⁺ CD45RA⁺ CD4 CTL phenotype. Importantly, these cells correlated with a protective HLA DR allele, and we demonstrate that these cells have direct ex vivo DENV-specific cytolytic activity. We speculate that cytotoxic dengue-specific CD4⁺ T cells may play a role in the control of dengue infection in vivo, and this immune correlate may be a key target for dengue virus vaccine development.
In an epitope-based vaccine setting, the use of conserved epitopes would be expected to provide broader protection across multiple strains, or even species, than epitopes derived from highly variable ...genome regions. Conversely, in a diagnostic and disease monitoring setting, epitopes that are specific to a given pathogen strain, for example, can be used to monitor responses to that particular infectious strain. In both cases, concrete information pertaining to the degree of conservancy of the epitope(s) considered is crucial.
To assist in the selection of epitopes with the desired degree of conservation, we have developed a new tool to determine the variability of epitopes within a given set of protein sequences. The tool was implemented as a component of the Immune Epitope Database and Analysis Resources (IEDB), and is directly accessible at http://tools.immuneepitope.org/tools/conservancy.
An epitope conservancy analysis tool was developed to analyze the variability or conservation of epitopes. The tool is user friendly, and is expected to aid in the design of epitope-based vaccines and diagnostics.
The identification of MHC class II restricted peptide epitopes is an important goal in immunological research. A number of computational tools have been developed for this purpose, but there is a ...lack of large-scale systematic evaluation of their performance. Herein, we used a comprehensive dataset consisting of more than 10,000 previously unpublished MHC-peptide binding affinities, 29 peptide/MHC crystal structures, and 664 peptides experimentally tested for CD4+ T cell responses to systematically evaluate the performances of publicly available MHC class II binding prediction tools. While in selected instances the best tools were associated with AUC values up to 0.86, in general, class II predictions did not perform as well as historically noted for class I predictions. It appears that the ability of MHC class II molecules to bind variable length peptides, which requires the correct assignment of peptide binding cores, is a critical factor limiting the performance of existing prediction tools. To improve performance, we implemented a consensus prediction approach that combines methods with top performances. We show that this consensus approach achieved best overall performance. Finally, we make the large datasets used publicly available as a benchmark to facilitate further development of MHC class II binding peptide prediction methods.
The proteasome generates the epitopes presented on human leukocyte antigen (HLA) class I molecules that elicit CD8⁺ T cell responses. Reports of proteasome-generated spliced epitopes exist, but they ...have been regarded as rare events. Here, however, we show that the proteasome-generated spliced peptide pool accounts for one-third of the entire HLA class I immunopeptidome in terms of diversity and one-fourth in terms of abundance. This pool also represents a unique set of antigens, possessing particular and distinguishing features. We validated this observation using a range of complementary experimental and bioinformatics approaches, as well as multiple cell types. The widespread appearance and abundance of proteasome-catalyzed peptide splicing events has implications for immunobiology and autoimmunity theories and may provide a previously untapped source of epitopes for use in vaccines and cancer immunotherapy.
Experts in peptide:MHC binding studies are often able to estimate the impact of a single residue substitution based on a heuristic understanding of amino acid similarity in an experimental context. ...Our aim is to quantify this measure of similarity to improve peptide:MHC binding prediction methods. This should help compensate for holes and bias in the sequence space coverage of existing peptide binding datasets.
Here, a novel amino acid similarity matrix (PMBEC) is directly derived from the binding affinity data of combinatorial peptide mixtures. Like BLOSUM62, this matrix captures well-known physicochemical properties of amino acid residues. However, PMBEC differs markedly from existing matrices in cases where residue substitution involves a reversal of electrostatic charge. To demonstrate its usefulness, we have developed a new peptide:MHC class I binding prediction method, using the matrix as a Bayesian prior. We show that the new method can compensate for missing information on specific residues in the training data. We also carried out a large-scale benchmark, and its results indicate that prediction performance of the new method is comparable to that of the best neural network based approaches for peptide:MHC class I binding.
A novel amino acid similarity matrix has been derived for peptide:MHC binding interactions. One prominent feature of the matrix is that it disfavors substitution of residues with opposite charges. Given that the matrix was derived from experimentally determined peptide:MHC binding affinity measurements, this feature is likely shared by all peptide:protein interactions. In addition, we have demonstrated the usefulness of the matrix as a Bayesian prior in an improved scoring-matrix based peptide:MHC class I prediction method. A software implementation of the method is available at: http://www.mhc-pathway.net/smmpmbec.
Class I major histocompatibility complex (MHC) molecules bind, and present to T cells, short peptides derived from intracellular processing of proteins. The peptide repertoire of a specific molecule ...is to a large extent determined by the molecular structure accommodating so-called main anchor positions of the presented peptide. These receptors are extremely polymorphic, and much of the polymorphism influences the peptide-binding repertoire. However, despite this polymorphism, class I molecules can be clustered into sets of molecules that bind largely overlapping peptide repertoires. Almost a decade ago we introduced this concept of clustering human leukocyte antigen (HLA) alleles and defined nine different groups, denominated as supertypes, on the basis of their main anchor specificity. The utility of this original supertype classification, as well several other subsequent arrangements derived by others, has been demonstrated in a large number of epitope identification studies.
Following our original approach, in the present report we provide an updated classification of HLA-A and -B class I alleles into supertypes. The present analysis incorporates the large amount of class I MHC binding data and sequence information that has become available in the last decade. As a result, over 80% of the 945 different HLA-A and -B alleles examined to date can be assigned to one of the original nine supertypes. A few alleles are expected to be associated with repertoires that overlap multiple supertypes. Interestingly, the current analysis did not identify any additional supertype specificities.
As a result of this updated analysis, HLA supertype associations have been defined for over 750 different HLA-A and -B alleles. This information is expected to facilitate epitope identification and vaccine design studies, as well as investigations into disease association and correlates of immunity. In addition, the approach utilized has been made more transparent, allowing others to utilize the classification approach going forward.