In this study a systematic attempt has been made to integrate various approaches in order to predict allergenic proteins with high accuracy. The dataset used for testing and training consists of 578 ...allergens and 700 non-allergens obtained from A. K. Bjorklund, D. Soeria-Atmadja, A. Zorzet, U. Hammerling and M. G. Gustafsson (2005) Bioinformatics, 21, 39-50. First, we developed methods based on support vector machine using amino acid and dipeptide composition and achieved an accuracy of 85.02 and 84.00%, respectively. Second, a motif-based method has been developed using MEME/MAST software that achieved sensitivity of 93.94 with 33.34% specificity. Third, a database of known IgE epitopes was searched and this predicted allergenic proteins with 17.47% sensitivity at specificity of 98.14%. Fourth, we predicted allergenic proteins by performing BLAST search against allergen representative peptides. Finally hybrid approaches have been developed, which combine two or more than two approaches. The performance of all these algorithms has been evaluated on an independent dataset of 323 allergens and on 101 725 non-allergens obtained from Swiss-Prot. A web server AlgPred has been developed for the predicting allergenic proteins and for mapping IgE epitopes on allergenic proteins (http://www.imtech.res.in/raghava/algpred/). AlgPred is available at www.imtech.res.in/raghava/algpred/.
Laterosporulin10 (LS10) is a defensin like peptide from Brevibacillus sp. strain SKDU10 that inhibited microbial pathogens. However, in this study, anticancer activity of LS10 was examined against ...different cancer cell lines and compared with normal cells. LS10 displayed cytotoxicity against cancer cells like MCF-7, HEK293T, HT1080, HeLa and H1299 at below 10 μM concentration, but not against prostate epithelium cells RWPE-1. Additionally, no hemolysis was observed at significantly higher concentration compared to IC
values observed for different cancer cell lines. Release of lactate dehydrogenase from cancer cell lines at 15 μM concentration upon 120 min treatment indicated the lytic ability of LS10. Accordingly, electron microscopy experiments also confirmed the necrotic effect of LS10 at 15 μM concentration against cancer cells. Furthermore, flow cytometry analysis of treated cancer cell lines revealed that LS10 induce apoptosis even at 2.5 μM concentration. Nevertheless, RWPE-1 cells remained viable even at 20 μM concentration. These results provide evidence that LS10 is an anticancer bacteriocin, which causes apoptotic and necrotic death of cancer cells at lower and higher concentrations, respectively. Taken all results together, the present study signifies that LS10 is an anticancer peptide that could be further developed for therapeutic applications.
Shigellosis or bacillary dysentery is an important cause of diarrhea, with the majority of the cases occurring in developing countries. Considering the high disease burden, increasing antibiotic ...resistance, serotype-specific immunity and the post-infectious sequelae associated with shigellosis, there is a pressing need of an effective vaccine against multiple serotypes of the pathogen. In the present study, we used bio-informatics approach to identify antigens shared among multiple serotypes of Shigella spp. This approach led to the identification of many immunogenic peptides. The five most promising peptides based on MHC binding efficiency were a putative lipoprotein (EL PGI I), a putative heat shock protein (EL PGI II), Spa32 (EL PGI III), IcsB (EL PGI IV) and a hypothetical protein (EL PGI V). These peptides were synthesized and the immunogenicity was evaluated in BALB/c mice by ELISA and cytokine assays. The putative heat shock protein (HSP) and the hypothetical protein elicited good humoral response, whereas putative lipoprotein, Spa32 and IcsB elicited good T-cell response as revealed by increased IFN-γ and TNF-α cytokine levels. The patient sera from confirmed cases of shigellosis were also evaluated for the presence of peptide specific antibodies with significant IgG and IgA antibodies against the HSP and the hypothetical protein, bestowing them as potential future vaccine candidates. The antigens reported in this study are novel and have not been tested as vaccine candidates against Shigella. This study offers time and cost-effective way of identifying unprecedented immunogenic antigens to be used as potential vaccine candidates. Moreover, this approach should easily be extendable to find new potential vaccine candidates for other pathogenic bacteria.
Recently, high profile cases of financial statement fraud have been dominating the news. This paper uses data mining techniques such as Multilayer Feed Forward Neural Network (MLFF), Support Vector ...Machines (SVM), Genetic Programming (GP), Group Method of Data Handling (GMDH), Logistic Regression (LR), and Probabilistic Neural Network (PNN) to identify companies that resort to financial statement fraud. Each of these techniques is tested on a dataset involving 202 Chinese companies and compared with and without feature selection. PNN outperformed all the techniques without feature selection, and GP and PNN outperformed others with feature selection and with marginally equal accuracies.
Use of therapeutic peptides in cancer therapy has been receiving considerable attention in the recent years. Present study describes the development of computational models for predicting and ...discovering novel anticancer peptides. Preliminary analysis revealed that Cys, Gly, Ile, Lys, and Trp are dominated at various positions in anticancer peptides. Support vector machine models were developed using amino acid composition and binary profiles as input features on main dataset that contains experimentally validated anticancer peptides and random peptides derived from SwissProt database. In addition, models were developed on alternate dataset that contains antimicrobial peptides instead of random peptides. Binary profiles-based model achieved maximum accuracy 91.44% with MCC 0.83. We have developed a webserver, which would be helpful in: (i) predicting minimum mutations required for improving anticancer potency; (ii) virtual screening of peptides for discovering novel anticancer peptides, and (iii) scanning natural proteins for identification of anticancer peptides (http://crdd.osdd.net/raghava/anticp/).
ProPred is a graphical web tool for predicting MHC class II binding regions in antigenic protein sequences. The server implement matrix based prediction algorithm, employing amino-acid/position ...coefficient table deduced from literature. The predicted binders can be visualized either as peaks in graphical interface or as colored residues in HTML interface. This server might be a useful tool in locating the promiscuous binding regions that can bind to several HLA-DR alleles. Availability: The server is available at http://www.imtech.res.in/raghava/propred/ Contact: raghava@imtech.res.in Supplementary information: http://www.imtech.res.in/raghava/propred/page3.html * To whom correspondence should be addressed.
Automated prediction of subcellular localization of proteins is an important step in the functional annotation of genomes. The existing subcellular localization prediction methods are based on either ...amino acid composition or N-terminal characteristics of the proteins. In this paper, support vector machine (SVM) has been used to predict the subcellular location of eukaryotic proteins from their different features such as amino acid composition, dipeptide composition and physico-chemical properties. The SVM module based on dipeptide composition performed better than the SVM modules based on amino acid composition or physico-chemical properties. In addition, PSI-BLAST was also used to search the query sequence against the dataset of proteins (experimentally annotated proteins) to predict its subcellular location. In order to improve the prediction accuracy, we developed a hybrid module using all features of a protein, which consisted of an input vector of 458 dimensions (400 dipeptide compositions, 33 properties, 20 amino acid compositions of the protein and 5 from PSI-BLAST output). Using this hybrid approach, the prediction accuracies of nuclear, cytoplasmic, mitochondrial and extracellular proteins reached 95.3, 85.2, 68.2 and 88.9%, respectively. The overall prediction accuracy of SVM modules based on amino acid composition, physico-chemical properties, dipeptide composition and the hybrid approach was 78.1, 77.8, 82.9 and 88.0%, respectively. The accuracy of all the modules was evaluated using a 5-fold cross-validation technique. Assigning a reliability index (reliability index ≥3), 73.5% of prediction can be made with an accuracy of 96.4%. Based on the above approach, an online web server ESLpred was developed, which is available at http://www.imtech.res.in/raghava/eslpred/.
The secretion of Interleukin-4 (IL4) is the characteristic of T-helper 2 responses. IL4 is a cytokine produced by CD4+ T cells in response to helminthes and other extracellular parasites. It has a ...critical role in guiding antibody class switching, hematopoiesis and inflammation, and the development of appropriate effector T-cell responses. In this study, it is the first time an attempt has been made to understand whether it is possible to predict IL4 inducing peptides. The data set used in this study comprises 904 experimentally validated IL4 inducing and 742 noninducing MHC class II binders. Our analysis revealed that certain types of residues are preferred at certain positions in IL4 inducing peptides. It was also observed that IL4 inducing and noninducing epitopes differ in compositional and motif pattern. Based on our analysis we developed classification models where the hybrid method of amino acid pairs and motif information performed the best with maximum accuracy of 75.76% and MCC of 0.51. These results indicate that it is possible to predict IL4 inducing peptides with reasonable precession. These models would be useful in designing the peptides that may induce desired Th2 response.
Antibacterial peptides are important components of the innate immune system, used by the host to protect itself from different types of pathogenic bacteria. Over the last few decades, the search for ...new drugs and drug targets has prompted an interest in these antibacterial peptides. We analyzed 486 antibacterial peptides, obtained from antimicrobial peptide database APD, in order to understand the preference of amino acid residues at specific positions in these peptides.
It was observed that certain types of residues are preferred over others in antibacterial peptides, particularly at the N and C terminus. These observations encouraged us to develop a method for predicting antibacterial peptides in proteins from their amino acid sequence. First, the N-terminal residues were used for predicting antibacterial peptides using Artificial Neural Network (ANN), Quantitative Matrices (QM) and Support Vector Machine (SVM), which resulted in an accuracy of 83.63%, 84.78% and 87.85%, respectively. Then, the C-terminal residues were used for developing prediction methods, which resulted in an accuracy of 77.34%, 82.03% and 85.16% using ANN, QM and SVM, respectively. Finally, ANN, QM and SVM models were developed using N and C terminal residues, which achieved an accuracy of 88.17%, 90.37% and 92.11%, respectively. All the models developed in this study were evaluated using five-fold cross validation technique. These models were also tested on an independent or blind dataset.
Among antibacterial peptides, there is preference for certain residues at N and C termini, which helps to demarcate them from non-antibacterial peptides. Both the termini play a crucial role in imparting the antibacterial property to these peptides. Among the methods developed, SVM shows the best performance in predicting antibacterial peptides followed by QM and ANN, in that order. AntiBP (Antibacterial peptides) will help in discovering efficacious antibacterial peptides, which we hope will prove to be a boon to combat the dreadful antibiotic resistant bacteria. A user friendly web server has also been developed to help the biological community, which is accessible at http://www.imtech.res.in/raghava/antibp/.