Nitrotyrosine is a product of tyrosine nitration mediated by reactive nitrogen species. As an indicator of cell damage and inflammation, protein nitrotyrosine serves to reveal biological change ...associated with various diseases or oxidative stress. Accurate identification of nitrotyrosine site provides the important foundation for further elucidating the mechanism of protein nitrotyrosination. However, experimental identification of nitrotyrosine sites through traditional methods are laborious and expensive. In silico prediction of nitrotyrosine sites based on protein sequence information are thus highly desired. Here, we report a novel predictor, NTyroSite, for accurate prediction of nitrotyrosine sites using sequence evolutionary information. The generated features were optimized using a Wilcoxon-rank sum test. A random forest classifier was then trained using these features to build the predictor. The final NTyroSite predictor achieved an area under a receiver operating characteristics curve (AUC) score of 0.904 in a 10-fold cross-validation test. It also significantly outperformed other existing implementations in an independent test. Meanwhile, for a better understanding of our prediction model, the predominant rules and informative features were extracted from the NTyroSite model to explain the prediction results. We expect that the NTyroSite predictor may serve as a useful computational resource for high-throughput nitrotyrosine site prediction. The online interface of the software is publicly available at https://biocomputer.bio.cuhk.edu.hk/NTyroSite/.
The pandemic threat of COVID-19 has severely destroyed human life as well as the economy around the world. Although, the vaccination has reduced the outspread, but people are still suffering due to ...the unstable RNA sequence patterns of SARS-CoV-2 which demands supplementary drugs. To explore novel drug target proteins, in this study, a transcriptomics RNA-Seq data generated from SARS-CoV-2 infection and control samples were analyzed. We identified 109 differentially expressed genes (DEGs) that were utilized to identify 10 hub-genes/proteins (TLR2, USP53, GUCY1A2, SNRPD2, NEDD9, IGF2, CXCL2, KLF6, PAG1 and ZFP36) by the protein-protein interaction (PPI) network analysis. The GO functional and KEGG pathway enrichment analyses of hub-DEGs revealed some important functions and signaling pathways that are significantly associated with SARS-CoV-2 infections. The interaction network analysis identified 5 TFs proteins and 6 miRNAs as the key regulators of hub-DEGs. Considering 10 hub-proteins and 5 key TFs-proteins as drug target receptors, we performed their docking analysis with the SARS-CoV-2 3CL protease-guided top listed 90 FDA approved drugs. We found Torin-2, Rapamycin, Radotinib, Ivermectin, Thiostrepton, Tacrolimus and Daclatasvir as the top ranked seven candidate drugs. We investigated their resistance performance against the already published COVID-19 causing top-ranked 11 independent and 8 protonated receptor proteins by molecular docking analysis and found their strong binding affinities, which indicates that the proposed drugs are effective against the state-of-the-arts alternatives independent receptor proteins also. Finally, we investigated the stability of top three drugs (Torin-2, Rapamycin and Radotinib) by using 100 ns MD-based MM-PBSA simulations with the two top-ranked proposed receptors (TLR2, USP53) and independent receptors (IRF7, STAT1), and observed their stable performance. Therefore, the proposed drugs might play a vital role for the treatment against different variants of SARS-CoV-2 infections.
Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2) is one of the most severe global pandemic due to its high pathogenicity and death rate starting from the end of 2019. Though there are ...some vaccines available against SAER-CoV-2 infections, we are worried about their effectiveness, due to its unstable sequence patterns. Therefore, beside vaccines, globally effective supporting drugs are also required for the treatment against SARS-CoV-2 infection. To explore commonly effective repurposable drugs for the treatment against different variants of coronavirus infections, in this article, an attempt was made to explore host genomic biomarkers guided repurposable drugs for SARS-CoV-1 infections and their validation with SARS-CoV-2 infections by using the integrated bioinformatics approaches. At first, we identified 138 differentially expressed genes (DEGs) between SARS-CoV-1 infected and control samples by analyzing high throughput gene-expression profiles to select drug target key receptors. Then we identified top-ranked 11 key DEGs (SMAD4, GSK3B, SIRT1, ATM, RIPK1, PRKACB, MED17, CCT2, BIRC3, ETS1 and TXN) as hub genes (HubGs) by protein-protein interaction (PPI) network analysis of DEGs highlighting their functions, pathways, regulators and linkage with other disease risks that may influence SARS-CoV-1 infections. The DEGs-set enrichment analysis significantly detected some crucial biological processes (immune response, regulation of angiogenesis, apoptotic process, cytokine production and programmed cell death, response to hypoxia and oxidative stress), molecular functions (transcription factor binding and oxidoreductase activity) and pathways (transcriptional mis-regulation in cancer, pathways in cancer, chemokine signaling pathway) that are associated with SARS-CoV-1 infections as well as SARS-CoV-2 infections by involving HubGs. The gene regulatory network (GRN) analysis detected some transcription factors (FOXC1, GATA2, YY1, FOXL1, TP53 and SRF) and micro-RNAs (hsa-mir-92a-3p, hsa-mir-155-5p, hsa-mir-106b-5p, hsa-mir-34a-5p and hsa-mir-19b-3p) as the key transcriptional and post- transcriptional regulators of HubGs, respectively. We also detected some chemicals (Valproic Acid, Cyclosporine, Copper Sulfate and arsenic trioxide) that may regulates HubGs. The disease-HubGs interaction analysis showed that our predicted HubGs are also associated with several other diseases including different types of lung diseases. Then we considered 11 HubGs mediated proteins and their regulatory 6 key TFs proteins as the drug target proteins (receptors) and performed their docking analysis with the SARS-CoV-2 3CL protease-guided top listed 90 anti-viral drugs out of 3410. We found Rapamycin, Tacrolimus, Torin-2, Radotinib, Danoprevir, Ivermectin and Daclatasvir as the top-ranked 7 candidate-drugs with respect to our proposed target proteins for the treatment against SARS-CoV-1 infections. Then, we validated these 7 candidate-drugs against the already published top-ranked 11 target proteins associated with SARS-CoV-2 infections by molecular docking simulation and found their significant binding affinity scores with our proposed candidate-drugs. Finally, we validated all of our findings by the literature review. Therefore, the proposed candidate-drugs might play a vital role for the treatment against different variants of SARS-CoV-2 infections with comorbidities, since the proposed HubGs are also associated with several comorbidities.
Integrated bioinformatics and statistical approaches are now playing the vital role in identifying potential molecular biomarkers more accurately in presence of huge number of alternatives for ...disease diagnosis, prognosis and therapies by reducing time and cost compared to the wet-lab based experimental procedures. Breast cancer (BC) is one of the leading causes of cancer related deaths for women worldwide. Several dry-lab and wet-lab based studies have identified different sets of molecular biomarkers for BC. But they did not compare their results to each other so much either computationally or experimentally. In this study, an attempt was made to propose a set of molecular biomarkers that might be more effective for BC diagnosis, prognosis and therapies, by using the integrated bioinformatics and statistical approaches. At first, we identified 190 differentially expressed genes (DEGs) between BC and control samples by using the statistical LIMMA approach. Then we identified 13 DEGs (AKR1C1, IRF9, OAS1, OAS3, SLCO2A1, NT5E, NQO1, ANGPT1, FN1, ATF6B, HPGD, BCL11A, and TP53INP1) as the key genes (KGs) by protein-protein interaction (PPI) network analysis. Then we investigated the pathogenetic processes of DEGs highlighting KGs by GO terms and KEGG pathway enrichment analysis. Moreover, we disclosed the transcriptional and post-transcriptional regulatory factors of KGs by their interaction network analysis with the transcription factors (TFs) and micro-RNAs. Both supervised and unsupervised learning's including multivariate survival analysis results confirmed the strong prognostic power of the proposed KGs. Finally, we suggested KGs-guided computationally more effective seven candidate drugs (NVP-BHG712, Nilotinib, GSK2126458, YM201636, TG-02, CX-5461, AP-24534) compared to other published drugs by cross-validation with the state-of-the-art alternatives top-ranked independent receptor proteins. Thus, our findings might be played a vital role in breast cancer diagnosis, prognosis and therapies.
HIF1A gene polymorphisms have been confirmed the association with cancer risk through the statistical meta-analysis based on single genetic association (SGA) studies. A good number SGA studies also ...investigated the association of HIF1A gene with several other diseases, but no researcher yet performed statistical meta-analysis to confirm this association more accurately. Therefore, in this paper, we performed a statistical meta-analysis to draw a consensus decision about the association of HIF1A gene polymorphisms with several diseases except cancers giving the weight on large sample size. This meta-analysis was performed based on 41 SGA study's findings, where the polymorphisms rs11549465 (1772 C/T) and rs11549467 (1790 G/A) of HIF1A gene were analyzed based on 11544 and 7426 cases and 11494 and 7063 control samples, respectively. Our results showed that the 1772 C/T polymorphism is not significantly associated with overall disease risks. The 1790 G/A polymorphism was significantly associated with overall diseases under recessive model (AA vs. AG + GG), which indicates that the A allele is responsible for overall diseases though it is recessive. The subgroup analysis based on ethnicity showed the significant association of 1772 C/T polymorphism with overall disease for Caucasian population under the all genetic models, which indicates that the C allele controls overall diseases. The ethnicity subgroup showed the significant association of 1790 G/A polymorphism with overall disease for Asian population under the recessive model (AA vs. AG + GG), which indicates that the A allele is responsible for overall diseases. The subgroup analysis based on disease types showed that 1772 C/T is significantly associated with chronic obstructive pulmonary disease (COPD) under two genetic models (C vs. T and CC vs. CT + TT), skin disease under two genetic models (CC vs. TT and CC + CT vs. TT), and diabetic complications under three genetic models (C vs. T, CT vs. TT and CC + CT vs. TT), where C allele is high risk factor for skin disease and diabetic complications (since, ORs > 1), but low risk factor for COPD (since, ORs < 1). Also the 1790 G/A variant significantly associated with the subgroup of cardiovascular disease (CVD) under homozygote model, diabetic complications under allelic and homozygote models, and other disease under four genetic models, where the A is high risk factor for diabetic complications and low risk factor for CVD. Thus, this study provided more evidence that the HIF1A gene is significantly associated with COPD, CVD, skin disease and diabetic complications. These might be the severe comorbidities and risk factors for multiple cancers due to the effect of HIF1A gene and need further investigations accumulating large number of studies.
The detoxification efflux carriers (DTX) are a significant group of multidrug efflux transporter family members that play diverse functions in all kingdoms of living organisms. However, genome-wide ...identification and characterization of DTX family transporters have not yet been performed in banana, despite its importance as an economic fruit plant. Therefore, a detailed genome-wide analysis of DTX family transporters in banana (Musa acuminata) was conducted using integrated bioinformatics and systems biology approaches. In this study, a total of 37 DTX transporters were identified in the banana genome and divided into four groups (I, II, III, and IV) based on phylogenetic analysis. The gene structures, as well as their proteins' domains and motifs, were found to be significantly conserved. Gene ontology (GO) annotation revealed that the predicted DTX genes might play a vital role in protecting cells and membrane-bound organelles through detoxification mechanisms and the removal of drug molecules from banana cells. Gene regulatory analyses identified key transcription factors (TFs), cis-acting elements, and post-transcriptional regulators (miRNAs) of DTX genes, suggesting their potential roles in banana. Furthermore, the changes in gene expression levels due to pathogenic infections and non-living factor indicate that banana DTX genes play a role in responses to both biotic and abiotic stresses. The results of this study could serve as valuable tools to improve banana quality by protecting them from a range of environmental stresses.
Bioinformatics analysis has been playing a vital role in identifying potential genomic biomarkers more accurately from an enormous number of candidates by reducing time and cost compared to the ...wet-lab-based experimental procedures for disease diagnosis, prognosis, and therapies. Cervical cancer (CC) is one of the most malignant diseases seen in women worldwide. This study aimed at identifying potential key genes (KGs), highlighting their functions, signaling pathways, and candidate drugs for CC diagnosis and targeting therapies. Four publicly available microarray datasets of CC were analyzed for identifying differentially expressed genes (DEGs) by the LIMMA approach through GEO2R online tool. We identified 116 common DEGs (cDEGs) that were utilized to identify seven KGs (AURKA, BRCA1, CCNB1, CDK1, MCM2, NCAPG2, and TOP2A) by the protein-protein interaction (PPI) network analysis. The GO functional and KEGG pathway enrichment analyses of KGs revealed some important functions and signaling pathways that were significantly associated with CC infections. The interaction network analysis identified four TFs proteins and two miRNAs as the key transcriptional and post-transcriptional regulators of KGs. Considering seven KGs-based proteins, four key TFs proteins, and already published top-ranked seven KGs-based proteins (where five KGs were common with our proposed seven KGs) as drug target receptors, we performed their docking analysis with the 80 meta-drug agents that were already published by different reputed journals as CC drugs. We found Paclitaxel, Vinorelbine, Vincristine, Docetaxel, Everolimus, Temsirolimus, and Cabazitaxel as the top-ranked seven candidate drugs. Finally, we investigated the binding stability of the top-ranked three drugs (Paclitaxel, Vincristine, Vinorelbine) by using 100 ns MD-based MM-PBSA simulations with the three top-ranked proposed receptors (AURKA, CDK1, TOP2A) and observed their stable performance. Therefore, the proposed drugs might play a vital role in the treatment against CC.
The pandemic of COVID-19 is a severe threat to human life and the global economy. Despite the success of vaccination efforts in reducing the spread of the virus, the situation remains largely ...uncontrolled due to the random mutation in the RNA sequence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), which demands different variants of effective drugs. Disease-causing gene-mediated proteins are usually used as receptors to explore effective drug molecules. In this study, we analyzed two different RNA-Seq and one microarray gene expression profile datasets by integrating EdgeR, LIMMA, weighted gene co-expression network and robust rank aggregation approaches, which revealed SARS-CoV-2 infection causing eight hub-genes (HubGs) including HubGs; REL, AURKA, AURKB, FBXL3, OAS1, STAT4, MMP2 and IL6 as the host genomic biomarkers. Gene Ontology and pathway enrichment analyses of HubGs significantly enriched some crucial biological processes, molecular functions, cellular components and signaling pathways that are associated with the mechanisms of SARS-CoV-2 infections. Regulatory network analysis identified top-ranked 5 TFs (SRF, PBX1, MEIS1, ESR1 and MYC) and 5 miRNAs (hsa-miR-106b-5p, hsa-miR-20b-5p, hsa-miR-93-5p, hsa-miR-106a-5p and hsa-miR-20a-5p) as the key transcriptional and post-transcriptional regulators of HubGs. Then, we conducted a molecular docking analysis to determine potential drug candidates that could interact with HubGs-mediated receptors. This analysis resulted in the identification of top-ranked ten drug agents, including Nilotinib, Tegobuvir, Digoxin, Proscillaridin, Olysio, Simeprevir, Hesperidin, Oleanolic Acid, Naltrindole and Danoprevir. Finally, we investigated the binding stability of the top-ranked three drug molecules Nilotinib, Tegobuvir and Proscillaridin with the three top-ranked proposed receptors (AURKA, AURKB, OAS1) by using 100 ns MD-based MM-PBSA simulations and observed their stable performance. Therefore, the findings of this study might be useful resources for diagnosis and therapies of SARS-CoV-2 infections.
The outbreak of SARS-CoV-2, also known as the COVID-19 pandemic, is still a critical risk factor for both human life and the global economy. Although, several promising therapies have been introduced ...in the literature to inhibit SARS-CoV-2, most of them are synthetic drugs that may have some adverse effects on the human body. Therefore, the main objective of this study was to carry out an in-silico investigation into the medicinal properties of Petiveria alliacea L. (P. alliacea L.)-mediated phytocompounds for the treatment of SARS-CoV-2 infections since phytochemicals have fewer adverse effects compared to synthetic drugs. To explore potential phytocompounds from P. alliacea L. as candidate drug molecules, we selected the infection-causing main protease (Mpro) of SARS-CoV-2 as the receptor protein. The molecular docking analysis of these receptor proteins with the different phytocompounds of P. alliacea L. was performed using AutoDock Vina. Then, we selected the three top-ranked phytocompounds (myricitrin, engeletin, and astilbin) as the candidate drug molecules based on their highest binding affinity scores of −8.9, −8.7 and −8.3 (Kcal/mol), respectively. Then, a 100 ns molecular dynamics (MD) simulation study was performed for their complexes with Mpro using YASARA software, computed RMSD, RMSF, PCA, DCCM, MM/PBSA, and free energy landscape (FEL), and found their almost stable binding performance. In addition, biological activity, ADME/T, DFT, and drug-likeness analyses exhibited the suitable pharmacokinetics properties of the selected phytocompounds. Therefore, the results of this study might be a useful resource for formulating a safe treatment plan for SARS-CoV-2 infections after experimental validation in wet-lab and clinical trials.
Interval mapping approaches have been playing significant role for quantitative trait locus (QTL) mapping to discover genetic architecture of diseases or traits with molecular markers. Composite ...interval mapping (CIM) is one of the superior approaches of the interval mapping for discovering both linked and unlinked putative QTL positions. However, estimators of this approach are not robust against phenotypic outliers. As a result, it fails to detect true QTL positions in presence of outliers. In this study, we investigated the performance of β-Composite Interval Mapping (BetaCIM) for detecting both linked and unlinked important QTLs positions from the robustness points of views. Performance of this approach depends on the value of tuning parameter β. It reduces to the classical CIM approach for β →0. We described and formulated the cross-validation procedure for selecting trait specific optimum value of β. It was observed that the optimum value of β depends on both amount of contaminated observations and their scatteredness. BetaCIM approach discover similar QTL positions as classical IM/CIM in absence of phenotypic outliers, but gives better results in presence of phenotypic outliers in terms of detecting true QTLs and effects estimation. We formulated the generalized forms of robust QTL analysis and developed an R-package named "BetaCIM" by implementing this robust approach. Left and right kidney weight data sets of mouse intercross population (129 S1/SvlmJ × A/J) were analyzed by using BetaCIM, CIM, and IM approaches. For right kidney weight (RKW) CIM and BetaCIM provided similar LOD score profile, and both approaches identified 3 QTL positions. IM approach also identified 3 QTL positions. For left kidney weight (LKW), there was evidence of one outlying observation; and in this case the BetaCIM approach identified 2 QTL positions. However, none of the QTLs were significant by CIM and IM approaches at 5% level of significance. Gene expression ontology (GEO) search showed that the candidate genes (Otof and A330033J07Rik) of the identified QTLs for LKW were expressed in kidney. Both simulation and real data analysis results showed that BetaCIM approach improves the performance over the existing methods in presence of phenotypic outliers. Otherwise, it keeps almost equal performance.