•Machine learning was used to develop models to predict COVID-19 positive patient.•Features were extracted from patient data using string matching algorithms.•Constructed a novel dataset from ...unstructured hospitalized patient information.•Used descriptive statistical analysis for frequency calculation of patient symptoms.•Identified significant symptoms of COVID-19 patients using five different ML models.
The recent outbreak of the respiratory ailment COVID-19 caused by novel coronavirus SARS-Cov2 is a severe and urgent global concern. In the absence of effective treatments, the main containment strategy is to reduce the contagion by the isolation of infected individuals; however, isolation of unaffected individuals is highly undesirable. To help make rapid decisions on treatment and isolation needs, it would be useful to determine which features presented by suspected infection cases are the best predictors of a positive diagnosis. This can be done by analyzing patient characteristics, case trajectory, comorbidities, symptoms, diagnosis, and outcomes. We developed a model that employed supervised machine learning algorithms to identify the presentation features predicting COVID-19 disease diagnoses with high accuracy. Features examined included details of the individuals concerned, e.g., age, gender, observation of fever, history of travel, and clinical details such as the severity of cough and incidence of lung infection. We implemented and applied several machine learning algorithms to our collected data and found that the XGBoost algorithm performed with the highest accuracy (>85%) to predict and select features that correctly indicate COVID-19 status for all age groups. Statistical analyses revealed that the most frequent and significant predictive symptoms are fever (41.1%), cough (30.3%), lung infection (13.1%) and runny nose (8.43%). While 54.4% of people examined did not develop any symptoms that could be used for diagnosis, our work indicates that for the remainder, our predictive model could significantly improve the prediction of COVID-19 status, including at early stages of infection.
Amyloid proteins have the ability to form insoluble fibril aggregates that have important pathogenic effects in many tissues. Such amyloidoses are prominently associated with common diseases such as ...type 2 diabetes, Alzheimer's disease, and Parkinson's disease. There are many types of amyloid proteins, and some proteins that form amyloid aggregates when in a misfolded state. It is difficult to identify such amyloid proteins and their pathogenic properties, but a new and effective approach is by developing effective bioinformatics tools. While several machine learning (ML)-based models for in silico identification of amyloid proteins have been proposed, their predictive performance is limited. In this study, we present AMYPred-FRL, a novel meta-predictor that uses a feature representation learning approach to achieve more accurate amyloid protein identification. AMYPred-FRL combined six well-known ML algorithms (extremely randomized tree, extreme gradient boosting, k-nearest neighbor, logistic regression, random forest, and support vector machine) with ten different sequence-based feature descriptors to generate 60 probabilistic features (PFs), as opposed to state-of-the-art methods developed by a single feature-based approach. A logistic regression recursive feature elimination (LR-RFE) method was used to find the optimal m number of 60 PFs in order to improve the predictive performance. Finally, using the meta-predictor approach, the 20 selected PFs were fed into a logistic regression method to create the final hybrid model (AMYPred-FRL). Both cross-validation and independent tests showed that AMYPred-FRL achieved superior predictive performance than its constituent baseline models. In an extensive independent test, AMYPred-FRL outperformed the existing methods by 5.5% and 16.1%, respectively, with accuracy and MCC of 0.873 and 0.710. To expedite high-throughput prediction, a user-friendly web server of AMYPred-FRL is freely available at http://pmlabstack.pythonanywhere.com/AMYPred-FRL . It is anticipated that AMYPred-FRL will be a useful tool in helping researchers to identify new amyloid proteins.
COVID-19, caused by SARS-CoV2 infection, varies greatly in its severity but presents with serious respiratory symptoms with vascular and other complications, particularly in older adults. The disease ...can be spread by both symptomatic and asymptomatic infected individuals. Uncertainty remains over key aspects of the virus infectiousness (particularly the newly emerging variants) and the disease has had severe economic impacts globally. For these reasons, COVID-19 is the subject of intense and widespread discussion on social media platforms including Facebook and Twitter. These public forums substantially influence public opinions and in some cases can exacerbate the widespread panic and misinformation spread during the crisis. Thus, this work aimed to design an intelligent clustering-based classification and topic extracting model named TClustVID that analyzes COVID-19-related public tweets to extract significant sentiments with high accuracy. We gathered COVID-19 Twitter datasets from the IEEE Dataport repository and employed a range of data preprocessing methods to clean the raw data, then applied tokenization and produced a word-to-index dictionary. Thereafter, different classifications were employed on these datasets which enabled the exploration of the performance of traditional classification and TClustVID. Our analysis found that TClustVID showed higher performance compared to traditional methodologies that are determined by clustering criteria. Finally, we extracted significant topics from the clusters, split them into positive, neutral and negative sentiments, and identified the most frequent topics using the proposed model. This approach is able to rapidly identify commonly prevailing aspects of public opinions and attitudes related to COVID-19 and infection prevention strategies spreading among different populations.
Multiple myeloma is largely incurable, despite development of therapies that target myeloma cell-intrinsic pathways. Disease relapse is thought to originate from dormant myeloma cells, localized in ...specialized niches, which resist therapy and repopulate the tumour. However, little is known about the niche, and how it exerts cell-extrinsic control over myeloma cell dormancy and reactivation. In this study, we track individual myeloma cells by intravital imaging as they colonize the endosteal niche, enter a dormant state and subsequently become activated to form colonies. We demonstrate that dormancy is a reversible state that is switched 'on' by engagement with bone-lining cells or osteoblasts, and switched 'off' by osteoclasts remodelling the endosteal niche. Dormant myeloma cells are resistant to chemotherapy that targets dividing cells. The demonstration that the endosteal niche is pivotal in controlling myeloma cell dormancy highlights the potential for targeting cell-extrinsic mechanisms to overcome cell-intrinsic drug resistance and prevent disease relapse.
Fast and accurate identification of phage virion proteins (PVPs) would greatly aid facilitation of antibacterial drug discovery and development. Although, several research efforts based on machine ...learning (ML) methods have been made for in silico identification of PVPs, these methods have certain limitations. Therefore, in this study, we propose a new computational approach, termed SCORPION, (StaCking-based Predictior fOR Phage VIrion PrOteiNs), to accurately identify PVPs using only protein primary sequences. Specifically, we explored comprehensive 13 different feature descriptors from different aspects (i.e., compositional information, composition-transition-distribution information, position-specific information and physicochemical properties) with 10 popular ML algorithms to construct a pool of optimal baseline models. These optimal baseline models were then used to generate probabilistic features (PFs) and considered as a new feature vector. Finally, we utilized a two-step feature selection strategy to determine the optimal PF feature vector and used this feature vector to develop a stacked model (SCORPION). Both tenfold cross-validation and independent test results indicate that SCORPION achieves superior predictive performance than its constitute baseline models and existing methods. We anticipate SCORPION will serve as a useful tool for the cost-effective and large-scale screening of new PVPs. The source codes and datasets for this work are available for downloading in the GitHub repository ( https://github.com/saeed344/SCORPION ).
In bone, depletion of osteoclasts reduces bone formation in vivo, as does osteal macrophage depletion. How osteoclasts and macrophages promote the action of bone forming osteoblasts is, however, ...unclear. Since recruitment and differentiation of multi-potential stromal cells/mesenchymal stem cells (MSC) generates new active osteoblasts, we investigated whether human osteoclasts and macrophages (generated from cord blood-derived hematopoietic progenitors) induce osteoblastic maturation in adipose tissue-derived MSC. When treated with an osteogenic stimulus (ascorbate, dexamethasone and β-glycerophosphate) these MSC form matrix-mineralising, alkaline phosphatase-expressing osteoblastic cells. Cord blood-derived progenitors were treated with macrophage colony stimulating factor (M-CSF) to form immature proliferating macrophages, or with M-CSF plus receptor activator of NFκB ligand (RANKL) to form osteoclasts; culture medium was conditioned for 3 days by these cells to study their production of osteoblastic factors. Both osteoclast- and macrophage-conditioned medium (CM) greatly enhanced MSC osteoblastic differentiation in both the presence and absence of osteogenic medium, evident by increased alkaline phosphatase levels within 4 days and increased mineralisation within 14 days. These CM effects were completely ablated by antibodies blocking gp130 or oncostatin M (OSM), and OSM was detectable in both CM. Recombinant OSM very potently stimulated osteoblastic maturation of these MSC and enhanced bone morphogenetic protein-2 (BMP-2) actions on MSC. To determine the influence of macrophage activation on this OSM-dependent activity, CM was collected from macrophage populations treated with M-CSF plus IL-4 (to induce alternative activation) or with GM-CSF, IFNγ and LPS to cause classical activation. CM from IL-4 treated macrophages stimulated osteoblastic maturation in MSC, while CM from classically-activated macrophages did not. Thus, macrophage-lineage cells, including osteoclasts but not classically activated macrophages, can strongly drive MSC-osteoblastic commitment in OSM-dependent manner. This supports the notion that eliciting gp130-dependent signals in human MSC would be a useful approach to increase bone formation.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Alzheimer's disease (AD) is the commonest progressive neurodegenerative condition in humans, and is currently incurable. A wide spectrum of comorbidities, including other neurodegenerative diseases, ...are frequently associated with AD. How AD interacts with those comorbidities can be examined by analysing gene expression patterns in affected tissues using bioinformatics tools. We surveyed public data repositories for available gene expression data on tissue from AD subjects and from people affected by neurodegenerative diseases that are often found as comorbidities with AD. We then utilized large set of gene expression data, cell-related data and other public resources through an analytical process to identify functional disease links. This process incorporated gene set enrichment analysis and utilized semantic similarity to give proximity measures. We identified genes with abnormal expressions that were common to AD and its comorbidities, as well as shared gene ontology terms and molecular pathways. Our methodological pipeline was implemented in the R platform as an open-source package and available at the following link: https://github.com/unchowdhury/AD_comorbidity. The pipeline was thus able to identify factors and pathways that may constitute functional links between AD and these common comorbidities by which they affect each others development and progression. This pipeline can also be useful to identify key pathological factors and therapeutic targets for other diseases and disease interactions.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Machine learning and data mining-based approaches to prediction and detection of heart disease would be of great clinical utility, but are highly challenging to develop. In most countries there is a ...lack of cardiovascular expertise and a significant rate of incorrectly diagnosed cases which could be addressed by developing accurate and efficient early-stage heart disease prediction by analytical support of clinical decision-making with digital patient records. This study aimed to identify machine learning classifiers with the highest accuracy for such diagnostic purposes. Several supervised machine-learning algorithms were applied and compared for performance and accuracy in heart disease prediction. Feature importance scores for each feature were estimated for all applied algorithms except MLP and KNN. All the features were ranked based on the importance score to find those giving high heart disease predictions. This study found that using a heart disease dataset collected from Kaggle three-classification based on k-nearest neighbor (KNN), decision tree (DT) and random forests (RF) algorithms the RF method achieved 100% accuracy along with 100% sensitivity and specificity. Thus, we found that a relatively simple supervised machine learning algorithm can be used to make heart disease predictions with very high accuracy and excellent potential utility.
•This research aims to find out some classifiers with the highest accuracy to predict heart disease.•Several supervised machine-learning algorithms were applied and compared their performance and accuracy.•Feature importance score for each feature is estimated for all the applied algorithms except MLP and KNN.•All the features are ranked based on the feature importance score to find highly predictive features.•KNN, DT, RF gained 100% accuracy along with 100% sensitivity and specificity.
Assessment of the cognitive functions and state of clinical subjects is an important aspect of e-health care delivery, and in the development of novel human-machine interfaces. A subject can display ...a range of emotions that significantly influence cognition, and emotion classification through the analysis of physiological signals is a key means of detecting emotion. Electroencephalography (EEG) signals have become a common focus of such development compared to other physiological signals because EEG employs simple and subject-acceptable methods for obtaining data that can be used for emotion analysis. We have therefore reviewed published studies that have used EEG signal data to identify possible interconnections between emotion and brain activity. We then describe theoretical conceptualization of basic emotions, and interpret the prevailing techniques that have been adopted for feature extraction, selection, and classification. Finally, we have compared the outcomes of these recent studies and discussed the likely future directions and main challenges for researchers developing EEG-based emotion analysis methods.
•This article reviews and compares existing studies on emotion detection using EEG signals.•This paper presents standard datasets, emotion elicitation materials, EEG devices, and the influences of artifacts on brain waves.•We then describe theoretical conceptualization of basic emotions, and interpret the prevailing techniques that have been adopted for feature extraction, selection, and classification.•We compared the outcomes of recent studies and discussed future directions and main challenges for developing methods for EEG-based emotion identification.
Coronavirus disease-19 (COVID-19) is a severe respiratory viral disease first reported in late 2019 that has spread worldwide. Although some wealthy countries have made significant progress in ...detecting and containing this disease, most underdeveloped countries are still struggling to identify COVID-19 cases in large populations. With the rising number of COVID-19 cases, there are often insufficient COVID-19 diagnostic kits and related resources in such countries. However, other basic diagnostic resources often do exist, which motivated us to develop Deep Learning models to assist clinicians and radiologists to provide prompt diagnostic support to the patients. In this study, we have developed a deep learning-based COVID-19 case detection model trained with a dataset consisting of chest CT scans and X-ray images. A modified ResNet50V2 architecture was employed as deep learning architecture in the proposed model. The dataset utilized to train the model was collected from various publicly available sources and included four class labels: confirmed COVID-19, normal controls and confirmed viral and bacterial pneumonia cases. The aggregated dataset was preprocessed through a sharpening filter before feeding the dataset into the proposed model. This model attained an accuracy of 96.452% for four-class cases (COVID-19/Normal/Bacterial pneumonia/Viral pneumonia), 97.242% for three-class cases (COVID-19/Normal/Bacterial pneumonia) and 98.954% for two-class cases (COVID-19/Viral pneumonia) using chest X-ray images. The model acquired a comprehensive accuracy of 99.012% for three-class cases (COVID-19/Normal/Community-acquired pneumonia) and 99.99% for two-class cases (Normal/COVID-19) using CT-scan images of the chest. This high accuracy presents a new and potentially important resource to enable radiologists to identify and rapidly diagnose COVID-19 cases with only basic but widely available equipment.
•Performing an effective preprocessing technique on collected images to make the dataset of images fit for the proposed deep learning model properly.•Developing an extended ResNet50V2 based deep learning model where finetuning has been performed to rapidly detect and diagnose COVID-19 infected cases with higher accuracy.•Classifying COVID-19 images from normal and typical pneumonia cases by considering two, three and four class categories respectively.•Conducting comparative performance analysis with o state-of-the-art research by using chest CT-scan and X-ray image datasets.