Sleep is critical to a person's physical and mental health, but there are few studies systematically assessing risk factors for sleep disorders.
The objective of this study was to identify risk ...factors for a sleep disorder through machine-learning and assess this methodology.
A retrospective, cross-sectional cohort study using the publicly available National Health and Nutrition Examination Survey (NHANES) was conducted in patients who completed the demographic, dietary, exercise, and mental health questionnaire and had laboratory and physical exam data.
A physician diagnosis of insomnia was the outcome of this study. Univariate logistic models, with insomnia as the outcome, were used to identify covariates that were associated with insomnia. Covariates that had a p<0.0001 on univariate analysis were included within the final machine-learning model. The machine learning model XGBoost was used due to its prevalence within the literature as well as its increased predictive accuracy in healthcare prediction. Model covariates were ranked according to the cover statistic to identify risk factors for insomnia. Shapely Additive Explanations (SHAP) were utilized to visualize the relationship between these potential risk factors and insomnia.
Of the 7,929 patients that met the inclusion criteria in this study, 4,055 (51% were female, 3,874 (49%) were male. The mean age was 49.2 (SD = 18.4), with 2,885 (36%) White patients, 2,144 (27%) Black patients, 1,639 (21%) Hispanic patients, and 1,261 (16%) patients of another race. The machine learning model had 64 out of a total of 684 features that were found to be significant on univariate analysis (P<0.0001 used). These were fitted into the XGBoost model and an AUROC = 0.87, Sensitivity = 0.77, Specificity = 0.77 were observed. The top four highest ranked features by cover, a measure of the percentage contribution of the covariate to the overall model prediction, were the Patient Health Questionnaire depression survey (PHQ-9) (Cover = 31.1%), age (Cover = 7.54%), physician recommendation of exercise (Cover = 3.86%), weight (Cover = 2.99%), and waist circumference (Cover = 2.70%).
Machine learning models can effectively predict risk for a sleep disorder using demographic, laboratory, physical exam, and lifestyle covariates and identify key risk factors.
Machine learning methods are widely used within the medical field. However, the reliability and efficacy of these models is difficult to assess, making it difficult for researchers to identify which ...machine-learning model to apply to their dataset. We assessed whether variance calculations of model metrics (e.g., AUROC, Sensitivity, Specificity) through bootstrap simulation and SHapely Additive exPlanations (SHAP) could increase model transparency and improve model selection. Data from the England National Health Services Heart Disease Prediction Cohort was used. After comparison of model metrics for XGBoost, Random Forest, Artificial Neural Network, and Adaptive Boosting, XGBoost was used as the machine-learning model of choice in this study. Boost-strap simulation (N = 10,000) was used to empirically derive the distribution of model metrics and covariate Gain statistics. SHapely Additive exPlanations (SHAP) to provide explanations to machine-learning output and simulation to evaluate the variance of model accuracy metrics. For the XGBoost modeling method, we observed (through 10,000 completed simulations) that the AUROC ranged from 0.771 to 0.947, a difference of 0.176, the balanced accuracy ranged from 0.688 to 0.894, a 0.205 difference, the sensitivity ranged from 0.632 to 0.939, a 0.307 difference, and the specificity ranged from 0.595 to 0.944, a 0.394 difference. Among 10,000 simulations completed, we observed that the gain for Angina ranged from 0.225 to 0.456, a difference of 0.231, for Cholesterol ranged from 0.148 to 0.326, a difference of 0.178, for maximum heart rate (MaxHR) ranged from 0.081 to 0.200, a range of 0.119, and for Age ranged from 0.059 to 0.157, difference of 0.098. Use of simulations to empirically evaluate the variability of model metrics and explanatory algorithms to observe if covariates match the literature are necessary for increased transparency, reliability, and utility of machine learning methods. These variance statistics, combined with model accuracy statistics can help researchers identify the best model for a given dataset.
Identification of associations between the obese category of weight in the general US population will continue to advance our understanding of the condition and allow clinicians, providers, ...communities, families, and individuals make more informed decisions. This study aims to improve the prediction of the obese category of weight and investigate its relationships with factors, ultimately contributing to healthier lifestyle choices and timely management of obesity.
Questionnaires that included demographic, dietary, exercise and health information from the US National Health and Nutrition Examination Survey (NHANES 2017-2020) were utilized with BMI 30 or higher defined as obesity. A machine learning model, XGBoost predicted the obese category of weight and Shapely Additive Explanations (SHAP) visualized the various covariates and their feature importance. Model statistics including Area under the receiver operator curve (AUROC), sensitivity, specificity, positive predictive value, negative predictive value and feature properties such as gain, cover, and frequency were measured. SHAP explanations were created for transparent and interpretable analysis.
There were 6,146 adults (age > 18) that were included in the study with average age 58.39 (SD = 12.94) and 3122 (51%) females. The machine learning model had an Area under the receiver operator curve of 0.8295. The top four covariates include waist circumference (gain = 0.185), GGT (gain = 0.101), platelet count (gain = 0.059), AST (gain = 0.057), weight (gain = 0.049), HDL cholesterol (gain = 0.032), and ferritin (gain = 0.034).
In conclusion, the utilization of machine learning models proves to be highly effective in accurately predicting the obese category of weight. By considering various factors such as demographic information, laboratory results, physical examination findings, and lifestyle factors, these models successfully identify crucial risk factors associated with the obese category of weight.
There is a continual push for developing accurate predictors for Intensive Care Unit (ICU) admitted heart failure (HF) patients and in-hospital mortality.
The study aimed to utilize transparent ...machine learning and create hierarchical clustering of key predictors based off of model importance statistics gain, cover, and frequency.
Inclusion criteria of complete patient information for in-hospital mortality in the ICU with HF from the MIMIC-III database were randomly divided into a training (n = 941, 80%) and test (n = 235, 20%). A grid search was set to find hyperparameters. Machine Learning with XGBoost were used to predict mortality followed by feature importance with Shapely Additive Explanations (SHAP) and hierarchical clustering of model metrics with a dendrogram and heat map.
Of the 1,176 heart failure ICU patients that met inclusion criteria for the study, 558 (47.5%) were males. The mean age was 74.05 (SD = 12.85). XGBoost model had an area under the receiver operator curve of 0.662. The highest overall SHAP explanations were urine output, leukocytes, bicarbonate, and platelets. Average urine output was 1899.28 (SD = 1272.36) mL/day with the hospital mortality group having 1345.97 (SD = 1136.58) mL/day and the group without hospital mortality having 1986.91 (SD = 1271.16) mL/day. The average leukocyte count in the cohort was 10.72 (SD = 5.23) cells per microliter. For the hospital mortality group the leukocyte count was 13.47 (SD = 7.42) cells per microliter and for the group without hospital mortality the leukocyte count was 10.28 (SD = 4.66) cells per microliter. The average bicarbonate value was 26.91 (SD = 5.17) mEq/L. Amongst the group with hospital mortality the average bicarbonate value was 24.00 (SD = 5.42) mEq/L. Amongst the group without hospital mortality the average bicarbonate value was 27.37 (SD = 4.98) mEq/L. The average platelet value was 241.52 platelets per microliter. For the group with hospital mortality the average platelet value was 216.21 platelets per microliter. For the group without hospital mortality the average platelet value was 245.47 platelets per microliter. Cluster 1 of the dendrogram grouped the temperature, platelets, urine output, Saturation of partial pressure of Oxygen (SPO2), Leukocyte count, lymphocyte count, bicarbonate, anion gap, respiratory rate, PCO2, BMI, and age as most similar in having the highest aggregate gain, cover, and frequency metrics.
Machine Learning models that incorporate dendrograms and heat maps can offer additional summaries of model statistics in differentiating factors between in patient ICU mortality in heart failure patients.
Asthma attacks are a major cause of morbidity and mortality in vulnerable populations, and identification of associations with asthma attacks is necessary to improve public awareness and the timely ...delivery of medical interventions.
The study aimed to identify feature importance of factors associated with asthma in a representative population of US adults.
A cross-sectional analysis was conducted using a modern, nationally representative cohort, the National Health and Nutrition Examination Surveys (NHANES 2017-2020). All adult patients greater than 18 years of age (total of 7,922 individuals) with information on asthma attacks were included in the study. Univariable regression was used to identify significant nutritional covariates to be included in a machine learning model and feature importance was reported. The acquisition and analysis of the data were authorized by the National Center for Health Statistics Ethics Review Board.
7,922 patients met the inclusion criteria in this study. The machine learning model had 55 out of a total of 680 features that were found to be significant on univariate analysis (P<0.0001 used). In the XGBoost model the model had an Area Under the Receiver Operator Characteristic Curve (AUROC) = 0.737, Sensitivity = 0.960, NPV = 0.967. The top five highest ranked features by gain, a measure of the percentage contribution of the covariate to the overall model prediction, were Octanoic Acid intake as a Saturated Fatty Acid (SFA) (gm) (Gain = 8.8%), Eosinophil percent (Gain = 7.9%), BMXHIP-Hip Circumference (cm) (Gain = 7.2%), BMXHT-standing height (cm) (Gain = 6.2%) and HS C-Reactive Protein (mg/L) (Gain 6.1%).
Machine Learning models can additionally offer feature importance and additional statistics to help identify associations with asthma attacks.
Coronary artery disease (CAD) is the leading cause of death in both developed and developing nations. The objective of this study was to identify risk factors for coronary artery disease through ...machine-learning and assess this methodology. A retrospective, cross-sectional cohort study using the publicly available National Health and Nutrition Examination Survey (NHANES) was conducted in patients who completed the demographic, dietary, exercise, and mental health questionnaire and had laboratory and physical exam data. Univariate logistic models, with CAD as the outcome, were used to identify covariates that were associated with CAD. Covariates that had a p<0.0001 on univariate analysis were included within the final machine-learning model. The machine learning model XGBoost was used due to its prevalence within the literature as well as its increased predictive accuracy in healthcare prediction. Model covariates were ranked according to the Cover statistic to identify risk factors for CAD. Shapely Additive Explanations (SHAP) explanations were utilized to visualize the relationship between these potential risk factors and CAD. Of the 7,929 patients that met the inclusion criteria in this study, 4,055 (51%) were female, 2,874 (49%) were male. The mean age was 49.2 (SD = 18.4), with 2,885 (36%) White patients, 2,144 (27%) Black patients, 1,639 (21%) Hispanic patients, and 1,261 (16%) patients of other race. A total of 338 (4.5%) of patients had coronary artery disease. These were fitted into the XGBoost model and an AUROC = 0.89, Sensitivity = 0.85, Specificity = 0.87 were observed (Fig 1). The top four highest ranked features by cover, a measure of the percentage contribution of the covariate to the overall model prediction, were age (Cover = 21.1%), Platelet count (Cover = 5.1%), family history of heart disease (Cover = 4.8%), and Total Cholesterol (Cover = 4.1%). Machine learning models can effectively predict coronary artery disease using demographic, laboratory, physical exam, and lifestyle covariates and identify key risk factors.
Abstract
Heterologous prime-boost COVID-19 vaccine strategy may facilitate mass COVID-19 immunization. We reported early immunogenicity and safety outcomes of heterologous immunization with a viral ...vector vaccine (ChAdOx1) and a spike-2P subunit vaccine (MVC-COV1901) in a participant-blinded, randomized, non-inferiority trial (NCT05054621). A total of 100 healthy adults aged 20–70 years having the first dose of ChAdOx1 were 1:1 randomly assigned to receive a booster dose either with ChAdOx1 (
n
= 50) or MVC-COV1901 (
n
= 50) at an interval of 4–6 or 8–10 weeks. At day 28 post-boosting, the neutralizing antibody geometric mean titer against wild-type SARS-CoV-2 in MVC-COV1901 recipients (236 IU/mL) was superior to that in ChAdOx1 recipients (115 IU/mL), with a GMT ratio of 2.1 (95% CI, 1.4 to 2.9). Superiority in the neutralizing antibody titer against Delta variant was also found for heterologous MVC-COV1901 immunization with a GMT ratio of 2.6 (95% CI, 1.8 to 3.8). Both spike-specific antibody-secreting B and T cell responses were substantially enhanced by the heterologous schedule. Heterologous boosting was particularly prominent at a short prime-boost interval. No serious adverse events occurred across all groups. The findings support the use of heterologous prime-boost with ChAdOx1 and protein-based subunit vaccines.
Ascites refers to the abnormal accumulation of fluid in the peritoneum resulting from an underlying pathology, such as metastatic cancer. Among all cancers, advanced-stage epithelial ovarian cancer ...is most frequently associated with the production of malignant ascites and is the leading cause of death from gynecologic malignancies. Despite decades of evidence showing that the accumulation of peritoneal fluid portends the poorest outcomes for cancer patients, the role of malignant ascites in promoting metastasis and therapy resistance remains poorly understood. This review summarizes the current understanding of malignant ascites, with a focus on ovarian cancer. The first section provides an overview of heterogeneity in ovarian cancer and the pathophysiology of malignant ascites. Next, analytical methods used to characterize the cellular and acellular components of malignant ascites, as well the role of these components in modulating cell biology, are discussed. The review then provides a perspective on the pressures and forces that tumors are subjected to in the presence of malignant ascites and the impact of physical stress on therapy resistance. Treatment options for malignant ascites, including surgical, pharmacological and photochemical interventions are then discussed to highlight challenges and opportunities at the interface of drug discovery, device development and physical sciences in oncology.
A central enigma in epigenetics is how epigenetic factors are guided to specific genomic sites for their function. Previously, we reported that a Piwi-piRNA complex associates with the ...piRNA-complementary site in the Drosophila genome and regulates its epigenetic state. Here, we report that Piwi-piRNA complexes bind to numerous piRNA-complementary sequences throughout the genome, implicating piRNAs as a major mechanism that guides Piwi and Piwi-associated epigenetic factors to program the genome. To test this hypothesis, we demonstrate that inserting piRNA-complementary sequences to an ectopic site leads to Piwi, HP1a, and Su(var)3-9 recruitment to the site as well as H3K9me2/3 enrichment and reduced RNA polymerase II association, indicating that piRNA is both necessary and sufficient to recruit Piwi and epigenetic factors to specific genomic sites. Piwi deficiency drastically changed the epigenetic landscape and polymerase II profile throughout the genome, revealing the Piwi-piRNA mechanism as a major epigenetic programming mechanism in Drosophila.
Display omitted
► Piwi-piRNA complexes bind to numerous piRNA-complementary sequences in the genome ► piRNA is necessary and sufficient to recruit Piwi, HP1, and HMT to its target site ► Piwi deficiency drastically changes the epigenetic landscape in the genome ► The Piwi-piRNA mechanism is a major epigenetic programming mechanism in Drosophila
Huang et al. characterize genomic recruitment of epigenetic factors guided by piRNAs. They find that Piwi/piRNAs represent a major epigenetic mechanism exerting strong global effects. This work addresses a central enigma in epigenetics—how epigenetic factors, most of which do not bind DNA, are guided to specific genomic sites.
Internal tandem duplication (ITD) mutation in Fms-like tyrosine kinase 3 gene (FLT3/ITD) represents an unfavorable genetic change in acute myeloid leukemia (AML) and is associated with poor ...prognosis. Metabolic alterations have been involved in tumor progression and attracted interest as a target for therapeutic intervention. However, few studies analyzed the adaptations of cellular metabolism in the context of FLT3/ITD mutation. Here, we report that FLT3/ITD causes a significant increase in aerobic glycolysis through AKT-mediated upregulation of mitochondrial hexokinase (HK2), and renders the leukemia cells highly dependent on glycolysis and sensitive to pharmacological inhibition of glycolytic activity. Inhibition of glycolysis preferentially causes severe ATP depletion and massive cell death in FLT3/ITD leukemia cells. Glycolytic inhibitors significantly enhances the cytotoxicity induced by FLT3 tyrosine kinase inhibitor sorafenib. Importantly, such combination provides substantial therapeutic benefit in a murine model bearing FLT3/ITD leukemia. Our study suggests that FLT3/ITD mutation promotes Warburg effect, and such metabolic alteration can be exploited to develop effective therapeutic strategy for treatment of AML with FLT3/ITD mutation via metabolic intervention.