Dealing with missing data in data analysis is inevitable. Although powerful imputation methods that address this problem exist, there is still much room for improvement. In this study, we examined ...single imputation based on deep autoencoders, motivated by the apparent success of deep learning to efficiently extract useful dataset features. We have developed a consistent framework for both training and imputation. Moreover, we benchmarked the results against state-of-the-art imputation methods on different data sizes and characteristics. The work was not limited to the one-type variable dataset; we also imputed missing data with multi-type variables, e.g., a combination of binary, categorical, and continuous attributes. To evaluate the imputation methods, we randomly corrupted the complete data, with varying degrees of corruption, and then compared the imputed and original values. In all experiments, the developed autoencoder obtained the smallest error for all ranges of initial data corruption.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
There is an urgent need for biomarkers in plasma to identify Alzheimer's disease (AD). It has previously been shown that a signature of 18 plasma proteins can identify AD during pre-dementia and ...dementia stages (Ray et al, Nature Medicine, 2007). We quantified the same 18 proteins in plasma from 174 controls, 142 patients with AD, and 88 patients with other dementias. Only three of these proteins (EGF, PDGF-BB and MIP-1δ) differed significantly in plasma between controls and AD. The 18 proteins could classify patients with AD from controls with low diagnostic precision (area under the ROC curve was 63%). Moreover, they could not distinguish AD from other dementias. In conclusion, independent validation of results is important in explorative biomarker studies.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Heart transplantation is life saving for patients with end-stage heart disease. However, a number of factors influence how well recipients and donor organs tolerate this procedure. The main objective ...of this study was to develop and validate a flexible risk model for prediction of survival after heart transplantation using the largest transplant registry in the world.
We developed a flexible, non-linear artificial neural networks model (IHTSA) and classification and regression tree to comprehensively evaluate the impact of recipient-donor variables on survival over time. We analyzed 56,625 heart-transplanted adult patients, corresponding to 294,719 patient-years. We compared the discrimination power with three existing scoring models, donor risk index (DRI), risk-stratification score (RSS) and index for mortality prediction after cardiac transplantation (IMPACT). The accuracy of the model was excellent (C-index 0.600 95% CI: 0.595-0.604) with predicted versus actual 1-year, 5-year and 10-year survival rates of 83.7% versus 82.6%, 71.4%-70.8%, and 54.8%-54.3% in the derivation cohort; 83.7% versus 82.8%, 71.5%-71.1%, and 54.9%-53.8% in the internal validation cohort; and 84.5% versus 84.4%, 72.9%-75.6%, and 57.5%-57.5% in the external validation cohort. The IHTSA model showed superior or similar discrimination in all of the cohorts. The receiver operating characteristic area under the curve to predict one-year mortality was for the IHTSA: 0.650 (95% CI: 0.640-0.655), DRI 0.56 (95% CI: 0.56-0.57), RSS 0.61 (95% CI: 0.60-0.61), and IMPACT 0.61 (0.61-0.62), respectively. The decision-tree showed that recipients matched to a donor younger than 38 years had additional expected median survival time of 2.8 years. Furthermore, the number of suitable donors could be increased by up to 22%.
We show that the IHTSA model can be used to predict both short-term and long-term mortality with high accuracy globally. The model also estimates the expected benefit to the individual patient.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Surgical sentinel lymph node biopsy (SLNB) is routinely used to reliably stage axillary lymph nodes in early breast cancer (BC). However, SLNB may be associated with postoperative arm morbidities. ...For most patients with BC undergoing SLNB, the findings are benign, and the procedure is currently questioned. A decision-support tool for the prediction of benign sentinel lymph nodes based on preoperatively available data has been developed using artificial neural network modelling.
This was a retrospective geographical and temporal validation study of the noninvasive lymph node staging (NILS) model, based on preoperatively available data from 586 women consecutively diagnosed with primary BC at two sites. Ten preoperative clinicopathological characteristics from each patient were entered into the web-based calculator, and the probability of benign lymph nodes was predicted. The performance of the NILS model was assessed in terms of discrimination with the area under the receiver operating characteristic curve (AUC) and calibration, that is, comparison of the observed and predicted event rates of benign axillary nodal status (N0) using calibration slope and intercept. The primary endpoint was axillary nodal status (discrimination, benign N0 vs. metastatic axillary nodal status N+) determined by the NILS model compared to nodal status by definitive pathology.
The mean age of the women in the cohort was 65 years, and most of them (93%) had luminal cancers. Approximately three-fourths of the patients had no metastases in SLNB (N0 74% and 73%, respectively). The AUC for the predicted probabilities for the whole cohort was 0.6741 (95% confidence interval: 0.6255-0.7227). More than one in four patients (n = 151, 26%) were identified as candidates for SLNB omission when applying the predefined cut-off for lymph node-negative status from the development cohort. The NILS model showed the best calibration in patients with a predicted high probability of healthy axilla.
The performance of the NILS model was satisfactory. In approximately every fourth patient, SLNB could potentially be omitted. Considering the shift from postoperatively to preoperatively available predictors in this validation study, we have demonstrated the robustness of the NILS model. The clinical usability of the web interface will be evaluated before its clinical implementation.
Registered in the ISRCTN registry with study ID ISRCTN14341750. Date of registration 23/11/2018.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
In the present study, we aimed to evaluate the performance of machine learning (ML) models for identification of acute myocardial infarction (AMI) or death within 30 days among emergency department ...(ED) chest pain patients.
Using data from 9519 consecutive ED chest pain patients, we created ML models based on logistic regression or artificial neural networks. Model inputs included sex, age, ECG and the first blood tests at patient presentation: High sensitivity TnT (hs-cTnT), glucose, creatinine, and hemoglobin. For a safe rule-out, the models were adapted to achieve a sensitivity > 99% and a negative predictive value (NPV) > 99.5% for 30-day AMI/death. For rule-in, we set the models to achieve a specificity > 90% and a positive predictive value (PPV) of > 70%. The models were also compared with the 0 h arm of the European Society of Cardiology algorithm (ESC 0 h); An initial hs-cTnT < 5 ng/L for rule-out and ≥ 52 ng/L for rule-in. A convolutional neural network was the best model and identified 55% of the patients for rule-out and 5.3% for rule-in, while maintaining the required sensitivity, specificity, NPV and PPV levels. ESC 0 h failed to reach these performance levels.
An ML model based on age, sex, ECG and blood tests at ED arrival can identify six out of ten chest pain patients for safe early rule-out or rule-in with no need for serial blood tests. Future studies should attempt to improve these ML models further, e.g. by including additional input data.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
6.
The skåne emergency medicine (SEM) cohort Ekelund, Ulf; Ohlsson, Bodil; Melander, Olle ...
Scandinavian journal of trauma, resuscitation and emergency medicine,
04/2024, Volume:
32, Issue:
1
Journal Article
Peer reviewed
Open access
In the European Union alone, more than 100 million people present to the emergency department (ED) each year, and this has increased steadily year-on-year by 2-3%. Better patient management decisions ...have the potential to reduce ED crowding, the number of diagnostic tests, the use of inpatient beds, and healthcare costs.
We have established the Skåne Emergency Medicine (SEM) cohort for developing clinical decision support systems (CDSS) based on artificial intelligence or machine learning as well as traditional statistical methods. The SEM cohort consists of 325 539 unselected unique patients with 630 275 visits from January 1st, 2017 to December 31st, 2018 at eight EDs in the region Skåne in southern Sweden. Data on sociodemographics, previous diseases and current medication are available for each ED patient visit, as well as their chief complaint, test results, disposition and the outcome in the form of subsequent diagnoses, treatments, healthcare costs and mortality within a follow-up period of at least 30 days, and up to 3 years.
The SEM cohort provides a platform for CDSS research, and we welcome collaboration. In addition, SEM's large amount of real-world patient data with almost complete short-term follow-up will allow research in epidemiology, patient management, diagnostics, prognostics, ED crowding, resource allocation, and social medicine.
To implement artificial neural network (ANN) algorithms for noninvasive lymph node staging (NILS) to a decision support tool and facilitate the option to omit surgical axillary staging in breast ...cancer patients with low-risk of nodal metastasis.
The NILS tool is a further development of an ANN prototype for the prediction of nodal status. Training and internal validation of the original algorithm included 15 clinical and tumor-related variables from a consecutive cohort of 800 breast cancer cases. The updated NILS tool included 10 top-ranked input variables from the original prototype. A workflow with four ANN pathways was additionally developed to allow different combinations of missing preoperative input values. Predictive performances were assessed by area under the receiver operating characteristics curves (AUC) and sensitivity/specificity values at defined cut-points. Clinical utility was presented by estimating possible sentinel lymph node biopsy (SLNB) reduction rates. The principles of user-centered design were applied to develop an interactive web-interface to predict the patient's probability of healthy lymph nodes. A technical validation of the interface was performed using data from 100 test patients selected to cover all combinations of missing histopathological input values.
ANN algorithms for the prediction of nodal status have been implemented into the web-based NILS tool for personalized, noninvasive nodal staging in breast cancer. The estimated probability of healthy lymph nodes using the interface showed a complete concordance with estimations from the reference algorithm except in two cases that had been wrongly included (ineligible for the technical validation). NILS predictive performance to distinguish node-negative from node-positive disease, also with missing values, displayed AUC ranged from 0.718 (95% CI, 0.687-0.748) to 0.735 (95% CI, 0.704-0.764), with good calibration. Sensitivity 90% and specificity 34% were demonstrated. The potential to abstain from axillary surgery was observed in 26% of patients using the NILS tool, acknowledging a false negative rate of 10%, which is clinically accepted for the standard SLNB technique.
The implementation of NILS into a web-interface are expected to provide the health care with decision support and facilitate preoperative identification of patients who could be good candidates to avoid unnecessary surgical axillary staging.
We investigate a new method to place patients into risk groups in censored survival data. Properties such as median survival time, and end survival rate, are implicitly improved by optimizing the ...area under the survival curve. Artificial neural networks (ANN) are trained to either maximize or minimize this area using a genetic algorithm, and combined into an ensemble to predict one of low, intermediate, or high risk groups. Estimated patient risk can influence treatment choices, and is important for study stratification. A common approach is to sort the patients according to a prognostic index and then group them along the quartile limits. The Cox proportional hazards model (Cox) is one example of this approach. Another method of doing risk grouping is recursive partitioning (Rpart), which constructs a decision tree where each branch point maximizes the statistical separation between the groups. ANN, Cox, and Rpart are compared on five publicly available data sets with varying properties. Cross-validation, as well as separate test sets, are used to validate the models. Results on the test sets show comparable performance, except for the smallest data set where Rpart's predicted risk groups turn out to be inverted, an example of crossing survival curves. Cross-validation shows that all three models exhibit crossing of some survival curves on this small data set but that the ANN model manages the best separation of groups in terms of median survival time before such crossings. The conclusion is that optimizing the area under the survival curve is a viable approach to identify risk groups. Training ANNs to optimize this area combines two key strengths from both prognostic indices and Rpart. First, a desired minimum group size can be specified, as for a prognostic index. Second, the ability to utilize non-linear effects among the covariates, which Rpart is also able to do.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Molecular mechanisms employed by individual multipotent cells at the point of lineage commitment remain largely uncharacterized. Current paradigms span from instructive to noise-driven mechanisms. Of ...considerable interest is also whether commitment involves a limited set of genes or the entire transcriptional program, and to what extent gene expression configures multiple trajectories into commitment. Importantly, the transient nature of the commitment transition confounds the experimental capture of committing cells. We develop a computational framework that simulates stochastic commitment events, and affords mechanistic exploration of the fate transition. We use a combined modeling approach guided by gene expression classifier methods that infers a time-series of stochastic commitment events from experimental growth characteristics and gene expression profiling of individual hematopoietic cells captured immediately before and after commitment. We define putative regulators of commitment and probabilistic rules of transition through machine learning methods, and employ clustering and correlation analyses to interrogate gene regulatory interactions in multipotent cells. Against this background, we develop a Monte Carlo time-series stochastic model of transcription where the parameters governing promoter status, mRNA production and mRNA decay in multipotent cells are fitted to experimental static gene expression distributions. Monte Carlo time is converted to physical time using cell culture kinetic data. Probability of commitment in time is a function of gene expression as defined by a logistic regression model obtained from experimental single-cell expression data. Our approach should be applicable to similar differentiating systems where single cell data is available. Within our system, we identify robust model solutions for the multipotent population within physiologically reasonable values and explore model predictions with regard to molecular scenarios of entry into commitment. The model suggests distinct dependencies of different commitment-associated genes on mRNA dynamics and promoter activity, which globally influence the probability of lineage commitment.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Abstract Background/Aims: Artificial neural networks (ANNs) are non-linear pattern recognition techniques, which can be used as a tool in medical decision-making. The aim of this study was to ...construct and validate an ANN model for early prediction of the severity of acute pancreatitis (AP). Methods: Patients treated for AP from 2002 to 2005 (n = 139) and from 2007 to 2009 (n = 69) were analyzed to develop and validate the ANN model. Severe AP was defined according to the Atlanta criteria. Results: ANNs selected 6 of 23 potential risk variables as relevant for severity prediction, including duration of pain until arrival at the emergency department, creatinine, hemoglobin, alanine aminotransferase, heart rate, and white blood cell count. The discriminatory power for prediction of progression to a severe course, determined from the area under the receiver-operating characteristic curve, was 0.92 for the ANN model, 0.84 for the logistic regression model (p = 0.030), and 0.63 for the APACHE II score (p < 0.001). The numbers of correctly classified patients for a sensitivity of 50 and 75% were significantly higher for the ANN model than for logistic regression (p = 0.002) and APACHE II (p!0.001). Conclusion: TheANN model identified 6 risk variables available at the time of admission, including duration of pain, a finding not being presented as a risk factor before. The severity classification developed proved to be superior to APACHE II.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UL, UM, UPCLJ, UPUK