Background Severe, symptomatic aortic stenosis (AS) is associated with poor prognoses. However, early detection of AS is difficult because of the long asymptomatic period experienced by many ...patients, during which screening tools are ineffective. The aim of this study was to develop and validate a deep learning-based algorithm, combining a multilayer perceptron and convolutional neural network, for detecting significant AS using ECGs. Methods and Results This retrospective cohort study included adult patients who had undergone both ECG and echocardiography. A deep learning-based algorithm was developed using 39 371 ECGs. Internal validation of the algorithm was performed with 6453 ECGs from one hospital, and external validation was performed with 10 865 ECGs from another hospital. The end point was significant AS (beyond moderate). We used demographic information, features, and 500-Hz, 12-lead ECG raw data as predictive variables. In addition, we identified which region had the most significant effect on the decision-making of the algorithm using a sensitivity map. During internal and external validation, the areas under the receiver operating characteristic curve of the deep learning-based algorithm using 12-lead ECG for detecting significant AS were 0.884 (95% CI, 0.880-0.887) and 0.861 (95% CI, 0.858-0.863), respectively; those using a single-lead ECG signal were 0.845 (95% CI, 0.841-0.848) and 0.821 (95% CI, 0.816-0.825), respectively. The sensitivity map showed the algorithm focused on the T wave of the precordial lead to determine the presence of significant AS. Conclusions The deep learning-based algorithm demonstrated high accuracy for significant AS detection using both 12-lead and single-lead ECGs.
Background
Heart disease (HD) is the leading cause of global death; there are several mortality prediction models of HD for identifying critically‐ill patients and for guiding decision making. The ...existing models, however, cannot be used during initial treatment or screening. This study aimed to derive and validate an echocardiography‐based mortality prediction model for HD using deep learning (DL).
Methods
In this multicenter retrospective cohort study, the subjects were admitted adult (age ≥ 18 years) HD patients who underwent echocardiography. The outcome was in‐hospital mortality. We extracted predictor variables from echocardiography reports using text mining. We developed deep learning‐based prediction model using derivation data of a hospital A. And we conducted external validation using echocardiography report of hospital B. We conducted subgroup analysis of coronary heart disease (CHD) and heart failure (HF) patients of hospital B and compared DL with the currently used predictive models (eg, Global Registry of Acute Coronary Events (GRACE) score, Thrombolysis in Myocardial Infarction score (TIMI), Meta‐Analysis Global Group in Chronic Heart Failure (MAGGIC) score, and Get With The Guidelines‐Heart Failure (GWTG‐HF) score).
Results
The study subjects comprised 25 776 patients with 1026 mortalities. The areas under the receiver operating characteristic curve (AUROC) of the DL model were 0.912, 0.898, 0.958, and 0.913 for internal validation, external validation, CHD, and HF, respectively, and these results significantly outperformed other comparison models.
Conclusions
This echocardiography‐based deep learning model predicted in‐hospital mortality among HD patients more accurately than existing prediction models and other machine learning models.
Rapid diagnosis of myocardial infarction (MI) using electrocardiography (ECG) is the cornerstone of effective treatment and prevention of mortality; however, conventional interpretation methods has ...low reliability for detecting MI and is difficulty to apply to limb 6-lead ECG based life type or wearable devices. We developed and validated a deep learning-based artificial intelligence algorithm (DLA) for detecting MI using 6-lead ECG. A total of 412,461 ECGs were used to develop a variational autoencoder (VAE) that reconstructed precordial 6-lead ECG using limb 6-lead ECG. Data from 9536, 1301, and 1768 ECGs of adult patients who underwent coronary angiography within 24 h from each ECG were used for development, internal and external validation, respectively. During internal and external validation, the area under the receiver operating characteristic curves of the DLA with VAE using a 6-lead ECG were 0.880 and 0.854, respectively, and the performances were preserved by the territory of the coronary lesion. Our DLA successfully detected MI using a 12-lead ECG or a 6-lead ECG. The results indicate that MI could be detected not only with a conventional 12 lead ECG but also with a life type 6-lead ECG device that employs our DLA.
This study aimed to develop and validate deep-learning-based artificial intelligence algorithm for predicting mortality of AHF (DAHF).
12,654 dataset from 2165 patients with AHF in two hospitals were ...used as train data for DAHF development, and 4759 dataset from 4759 patients with AHF in 10 hospitals enrolled to the Korean AHF registry were used as performance test data. The endpoints were in-hospital, 12-month, and 36-month mortality. We compared the DAHF performance with the Get with the Guidelines-Heart Failure (GWTG-HF) score, Meta-Analysis Global Group in Chronic Heart Failure (MAGGIC) score, and other machine-learning models by using the test data. Area under the receiver operating characteristic curve of the DAHF were 0.880 (95% confidence interval, 0.876-0.884) for predicting in-hospital mortality; these results significantly outperformed those of the GWTG-HF (0.728 0.720-0.737) and other machine-learning models. For predicting 12- and 36-month endpoints, DAHF (0.782 and 0.813) significantly outperformed MAGGIC score (0.718 and 0.729). During the 36-month follow-up, the high-risk group, defined by the DAHF, had a significantly higher mortality rate than the low-risk group(p<0.001).
DAHF predicted the in-hospital and long-term mortality of patients with AHF more accurately than the existing risk scores and other machine-learning models.
Conventional risk stratification models for mortality of acute myocardial infarction (AMI) have potential limitations. This study aimed to develop and validate deep-learning-based risk stratification ...for the mortality of patients with AMI (DAMI).
The data of 22,875 AMI patients from the Korean working group of the myocardial infarction (KorMI) registry were exclusively divided into 12,152 derivation data of 36 hospitals and 10,723 validation data of 23 hospitals. The predictor variables were the initial demographic and laboratory data. The endpoints were in-hospital mortality and 12-months mortality. We compared the DAMI performance with the global registry of acute coronary event (GRACE) score, acute coronary treatment and intervention outcomes network (ACTION) score, and the thrombolysis in myocardial infarction (TIMI) score using the validation data.
In-hospital mortality for the study subjects was 4.4% and 6-month mortality after survival upon discharge was 2.2%. The areas under the receiver operating characteristic curves (AUCs) of the DAMI were 0.905 95% confidence interval 0.902-0.909 and 0.870 0.865-0.876 for the ST elevation myocardial infarction (STEMI) and non ST elevation myocardial infarction (NSTEMI) patients, respectively; these results significantly outperformed those of the GRACE (0.851 0.846-0.856, 0.810 0.803-0.819), ACTION (0.852 0.847-0.857, 0.806 0.799-0.814 and TIMI score (0.781 0.775-0.787, 0.5930.585-0.603). DAMI predicted 30.9% of patients more accurately than the GRACE score. As secondary outcome, during the 6-month follow-up, the high risk group, defined by the DAMI, has a significantly higher mortality rate than the low risk group (17.1% vs. 0.5%, p < 0.001).
The DAMI predicted in-hospital mortality and 12-month mortality of AMI patients more accurately than the existing risk scores and other machine-learning methods.
Abstract
Aims
Although left ventricular hypertrophy (LVH) has a high incidence and clinical importance, the conventional diagnosis criteria for detecting LVH using electrocardiography (ECG) has not ...been satisfied. We aimed to develop an artificial intelligence (AI) algorithm for detecting LVH.
Methods and results
This retrospective cohort study involved the review of 21 286 patients who were admitted to two hospitals between October 2016 and July 2018 and underwent 12-lead ECG and echocardiography within 4 weeks. The patients in one hospital were divided into a derivation and internal validation dataset, while the patients in the other hospital were included in only an external validation dataset. An AI algorithm based on an ensemble neural network (ENN) combining convolutional and deep neural network was developed using the derivation dataset. And we visualized the ECG area that the AI algorithm used to make the decision. The area under the receiver operating characteristic curve of the AI algorithm based on ENN was 0.880 (95% confidence interval 0.877–0.883) and 0.868 (0.865–0.871) during the internal and external validations. These results significantly outperformed the cardiologist’s clinical assessment with Romhilt-Estes point system and Cornell voltage criteria, Sokolov-Lyon criteria, and interpretation of ECG machine. At the same specificity, the AI algorithm based on ENN achieved 159.9%, 177.7%, and 143.8% higher sensitivities than those of the cardiologist’s assessment, Sokolov-Lyon criteria, and interpretation of ECG machine.
Conclusion
An AI algorithm based on ENN was highly able to detect LVH and outperformed cardiologists, conventional methods, and other machine learning techniques.
Out-of-hospital cardiac arrest (OHCA) is a major healthcare burden, and prognosis is critical in decision-making for treatment and the withdrawal of life-sustaining therapy. This study aimed to ...develop and validate a deep-learning-based out-of-hospital cardiac arrest prognostic system (DCAPS) for predicting neurologic recovery and survival to discharge.
The study subjects were patients from the Korea OHCA registry who experienced return of spontaneous circulation (ROSC) after OHCA. A total of 36,190 patients were exclusively divided into a set of 28,045 subjects for derivation data and 8,145 subjects for validation data. We used information available for the time of ROSC as predictor variables, and the endpoints were neurologic recovery (cerebral performance category 1 or 2) and survival to discharge. The DCAPS was developed using the derivation data and represented the favorability of prognosis with a score between 0 and 100.
The area under the receiver operating characteristic curve (AUROC) of DCAPS for predicting neurologic recovery for the validation data was 0.953 95% confidence interval 0.952–0.954; these results significantly outperformed those of logistic regression (0.947 0.943–0.948), random forest (0.943 0.942–0.945), support vector machine (0.930 0.929–0.932), and conventional methods of a previous study (0.817 0.815–0.820). The AUROC of the DCAPS for survival to discharge was 0.901 0.900–0.903, and this result significantly outperformed those of other models as well.
The DCAPS predicted neurologic recovery and survival to discharge of OHCA patients accurately and outperformed the conventional method and other machine-learning methods.
Early detection and intervention of atrial fibrillation (AF) is a cornerstone for effective treatment and prevention of mortality. Diverse deep learning models (DLMs) have been developed, but they ...could not be applied in clinical practice owing to their lack of interpretability. We developed an explainable DLM to detect AF using ECG and validated its performance using diverse formats of ECG.
We conducted a retrospective study. The Sejong ECG dataset comprising 128,399 ECGs was used to develop and internally validated the explainable DLM. DLM was developed with two feature modules, which could describe the reason for DLM decisions. DLM was external validated using data from 21,837, 10,605, and 8528 ECGs from PTB-XL, Chapman, and PhysioNet non-restricted datasets, respectively. The predictor variables were digitally stored ECGs, and the endpoints were AFs.
During internal and external validation of the DLM, the area under the receiver operating characteristic curves (AUCs) of the DLM using a 12‑lead ECG in detecting AF were 0.997–0.999. The AUCs of the DLM with VAE using a 6‑lead and single‑lead ECG were 0.990–0.999. The AUCs of explainability about features such as rhythm irregularity and absence of P-wave were 0.961–0.993 and 0.983–0.993, respectively.
Our DLM successfully detected AF using diverse ECGs and described the reason for this decision. The results indicated that an explainable artificial intelligence methodology could be adopted to the DLM using ECG and enhance the transparency of the DLM for its application in clinical practice.
•Explainable deep learning model detected atrial fibrillation using electrocardiogram.•Explainable deep learning model could describe the reason for this decision.•Explainable model enhances the transparency for its application in clinical practice.
Introduction
The detection and monitoring of electrolyte imbalance is essential for appropriate management of many metabolic diseases; however, there is no tool that detects such imbalances reliably ...and noninvasively. In this study, we developed a deep learning model (DLM) using electrocardiography (ECG) for detecting electrolyte imbalance and validated its performance in a multicenter study.
Methods and Results
This retrospective cohort study included two hospitals: 92,140 patients who underwent a laboratory electrolyte examination and an ECG within 30 min were included in this study. A DLM was developed using 83,449 ECGs of 48,356 patients; the internal validation included 12,091 ECGs of 12,091 patients. We conducted an external validation with 31,693 ECGs of 31,693 patients from another hospital, and the result was electrolyte imbalance detection. During internal, the area under the receiving operating characteristic curve (AUC) of a DLM using a 12‐lead ECG for detecting hyperkalemia, hypokalemia, hypernatremia, hyponatremia, hypercalcemia, and hypocalcemia were 0.945, 0.866, 0.944, 0.885, 0.905, and 0.901, respectively. The values during external validation of the AUC of hyperkalemia, hypokalemia, hypernatremia, hyponatremia, hypercalcemia, and hypocalcemia were 0.873, 0.857, 0.839, 0.856, 0.831, and 0.813 respectively. The DLM helped to visualize the important ECG region for detecting each electrolyte imbalance, and it showed how the P wave, QRS complex, or T wave differs in importance in detecting each electrolyte imbalance.
Conclusion
The proposed DLM demonstrated high performance in detecting electrolyte imbalance. These results suggest that a DLM can be used for detecting and monitoring electrolyte imbalance using ECG on a daily basis.
Screening and early diagnosis for heart failure (HF) are critical. However, conventional screening diagnostic methods have limitations, and electrocardiography (ECG)-based HF identification may be ...helpful. This study aimed to develop and validate a deep-learning algorithm for ECG-based HF identification (DEHF).
The study involved 2 hospitals and 55,163 ECGs of 22,765 patients who performed echocardiography within 4 weeks were study subjects. ECGs were divided into derivation and validation data. Demographic and ECG features were used as predictive variables. The primary endpoint was detection of HF with reduced ejection fraction (HFrEF; ejection fraction EF≤40%), and the secondary endpoint was HF with mid-range to reduced EF (≤50%). We developed the DEHF using derivation data and the algorithm representing the risk of HF between 0 and 1. We confirmed accuracy and compared logistic regression (LR) and random forest (RF) analyses using validation data.
The area under the receiver operating characteristic curves (AUROCs) of DEHF for identification of HFrEF were 0.843 (95% confidence interval, 0.840-0.845) and 0.889 (0.887-0.891) for internal and external validation, respectively, and these results significantly outperformed those of LR (0.800 0.797-0.803, 0.847 0.844-0.850) and RF (0.807 0.804-0.810, 0.853 0.850-0.855) analyses. The AUROCs of deep learning for identification of the secondary endpoint was 0.821 (0.819-0.823) and 0.850 (0.848-0.852) for internal and external validation, respectively, and these results significantly outperformed those of LR and RF.
The deep-learning algorithm accurately identified HF using ECG features and outperformed other machine-learning methods.