This study evaluated the added predictive value of combining clinical information and myocardial perfusion single-photon emission computed tomography (SPECT) imaging (MPI) data using machine learning ...(ML) to predict major adverse cardiac events (MACE).
Traditionally, prognostication by MPI has relied on visual or quantitative analysis of images without objective consideration of the clinical data. ML permits a large number of variables to be considered in combination and at a level of complexity beyond the human clinical reader.
A total of 2,619 consecutive patients (48% men; 62 ± 13 years of age) who underwent exercise (38%) or pharmacological stress (62%) with high-speed SPECT MPI were monitored for MACE. Twenty-eight clinical variables, 17 stress test variables, and 25 imaging variables (including total perfusion deficit TPD) were recorded. Areas under the receiver-operating characteristic curve (AUC) for MACE prediction were compared among: 1) ML with all available data (ML-combined); 2) ML with only imaging data (ML-imaging); 3) 5-point scale visual diagnosis (physician MD diagnosis); and 4) automated quantitative imaging analysis (stress TPD and ischemic TPD). ML involved automated variable selection by information gain ranking, model building with a boosted ensemble algorithm, and 10-fold stratified cross validation.
During follow-up (3.2 ± 0.6 years), 239 patients (9.1%) had MACE. MACE prediction was significantly higher for ML-combined than ML-imaging (AUC: 0.81 vs. 0.78; p < 0.01). ML-combined also had higher predictive accuracy compared with MD diagnosis, automated stress TPD, and automated ischemic TPD (AUC: 0.81 vs. 0.65 vs. 0.73 vs. 0.71, respectively; p < 0.01 for all). Risk reclassification for ML-combined compared with visual MD diagnosis was 26% (p < 0.001).
ML combined with both clinical and imaging data variables was found to have high predictive accuracy for 3-year risk of MACE and was superior to existing visual or automated perfusion assessments. ML could allow integration of clinical and imaging data for personalized MACE risk computations in patients undergoing SPECT MPI.
Display omitted
Combined analysis of SPECT myocardial perfusion imaging (MPI) performed with a solid-state camera on patients in 2 positions (semiupright, supine) is routinely used to mitigate attenuation artifacts. ...We evaluated the prediction of obstructive disease from combined analysis of semiupright and supine stress MPI by deep learning (DL) as compared with standard combined total perfusion deficit (TPD).
1,160 patients without known coronary artery disease (64% male) were studied. Patients underwent stress
Tc-sestamibi MPI with new-generation solid-state SPECT scanners in 4 different centers. All patients had on-site clinical reads and invasive coronary angiography correlations within 6 mo of MPI. Obstructive disease was defined as at least 70% narrowing of the 3 major coronary arteries and at least 50% for the left main coronary artery. Images were quantified at Cedars-Sinai. The left ventricular myocardium was segmented using standard clinical nuclear cardiology software. The contour placement was verified by an experienced technologist. Combined stress TPD was computed using sex- and camera-specific normal limits. DL was trained using polar distributions of normalized radiotracer counts, hypoperfusion defects, and hypoperfusion severities and was evaluated for prediction of obstructive disease in a novel leave-one-center-out cross-validation procedure equivalent to external validation. During the validation procedure, 4 DL models were trained using data from 3 centers and then evaluated on the 1 center left aside. Predictions for each center were merged to have an overall estimation of the multicenter performance.
718 (62%) patients and 1,272 of 3,480 (37%) arteries had obstructive disease. The area under the receiver operating characteristics curve for prediction of disease on a per-patient and per-vessel basis by DL was higher than for combined TPD (per-patient, 0.81 vs. 0.78; per-vessel, 0.77 vs. 0.73;
< 0.001). With the DL cutoff set to exhibit the same specificity as the standard cutoff for combined TPD, per-patient sensitivity improved from 61.8% (TPD) to 65.6% (DL) (
< 0.05), and per-vessel sensitivity improved from 54.6% (TPD) to 59.1% (DL) (
< 0.01). With the threshold matched to the specificity of a normal clinical read (56.3%), DL had a sensitivity of 84.8%, versus 82.6% for an on-site clinical read (
= 0.3).
DL improves automatic interpretation of MPI as compared with current quantitative methods.
The study evaluated the automatic prediction of obstructive disease from myocardial perfusion imaging (MPI) by deep learning as compared with total perfusion deficit (TPD).
Deep convolutional neural ...networks trained with a large multicenter population may provide improved prediction of per-patient and per-vessel coronary artery disease from single-photon emission computed tomography MPI.
A total of 1,638 patients (67% men) without known coronary artery disease, undergoing stress
Tc-sestamibi or tetrofosmin MPI with new generation solid-state scanners in 9 different sites, with invasive coronary angiography performed within 6 months of MPI, were studied. Obstructive disease was defined as ≥70% narrowing of coronary arteries (≥50% for left main artery). Left ventricular myocardium was segmented using clinical nuclear cardiology software and verified by an expert reader. Stress TPD was computed using sex- and camera-specific normal limits. Deep learning was trained using raw and quantitative polar maps and evaluated for prediction of obstructive stenosis in a stratified 10-fold cross-validation procedure.
A total of 1,018 (62%) patients and 1,797 of 4,914 (37%) arteries had obstructive disease. Area under the receiver-operating characteristic curve for disease prediction by deep learning was higher than for TPD (per patient: 0.80 vs. 0.78; per vessel: 0.76 vs. 0.73: p < 0.01). With deep learning threshold set to the same specificity as TPD, per-patient sensitivity improved from 79.8% (TPD) to 82.3% (deep learning) (p < 0.05), and per-vessel sensitivity improved from 64.4% (TPD) to 69.8% (deep learning) (p < 0.01).
Deep learning has the potential to improve automatic interpretation of MPI as compared with current clinical methods.
Optimal risk stratification with machine learning (ML) from myocardial perfusion imaging (MPI) includes both clinical and imaging data. While most imaging variables can be derived automatically, ...clinical variables require manual collection, which is time-consuming and prone to error. We determined the fewest manually input and imaging variables required to maintain the prognostic accuracy for major adverse cardiac events (MACE) in patients undergoing a single-photon emission computed tomography (SPECT) MPI.
This study included 20 414 patients from the multicentre REFINE SPECT registry and 2984 from the University of Calgary for training and external testing of the ML models, respectively. ML models were trained using all variables (ML-All) and all image-derived variables (including age and sex, ML-Image). Next, ML models were sequentially trained by incrementally adding manually input and imaging variables to baseline ML models based on their importance ranking. The fewest variables were determined as the ML models (ML-Reduced, ML-Minimum, and ML-Image-Reduced) that achieved comparable prognostic performance to ML-All and ML-Image. Prognostic accuracy of the ML models was compared with visual diagnosis, stress total perfusion deficit (TPD), and traditional multivariable models using area under the receiver-operating characteristic curve (AUC). ML-Minimum (AUC 0.798) obtained comparable prognostic accuracy to ML-All (AUC 0.799, P = 0.19) by including 12 of 40 manually input variables and 11 of 58 imaging variables. ML-Reduced achieved comparable accuracy (AUC 0.796) with a reduced set of manually input variables and all imaging variables. In external validation, the ML models also obtained comparable or higher prognostic accuracy than traditional multivariable models.
Reduced ML models, including a minimum set of manually collected or imaging variables, achieved slightly lower accuracy compared to a full ML model but outperformed standard interpretation methods and risk models. ML models with fewer collected variables may be more practical for clinical implementation.
Abstract
Aims
To optimize per-vessel prediction of early coronary revascularization (ECR) within 90 days after fast single-photon emission computed tomography (SPECT) myocardial perfusion imaging ...(MPI) using machine learning (ML) and introduce a method for a patient-specific explanation of ML results in a clinical setting.
Methods and results
A total of 1980 patients with suspected coronary artery disease (CAD) underwent stress/rest 99mTc-sestamibi/tetrofosmin MPI with new-generation SPECT scanners were included. All patients had invasive coronary angiography within 6 months after SPECT MPI. ML utilized 18 clinical, 9 stress test, and 28 imaging variables to predict per-vessel and per-patient ECR with 10-fold cross-validation. Area under the receiver operator characteristics curve (AUC) of ML was compared with standard quantitative analysis total perfusion deficit (TPD) and expert interpretation. ECR was performed in 958 patients (48%). Per-vessel, the AUC of ECR prediction by ML (AUC 0.79, 95% confidence interval (CI) 0.77, 0.80) was higher than by regional stress TPD (0.71, 0.70, 0.73), combined-view stress TPD (AUC 0.71, 95% CI 0.69, 0.72), or ischaemic TPD (AUC 0.72, 95% CI 0.71, 0.74), all P < 0.001. Per-patient, the AUC of ECR prediction by ML (AUC 0.81, 95% CI 0.79, 0.83) was higher than that of stress TPD, combined-view TPD, and ischaemic TPD, all P < 0.001. ML also outperformed nuclear cardiologists’ expert interpretation of MPI for the prediction of early revascularization performance. A method to explain ML prediction for an individual patient was also developed.
Conclusion
In patients with suspected CAD, the prediction of ECR by ML outperformed automatic MPI quantitation by TPDs (per-vessel and per-patient) or nuclear cardiologists’ expert interpretation (per-patient).
Explainable artificial intelligence (AI) can be integrated within standard clinical software to facilitate the acceptance of the diagnostic findings during clinical interpretation.
This study sought ...to develop and evaluate a novel, general purpose, explainable deep learning model (coronary artery disease–deep learning CAD-DL) for the detection of obstructive CAD following single-photon emission computed tomography (SPECT) myocardial perfusion imaging (MPI).
A total of 3,578 patients with suspected CAD undergoing SPECT MPI and invasive coronary angiography within a 6-month interval from 9 centers were studied. CAD-DL computes the probability of obstructive CAD from stress myocardial perfusion, wall motion, and wall thickening maps, as well as left ventricular volumes, age, and sex. Myocardial regions contributing to the CAD-DL prediction are highlighted to explain the findings to the physician. A clinical prototype was integrated using a standard clinical workstation. Diagnostic performance by CAD-DL was compared to automated quantitative total perfusion deficit (TPD) and reader diagnosis.
In total, 2,247 patients (63%) had obstructive CAD. In 10-fold repeated testing, the area under the receiver-operating characteristic curve (AUC) (95% CI) was higher according to CAD-DL (AUC: 0.83 95% CI: 0.82-0.85) than stress TPD (AUC: 0.78 95% CI: 0.77-0.80) or reader diagnosis (AUC: 0.71 95% CI: 0.69-0.72; P < 0.0001 for both). In external testing, the AUC in 555 patients was higher according to CAD-DL (AUC: 0.80 95% CI: 0.76-0.84) than stress TPD (AUC: 0.73 95% CI: 0.69-0.77) or reader diagnosis (AUC: 0.65 95% CI: 0.61-0.69; P < 0.001 for all). The present model can be integrated within standard clinical software and generates results rapidly (<12 seconds on a standard clinical workstation) and therefore could readily be incorporated into a typical clinical workflow.
The deep-learning model significantly surpasses the diagnostic accuracy of standard quantitative analysis and clinical visual reading for MPI. Explainable artificial intelligence can be integrated within standard clinical software to facilitate acceptance of artificial intelligence diagnosis of CAD following MPI.
Display omitted
This study compared the ability of automated myocardial perfusion imaging analysis to predict major adverse cardiac events (MACE) to that of visual analysis.
Quantitative analysis has not been ...compared with clinical visual analysis in prognostic studies.
A total of 19,495 patients from the multicenter REFINE SPECT (REgistry of Fast Myocardial Perfusion Imaging with NExt generation SPECT) study (64 ± 12 years of age, 56% males) undergoing stress Tc-99m-labeled single-photon emission computed tomography (SPECT) myocardial perfusion imaging were followed for 4.5 ± 1.7 years for MACE. Perfusion abnormalities were assessed visually and categorized as normal, probably normal, equivocal, or abnormal. Stress total perfusion deficit (TPD), quantified automatically, was categorized as TPD = 0%, TPD >0% to <1%, ≤1% to <3%, ≤3% to <5%, ≤5% to ≤10%, or TPD >10%. MACE consisted of death, nonfatal myocardial infarction, unstable angina, or late revascularization (>90 days). Kaplan-Meier and Cox proportional hazards analyses were performed to test the performance of visual and quantitative assessments in predicting MACE.
During follow-up examinations, 2,760 (14.2%) MACE occurred. MACE rates increased with worsening of visual assessments, that is, the rate for normal MACE was 2.0%, 3.2% for probably normal, 4.2% for equivocal, and 7.4% for abnormal (all p < 0.001). MACE rates increased with increasing stress TPD from 1.3% for the TPD category of 0% to 7.8% for the TPD category of >10% (p < 0.0001). The adjusted hazard ratio (HR) for MACE increased even in equivocal assessment (HR: 1.56; 95% confidence interval CI: 1.37 to 1.78) and in the TPD category of ≤3% to <5% (HR: 1.74; 95% CI: 1.41 to 2.14; all p < 0.001). The rate of MACE in patients visually assessed as normal still increased from 1.3% (TPD = 0%) to 3.4% (TPD ≥5%) (p < 0.0001).
Quantitative analysis allows precise granular risk stratification in comparison to visual reading, even for cases with normal clinical reading.
Artificial intelligence may improve accuracy of myocardial perfusion imaging (MPI) but will likely be implemented as an aid to physician interpretation rather than an autonomous tool. Deep learning ...(DL) has high standalone diagnostic accuracy for obstructive coronary artery disease (CAD), but its influence on physician interpretation is unknown. We assessed whether access to explainable DL predictions improves physician interpretation of MPI.
We selected a representative cohort of patients who underwent MPI with reference invasive coronary angiography. Obstructive CAD, defined as stenosis ≥50% in the left main artery or ≥70% in other coronary segments, was present in half of the patients. We used an explainable DL model (CAD-DL), which was previously developed in a separate population from different sites. Three physicians interpreted studies first with clinical history, stress, and quantitative perfusion, then with all the data plus the DL results. Diagnostic accuracy was assessed using area under the receiver-operating-characteristic curve (AUC).
In total, 240 patients with a median age of 65 y (interquartile range 58-73) were included. The diagnostic accuracy of physician interpretation with CAD-DL (AUC 0.779) was significantly higher than that of physician interpretation without CAD-DL (AUC 0.747,
= 0.003) and stress total perfusion deficit (AUC 0.718,
< 0.001). With matched specificity, CAD-DL had higher sensitivity when operating autonomously compared with readers without DL results (
< 0.001), but not compared with readers interpreting with DL results (
= 0.122). All readers had numerically higher accuracy with CAD-DL, with AUC improvement 0.02-0.05, and interpretation with DL resulted in overall net reclassification improvement of 17.2% (95% CI 9.2%-24.4%,
< 0.001).
Explainable DL predictions lead to meaningful improvements in physician interpretation; however, the improvement varied across the readers, reflecting the acceptance of this new technology. This technique could be implemented as an aid to physician diagnosis, improving the diagnostic accuracy of MPI.
We aim to establish a multicenter registry collecting clinical, imaging, and follow-up data for patients who undergo myocardial perfusion imaging (MPI) with the latest generation SPECT scanners.
...REFINE SPECT (REgistry of Fast Myocardial Perfusion Imaging with NExt generation SPECT) uses a collaborative design with multicenter contribution of clinical data and images into a comprehensive clinical-imaging database. All images are processed by quantitative software. Over 290 individual imaging variables are automatically extracted from each image dataset and merged with clinical variables. In the prognostic cohort, patient follow-up is performed for major adverse cardiac events. In the diagnostic cohort (patients with correlating invasive angiography), angiography and revascularization results within 6 months are obtained.
To date, collected prognostic data include scans from 20,418 patients in 5 centers (57% male, 64.0 ± 12.1 years) who underwent exercise (48%) or pharmacologic stress (52%). Diagnostic data include 2079 patients in 9 centers (67% male, 64.7 ± 11.2 years) who underwent exercise (39%) or pharmacologic stress (61%).
The REFINE SPECT registry will provide a resource for collaborative projects related to the latest generation SPECT-MPI. It will aid in the development of new artificial intelligence tools for automated diagnosis and prediction of prognostic outcomes.
Abstract
Aims
Single-photon emission computed tomography (SPECT) myocardial perfusion imaging (MPI) stress-only protocols reduce radiation exposure and cost but require clinicians to make immediate ...decisions regarding rest scan cancellation. We developed a machine learning (ML) approach for automatic rest scan cancellation and evaluated its prognostic safety.
Methods and results
In total, 20 414 patients from a solid-state SPECT MPI international multicentre registry with clinical data and follow-up for major adverse cardiac events (MACE) were used to train ML for MACE prediction as a continuous probability (ML score), using 10-fold repeated hold-out testing to separate test from training data. Three ML score thresholds (ML1, ML2, and ML3) were derived by matching the cancellation rates achieved by physician interpretation and two clinical selection rules. Annual MACE rates were compared in patients selected for rest scan cancellation between approaches. Patients selected for rest scan cancellation with ML had lower annualized MACE rates than those selected by physician interpretation or clinical selection rules (ML1 vs. physician interpretation: 1.4 ± 0.1% vs. 2.1 ± 0.1%; ML2 vs. clinical selection: 1.5 ± 0.1% vs. 2.0 ± 0.1%; ML3 vs. stringent clinical selection: 0.6 ± 0.1% vs. 1.7 ± 0.1%, all P < 0.0001) at matched cancellation rates (60 ± 0.7, 64 ± 0.7, and 30 ± 0.6%). Annualized all-cause mortality rates in populations recommended for rest cancellation by physician interpretation, clinical selection approaches were higher (1.3%, 1.2%, and 1.0%, respectively) compared with corresponding ML thresholds (0.6%, 0.6%, and 0.2%).
Conclusion
ML, using clinical and stress imaging data, can be used to automatically recommend cancellation of rest SPECT MPI scans, while ensuring higher prognostic safety than current clinical approaches.