Purpose
Considerable progress has been made in the assessment and management of non-small cell lung cancer (NSCLC) patients based on mutation status in the epidermal growth factor receptor (EGFR) and ...Kirsten rat sarcoma viral oncogene (KRAS). At the same time, NSCLC management through KRAS and EGFR mutation profiling faces challenges. In the present work, we aimed to evaluate a comprehensive radiomics framework that enabled prediction of EGFR and KRAS mutation status in NSCLC patients based on radiomic features from low-dose computed tomography (CT), contrast-enhanced diagnostic quality CT (CTD), and positron emission tomography (PET) imaging modalities and use of machine learning algorithms.
Methods
Our study involved NSCLC patients including 150 PET, low-dose CT, and CTD images. Radiomic features from original and preprocessed (including 64 bin discretizing, Laplacian-of-Gaussian (LOG), and Wavelet) images were extracted. Conventional clinically used standard uptake value (SUV) parameters and metabolic tumor volume (MTV) were also obtained from PET images. Highly correlated features were pre-eliminated, and false discovery rate (FDR) correction was performed with the resulting q-values reported for univariate analysis. Six feature selection methods and 12 classifiers were then used for multivariate prediction of gene mutation status (provided by polymerase chain reaction (PCR)) in patients. We performed 10-fold cross-validation for model tuning to improve robustness, and our developed models were assessed on an independent validation set with 68 patients (common in all three imaging modalities). The average area under the receiver operator characteristic curve (AUC) was utilized for performance evaluation.
Results
The best predictive power for conventional PET parameters was achieved by SUV
peak
(AUC 0.69,
p
value = 0.0002) and MTV (AUC 0.55,
p
value = 0.0011) for EGFR and KRAS, respectively. Univariate analysis of extracted radiomics features improved AUC performance to 0.75 (q-value 0.003, Short-Run Emphasis feature of GLRLM from LOG preprocessed image of PET with sigma value 1.5) and 0.71 (q-value 0.00005, Large Dependence Low Gray-Level Emphasis feature of GLDM in LOG preprocessed image of CTD with sigma value 5) for EGFR and KRAS, respectively. Furthermore, multivariate machine learning-based AUC performances were significantly improved to 0.82 for EGFR (LOG preprocessed image of PET with sigma 3 with variance threshold (VT) feature selector and stochastic gradient descent (SGD) classifier (q-value = 4.86E-05) and 0.83 for KRAS (LOG preprocessed image of CT with sigma 3.5 with select model (SM) feature selector and SGD classifier (q-value = 2.81E−09).
Conclusion
Our work demonstrated that non-invasive and reliable radiomics analysis can be successfully used to predict EGFR and KRAS mutation status in NSCLC patients. We demonstrated that radiomic features extracted from different image-feature sets could be used for EGFR and KRAS mutation status prediction in NSCLC patients and showed improved predictive power relative to conventional image-derived metrics.
Purpose
To identify optimal classification methods for computed tomography (CT) radiomics-based preoperative prediction of clear cell renal cell carcinoma (ccRCC) grade.
Materials and methods
...Seventy-one ccRCC patients (31 low grade and 40 high grade) were included in this study. Tumors were manually segmented on CT images followed by the application of three image preprocessing techniques (Laplacian of Gaussian, wavelet filter, and discretization of the intensity values) on delineated tumor volumes. Overall, 2530 radiomics features (tumor shape and size, intensity statistics, and texture) were extracted from each segmented tumor volume. Univariate analysis was performed to assess the association between each feature and the histological condition. Multivariate analysis involved the use of machine learning (ML) algorithms and the following three feature selection algorithms: the least absolute shrinkage and selection operator, Student’s
t
test, and minimum Redundancy Maximum Relevance. These selected features were then used to construct three classification models (SVM, random forest, and logistic regression) to discriminate high from low-grade ccRCC at nephrectomy. Lastly, multivariate model performance was evaluated on the bootstrapped validation cohort using the area under the receiver operating characteristic curve (AUC) metric.
Results
The univariate analysis demonstrated that among the different image sets, 128 bin-discretized images have statistically significant different texture parameters with a mean AUC of 0.74 ± 3 (
q
value < 0.05). The three ML-based classifiers showed proficient discrimination between high and low-grade ccRCC. The AUC was 0.78 for logistic regression, 0.62 for random forest, and 0.83 for the SVM model, respectively.
Conclusion
CT radiomic features can be considered as a useful and promising noninvasive methodology for preoperative evaluation of ccRCC Fuhrman grades.
Full text
Available for:
EMUNI, FIS, FZAB, GEOZS, GIS, IJS, IMTLJ, KILJ, KISLJ, MFDPS, NLZOH, NUK, OBVAL, OILJ, PNG, SAZU, SBCE, SBJE, SBMB, SBNM, UKNU, UL, UM, UPUK, VKSCE, ZAGLJ
This study aimed to predict methylation status of the O6 methylguanine-DNA methyltransferase (MGMT) gene promoter status by using magnetic resonance imaging radiomics features, as well as univariate ...and multivariate analysis.
Eighty-two patients who had an MGMT methylation status were included in this study. Tumors were manually segmented in the 4 regions of magnetic resonance images, 1) whole tumor, 2) active/enhanced region, 3) necrotic regions, and 4) edema regions. About 7000 radiomics features were extracted for each patient. Feature selection and classifier were used to predict MGMT status through different machine learning algorithms. The area under the curve (AUC) of the receiver operating characteristic curve was used for model evaluations.
Regarding univariate analysis, the Inverse Variance feature From Gray Level Co-occurrence Matrix in whole tumor segment with 4.5 mm Sigma of Laplacian of Gaussian filter with AUC of 0.71 (P value = 0.002) was found to be the best predictor. For multivariate analysis, the Decision Tree classifier with Select from Model feature selector and LOG (Laplacian of Gaussian) filter in edema region had the highest performance (AUC, 0.78), followed by Ada-Boost classifier with Select from Model feature selector and LOG filter in edema region (AUC, 0.74).
This study showed that radiomics using machine learning algorithms is a feasible noninvasive approach to predict MGMT methylation status in patients with glioblastoma multiforme cancer.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
To investigate the impact of harmonization on the performance of CT, PET, and fused PET/CT radiomic features toward the prediction of mutations status, for epidermal growth factor receptor (EGFR) and ...Kirsten rat sarcoma viral oncogene (KRAS) genes in non-small cell lung cancer (NSCLC) patients.
Radiomic features were extracted from tumors delineated on CT, PET, and wavelet fused PET/CT images obtained from 136 histologically proven NSCLC patients. Univariate and multivariate predictive models were developed using radiomic features before and after ComBat harmonization to predict EGFR and KRAS mutation statuses. Multivariate models were built using minimum redundancy maximum relevance feature selection and random forest classifier. We utilized 70/30% splitting patient datasets for training/testing, respectively, and repeated the procedure 10 times. The area under the receiver operator characteristic curve (AUC), accuracy, sensitivity, and specificity were used to assess model performance. The performance of the models (univariate and multivariate), before and after ComBat harmonization was compared using statistical analyses.
While the performance of most features in univariate modeling was significantly improved for EGFR prediction, most features did not show any significant difference in performance after harmonization in KRAS prediction. Average AUCs of all multivariate predictive models for both EGFR and KRAS were significantly improved (q-value < 0.05) following ComBat harmonization. The mean ranges of AUCs increased following harmonization from 0.87–0.90 to 0.92–0.94 for EGFR, and from 0.85–0.90 to 0.91–0.94 for KRAS. The highest performance was achieved by harmonized F_R0.66_W0.75 model with AUC of 0.94, and 0.93 for EGFR and KRAS, respectively.
Our results demonstrated that regarding univariate modelling, while ComBat harmonization had generally a better impact on features for EGFR compared to KRAS status prediction, its effect is feature-dependent. Hence, no systematic effect was observed. Regarding the multivariate models, ComBat harmonization significantly improved the performance of all radiomics models toward more successful prediction of EGFR and KRAS mutation statuses in lung cancer patients. Thus, by eliminating the batch effect in multi-centric radiomic feature sets, harmonization is a promising tool for developing robust and reproducible radiomics using vast and variant datasets.
Display omitted
•We examine the impact of harmonization on multimodality PET/CT radiomics for EGFR and KRAS mutation prediction in NSCLC.•Harmonization had overall a better impact on features for EGFR prediction compared to KRAS, in univariate analysis.•Harmonization effect is feature dependent without a systematic effect on performance, in univariate analysis.•Harmonization significantly improved the performance of multivariate models for EGFR and KRAS mutation prediction.•Harmonization is promising for batch-effect elimination, for robust and reproducible biological status capturing.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
The aim of this work was to assess the robustness of cardiac SPECT radiomic features against changes in imaging settings, including acquisition, and reconstruction parameters.
Four commercial SPECT ...and SPECT/CT cameras were used to acquire images of a static cardiac phantom mimicking typical myorcardial perfusion imaging using 185 MBq of 99mTc. The effects of different image acquisition and reconstruction parameters, including number of views, view matrix size, attenuation correction, as well as image reconstruction related parameters (algorithm, number of iterations, number of subsets, type of post-reconstruction filter, and its associated parameters, including filter order and cut-off frequency) were studied. In total, 5,063 transverse views were reconstructed by varying the aforementioned factors. Eighty-seven radiomic features including first-, second-, and high-order textures were extracted from these images. To assess reproducibility and repeatability, the coefficient of variation (COV), as a widely adopted metric, was measured for each of the radiomic features over the different imaging settings.
The Inverse Difference Moment Normalized (IDMN) and Inverse Difference Normalized (IDN) features from the Gray Level Co-occurrence Matrix (GLCM), Run Percentage (RP) from the Gray Level Co-occurrence Matrix (GLRLM), Zone Entropy (ZE) from the Gray Level Size Zone Matrix (GLSZM), and Dependence Entropy (DE) from the Gray Level Dependence Matrix (GLDM) feature sets were the only features that exhibited high reproducibility (COV ≤ 5%) against changes in all imaging settings. In addition, Large Area Low Gray Level Emphasis (LALGLE), Small Area Low Gray Level Emphasis (SALGLE) and Low Gray Level Zone Emphasis (LGLZE) from GLSZM, and Small Dependence Low Gray Level Emphasis (SDLGLE) from GLDM feature sets turned out to be less reproducible (COV > 20%) against changes in imaging settings. The GLRLM (31.88%) and GLDM feature set (54.2%) had the highest (COV < 5%) and lowest (COV > 20%) number of the reproducible features, respectively. Matrix size had the largest impact on feature variability as most of the features were not repeatable when matrix size was modified with 82.8% of them having a COV > 20%.
The repeatability and reproducibility of SPECT/CT cardiac radiomic features under different imaging settings is feature-dependent. Different image acquisition and reconstruction protocols have variable effects on radiomic features. The radiomic features exhibiting low COV are potential candidates for future clinical studies.
Full text
Available for:
EMUNI, FZAB, GEOZS, IJS, IMTLJ, KILJ, KISLJ, MFDPS, NUK, OILJ, PNG, SAZU, SBCE, SBJE, SBMB, SBNM, UKNU, UL, UM, UPUK, VKSCE, ZAGLJ
Abstract
This study intends to predict in-hospital and 6-month mortality, as well as 30-day and 90-day hospital readmission, using Machine Learning (ML) approach via conventional features. A total of ...737 patients remained after applying the exclusion criteria to 1101 heart failure patients. Thirty-four conventional features were collected for each patient. First, the data were divided into train and test cohorts with a 70–30% ratio. Then train data were normalized using the Z-score method, and its mean and standard deviation were applied to the test data. Subsequently, Boruta, RFE, and MRMR feature selection methods were utilized to select more important features in the training set. In the next step, eight ML approaches were used for modeling. Next, hyperparameters were optimized using tenfold cross-validation and grid search in the train dataset. All model development steps (normalization, feature selection, and hyperparameter optimization) were performed on a train set without touching the hold-out test set. Then, bootstrapping was done 1000 times on the hold-out test data. Finally, the obtained results were evaluated using four metrics: area under the ROC curve (AUC), accuracy (ACC), specificity (SPE), and sensitivity (SEN). The RFE-LR (AUC: 0.91, ACC: 0.84, SPE: 0.84, SEN: 0.83) and Boruta-LR (AUC: 0.90, ACC: 0.85, SPE: 0.85, SEN: 0.83) models generated the best results in terms of in-hospital mortality. In terms of 30-day rehospitalization, Boruta-SVM (AUC: 0.73, ACC: 0.81, SPE: 0.85, SEN: 0.50) and MRMR-LR (AUC: 0.71, ACC: 0.68, SPE: 0.69, SEN: 0.63) models performed the best. The best model for 3-month rehospitalization was MRMR-KNN (AUC: 0.60, ACC: 0.63, SPE: 0.66, SEN: 0.53) and regarding 6-month mortality, the MRMR-LR (AUC: 0.61, ACC: 0.63, SPE: 0.44, SEN: 0.66) and MRMR-NB (AUC: 0.59, ACC: 0.61, SPE: 0.48, SEN: 0.63) models outperformed the others. Reliable models were developed in 30-day rehospitalization and in-hospital mortality using conventional features and ML techniques. Such models can effectively personalize treatment, decision-making, and wiser budget allocation. Obtained results in 3-month rehospitalization and 6-month mortality endpoints were not astonishing and further experiments with additional information are needed to fetch promising results in these endpoints.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
This study aimed to investigate the value of clinical, radiomic features extracted from gross tumor volumes (GTVs) delineated on CT images, dose distributions (Dosiomics), and fusion of CT and dose ...distributions to predict outcomes in head and neck cancer (HNC) patients.
A cohort of 240 HNC patients from five different centers was obtained from The Cancer Imaging Archive. Seven strategies, including four non-fusion (Clinical, CT, Dose, DualCT-Dose), and three fusion algorithms (latent low-rank representation referred (LLRR),Wavelet, weighted least square (WLS)) were applied. The fusion algorithms were used to fuse the pre-treatment CT images and 3-dimensional dose maps. Overall, 215 radiomics and Dosiomics features were extracted from the GTVs, alongside with seven clinical features incorporated. Five feature selection (FS) methods in combination with six machine learning (ML) models were implemented. The performance of the models was quantified using the concordance index (CI) in one-center-leave-out 5-fold cross-validation for overall survival (OS) prediction considering the time-to-event.
The mean CI and Kaplan-Meier curves were used for further comparisons. The CoxBoost ML model using the Minimal Depth (MD) FS method and the glmnet model using the Variable hunting (VH) FS method showed the best performance with CI = 0.73 ± 0.15 for features extracted from LLRR fused images. In addition, both glmnet-Cindex and Coxph-Cindex classifiers achieved a CI of 0.72 ± 0.14 by employing the dose images (+ incorporated clinical features) only.
Our results demonstrated that clinical features, Dosiomics and fusion of dose and CT images by specific ML-FS models could predict the overall survival of HNC patients with acceptable accuracy. Besides, the performance of ML methods among the three different strategies was almost comparable.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
Objectives
This study aims to use ultrasound derived features as biomarkers to assess the malignancy of thyroid nodules in patients who were candidates for FNA according to the ACR TI-RADS ...guidelines.
Methods
Two hundred and ten patients who met the selection criteria were enrolled in the study and subjected to ultrasound-guided FNA of thyroid nodules. Different radiomics features were extracted from sonographic images, including intensity, shape, and texture feature sets. Least Absolute Shrinkage and Selection Operator (LASSO), Minimum Redundancy Maximum Relevance (MRMR), and Random Forests/Extreme Gradient Boosting Machine (XGBoost) algorithms were used for feature selection and classification of the univariate and multivariate modeling, respectively. Evaluation of models performed using accuracy, sensitivity, specificity, and area under the receiver operating characteristic curve (AUC).
Results
In the univariate analysis, Gray Level Run Length Matrix - Run-Length Non-Uniformity (GLRLM-RLNU) and gray-level zone length matrix - Run-Length Non-Uniformity (GLZLM-GLNU) (both with an AUC of 0.67) were top-performing for predicting nodules malignancy. In the multivariate analysis of the training dataset, the AUC of all combinations of feature selection algorithms and classifiers was 0.99, and the highest sensitivity was for XGBoost classifier and MRMR feature selection algorithms (0.99). Finally, the test dataset was used to evaluate our model in which XGBoost classifier with MRMR and LASSO feature selection algorithms had the highest performance (AUC = 0.95).
Conclusions
Ultrasound-extracted features can be used as non-invasive biomarkers for thyroid nodules’ malignancy prediction.
Key Points
Ultrasound imaging features can be used for thyroid nodules’ malignancy prediction.
The quantitative features of ultrasound images can be used as non-invasive biomarkers.
Full text
Available for:
EMUNI, FIS, FZAB, GEOZS, GIS, IJS, IMTLJ, KILJ, KISLJ, MFDPS, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, SBMB, SBNM, UKNU, UL, UM, UPUK, VKSCE, ZAGLJ
Abstract
This study aimed to investigate the diagnostic performance of machine learning-based radiomics analysis to diagnose coronary artery disease status and risk from rest/stress Myocardial ...Perfusion Imaging (MPI) single-photon emission computed tomography (SPECT). A total of 395 patients suspicious of coronary artery disease who underwent 2-day stress-rest protocol MPI SPECT were enrolled in this study. The left ventricle myocardium, excluding the cardiac cavity, was manually delineated on rest and stress images to define a volume of interest. Added to clinical features (age, sex, family history, diabetes status, smoking, and ejection fraction), a total of 118 radiomics features, were extracted from rest and stress MPI SPECT images to establish different feature sets, including Rest-, Stress-, Delta-, and Combined-radiomics (all together) feature sets. The data were randomly divided into 80% and 20% subsets for training and testing, respectively. The performance of classifiers built from combinations of three feature selections, and nine machine learning algorithms was evaluated for two different diagnostic tasks, including 1) normal/abnormal (no CAD vs. CAD) classification, and 2) low-risk/high-risk CAD classification. Different metrics, including the area under the ROC curve (AUC), accuracy (ACC), sensitivity (SEN), and specificity (SPE), were reported for models’ evaluation. Overall, models built on the Stress feature set (compared to other feature sets), and models to diagnose the second task (compared to task 1 models) revealed better performance. The Stress-mRMR-KNN (feature set-feature selection-classifier) reached the highest performance for task 1 with AUC, ACC, SEN, and SPE equal to 0.61, 0.63, 0.64, and 0.6, respectively. The Stress-Boruta-GB model achieved the highest performance for task 2 with AUC, ACC, SEN, and SPE of 0.79, 0.76, 0.75, and 0.76, respectively. Diabetes status from the clinical feature family, and dependence count non-uniformity normalized, from the NGLDM family, which is representative of non-uniformity in the region of interest were the most frequently selected features from stress feature set for CAD risk classification. This study revealed promising results for CAD risk classification using machine learning models built on MPI SPECT radiomics. The proposed models are helpful to alleviate the labor-intensive MPI SPECT interpretation process regarding CAD status and can potentially expedite the diagnostic process.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
Purpose
This study aimed to examine the robustness of positron emission tomography (PET) radiomic features extracted via different segmentation methods before and after ComBat harmonization in ...patients with non-small cell lung cancer (NSCLC).
Methods
We included 120 patients (positive recurrence = 46 and negative recurrence = 74) referred for PET scanning as a routine part of their care. All patients had a biopsy-proven NSCLC. Nine segmentation methods were applied to each image, including manual delineation, K-means (KM), watershed, fuzzy-C-mean, region-growing, local active contour (LAC), and iterative thresholding (IT) with 40, 45, and 50% thresholds. Diverse image discretizations, both without a filter and with different wavelet decompositions, were applied to PET images. Overall, 6741 radiomic features were extracted from each image (749 radiomic features from each segmented area). Non-parametric empirical Bayes (NPEB) ComBat harmonization was used to harmonize the features. Linear Support Vector Classifier (LinearSVC) with L1 regularization For feature selection and Support Vector Machine classifier (SVM) with fivefold nested cross-validation was performed using StratifiedKFold with ‘n_splits’ set to 5 to predict recurrence in NSCLC patients and assess the impact of ComBat harmonization on the outcome.
Results
From 749 extracted radiomic features, 206 (27%) and 389 (51%) features showed excellent reliability (ICC ≥ 0.90) against segmentation method variation before and after NPEB ComBat harmonization, respectively. Among all, 39 features demonstrated poor reliability, which declined to 10 after ComBat harmonization. The 64 fixed bin widths (without any filter) and wavelets (LLL)-based radiomic features set achieved the best performance in terms of robustness against diverse segmentation techniques before and after ComBat harmonization. The first-order and GLRLM and also first-order and NGTDM feature families showed the largest number of robust features before and after ComBat harmonization, respectively. In terms of predicting recurrence in NSCLC, our findings indicate that using ComBat harmonization can significantly enhance machine learning outcomes, particularly improving the accuracy of watershed segmentation, which initially had fewer reliable features than manual contouring. Following the application of ComBat harmonization, the majority of cases saw substantial increase in sensitivity and specificity.
Conclusion
Radiomic features are vulnerable to different segmentation methods. ComBat harmonization might be considered a solution to overcome the poor reliability of radiomic features.
Full text
Available for:
EMUNI, FIS, FZAB, GEOZS, GIS, IJS, IMTLJ, KILJ, KISLJ, MFDPS, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, SBMB, SBNM, UKNU, UL, UM, UPUK, VKSCE, ZAGLJ