Radiomics relies on the extraction of a wide variety of quantitative image-based features to provide decision support. Magnetic resonance imaging (MRI) contributes to the personalization of patient ...care but suffers from being highly dependent on acquisition and reconstruction parameters. Today, there are no guidelines regarding the optimal pre-processing of MR images in the context of radiomics, which is crucial for the generalization of published image-based signatures. This study aims to assess the impact of three different intensity normalization methods (Nyul, WhiteStripe, Z-Score) typically used in MRI together with two methods for intensity discretization (fixed bin size and fixed bin number). The impact of these methods was evaluated on first- and second-order radiomics features extracted from brain MRI, establishing a unified methodology for future radiomics studies. Two independent MRI datasets were used. The first one (DATASET1) included 20 institutional patients with WHO grade II and III gliomas who underwent post-contrast 3D axial T1-weighted (T1w-gd) and axial T2-weighted fluid attenuation inversion recovery (T2w-flair) sequences on two different MR devices (1.5 T and 3.0 T) with a 1-month delay. Jensen-Shannon divergence was used to compare pairs of intensity histograms before and after normalization. The stability of first-order and second-order features across the two acquisitions was analysed using the concordance correlation coefficient and the intra-class correlation coefficient. The second dataset (DATASET2) was extracted from the public TCIA database and included 108 patients with WHO grade II and III gliomas and 135 patients with WHO grade IV glioblastomas. The impact of normalization and discretization methods was evaluated based on a tumour grade classification task (balanced accuracy measurement) using five well-established machine learning algorithms. Intensity normalization highly improved the robustness of first-order features and the performances of subsequent classification models. For the T1w-gd sequence, the mean balanced accuracy for tumour grade classification was increased from 0.67 (95% CI 0.61-0.73) to 0.82 (95% CI 0.79-0.84, P = .006), 0.79 (95% CI 0.76-0.82, P = .021) and 0.82 (95% CI 0.80-0.85, P = .005), respectively, using the Nyul, WhiteStripe and Z-Score normalization methods compared to no normalization. The relative discretization makes unnecessary the use of intensity normalization for the second-order radiomics features. Even if the bin number for the discretization had a small impact on classification performances, a good compromise was obtained using the 32 bins considering both T1w-gd and T2w-flair sequences. No significant improvements in classification performances were observed using feature selection. A standardized pre-processing pipeline is proposed for the use of radiomics in MRI of brain tumours. For models based on first- and second-order features, we recommend normalizing images with the Z-Score method and adopting an absolute discretization approach. For second-order feature-based signatures, relative discretization can be used without prior normalization. In both cases, 32 bins for discretization are recommended. This study may pave the way for the multicentric development and validation of MR-based radiomics biomarkers.
Objectives
To differentiate Glioblastomas (GBM) and Brain Metastases (BM) using a radiomic features-based Machine Learning (ML) classifier trained from post-contrast three-dimensional T1-weighted ...(post-contrast 3DT1) MR imaging, and compare its performance in medical diagnosis
versus
human experts, on a testing cohort.
Methods
We enrolled 143 patients (71 GBM and 72 BM) in a retrospective bicentric study from January 2010 to May 2019 to train the classifier. Post-contrast 3DT1 MR images were performed on a 3-Tesla MR unit and 100 radiomic features were extracted. Selection and optimization of the Machine Learning (ML) classifier was performed using a nested cross-validation. Sensitivity, specificity, balanced accuracy, and area under the receiver operating characteristic curve (AUC) were calculated as performance metrics. The model final performance was cross-validated, then evaluated on a test set of 37 patients, and compared to human blind reading using a McNemar’s test.
Results
The ML classifier had a mean 95% confidence interval sensitivity of 85% 77; 94, a specificity of 87% 78; 97, a balanced accuracy of 86% 80; 92, and an AUC of 92% 87; 97 with cross-validation. Sensitivity, specificity, balanced accuracy and AUC were equal to 75, 86, 80 and 85% on the test set. Sphericity 3D radiomic index highlighted the highest coefficient in the logistic regression model. There were no statistical significant differences observed between the performance of the classifier and the experts’ blinded examination.
Conclusions
The proposed diagnostic support system based on radiomic features extracted from post-contrast 3DT1 MR images helps in differentiating solitary BM from GBM with high diagnosis performance and generalizability.
Textural and shape analysis is gaining considerable interest in medical imaging, particularly to identify parameters characterizing tumor heterogeneity and to feed radiomic models. Here, we present a ...free, multiplatform, and easy-to-use freeware called LIFEx, which enables the calculation of conventional, histogram-based, textural, and shape features from PET, SPECT, MR, CT, and US images, or from any combination of imaging modalities. The application does not require any programming skills and was developed for medical imaging professionals. The goal is that independent and multicenter evidence of the usefulness and limitations of radiomic features for characterization of tumor heterogeneity and subsequent patient management can be gathered. Many options are offered for interactive textural index calculation and for increasing the reproducibility among centers. The software already benefits from a large user community (more than 800 registered users), and interactions within that community are part of the development strategy.
This study presents a user-friendly, multi-platform freeware to extract radiomic features from PET, SPECT, MR, CT, and US images, or any combination of imaging modalities.
.
This study aims to evaluate the impact of key parameters on the pseudo computed tomography (pCT) quality generated from magnetic resonance imaging (MRI) with a 3-dimensional (3D) convolutional neural ...network.
Four hundred two brain tumor cases were retrieved, yielding associations between 182 computed tomography (CT) and T1-weighted MRI (T1) scans, 180 CT and contrast-enhanced T1-weighted MRI (T1-Gd) scans, and 40 CT, T1, and T1-Gd scans. A 3D CNN was used to map T1 or T1-Gd onto CT scans and evaluate the importance of different components. First, the training set size's influence on testing set accuracy was assessed. Moreover, we evaluated the MRI sequence impact, using T1-only and T1-Gd-only cohorts. We then investigated 4 MRI standardization approaches (histogram-based, zero-mean/unit-variance, white stripe, and no standardization) based on training, validation, and testing cohorts composed of 242, 81, and 79 patients cases, respectively, as well as a bias field correction influence. Finally, 2 networks, namely HighResNet and 3D UNet, were compared to evaluate the architecture's impact on the pCT quality. The mean absolute error, gamma indices, and dose-volume histograms were used as evaluation metrics.
Generating models using all the available cases for training led to higher pCT quality. The T1 and T1-Gd models had a maximum difference in gamma index means of 0.07 percentage point. The mean absolute error obtained with white stripe was 78 ± 22 Hounsfield units, which slightly outperformed histogram-based, zero-mean/unit-variance, and no standardization (P < .0001). Regarding the network architectures, 3%/3 mm gamma indices of 99.83% ± 0.19% and 99.74% ± 0.24% were obtained for HighResNet and 3D UNet, respectively.
Our best pCTs were generated using more than 200 samples in the training data set. Training with T1 only and T1-Gd only did not significantly affect performance. Regardless of the preprocessing applied, the dosimetry quality remained equivalent and relevant for potential use in clinical practice.
Radiomics extracts high-throughput quantitative data from medical images to contribute to precision medicine. Radiomic shape features have been shown to correlate with patient outcomes. However, how ...radiomic shape features vary in function of tumor complexity and tumor volume, as well as with method used for meshing and voxel resampling, remains unknown. The aims of this study are to create tumor models with varying degrees of complexity, or spiculatedness, and evaluate their relationship with quantitatively extracted shape features. Twenty-eight tumor models were mathematically created using spherical harmonics with the spiculatedness degree d being increased by increments of 3 (d = 11 to d = 92). Models were 3D printed with identical bases of 5 cm, imaged with a CT scanner with two different slice thicknesses, and semi-automatically delineated. Resampling of the resulting masks on a 1 × 1 × 1 mm
grid was performed, and the voxel size of each model was then calculated to eliminate volume differences. Four MATLAB-based algorithms (isosurface (M1), isosurface filter (M2), isosurface remeshing (M3), and boundary (M4)) were used to extract nine 3D features (Volume, Surface area, Surface-to-volume, Compactness1, Compactness2, Compactness3, Spherical Disproportion, Sphericity and Fractional Concavity). To quantify the impact of 3D printing, acquisition, segmentation and meshing, features were computed directly from the stereolithography (STL) file format that was used for 3D printing, and compared to those computed. Changes in feature values between 0.6 and 2 mm slice acquisitions were also compared. Spearman's rank-order correlation coefficients were computed to determine the relationship of each shape feature with spiculatedness for each of the four meshing algorithms. Percent changes were calculated between shape features extracted from the original and resampled contoured images to evaluate the influence of spatial resampling. Finally, the percent change in shape features when the volume was changed from 25% to 150% of their original volume was quantified for three distinct tumor models and compared to the percent change observed when modifying the spiculatedness of the model from d = 11 to d = 92. Values extracted using isosurface remeshing method are the closest to the STL reference ones, with mean differences less than 10.8% (Compactness2) for all features. Seven of the eight features had strong significant correlations with tumor model complexity irrespective of the meshing algorithm (r > 0.98, p < 10
), with fractional concavity having the lowest correlation coefficient (r = 0.83, p < 10
, M2). Comparisons of features extracted from the 0.6 and 2 mm slice thicknesses showed that mean differences were from 2.1% (Compactness3) to 12.7% (Compactness2) for the isosurface remeshing method. Resampling on a 1 × 1 × 1 mm
grid resulted in between 1.3% (Compactness3) to 9.5% (Fractional Concavity) mean changes in feature values. Compactness2, Compactness3, Spherical Disproportion, Sphericity and Fractional Concavity were the features least affected by volume changes. Compactness1 had a 90.4% change with volume, which was greater than the change between the least and most spiculated models. This is the first methodological study that directly demonstrates the relationship of tumor spiculatedness with radiomic shape features, that also produced 3D tumor models, which may serve as reference phantoms for future radiomic studies. Surface Area, Surface-to-volume, and Spherical Disproportion had direct relationships with spiculatedness while the three formulas for Compactness, Sphericity and Fractional Concavity had inverse relationships. The features Compactness2, Compactness3, Spherical Disproportion, and Sphericity should be prioritized as these have minimal variations with volume changes, slice thickness and resampling.
Purpose
We investigated whether a score combining baseline neutrophilia and a PET biomarker could predict outcome in patients with locally advanced cervical cancer (LACC).
Methods
Patients ...homogeneously treated with definitive chemoradiation plus image-guided adaptive brachytherapy (IGABT) between 2006 and 2013 were analyzed retrospectively. We divided patients into two groups depending on the PET device used: a training set (TS) and a validation set (VS). Primary tumors were semi-automatically delineated on PET images, and 11 radiomics features were calculated (LIFEx software). A PET radiomic index was selected using the time-dependent area under the curve (td-AUC) for 3-year local control (LC). We defined the neutrophil SUV grade (NSG = 0, 1 or 2) score as the number of risk factors among (i) neutrophilia (neutrophil count >7 G/L) and (ii) high risk defined from the PET radiomic index. The NSG prognostic value was evaluated for LC and overall survival (OS).
Results
Data from 108 patients were analyzed. Estimated 3-year LC was 72% in the TS (
n
= 69) and 65% in the VS (
n
= 39). In the TS, SUV
peak
was selected as the most LC-predictive biomarker (td-AUC = 0.75), and was independent from neutrophilia (
p
= 0.119). Neutrophilia (HR = 2.6), high-risk SUV
peak
(SUV
peak
> 10, HR = 4.4) and NSG = 2 (HR = 9.2) were associated with low probability of LC in TS. In multivariate analysis, NSG = 2 was independently associated with low probability of LC (HR = 7.5,
p
< 0.001) and OS (HR = 5.8,
p
= 0.001) in the TS. Results obtained in the VS (HR = 5.2 for OS and 3.5 for LC,
p
< 0.02) were promising.
Conclusion
This innovative scoring approach combining baseline neutrophilia and a PET biomarker provides an independent prognostic factor to consider for further clinical investigations.
e21169
Background: Brain metastases (BMs) incidence is high in patients with metastatic non-small cell lung cancer (NSCLC). Immune Checkpoints Inhibitors (ICIs) are now standard of care for ...metastatic NSCLC in the first-line or beyond settings. However, less than half of patients will have a tumor response, and the determinants of BMs response to ICIs remain unknown. This study aims to evaluate the value of radiomics to predict control/progression of BMs at a lesion level, and outcomes at a patient level in the BM+ NSCLC population treated by ICIs. Methods: We conducted a retrospective multicenter (5 European centers) study including consecutive patients with NSCLC BMs and available baseline MRI before ICIs (evaluation cohort) or chemotherapy (control cohort) with a 2:1 ratio, between 2011 and 2021. After modified RANO (Response Assessment in Neuro-Oncology) assessment of individual BMs, we developed a radiomic model to predict BMs control/progression to ICIs at the first brain follow-up imaging. The ICIs cohort was split into two datasets used for (i) model training with 5-fold cross-validation and (ii) model testing. The Chemotherapy cohort was used to ensure the specificity of our model to ICIs-treated patients. Results: Ninety-four and 49 patients were included in the ICIs and Chemotherapy cohorts respectively, of which 56 (59.6%, N = 227 BMs) and 39 (79.6%, N = 192 BMs) patients had available brain follow-up imaging. The ICIs cohort was specifically enriched in radiomic features which were significantly associated to BMs progression. Our final model, based on extreme gradient boosting (xgboost) on BMs > 10mm from the training set (N = 39 BMs), could predict individual BMs progression to ICIs with an area under curve (AUC) of 0.77 (p-value = 0.029, 95% CI 0.56-0.98) in the test set (N = 20 BMs). We further generated a radiomic score to stratify BM+ NSCLC ICIs-treated patients between High-Risk or Low-Risk groups according to the predicted individual BMs progression. High-risk patients were associated with worse overall survival (OS) (median OS of 6.3 months, 95% CI 3.02-10.49) compared to Low-risk patients (median OS of 11.87 months, 95% CI 7.02-21.90, p = 0.042). The prognostic value of the radiomic score on OS was validated in a multivariate analysis in the ICIs cohort (Table). Conclusions: To our knowledge, this is the first study to explore the value of radiomics in the prediction of BMs response to immunotherapy. Prospective evaluation will confirm the generalizability of our model in clinical practice.Table: see text
The development and clinical adoption of quantitative imaging biomarkers (radiomics) has established the need for the identification of parameters altering radiomics reproducibility. The aim of this ...study was to assess the impact of magnetic field strength on magnetic resonance imaging (MRI) radiomics features in neuroradiology clinical practice.
T1 3D SPGR sequence was acquired on two phantoms and 10 healthy volunteers with two clinical MR devices from the same manufacturer using two different magnetic fields (1.5 and 3T). Phantoms varied in terms of gadolinium concentrations and textural heterogeneity. 27 regions of interest were segmented (phantom: 21, volunteers: 6) using the LIFEX software. 34 features were analyzed.
In the phantom dataset, 10 (67%) out of 15 radiomics features were significantly different when measured at 1.5T or 3T (student's t-test, p < 0.05). Gray levels resampling, and pixel size also influence part of texture features. These findings were validated in healthy volunteers.
According to daily used protocols for clinical examinations, radiomic features extracted on 1.5T should not be used interchangeably with 3T when evaluating texture features. Such confounding factor should be adjusted when adapting the results of a study to a different platform, or when designing a multicentric trial.