FDG-PET-derived textural features describing intra-tumor heterogeneity are increasingly investigated as imaging biomarkers. As part of the process of quantifying heterogeneity, image intensities ...(SUVs) are typically resampled into a reduced number of discrete bins. We focused on the implications of the manner in which this discretization is implemented. Two methods were evaluated: (1) R(D), dividing the SUV range into D equally spaced bins, where the intensity resolution (i.e. bin size) varies per image; and (2) R(B), maintaining a constant intensity resolution B. Clinical feasibility was assessed on 35 lung cancer patients, imaged before and in the second week of radiotherapy. Forty-four textural features were determined for different D and B for both imaging time points. Feature values depended on the intensity resolution and out of both assessed methods, R(B) was shown to allow for a meaningful inter- and intra-patient comparison of feature values. Overall, patients ranked differently according to feature values–which was used as a surrogate for textural feature interpretation–between both discretization methods. Our study shows that the manner of SUV discretization has a crucial effect on the resulting textural features and the interpretation thereof, emphasizing the importance of standardized methodology in tumor texture analysis.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
Purpose
Modern computed tomography (CT) scanners have an extended field‐of‐view (eFoV) for reconstructing images up to the bore size, which is relevant for patients with higher BMI or non‐isocentric ...positioning due to fixation devices. However, the accuracy of the image reconstruction in eFoV is not well known since truncated data are used. This study introduces a new deep learning‐based algorithm for extended field‐of‐view reconstruction and evaluates the accuracy of the eFoV reconstruction focusing on aspects relevant for radiotherapy.
Methods
A life‐size three‐dimensional (3D) printed thorax phantom, based on a patient CT for which eFoV was necessary, was manufactured and used as reference. The phantom has holes allowing the placement of tissue mimicking inserts used to evaluate the Hounsfield unit (HU) accuracy. CT images of the phantom were acquired using different configurations aiming to evaluate geometric and HU accuracy in the eFoV. Image reconstruction was performed using a state‐of‐the‐art reconstruction algorithm (HDFoV), commercially available, and the novel deep learning‐based approach (HDeepFoV). Five patient cases were selected to evaluate the performance of both algorithms on patient data. There is no ground truth for patients so the reconstructions were qualitatively evaluated by five physicians and five medical physicists.
Results
The phantom geometry reconstructed with HDFoV showed boundary deviations from 1.0 to 2.5 cm depending on the volume of the phantom outside the regular scan field of view. HDeepFoV showed a superior performance regardless of the volume of the phantom within eFOV with a maximum boundary deviation below 1.0 cm. The maximum HU (absolute) difference for soft issue inserts is below 79 and 41 HU for HDFoV and HDeepFoV, respectively. HDeepFoV has a maximum deviation of −18 HU for an inhaled lung insert while HDFoV reached a 229 HU difference. The qualitative evaluation of patient cases shows that the novel deep learning approach produces images that look more realistic and have fewer artifacts.
Conclusion
To be able to reconstruct images outside the sFoV of the CT scanner there is no alternative than to use some kind of extrapolated data. In our study, we proposed and investigated a new deep learning‐based algorithm and compared it to a commercial solution for eFoV reconstruction. The deep learning‐based algorithm showed superior performance in quantitative evaluations based on phantom data and in qualitative assessments of patient data.
Full text
Available for:
BFBNIB, FZAB, GIS, IJS, KILJ, NLZOH, NUK, OILJ, SAZU, SBCE, SBMB, UL, UM, UPUK
Several individual clinical and preclinical studies have shown the possibility of evaluating tumor hypoxia by using noninvasive positron emission tomography (PET). The current study compared 3 ...hypoxia PET tracers frequently used in the clinic, 18FFMISO, 18FFAZA, and 18FHX4, in a preclinical tumor model. Tracer uptake was evaluated for the optimal time point for imaging, tumor-to-blood ratios (TBR), spatial reproducibility, and sensitivity to oxygen modification.
PET/computed tomography (CT) images of rhabdomyosarcoma R1-bearing WAG/Rij rats were acquired at multiple time points post injection (p.i.) with one of the hypoxia tracers. TBR values were calculated, and reproducibility was investigated by voxel-to-voxel analysis, represented as correlation coefficients (R) or Dice similarity coefficient of the high-uptake volume. Tumor oxygen modifications were induced by exposure to either carbogen/nicotinamide treatment or 7% oxygen breathing.
TBR was stabilized and maximal at 2 hours p.i. for 18FFAZA (4.0 ± 0.5) and at 3 hours p.i. for 18FHX4 (7.2 ± 0.7), whereas 18FFMISO showed a constant increasing TBR (9.0 ± 0.8 at 6 hours p.i.). High spatial reproducibility was observed by voxel-to-voxel comparisons and Dice similarity coefficient calculations on the 30% highest uptake volume for both 18FFMISO (R = 0.86; Dice coefficient = 0.76) and 18FHX4 (R = 0.76; Dice coefficient = 0.70), whereas 18FFAZA was less reproducible (R = 0.52; Dice coefficient = 0.49). Modifying the hypoxic fraction resulted in enhanced mean standardized uptake values for both 18FHX4 and 18FFAZA upon 7% oxygen breathing. Only 18FFMISO uptake was found to be reversible upon exposure to nicotinamide and carbogen.
This study indicates that each tracer has its own strengths and, depending on the question to be answered, a different tracer can be put forward.
Full text
Available for:
GEOZS, IJS, NUK, OILJ, UL, UM, UPUK
Abstract
Purpose. Besides basic measurements as maximum standardized uptake value (SUV)max or SUVmean derived from 18F-FDG positron emission tomography (PET) scans, more advanced quantitative imaging ...features (i.e. "Radiomics" features) are increasingly investigated for treatment monitoring, outcome prediction, or as potential biomarkers. With these prospected applications of Radiomics features, it is a requisite that they provide robust and reliable measurements. The aim of our study was therefore to perform an integrated stability analysis of a large number of PET-derived features in non-small cell lung carcinoma (NSCLC), based on both a test-retest and an inter-observer setup. Methods. Eleven NSCLC patients were included in the test-retest cohort. Patients underwent repeated PET imaging within a one day interval, before any treatment was delivered. Lesions were delineated by applying a threshold of 50% of the maximum uptake value within the tumor. Twenty-three NSCLC patients were included in the inter-observer cohort. Patients underwent a diagnostic whole body PET-computed tomography (CT). Lesions were manually delineated based on fused PET-CT, using a standardized clinical delineation protocol. Delineation was performed independently by five observers, blinded to each other. Fifteen first order statistics, 39 descriptors of intensity volume histograms, eight geometric features and 44 textural features were extracted. For every feature, test-retest and inter-observer stability was assessed with the intra-class correlation coefficient (ICC) and the coefficient of variability, normalized to mean and range. Similarity between test-retest and inter-observer stability rankings of features was assessed with Spearman's rank correlation coefficient. Results. Results showed that the majority of assessed features had both a high test-retest (71%) and inter-observer (91%) stability in terms of their ICC. Overall, features more stable in repeated PET imaging were also found to be more robust against inter-observer variability. Conclusion. Results suggest that further research of quantitative imaging features is warranted with respect to more advanced applications of PET imaging as being used for treatment monitoring, outcome prediction or imaging biomarkers.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, UILJ, UKNU, UL, UM, UPUK
Oropharyngeal squamous cell carcinoma (OPSCC) is one of the fastest growing disease sites of head and neck cancers. A recently described radiomic signature, based exclusively on pre-treatment ...computed tomography (CT) imaging of the primary tumor volume, was found to be prognostic in independent cohorts of lung and head and neck cancer patients treated in the Netherlands. Here, we further validate this signature in a large and independent North American cohort of OPSCC patients, also considering CT artifacts.
A total of 542 OPSCC patients were included for which we determined the prognostic index (PI) of the radiomic signature. We tested the signature model fit in a Cox regression and assessed model discrimination with Harrell's c-index. Kaplan-Meier survival curves between high and low signature predictions were compared with a log-rank test. Validation was performed in the complete cohort (PMH1) and in the subset of patients without (PMH2) and with (PMH3) visible CT artifacts within the delineated tumor region.
We identified 267 (49%) patients without and 275 (51%) with visible CT artifacts. The calibration slope (β) on the PI in a Cox proportional hazards model was 1.27 (H0: β = 1, p = 0.152) in the PMH1 (n = 542), 0.855 (H0: β = 1, p = 0.524) in the PMH2 (n = 267) and 1.99 (H0: β = 1, p = 0.002) in the PMH3 (n = 275) cohort. Harrell's c-index was 0.628 (p = 2.72e-9), 0.634 (p = 2.7e-6) and 0.647 (p = 5.35e-6) for the PMH1, PMH2 and PMH3 cohort, respectively. Kaplan-Meier survival curves were significantly different (p < 0.05) between high and low radiomic signature model predictions for all cohorts.
Overall, the signature validated well using all CT images as-is, demonstrating a good model fit and preservation of discrimination. Even though CT artifacts were shown to be of influence, the signature had significant prognostic power regardless if patients with CT artifacts were included.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, UILJ, UKNU, UL, UM, UPUK
Our hypothesis was that pretreatment inflammation in the lung makes pulmonary tissue more susceptible to radiation damage. The relationship between pretreatment (18)Ffluorodeoxyglucose ((18)FFDG) ...uptake in the lungs (as a surrogate for inflammation) and the delivered radiation dose and radiation-induced lung toxicity (RILT) was investigated.
We retrospectively studied a prospectively obtained cohort of 101 non-small-cell lung cancer patients treated with (chemo)radiation therapy (RT). (18)FFDG-positron emission tomography-computed tomography (PET-CT) scans used for treatment planning were studied. Different parameters were used to describe (18)FFDG uptake patterns in the lungs, excluding clinical target volumes, and the interaction with radiation dose. An increase in the dyspnea grade of 1 (Common Terminology Criteria for Adverse Events version 3.0) or more points compared to the pre-RT score was used as an endpoint for analysis of RILT. The effect of (18)FFDG and CT-based variables, dose, and other patient or treatment characteristics that effected RILT was studied using logistic regression.
Increased lung density and pretreatment (18)FFDG uptake were related to RILT after RT with univariable logistic regression. The 95th percentile of the (18)FFDG uptake in the lungs remained significant in multivariable logistic regression (p = 0.016; odds ratio OR = 4.3), together with age (p = 0.029; OR = 1.06), and a pre-RT dyspnea score of ≥1 (p = 0.005; OR = 0.20). Significant interaction effects were demonstrated among the 80th, 90th, and 95th percentiles and the relative lung volume receiving more than 2 and 5 Gy.
The risk of RILT increased with the 95th percentile of the (18)FFDG uptake in the lungs, excluding clinical tumor volume (OR = 4.3). The effect became more pronounced as the fraction of the 5%, 10%, and 20% highest standardized uptake value voxels that received more than 2 Gy to 5 Gy increased. Therefore, the risk of RILT may be decreased by applying sophisticated radiotherapy techniques to avoid areas in the lung with high (18)FFDG uptake.
Full text
Available for:
GEOZS, IJS, NUK, OILJ, UL, UM, UPUK
Purpose
Automated techniques for estimating the contours of organs and structures in medical images have become more widespread and a variety of measures are available for assessing their quality. ...Quantitative measures of geometric agreement, for example, overlap with a gold‐standard delineation, are popular but may not predict the level of clinical acceptance for the contouring method. Therefore, surrogate measures that relate more directly to the clinical judgment of contours, and to the way they are used in routine workflows, need to be developed. The purpose of this study is to propose a method (inspired by the Turing Test) for providing contour quality measures that directly draw upon practitioners’ assessments of manual and automatic contours. This approach assumes that an inability to distinguish automatically produced contours from those of clinical experts would indicate that the contours are of sufficient quality for clinical use. In turn, it is anticipated that such contours would receive less manual editing prior to being accepted for clinical use. In this study, an initial assessment of this approach is performed with radiation oncologists and therapists.
Methods
Eight clinical observers were presented with thoracic organ‐at‐risk contours through a web interface and were asked to determine if they were automatically generated or manually delineated. The accuracy of the visual determination was assessed, and the proportion of contours for which the source was misclassified recorded. Contours of six different organs in a clinical workflow were for 20 patient cases. The time required to edit autocontours to a clinically acceptable standard was also measured, as a gold standard of clinical utility. Established quantitative measures of autocontouring performance, such as Dice similarity coefficient with respect to the original clinical contour and the misclassification rate accessed with the proposed framework, were evaluated as surrogates of the editing time measured.
Results
The misclassification rates for each organ were: esophagus 30.0%, heart 22.9%, left lung 51.2%, right lung 58.5%, mediastinum envelope 43.9%, and spinal cord 46.8%. The time savings resulting from editing the autocontours compared to the standard clinical workflow were 12%, 25%, 43%, 77%, 46%, and 50%, respectively, for these organs. The median Dice similarity coefficients between the clinical contours and the autocontours were 0.46, 0.90, 0.98, 0.98, 0.94, and 0.86, respectively, for these organs.
Conclusions
A better correspondence with time saving was observed for the misclassification rate than the quantitative contour measures explored. From this, we conclude that the inability to accurately judge the source of a contour indicates a reduced need for editing and therefore a greater time saving overall. Hence, task‐based assessments of contouring performance may be considered as an additional way of evaluating the clinical utility of autosegmentation methods.
Full text
Available for:
BFBNIB, FZAB, GIS, IJS, KILJ, NLZOH, NUK, OILJ, SAZU, SBCE, SBMB, UL, UM, UPUK
•AI is emerging as a tool for different applications in the radiotherapy workflow.•Overview of commonly used AI-based applications in the radiotherapy workflow.•Recommendations on the clinical ...implementation in the field of radiotherapy.•Contouring, planning and synthetic CT are discussed in depth.
Artificial Intelligence (AI) is currently being introduced into different domains, including medicine. Specifically in radiation oncology, machine learning models allow automation and optimization of the workflow. A lack of knowledge and interpretation of these AI models can hold back wide-spread and full deployment into clinical practice. To facilitate the integration of AI models in the radiotherapy workflow, generally applicable recommendations on implementation and quality assurance (QA) of AI models are presented. For commonly used applications in radiotherapy such as auto-segmentation, automated treatment planning and synthetic computed tomography (sCT) the basic concepts are discussed in depth. Emphasis is put on the commissioning, implementation and case-specific and routine QA of AI models needed for a methodical introduction in clinical practice.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UL, UM, UPCLJ, UPUK, ZRSKP
Atlas-based automatic segmentation is used in radiotherapy planning to accelerate the delineation of organs at risk (OARs). Atlas selection has been proposed as a way to improve the accuracy and ...execution time of segmentation, assuming that, the more similar the atlas is to the patient, the better the results will be. This paper presents an analysis of atlas selection methods in the context of radiotherapy treatment planning. For a range of commonly contoured OARs, a thorough comparison of a large class of typical atlas selection methods has been performed. For this evaluation, clinically contoured CT images of the head and neck (N = 316) and thorax (N = 280) were used. The state-of-the-art intensity and deformation similarity-based atlas selection methods were found to compare poorly to perfect atlas selection. Counter-intuitively, atlas selection methods based on a fixed set of representative atlases outperformed atlas selection methods based on the patient image. This study suggests that atlas-based segmentation with currently available selection methods compares poorly to the potential best performance, hampering the clinical utility of atlas-based segmentation. Effective atlas selection remains an open challenge in atlas-based segmentation for radiotherapy planning.
Contouring of organs at risk (OARs) is an important but time consuming part of radiotherapy treatment planning. The aim of this study was to investigate whether using institutional created ...software-generated contouring will save time if used as a starting point for manual OAR contouring for lung cancer patients.
Twenty CT scans of stage I–III NSCLC patients were used to compare user adjusted contours after an atlas-based and deep learning contour, against manual delineation. The lungs, esophagus, spinal cord, heart and mediastinum were contoured for this study. The time to perform the manual tasks was recorded.
With a median time of 20 min for manual contouring, the total median time saved was 7.8 min when using atlas-based contouring and 10 min for deep learning contouring. Both atlas based and deep learning adjustment times were significantly lower than manual contouring time for all OARs except for the left lung and esophagus of the atlas based contouring.
User adjustment of software generated contours is a viable strategy to reduce contouring time of OARs for lung radiotherapy while conforming to local clinical standards. In addition, deep learning contouring shows promising results compared to existing solutions.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UL, UM, UPCLJ, UPUK, ZRSKP