The European Directive 2013/59/Euratom requires member states of the European Union to ensure justification and optimisation of radiological procedures and store information on patient exposure for ...analysis and quality assurance. The EuroSafe Imaging campaign of the European Society of Radiology created a working group (WG) on “Dose Management” with the aim to provide European recommendations on the implementation of dose management systems (DMS) in clinical practice. The WG follows Action 4: “Promote dose management systems to establish local, national, and European diagnostic reference levels (DRL)” of the EuroSafe Imaging Call for Action 2018. DMS are designed for medical practitioners, radiographers, medical physics experts (MPE) and other health professionals involved in imaging to support their tasks and duties of radiation protection in accordance with local and national requirements. The WG analysed requirements and critical points when installing a DMS and classified the individual functions at different performance levels.
Key Points
•
DMS are very helpful software tools for monitoring patient exposure, optimisation, compliance with DRLs and quality assurance.
•
DMS can help to fulfil dosimetric aspects of the European Directive 2013/59/Euratom.
•
The EuroSafe WG analyses DMS requirements and gives recommendations for users.
Background
Radiomics is expected to improve the management of metastatic colorectal cancer (CRC). We aimed at evaluating the impact of liver lesion contouring as a source of variability on radiomic ...features (RFs).
Methods
After Ethics Committee approval, 70 liver metastases in 17 CRC patients were segmented on contrast-enhanced computed tomography scans by two residents and checked by experienced radiologists. RFs from grey level co-occurrence and run length matrices were extracted from three-dimensional (3D) regions of interest (ROIs) and the largest two-dimensional (2D) ROIs. Inter-reader variability was evaluated with Dice coefficient and Hausdorff distance, whilst its impact on RFs was assessed using mean relative change (MRC) and intraclass correlation coefficient (ICC). For the main lesion of each patient, one reader also segmented a circular ROI on the same image used for the 2D ROI.
Results
The best inter-reader contouring agreement was observed for 2D ROIs according to both Dice coefficient (median 0.85, interquartile range 0.78–0.89) and Hausdorff distance (0.21 mm, 0.14–0.31 mm). Comparing RF values, MRC ranged 0–752% for 2D and 0–1567% for 3D. For 24/32 RFs (75%), MRC was lower for 2D than for 3D. An ICC > 0.90 was observed for more RFs for 2D (53%) than for 3D (34%). Only 2/32 RFs (6%) showed a variability between 2D and circular ROIs higher than inter-reader variability.
Conclusions
A 2D contouring approach may help mitigate overall inter-reader variability, albeit stable RFs can be extracted from both 3D and 2D segmentations of CRC liver metastases.
Background
Radiation dose in computed tomography (CT) has become a topic of high interest due to the increasing numbers of CT examinations performed worldwide. Hence, dose tracking and organ dose ...calculation software are increasingly used. We evaluated the organ dose variability associated with the use of different software applications or calculation methods.
Methods
We tested four commercial software applications on CT protocols actually in use in our hospital: CT-Expo, NCICT, NCICTX, and Virtual Dose. We compared dose coefficients, estimated organ doses and effective doses obtained by the four software applications by varying exposure parameters. Our results were also compared with estimates reported by the software authors.
Results
All four software applications showed dependence on tube voltage and volume CT dose index, while only CT-Expo was also dependent on other exposure parameters, in particular scanner model and pitch caused a variability till 50%. We found a disagreement between our results and those reported by the software authors (up to 600%), mainly due to a different extent of examined body regions. The relative range of the comparison of the four software applications was within 35% for most organs inside the scan region, but increased over the 100% for organs partially irradiated and outside the scan region. For effective doses, this variability was less evident (ranging from 9 to 36%).
Conclusions
The two main sources of organ dose variability were the software application used and the scan region set. Dose estimate must be related to the process used for its calculation.
The evaluation of radiation burden in vivo is crucial in modern radiology as stated also in the European Directive 2013/59/Euratom—Basic Safety Standard. Although radiation dose monitoring can impact ...the justification and optimization of radiological procedure, as well as effective patient communication, standardization of radiation monitoring software is far to be achieved. Toward this goal, the Italian Association of Medical Physics (AIFM) published a report describing the state of the art and standard guidelines in radiation dose monitoring system quality assurance. This article reports the AIFM statement about radiation dose monitoring systems (RDMSs) summarizing the different critical points of the systems related to Medical Physicist Expert (MPE) activities before, during, and after their clinical implementation. In particular, the article describes the general aspects of radiation dose data management, radiation dose monitoring systems, data integrity, and data responsibilities. Furthermore, the acceptance tests that need to be implemented and the most relevant dosimetric data for each radiological modalities are reported under the MPE responsibility.
In the scope of the European Commission Initiative on Breast Cancer (ECIBC) the Monitoring and Evaluation (M&E) subgroup was tasked to identify breast cancer screening programme (BCSP) performance ...indicators, including their acceptable and desirable levels, which are associated with breast cancer (BC) mortality. This paper documents the methodology used for the indicator selection.
The indicators were identified through a multi-stage process. First, a scoping review was conducted to identify existing performance indicators. Second, building on existing frameworks for making well-informed health care choices, a specific conceptual framework was developed to guide the indicator selection. Third, two group exercises including a rating and ranking survey were conducted for indicator selection using pre-determined criteria, such as: relevance, measurability, accurateness, ethics and understandability. The selected indicators were mapped onto a BC screening pathway developed by the M&E subgroup to illustrate the steps of BC screening common to all EU countries.
A total of 96 indicators were identified from an initial list of 1325 indicators. After removing redundant and irrelevant indicators and adding those missing, 39 candidate indicators underwent the rating and ranking exercise. Based on the results, the M&E subgroup selected 13 indicators: screening coverage, participation rate, recall rate, breast cancer detection rate, invasive breast cancer detection rate, cancers > 20 mm, cancers ≤10 mm, lymph node status, interval cancer rate, episode sensitivity, time interval between screening and first treatment, benign open surgical biopsy rate, and mastectomy rate.
This systematic approach led to the identification of 13 BCSP candidate performance indicators to be further evaluated for their association with BC mortality.
Highlights • A guideline for quality control of cone-beam computed tomography is now complete. • The guideline describes objective tests for radiation output and image quality. • The tests are valid ...for all kinds of cone-beam computed tomography systems. • Test frequencies and action levels were consensed by EFOMP, ESTRO and IAEA.
Growing evidence suggests that artificial intelligence tools could help radiologists in differentiating COVID-19 pneumonia from other types of viral (non-COVID-19) pneumonia. To test this hypothesis, ...an R-AI classifier capable of discriminating between COVID-19 and non-COVID-19 pneumonia was developed using CT chest scans of 1031 patients with positive swab for SARS-CoV-2 (
= 647) and other respiratory viruses (
= 384). The model was trained with 811 CT scans, while 220 CT scans (
= 151 COVID-19;
= 69 non-COVID-19) were used for independent validation. Four readers were enrolled to blindly evaluate the validation dataset using the CO-RADS score. A pandemic-like high suspicion scenario (CO-RADS 3 considered as COVID-19) and a low suspicion scenario (CO-RADS 3 considered as non-COVID-19) were simulated. Inter-reader agreement and performance metrics were calculated for human readers and R-AI classifier. The readers showed good agreement in assigning CO-RADS score (Gwet's AC2 = 0.71,
< 0.001). Considering human performance, accuracy = 78% and accuracy = 74% were obtained in the high and low suspicion scenarios, respectively, while the AI classifier achieved accuracy = 79% in distinguishing COVID-19 from non-COVID-19 pneumonia on the independent validation dataset. The R-AI classifier performance was equivalent or superior to human readers in all comparisons. Therefore, a R-AI classifier may support human readers in the difficult task of distinguishing COVID-19 from other types of viral pneumonia on CT imaging.
Background
To develop a pipeline for automatic extraction of quantitative metrics and radiomic features from lung computed tomography (CT) and develop artificial intelligence (AI) models supporting ...differential diagnosis between coronavirus disease 2019 (COVID-19) and other viral pneumonia (non-COVID-19).
Methods
Chest CT of 1,031 patients (811 for model building; 220 as independent validation set (IVS) with positive swab for severe acute respiratory syndrome coronavirus-2 (647 COVID-19) or other respiratory viruses (384 non-COVID-19) were segmented automatically. A Gaussian model, based on the HU histogram distribution describing well-aerated and ill portions, was optimised to calculate quantitative metrics (QM,
n
= 20) in both lungs (2L) and four geometrical subdivisions (GS) (upper front, lower front, upper dorsal, lower dorsal;
n
= 80). Radiomic features (RF) of first (RF1,
n
= 18) and second (RF2,
n
= 120) order were extracted from 2L using PyRadiomics tool. Extracted metrics were used to develop four multilayer-perceptron classifiers, built with different combinations of QM and RF:
Model1
(RF1-2L); Model2 (QM-2L, QM-GS); Model3 (RF1-2L, RF2-2L); Model4 (RF1-2L, QM-2L, GS-2L, RF2-2L).
Results
The classifiers showed accuracy from 0.71 to 0.80 and area under the receiving operating characteristic curve (AUC) from 0.77 to 0.87 in differentiating COVID-19
versus
non-COVID-19 pneumonia. Best results were associated with
Model3
(
AUC
0.867 ± 0.008) and
Model4
(
AUC
0.870 ± 0.011. For the IVS, the AUC values were 0.834 ± 0.008 for
Model3
and 0.828 ± 0.011 for
Model4
.
Conclusions
Four AI-based models for classifying patients as COVID-19 or non-COVID-19 viral pneumonia showed good diagnostic performances that could support clinical decisions.
Background
The role of computed tomography (CT) in the diagnosis and characterization of coronavirus disease 2019 (COVID-19) pneumonia has been widely recognized. We evaluated the performance of a ...software for quantitative analysis of chest CT, the
LungQuant
system, by comparing its results with independent visual evaluations by a group of 14 clinical experts. The aim of this work is to evaluate the ability of the automated tool to extract quantitative information from lung CT, relevant for the design of a diagnosis support model.
Methods
LungQuant
segments both the lungs and lesions associated with COVID-19 pneumonia (ground-glass opacities and consolidations) and computes derived quantities corresponding to qualitative characteristics used to clinically assess COVID-19 lesions. The comparison was carried out on 120 publicly available CT scans of patients affected by COVID-19 pneumonia. Scans were scored for four qualitative metrics: percentage of lung involvement, type of lesion, and two disease distribution scores. We evaluated the agreement between the
LungQuant
output and the visual assessments through receiver operating characteristics area under the curve (AUC) analysis and by fitting a nonlinear regression model.
Results
Despite the rather large heterogeneity in the qualitative labels assigned by the clinical experts for each metric, we found good agreement on the metrics compared to the
LungQuant
output. The AUC values obtained for the four qualitative metrics were 0.98, 0.85, 0.90, and 0.81.
Conclusions
Visual clinical evaluation could be complemented and supported by computer-aided quantification, whose values match the average evaluation of several independent clinical experts.
Key points
We conducted a multicenter evaluation of the deep learning-based
LungQuant
automated software.
We translated qualitative assessments into quantifiable metrics to characterize coronavirus disease 2019 (COVID-19) pneumonia lesions.
Comparing the software output to the clinical evaluations, results were satisfactory despite heterogeneity of the clinical evaluations.
An automatic quantification tool may contribute to improve the clinical workflow of COVID-19 pneumonia.