•Strategies how to develop AI applications as clinical decision support systems are provided.•We focus on differences between radiomic machine learning and deep learning application domains.•Pros and ...cons, recommendations and references to software tools are provided.
Artificial intelligence (AI) models are playing an increasing role in biomedical research and healthcare services. This review focuses on challenges points to be clarified about how to develop AI applications as clinical decision support systems in the real-world context.
A narrative review has been performed including a critical assessment of articles published between 1989 and 2021 that guided challenging sections.
We first illustrate the architectural characteristics of machine learning (ML)/radiomics and deep learning (DL) approaches. For ML/radiomics, the phases of feature selection and of training, validation, and testing are described. DL models are presented as multi-layered artificial/convolutional neural networks, allowing us to directly process images. The data curation section includes technical steps such as image labelling, image annotation (with segmentation as a crucial step in radiomics), data harmonization (enabling compensation for differences in imaging protocols that typically generate noise in non-AI imaging studies) and federated learning. Thereafter, we dedicate specific sections to: sample size calculation, considering multiple testing in AI approaches; procedures for data augmentation to work with limited and unbalanced datasets; and the interpretability of AI models (the so-called black box issue). Pros and cons for choosing ML versus DL to implement AI applications to medical imaging are finally presented in a synoptic way.
Biomedicine and healthcare systems are one of the most important fields for AI applications and medical imaging is probably the most suitable and promising domain. Clarification of specific challenging points facilitates the development of such systems and their translation to clinical practice.
Objective
The objectives of our study were to assess the association of radiomic and genomic data with histology and patient outcome in non-small cell lung cancer (NSCLC).
Methods
In this ...retrospective single-centre observational study, we selected 151 surgically treated patients with adenocarcinoma or squamous cell carcinoma who performed baseline 18F FDG PET/CT. A subgroup of patients with cancer tissue samples at the Institutional Biobank (
n
= 74/151) was included in the genomic analysis. Features were extracted from both PET and CT images using an in-house tool. The genomic analysis included detection of genetic variants, fusion transcripts, and gene expression. Generalised linear model (GLM) and machine learning (ML) algorithms were used to predict histology and tumour recurrence.
Results
Standardised uptake value (SUV) and kurtosis (among the PET and CT radiomic features, respectively), and the expression of
TP63
,
EPHA10
,
FBN2
, and
IL1RAP
were associated with the histotype. No correlation was found between radiomic features/genomic data and relapse using GLM. The ML approach identified several radiomic/genomic rules to predict the histotype successfully. The ML approach showed a modest ability of PET radiomic features to predict relapse, while it identified a robust gene expression signature able to predict patient relapse correctly. The best-performing ML radiogenomic rule predicting the outcome resulted in an area under the curve (AUC) of 0.87.
Conclusions
Radiogenomic data may provide clinically relevant information in NSCLC patients regarding the histotype, aggressiveness, and progression. Gene expression analysis showed potential new biomarkers and targets valuable for patient management and treatment. The application of ML allows to increase the efficacy of radiogenomic analysis and provides novel insights into cancer biology.
Background
We aimed to train and test a deep learning classifier to support the diagnosis of coronavirus disease 2019 (COVID-19) using chest x-ray (CXR) on a cohort of subjects from two hospitals in ...Lombardy, Italy.
Methods
We used for training and validation an ensemble of ten convolutional neural networks (CNNs) with mainly bedside CXRs of 250 COVID-19 and 250 non-COVID-19 subjects from two hospitals (Centres 1 and 2). We then tested such system on bedside CXRs of an independent group of 110 patients (74 COVID-19, 36 non-COVID-19) from one of the two hospitals. A retrospective reading was performed by two radiologists in the absence of any clinical information, with the aim to differentiate COVID-19 from non-COVID-19 patients. Real-time polymerase chain reaction served as the reference standard.
Results
At 10-fold cross-validation, our deep learning model classified COVID-19 and non-COVID-19 patients with 0.78 sensitivity (95% confidence interval CI 0.74–0.81), 0.82 specificity (95% CI 0.78–0.85), and 0.89 area under the curve (AUC) (95% CI 0.86–0.91). For the independent dataset, deep learning showed 0.80 sensitivity (95% CI 0.72–0.86) (59/74), 0.81 specificity (29/36) (95% CI 0.73–0.87), and 0.81 AUC (95% CI 0.73–0.87). Radiologists’ reading obtained 0.63 sensitivity (95% CI 0.52–0.74) and 0.78 specificity (95% CI 0.61–0.90) in Centre 1 and 0.64 sensitivity (95% CI 0.52–0.74) and 0.86 specificity (95% CI 0.71–0.95) in Centre 2.
Conclusions
This preliminary experience based on ten CNNs trained on a limited training dataset shows an interesting potential of deep learning for COVID-19 diagnosis. Such tool is in training with new CXRs to further increase its performance.
Alzheimer's Disease (AD) is the most common neurodegenerative disease, with 10% prevalence in the elder population. Conventional Machine Learning (ML) was proven effective in supporting the diagnosis ...of AD, while very few studies investigated the performance of deep learning and transfer learning in this complex task. In this paper, we evaluated the potential of ensemble transfer-learning techniques, pretrained on generic images and then transferred to structural brain MRI, for the early diagnosis and prognosis of AD, with respect to a fusion of conventional-ML approaches based on Support Vector Machine directly applied to structural brain MRI. Specifically, more than 600 subjects were obtained from the ADNI repository, including AD, Mild Cognitive Impaired converting to AD (MCIc), Mild Cognitive Impaired not converting to AD (MCInc), and cognitively-normal (CN) subjects. We used T1-weighted cerebral-MRI studies to train: (1) an ensemble of five transfer-learning architectures pretrained on generic images; (2) a 3D Convolutional Neutral Network (CNN) trained from scratch on MRI volumes; and (3) a fusion of two conventional-ML classifiers derived from different feature extraction/selection techniques coupled to SVM. The AD-vs-CN, MCIc-vs-CN, MCIc-vs-MCInc comparisons were investigated. The ensemble transfer-learning approach was able to effectively discriminate AD from CN with 90.2% AUC, MCIc from CN with 83.2% AUC, and MCIc from MCInc with 70.6% AUC, showing comparable or slightly lower results with the fusion of conventional-ML systems (AD from CN with 93.1% AUC, MCIc from CN with 89.6% AUC, and MCIc from MCInc with AUC in the range of 69.1-73.3%). The deep-learning network trained from scratch obtained lower performance than either the fusion of conventional-ML systems and the ensemble transfer-learning, due to the limited sample of images used for training. These results open new prospective on the use of transfer learning combined with neuroimages for the automatic early diagnosis and prognosis of AD, even if pretrained on generic images.
Molecular/genomic profiling is the most accurate method to assess prognosis of endometrial cancer patients. Radiomic profiling allows for the extraction of mineable high-dimensional data from ...clinical radiological images, thus providing noteworthy information regarding tumor tissues. Interestingly, the adoption of radiomics shows important results for screening, diagnosis and prognosis, across various radiological systems and oncologic specialties. The central hypothesis of the prospective trial is that combining radiomic features with molecular features might allow for the identification of various classes of risks for endometrial cancer, e.g., predicting unfavorable molecular/genomic profiling. The rationale for the proposed research is that once validated, radiomics applied to ultrasonographic images would be an effective, innovative and inexpensive method for tailoring operative and postoperative treatment modalities in endometrial cancer. Patients with newly diagnosed endometrial cancer will have ultrasonographic evaluation and radiomic analysis of the ultrasonographic images. We will correlate radiomic features with molecular/genomic profiling to classify prognosis.
Background
To evaluate the performance of a decision support system (DSS) based on radiomics and machine learning in predicting the risk of malignancy of ovarian masses (OMs) from transvaginal ...ultrasonography (TUS) and serum CA-125.
Methods
A total of 274 consecutive patients who underwent TUS (by different examiners and with different ultrasound machines) and surgery, with suspicious OMs and known CA-125 serum level were used to train and test a DSS. The DSS was used to predict the risk of malignancy of these masses (very low
versus
medium-high risk), based on the US appearance (solid, liquid, or mixed) and radiomic features (morphometry and regional texture features) within the masses, on the shadow presence (yes/no), and on the level of serum CA-125. Reproducibility of results among the examiners, and performance accuracy, sensitivity, specificity, and area under the curve were tested in a real-world clinical setting.
Results
The DSS showed a mean 88% accuracy, 99% sensitivity, and 77% specificity for the 239 patients used for training, cross-validation, and testing, and a mean 91% accuracy, 100% sensitivity, and 80% specificity for the 35 patients used for independent testing.
Conclusions
This DSS is a promising tool in women diagnosed with OMs at TUS, allowing to predict the individual risk of malignancy, supporting clinical decision making.
The exhibition “The Ladies of Art”, held at the Palazzo Reale in Milan in 2021, focused on the history of women artists during the 16th and 17th centuries. As part of the exhibition, a series of ...diagnostic analyses have been carried out on two paintings, thanks to the collaboration of several research institutions. The considered artworks consist of two paintings on parchments, realized by Giovanna Garzoni at the Savoy Court in the 17th century. Diagnostic analyses were performed using different, complementary, and non-invasive techniques: high-resolution multiband imaging, hyperspectral imaging, optical microscopy, X-ray fluorescence, and FORS spectrometry, combined with advanced post-processing techniques, in order to map and distinguish the pigments and the underdrawing of both the paintings. This research is the first conducted on these paintings and one of the few on the entire works of this important miniaturist. The results showed an incredibly meticulous painting technique, with a detailed metal point underdrawing and painstaking brushstrokes to describe the details with a high degree of realism. Precious materials, such as gold and lapislazuli, were identified and mapped. The findings of this work represent a new contribution of knowledge, which helps to lessen the lack of information for systematic studies on the artistic production of G. Garzoni.
Purpose
To determine diagnostic performance of MRI radiomics-based machine learning for classification of deep-seated lipoma and atypical lipomatous tumor (ALT) of the extremities.
Material and ...methods
This retrospective study was performed at three tertiary sarcoma centers and included 150 patients with surgically treated and histology-proven lesions. The training-validation cohort consisted of 114 patients from centers 1 and 2 (n = 64 lipoma, n = 50 ALT). The external test cohort consisted of 36 patients from center 3 (n = 24 lipoma, n = 12 ALT). 3D segmentation was manually performed on T1- and T2-weighted MRI. After extraction and selection of radiomic features, three machine learning classifiers were trained and validated using nested fivefold cross-validation. The best-performing classifier according to previous analysis was evaluated and compared to an experienced musculoskeletal radiologist in the external test cohort.
Results
Eight features passed feature selection and were incorporated into the machine learning models. After training and validation (74% ROC-AUC), the best-performing classifier (Random Forest) showed 92% sensitivity and 33% specificity in the external test cohort with no statistical difference compared to the radiologist (
p
= 0.474).
Conclusion
MRI radiomics-based machine learning may classify deep-seated lipoma and ALT of the extremities with high sensitivity and negative predictive value, thus potentially serving as a non-invasive screening tool to reduce unnecessary referral to tertiary tumor centers.
Radiomics allows the extraction quantitative features from imaging, as imaging biomarkers of disease. The objective of this exploratory study is to implement a reproducible radiomic-pipeline for the ...extraction of a magnetic resonance imaging (MRI) signature for prostate cancer (PCa) aggressiveness. One hundred and two consecutive patients performing preoperative prostate multiparametric magnetic resonance imaging (mpMRI) and radical prostatectomy were enrolled. Multiparametric images, including T2-weighted (T2w), diffusion-weighted and dynamic contrast-enhanced images, were acquired at 1.5 T. Ninety-three imaging features (Ifs) were extracted from segmentation of index lesion. Ifs were ranked based on a stability rank and redundant Ifs were excluded. Using unsupervised hierarchical clustering, patients were grouped on the basis of similar radiomic patterns, whose association with Gleason Grade Group (GGG), extracapsular extension (ECE), and nodal involvement (pN) was tested. Signatures composed by IFs from T2w-images and Apparent Diffusion Coefficient (ADC) maps were tested for the prediction of GGG, ECE, and pN. T2w radiomic pattern was associated with pN, ECE, and GGG (
= 0.027, 0.05, 0.03) and ADC radiomic pattern was associated with GGG (
= 0.004). The best performance was reached by the signature combing IFs from multiparametric images (0.88, 0.89, and 0.84 accuracy for GGG, pN, and ECE). A reliable multiparametric MRI radiomic signature was extracted, potentially able to predict PCa aggressiveness, to be further validated on an independent sample.