We propose a novel Computer-Aided Detection (CAD) system for pulmonary nodules using multi-view convolutional networks (ConvNets), for which discriminative features are automatically learnt from the ...training data. The network is fed with nodule candidates obtained by combining three candidate detectors specifically designed for solid, subsolid, and large nodules. For each candidate, a set of 2-D patches from differently oriented planes is extracted. The proposed architecture comprises multiple streams of 2-D ConvNets, for which the outputs are combined using a dedicated fusion method to get the final classification. Data augmentation and dropout are applied to avoid overfitting. On 888 scans of the publicly available LIDC-IDRI dataset, our method reaches high detection sensitivities of 85.4% and 90.1% at 1 and 4 false positives per scan, respectively. An additional evaluation on independent datasets from the ANODE09 challenge and DLCST is performed. We showed that the proposed multi-view ConvNets is highly suited to be used for false positive reduction of a CAD system.
•A novel objective evaluation framework for nodule detection algorithms using the largest publicly available LIDC-IDRI data set.•The impact of combining individual systems on the detection ...performance was investigated.•The combination of classical candidate detectors and a combination of deep learning architectures generates excellent results, better than any individual system.•Our observer study has shown that CAD detects nodules that were missed by expert readers.•We released this set of additional nodules for further development of CAD systems.
Display omitted
Automatic detection of pulmonary nodules in thoracic computed tomography (CT) scans has been an active area of research for the last two decades. However, there have only been few studies that provide a comparative performance evaluation of different systems on a common database. We have therefore set up the LUNA16 challenge, an objective evaluation framework for automatic nodule detection algorithms using the largest publicly available reference database of chest CT scans, the LIDC-IDRI data set. In LUNA16, participants develop their algorithm and upload their predictions on 888 CT scans in one of the two tracks: 1) the complete nodule detection track where a complete CAD system should be developed, or 2) the false positive reduction track where a provided set of nodule candidates should be classified. This paper describes the setup of LUNA16 and presents the results of the challenge so far. Moreover, the impact of combining individual systems on the detection performance was also investigated. It was observed that the leading solutions employed convolutional networks and used the provided set of nodule candidates. The combination of these solutions achieved an excellent sensitivity of over 95% at fewer than 1.0 false positives per scan. This highlights the potential of combining algorithms to improve the detection performance. Our observer study with four expert readers has shown that the best system detects nodules that were missed by expert readers who originally annotated the LIDC-IDRI data. We released this set of additional nodules for further development of CAD systems.
Objectives
To benchmark the performance of state-of-the-art computer-aided detection (CAD) of pulmonary nodules using the largest publicly available annotated CT database (LIDC/IDRI), and to show ...that CAD finds lesions not identified by the LIDC’s four-fold double reading process.
Methods
The LIDC/IDRI database contains 888 thoracic CT scans with a section thickness of 2.5 mm or lower. We report performance of two commercial and one academic CAD system. The influence of presence of contrast, section thickness, and reconstruction kernel on CAD performance was assessed. Four radiologists independently analyzed the false positive CAD marks of the best CAD system.
Results
The updated commercial CAD system showed the best performance with a sensitivity of 82 % at an average of 3.1 false positive detections per scan. Forty-five false positive CAD marks were scored as nodules by all four radiologists in our study.
Conclusions
On the largest publicly available reference database for lung nodule detection in chest CT, the updated commercial CAD system locates the vast majority of pulmonary nodules at a low false positive rate. Potential for CAD is substantiated by the fact that it identifies pulmonary nodules that were not marked during the extensive four-fold LIDC annotation process.
Key Points
•
CAD systems should be validated on public, heterogeneous databases.
•
The LIDC/IDRI database is an excellent database for benchmarking nodule CAD.
•
CAD can identify the majority of pulmonary nodules at a low false positive rate.
•
CAD can identify nodules missed by an extensive two-stage annotation process.
Convolutional neural networks (CNNs) have emerged as the most powerful technique for a range of different tasks in computer vision. Recent work suggested that CNN features are generic and can be used ...for classification tasks outside the exact domain for which the networks were trained. In this work we use the features from one such network, OverFeat, trained for object detection in natural images, for nodule detection in computed tomography scans. We use 865 scans from the publicly available LIDC data set, read by four thoracic radiologists. Nodule candidates are generated by a state-of-the-art nodule detection system. We extract 2D sagittal, coronal and axial patches for each nodule candidate and extract 4096 features from the penultimate layer of OverFeat and classify these with linear support vector machines. We show for various configurations that the off-the-shelf CNN features perform surprisingly well, but not as good as the dedicated detection system. When both approaches are combined, significantly better results are obtained than either approach alone. We conclude that CNN features have great potential to be used for detection tasks in volumetric medical data.
Abstract
Accurate identification of emphysema subtypes and severity is crucial for effective management of COPD and the study of disease heterogeneity. Manual analysis of emphysema subtypes and ...severity is laborious and subjective. To address this challenge, we present a deep learning-based approach for automating the Fleischner Society’s visual score system for emphysema subtyping and severity analysis. We trained and evaluated our algorithm using 9650 subjects from the COPDGene study. Our algorithm achieved the predictive accuracy at 52%, outperforming a previously published method’s accuracy of 45%. In addition, the agreement between the predicted scores of our method and the visual scores was good, where the previous method obtained only moderate agreement. Our approach employs a regression training strategy to generate categorical labels while simultaneously producing high-resolution localized activation maps for visualizing the network predictions. By leveraging these dense activation maps, our method possesses the capability to compute the percentage of emphysema involvement per lung in addition to categorical severity scores. Furthermore, the proposed method extends its predictive capabilities beyond centrilobular emphysema to include paraseptal emphysema subtypes.
Abstract
We present spectroscopic confirmation of candidate strong gravitational lenses using the Keck Observatory and Very Large Telescope as part of our
ASTRO 3D Galaxy Evolution with Lenses
(
AGEL
...) survey. We confirm that (1) search methods using convolutional neural networks (CNNs) with visual inspection successfully identify strong gravitational lenses and (2) the lenses are at higher redshifts relative to existing surveys due to the combination of deeper and higher-resolution imaging from DECam and spectroscopy spanning optical to near-infrared wavelengths. We measure 104 redshifts in 77 systems selected from a catalog in the
DES
and
DECaLS
imaging fields (
r
≤ 22 mag). Combining our results with published redshifts, we present redshifts for 68 lenses and establish that CNN-based searches are highly effective for use in future imaging surveys with a success rate of at least 88% (defined as 68/77). We report 53 strong lenses with spectroscopic redshifts for both the deflector and source (
z
src
>
z
defl
), and 15 lenses with a spectroscopic redshift for either the deflector (
z
defl
> 0.21) or source (
z
src
≥ 1.34). For the 68 lenses, the deflectors and sources have average redshifts and standard deviations of 0.58 ± 0.14 and 1.92 ± 0.59 respectively, and corresponding redshift ranges of
z
defl
= 0.21–0.89 and
z
src
= 0.88–3.55. The
AGEL
systems include 41 deflectors at
z
defl
≥ 0.5 that are ideal for follow-up studies to track how mass density profiles evolve with redshift. Our goal with
AGEL
is to spectroscopically confirm ∼100 strong gravitational lenses that can be observed from both hemispheres throughout the year. The
AGEL
survey is a resource for refining automated all-sky searches and addressing a range of questions in astrophysics and cosmology.