Infrared (IR) modalities represent the only currently viable mass fever screening approaches for outbreaks of infectious disease pandemics such as Ebola virus disease and severe acute respiratory ...syndrome. Non-contact IR thermometers (NCITs) and IR thermographs (IRTs) have been used for fever screening in public areas such as airports. While NCITs remain a more popular choice than IRTs, there has been increasing evidences in the literature that IRTs can provide great accuracy in estimating body temperature if qualified systems are used and appropriate procedures are consistently applied. In this study, we addressed the issue of IRT qualification by implementing and evaluating a battery of test methods for objective, quantitative assessment of IRT performance based on a recent international standard (IEC 80601-2-59). We tested two commercial IRTs to evaluate their stability and drift, image uniformity, minimum resolvable temperature difference, and radiometric temperature laboratory accuracy. Based on these tests, we illustrated how experimental and data processing procedures could affect results, and suggested methods for clarifying and optimizing test methods. Overall, the insights into thermograph standardization and acquisition methods provided by this study may improve the utility of IR thermography and aid in comparing IRT performance, thus improving the potential for producing high quality disease pandemic countermeasures.
Infrared thermographs (IRTs) implemented according to standardized best practices have shown strong potential for detecting elevated body temperatures (EBT), which may be useful in clinical settings ...and during infectious disease epidemics. However, optimal IRT calibration methods have not been established and the clinical performance of these devices relative to the more common non-contact infrared thermometers (NCITs) remains unclear. In addition to confirming the findings of our preliminary analysis of clinical study results, the primary intent of this study was to compare methods for IRT calibration and identify best practices for assessing the performance of IRTs intended to detect EBT. A key secondary aim was to compare IRT clinical accuracy to that of NCITs. We performed a clinical thermographic imaging study of more than 1000 subjects, acquiring temperature data from several facial locations that, along with reference oral temperatures, were used to calibrate two IRT systems based on seven different regression methods. Oral temperatures imputed from facial data were used to evaluate IRT clinical accuracy based on metrics such as clinical bias (Δcb), repeatability, root-mean-square difference, and sensitivity/specificity. We proposed several calibration approaches designed to account for the non-uniform data density across the temperature range and a constant offset approach tended to show better ability to detect EBT. As in our prior study, inner canthi or full-face maximum temperatures provided the highest clinical accuracy. With an optimal calibration approach, these methods achieved a Δcb between ±0.03 °C with standard deviation (σΔcb) less than 0.3 °C, and sensitivity/specificity between 84% and 94%. Results of forehead-center measurements with NCITs or IRTs indicated reduced performance. An analysis of the complete clinical data set confirms the essential findings of our preliminary evaluation, with minor differences. Our findings provide novel insights into methods and metrics for the clinical accuracy assessment of IRTs. Furthermore, our results indicate that calibration approaches providing the highest clinical accuracy in the 37-38.5 °C range may be most effective for measuring EBT. While device performance depends on many factors, IRTs can provide superior performance to NCITs.
Significance: Infrared thermographs (IRTs) have been used for fever screening during infectious disease epidemics, including severe acute respiratory syndrome, Ebola virus disease, and coronavirus ...disease 2019 (COVID-19). Although IRTs have significant potential for human body temperature measurement, the literature indicates inconsistent diagnostic performance, possibly due to wide variations in implemented methodology. A standardized method for IRT fever screening was recently published, but there is a lack of clinical data demonstrating its impact on IRT performance.
Aim: Perform a clinical study to assess the diagnostic effectiveness of standardized IRT-based fever screening and evaluate the effect of facial measurement location.
Approach: We performed a clinical study of 596 subjects. Temperatures from 17 facial locations were extracted from thermal images and compared with oral thermometry. Statistical analyses included calculation of receiver operating characteristic (ROC) curves and area under the curve (AUC) values for detection of febrile subjects.
Results: Pearson correlation coefficients for IRT-based and reference (oral) temperatures were found to vary strongly with measurement location. Approaches based on maximum temperatures in either inner canthi or full-face regions indicated stronger discrimination ability than maximum forehead temperature (AUC values of 0.95 to 0.97 versus 0.86 to 0.87, respectively) and other specific facial locations. These values are markedly better than the vast majority of results found in prior human studies of IRT-based fever screening.
Conclusion: Our findings provide clinical confirmation of the utility of consensus approaches for fever screening, including the use of inner canthi temperatures, while also indicating that full-face maximum temperatures may provide an effective alternate approach.
Established medical imaging technologies such as magnetic resonance imaging and computed tomography rely on well-validated tissue-simulating phantoms for standardized testing of device image quality. ...The availability of high-quality phantoms for optical-acoustic diagnostics such as photoacoustic tomography (PAT) will facilitate standardization and clinical translation of these emerging approaches. Materials used in prior PAT phantoms do not provide a suitable combination of long-term stability and realistic acoustic and optical properties. Therefore, we have investigated the use of custom polyvinyl chloride plastisol (PVCP) formulations for imaging phantoms and identified a dual-plasticizer approach that provides biologically relevant ranges of relevant properties. Speed of sound and acoustic attenuation were determined over a frequency range of 4 to 9 MHz and optical absorption and scattering over a wavelength range of 400 to 1100 nm. We present characterization of several PVCP formulations, including one designed to mimic breast tissue. This material is used to construct a phantom comprised of an array of cylindrical, hemoglobin-filled inclusions for evaluation of penetration depth. Measurements with a custom near-infrared PAT imager provide quantitative and qualitative comparisons of phantom and tissue images. Results indicate that our PVCP material is uniquely suitable for PAT system image quality evaluation and may provide a practical tool for device validation and intercomparison.
We have investigated the potential for contrast-enhanced near-infrared fluorescence imaging of tissue on a mobile phone platform. Charge-coupled device- and phone-based cameras were used to image ...molded and three-dimensional-printed tissue phantoms, and an ex vivo animal model. Quantitative and qualitative evaluations of image quality demonstrate the viability of this approach and elucidate variations in performance due to wavelength, pixel color, and image processing.
Fever screening based on infrared (IR) thermographs (IRTs) is an approach that has been implemented during infectious disease pandemics, such as Ebola and Severe Acute Respiratory Syndrome. A ...recently published international standard indicates that regions medially adjacent to the inner canthi provide accurate estimates of core body temperature and are preferred sites for fever screening. Therefore, rapid, automated identification of the canthi regions within facial IR images may greatly facilitate rapid fever screening of asymptomatic travelers. However, it is more difficult to accurately identify the canthi regions from IR images than from visible images that are rich with exploitable features. In this study, we developed and evaluated techniques for multi-modality image registration (MMIR) of simultaneously captured visible and IR facial images for fever screening. We used free form deformation (FFD) models based on edge maps to improve registration accuracy after an affine transformation. Two widely used FFD models in medical image registration based on the Demons and cubic B-spline algorithms were qualitatively compared. The results showed that the Demons algorithm outperformed the cubic B-spline algorithm, likely due to overfitting of outliers by the latter method. The quantitative measure of registration accuracy, obtained through selected control point correspondence, was within 2.8 ± 1.2 mm, which enables accurate and automatic localization of canthi regions in the IR images for temperature measurement.
Optical coherence tomography (OCT) is a rapidly growing imaging modality, particularly in the field of ophthalmology. Accurate early diagnosis of diseases requires consistent and validated imaging ...performance. In contrast to more well-established medical imaging modalities, no standardized test methods currently exist for OCT quality assurance. We developed a retinal phantom which mimics the thickness and near-infrared optical properties of each anatomical retinal layer as well as the surface topography of the foveal pit. The fabrication process involves layer-by-layer spin coating of nanoparticle-embedded silicone films followed by laser micro-etching to modify the surface topography. The thickness of each layer and dimensions of the foveal pit are measured with high precision. The phantom is embedded into a commercially available, water-filled model eye to simulate ocular dispersion and emmetropic refraction, and for ease of use with clinical OCT systems. The phantom was imaged with research and clinical OCT systems to assess image quality and software accuracy. Our results indicate that this phantom may serve as a useful tool to evaluate and standardize OCT performance.
Significance: Expanded use of fluorescence-guided surgery with devices approved for use with indocyanine green (ICG) has led to a range of commercial systems available. There is a compelling need to ...be able to independently characterize system performance and allow for cross-system comparisons.
Aim: The goal of this work is to expand on previous proposed fluorescence imaging standard designs to develop a long-term stable phantom that spectrally matches ICG characteristics and utilizes 3D printing technology for incorporating tissue-equivalent materials.
Approach: A batch of test targets was created to assess ICG concentration sensitivity in the 0.3- to 1000-nM range, tissue-equivalent depth sensitivity down to 6 mm, and spatial resolution with a USAF test chart. Comparisons were completed with a range of systems that have significantly different imaging capabilities and applications, including the Li-Cor® Odyssey, Li-Cor® Pearl, PerkinElmer® Solaris, and Stryker® Spy Elite.
Results: Imaging of the ICG-matching phantoms with all four commercially available systems showed the ability to benchmark system performance and allow for cross-system comparisons. The fluorescence tests were able to assess differences in the detectable concentrations of ICG with sensitivity differences >10× for preclinical and clinical systems. Furthermore, the tests successfully assessed system differences in the depth-signal decay rate, as well as resolution performance and image artifacts. The manufacturing variations, photostability, and mechanical design of the tests showed promise in providing long-term stable standards for fluorescence imaging.
Conclusions: The presented ICG-matching phantom provides a major step toward standardizing performance characterization and cross-system comparisons for devices approved for use with ICG. The developed hybrid manufacturing platform can incorporate long-term stable fluorescing agents with 3D printed tissue-equivalent material. Further, long-term testing of the phantom and refinements to the manufacturing process are necessary for future implementation as a widely adopted fluorescence imaging standard.
Standardization of fluorescence molecular imaging (FMI) is critical for ensuring quality control in guiding surgical procedures. To accurately evaluate system performance, two metrics, the ...signal-to-noise ratio (SNR) and contrast, are widely employed. However, there is currently no consensus on how these metrics can be computed.SignificanceStandardization of fluorescence molecular imaging (FMI) is critical for ensuring quality control in guiding surgical procedures. To accurately evaluate system performance, two metrics, the signal-to-noise ratio (SNR) and contrast, are widely employed. However, there is currently no consensus on how these metrics can be computed.We aim to examine the impact of SNR and contrast definitions on the performance assessment of FMI systems.AimWe aim to examine the impact of SNR and contrast definitions on the performance assessment of FMI systems.We quantified the SNR and contrast of six near-infrared FMI systems by imaging a multi-parametric phantom. Based on approaches commonly used in the literature, we quantified seven SNRs and four contrast values considering different background regions and/or formulas. Then, we calculated benchmarking (BM) scores and respective rank values for each system.ApproachWe quantified the SNR and contrast of six near-infrared FMI systems by imaging a multi-parametric phantom. Based on approaches commonly used in the literature, we quantified seven SNRs and four contrast values considering different background regions and/or formulas. Then, we calculated benchmarking (BM) scores and respective rank values for each system.We show that the performance assessment of an FMI system changes depending on the background locations and the applied quantification method. For a single system, the different metrics can vary up to ∼ 35 dB (SNR), ∼ 8.65 a . u . (contrast), and ∼ 0.67 a . u . (BM score).ResultsWe show that the performance assessment of an FMI system changes depending on the background locations and the applied quantification method. For a single system, the different metrics can vary up to ∼ 35 dB (SNR), ∼ 8.65 a . u . (contrast), and ∼ 0.67 a . u . (BM score).The definition of precise guidelines for FMI performance assessment is imperative to ensure successful clinical translation of the technology. Such guidelines can also enable quality control for the already clinically approved indocyanine green-based fluorescence image-guided surgery.ConclusionsThe definition of precise guidelines for FMI performance assessment is imperative to ensure successful clinical translation of the technology. Such guidelines can also enable quality control for the already clinically approved indocyanine green-based fluorescence image-guided surgery.