Having gained a tremendous amount of popularity since its introduction in 2006, tract-based spatial statistics (TBSS) can now be considered as the standard approach for voxel-based analysis (VBA) of ...diffusion tensor imaging (DTI) data. Aiming to improve the sensitivity, objectivity, and interpretability of multi-subject DTI studies, TBSS includes a skeletonization step that alleviates residual image misalignment and obviates the need for data smoothing. Although TBSS represents an elegant and user-friendly framework that tackles numerous concerns existing in conventional VBA methods, it has limitations of its own, some of which have already been detailed in recent literature. In this work, we present general methodological considerations on TBSS and report on pitfalls that have not been described previously. In particular, we have identified specific assumptions of TBSS that may not be satisfied under typical conditions. Moreover, we demonstrate that the existence of such violations can severely affect the reliability of TBSS results. With TBSS being used increasingly, it is of paramount importance to acquaint TBSS users with these concerns, such that a well-informed decision can be made as to whether and how to pursue a TBSS analysis. Finally, in addition to raising awareness by providing our new insights, we provide constructive suggestions that could improve the validity and increase the impact of TBSS drastically.
•We investigate tract-based spatial statistics (TBSS) considering potential pitfalls.•TBSS is not tract-specific and we show how this may falsify results.•User defined parameters strongly influence the final TBSS-derived results.•We provide suggestions that improve the validity and increase the impact of TBSS.
Abstract The intravoxel incoherent motion (IVIM) theory provides a framework for the separation of perfusion and diffusion effects in diffusion-weighted imaging (DWI). To measure the three free IVIM ...parameters, DWIs with several diffusion weightings b must be acquired. To date, the used b value distributions are chosen heuristically and vary greatly among researchers. In this work, optimal b value distributions for the three parameter fit are determined using Monte-Carlo simulations for the measurement of a low, medium and high IVIM perfusion regime. The first 16 b values of a b value distribution, which was optimized to be appropriate for all three regimes, are {0, 40, 1000, 240, 10, 750, 90, 390, 170, 10, 620, 210, 100, 0, 530 and 970} in units of seconds per square meter. This distribution performed well for all organs and outperformed a distribution frequently used in the literature. In case of limited acquisition time, the b values should be chosen in the given order, but at least 10 b values should be used for current clinical settings. The overall parameter estimation quality depends strongly and nonlinearly on the signal-to-noise ratio (SNR): it is essential that the SNR is considerably higher than a critical SNR. This critical SNR is about 8 for medium and high IVIM perfusion and 50 for the low IVIM perfusion regime. Initial in vivo IVIM measurements were performed in the abdomen and were in keeping with the numerically simulated results.
To use the "apparent diffusion coefficient" (Dapp) as a quantitative imaging parameter, well-suited test fluids are essential. In this study, the previously proposed aqueous solutions of ...polyvinylpyrrolidone (PVP) were examined and temperature calibrations were obtained. For example, at a temperature of 20°C, Dapp ranged from 1.594 (95% CI: 1.593, 1.595) μm2/ms to 0.3326 (95% CI: 0. 3304, 0.3348) μm2/ms for PVP-concentrations ranging from 10% (w/w) to 50% (w/w) using K30 polymer lengths. The temperature dependence of Dapp was found to be so strong that a negligence seems not advisable. The temperature dependence is descriptively modelled by an exponential function exp(c2 (T - 20°C)) and the determined c2 values are reported, which can be used for temperature calibration. For example, we find the value 0.02952 K-1 for 30% (w/w) PVP-concentration and K30 polymer length. In general, aqueous PVP solutions were found to be suitable to produce easily applicable and reliable Dapp-phantoms.
Magnetic resonance (MR) diffusion-weighted imaging (DWI) is often used to detect focal liver lesions (FLLs), though DWI image quality can be limited in the left liver lobe owing to the pulsatile ...motion of the nearby heart. Flow-compensated (FloCo) diffusion encoding has been shown to reduce this pulsation artifact. The purpose of this prospective study was to intra-individually compare DWI of the liver acquired with conventional monopolar and FloCo diffusion encoding for assessing metastatic FLLs in non-cirrhotic patients. Forty patients with known or suspected multiple metastatic FLLs were included and measured at 1.5 T field strength with a conventional (monopolar) and a FloCo diffusion encoding EPI sequence (single refocused; b-values, 50 and 800 s/mm2). Two board-certified radiologists analyzed the DWI images independently. They issued Likert-scale ratings (1 = worst, 5 = best) for pulsation artifact severity and counted the difference of lesions visible at b = 800 s/mm² separately for small and large FLLs (i.e., < 1 cm or > 1 cm) and separately for left and right liver lobe. Differences between the two diffusion encodings were assessed with the Wilcoxon signed-rank test. Both readers found a reduction in pulsation artifact in the liver with FloCo encoding (p < 0.001 for both liver lobes). More small lesions were detected with FloCo diffusion encoding in both liver lobes (left lobe: six and seven additional lesions by readers 1 and 2, respectively; right lobe: five and seven additional lesions for readers 1 and 2, respectively). Both readers found one additional large lesion in the left liver lobe. Thus, flow-compensated diffusion encoding appears more effective than monopolar diffusion encoding for the detection of liver metastases.
Purpose
To investigate and to provide guidance for sample size selection based on the current practice in MR technical development studies in which healthy volunteers are examined.
Methods
All ...original articles published in Magnetic Resonance in Medicine between 2017 and 2019 were investigated and categorized according to technique, anatomical region, and magnetic field strength. The number of examined healthy volunteers (ie, the sample size) was collected and evaluated, whereas the number of patients was not considered. Papers solely measuring patients, animals, phantoms, specimens, or studies using existing data, for example, from an open databank, or consisting only of theoretical work or simulations were excluded.
Results
The median sample size of the 882 included studies was 6. There were some peaks in the sample size distribution (eg, 1, 5, and 10). In 49.9%, 82.1%, and 95.6% of the studies, the sample size was smaller or equal to 5, 10, and 20, respectively.
Conclusion
We observed a large variance in sample sizes reflecting the variety of studies published in Magnetic Resonance in Medicine. Therefore, it can be concluded that it is current practice to balance the need for statistical power with the demand to minimize experiments involving healthy humans, often by choosing small sample sizes between 1 and 10. Naturally, this observation does not release an investigator from ensuring that sufficient data are acquired to reach statistical conclusions.
Purpose
To enable a fast and automatic deep learning–based QSM reconstruction of tissues with diverse chemical shifts, relevant to most regions outside the brain.
Methods
A UNET was trained to ...reconstruct susceptibility maps using synthetically generated, unwrapped, multi‐echo phase data as input. The RMS error with respect to synthetic validation data was computed. The method was tested on two in vivo knee and two pelvis data sets. Comparisons were made to a conventional fat–water separation pipeline by applying a commonly used graph‐cut algorithm, both without and with an extended mask for background field removal (FWS‐CONV‐QSM and FWS‐MASK‐CONV‐QSM, respectively). Several regions of interest were segmented and compared. Furthermore, the approach was tested on a prostate cancer patient receiving low‐dose‐rate brachytherapy, to detect and localize the seeds by MRI.
Results
The RMS error was 0.292 ppm with FWS‐CONV‐QSM and 0.123 ppm for the UNET approach. Susceptibility maps were reconstructed much faster (< 10 s) and completely automatically (no background masking needed) by the UNET compared with the other applied techniques (5 min 51 s and 22 min 44 s for CONV‐QSM and FWS‐MASK‐CONV‐QSM, respectively. Background artifacts, fat–water swaps, and hypointense artifacts between I‐125 seeds of a patient receiving low‐dose brachytherapy in the prostate were largely reduced in the UNET approach.
Conclusions
Deep learning–based QSM reconstruction, trained solely with synthetic data, is well‐suited to rapidly reconstructing high‐quality susceptibility maps in the presence of fat without needing masking for background field removal.
Purpose
To assess radiomics as a tool to determine how well lesions found suspicious on breast cancer screening X‐ray mammography can be categorized into malignant and benign with unenhanced magnetic ...resonance (MR) mammography with diffusion‐weighted imaging and T2‐weighted sequences.
Materials and Methods
From an asymptomatic screening cohort, 50 women with mammographically suspicious findings were examined with contrast‐enhanced breast MRI (ceMRI) at 1.5T. Out of this protocol an unenhanced, abbreviated diffusion‐weighted imaging protocol (ueMRI) including T2‐weighted, (T2w), diffusion‐weighted imaging (DWI), and DWI with background suppression (DWIBS) sequences and corresponding apparent diffusion coefficient (ADC) maps were extracted. From ueMRI‐derived radiomic features, three Lasso‐supervised machine‐learning classifiers were constructed and compared with the clinical performance of a highly experienced radiologist: 1) univariate mean ADC model, 2) unconstrained radiomic model, 3) constrained radiomic model with mandatory inclusion of mean ADC.
Results
The unconstrained and constrained radiomic classifiers consisted of 11 parameters each and achieved differentiation of malignant from benign lesions with a .632 + bootstrap receiver operating characteristics (ROC) area under the curve (AUC) of 84.2%/85.1%, compared to 77.4% for mean ADC and 95.9%/95.9% for the experienced radiologist using ceMRI/ueMRI.
Conclusion
In this pilot study we identified two ueMRI radiomics classifiers that performed well in the differentiation of malignant from benign lesions and achieved higher performance than the mean ADC parameter alone. Classification was lower than the almost perfect performance of a highly experienced breast radiologist. The potential of radiomics to provide a training‐independent diagnostic decision tool is indicated. A performance reaching the human expert would be highly desirable and based on our results is considered possible when the concept is extended in larger cohorts with further development and validation of the technique.
Level of Evidence: 1
Technical Efficacy: Stage 2
J. MAGN. RESON. IMAGING 2017;46:604–616