To investigate the effect of enriching mild cognitive impairment (MCI) clinical trials using combined markers of amyloid pathology and neurodegeneration.
We evaluate an implementation of the recent ...National Institute for Aging-Alzheimer's Association (NIA-AA) diagnostic criteria for MCI due to Alzheimer disease (AD) as inclusion criteria in clinical trials and assess the effect of enrichment with amyloid (A+), neurodegeneration (N+), and their combination (A+N+) on the rate of clinical progression, required sample sizes, and estimates of trial time and cost.
Enrichment based on an individual marker (A+ or N+) substantially improves all assessed trial characteristics. Combined enrichment (A+N+) further improves these results with a reduction in required sample sizes by 45% to 60%, depending on the endpoint.
Operationalizing the NIA-AA diagnostic criteria for clinical trial screening has the potential to substantially improve the statistical power of trials in MCI due to AD by identifying a more rapidly progressing patient population.
Artificial intelligence (AI) methods have been applied to medical imaging for several decades, but in the last few years, the number of publications and the number of AI-enabled medical devices ...coming on the market have significantly increased. While some AI-enabled approaches are proving very valuable, systematic reviews of the AI imaging field identify significant weaknesses in a significant proportion of the literature. Medical device regulators have recently become more proactive in publishing guidance documents and recognizing standards that will require that the development and validation of AI-enabled medical devices need to be more rigorous than required for tradition "rule-based" software. In particular, developers are required to better identify and mitigate risks (such as bias) that arise in AI-enabled devices, and to ensure that the devices are validated in a realistic clinical setting to ensure their output is clinically meaningful. While this evolving regulatory landscape will mean that device developers will take longer to bring novel AI-based medical imaging devices to market, such additional rigour is necessary to address existing weaknesses in the field and ensure that patients and healthcare professionals can trust AI-enabled devices. There would also be benefits in the academic community taking into account this regulatory framework, to improve the quality of the literature and make it easier for academically developed AI tools to make the transition to medical devices that impact healthcare.
Abstract Background The promise of Alzheimer’s disease biomarkers has led to their incorporation in new diagnostic criteria and in therapeutic trials; however, significant barriers exist to ...widespread use. Chief among these is the lack of internationally accepted standards for quantitative metrics. Hippocampal volumetry is the most widely studied quantitative magnetic resonance imaging measure in Alzheimer’s disease and thus represents the most rational target for an initial effort at standardization. Methods and Results The authors of this position paper propose a path toward this goal. The steps include the following: (1) Establish and empower an oversight board to manage and assess the effort, (2) adopt the standardized definition of anatomic hippocampal boundaries on magnetic resonance imaging arising from the European Alzheimer’s Disease Centers–Alzheimer’s Disease Neuroimaging Initiative hippocampal harmonization effort as a reference standard, (3) establish a scientifically appropriate, publicly available reference standard data set based on manual delineation of the hippocampus in an appropriate sample of subjects (Alzheimer’s Disease Neuroimaging Initiative), and (4) define minimum technical and prognostic performance metrics for validation of new measurement techniques using the reference standard data set as a benchmark. Conclusions Although manual delineation of the hippocampus is the best available reference standard, practical application of hippocampal volumetry will require automated methods. Our intent was to establish a mechanism for credentialing automated software applications to achieve internationally recognized accuracy and prognostic performance standards that lead to the systematic evaluation and then widespread acceptance and use of hippocampal volumetry. The standardization and assay validation process outlined for hippocampal volumetry was envisioned as a template that could be applied to other imaging biomarkers.
There is great interest in using automatic computational neuroanatomy tools to study ageing and neurodegenerative disease. Voxel-based morphometry (VBM) is one of the most widely used of such ...techniques. VBM performs voxel-wise statistical analysis of smoothed spatially normalised segmented Magnetic Resonance Images. There are several reasons why the analysis should include only voxels within a certain mask. We show that one of the most commonly used strategies for defining this mask runs a major risk of excluding from the analysis precisely those voxels where the subjects' brains were most vulnerable to atrophy. We investigate the issues related to mask construction, and recommend the use of alternative strategies which greatly decrease this danger of false negatives.
Measures of overlap of labelled regions of images, such as the Dice and Tanimoto coefficients, have been extensively used to evaluate image registration and segmentation algorithms. Modern studies ...can include multiple labels defined on multiple images yet most evaluation schemes report one overlap per labelled region, simply averaged over multiple images. In this paper, common overlap measures are generalized to measure the total overlap of ensembles of labels defined on multiple test images and account for fractional labels using fuzzy set theory. This framework allows a single "figure-of-merit" to be reported which summarises the results of a complex experiment by image pair, by label or overall. A complementary measure of error, the overlap distance, is defined which captures the spatial extent of the nonoverlapping part and is related to the Hausdorff distance computed on grey level images. The generalized overlap measures are validated on synthetic images for which the overlap can be computed analytically and used as similarity measures in nonrigid registration of three-dimensional magnetic resonance imaging (MRI) brain images. Finally, a pragmatic segmentation ground truth is constructed by registering a magnetic resonance atlas brain to 20 individual scans, and used with the overlap measures to evaluate publicly available brain segmentation algorithms
Purpose
A standard MRI system phantom has been designed and fabricated to assess scanner performance, stability, comparability and assess the accuracy of quantitative relaxation time imaging. The ...phantom is unique in having traceability to the International System of Units, a high level of precision, and monitoring by a national metrology institute. Here, we describe the phantom design, construction, imaging protocols, and measurement of geometric distortion, resolution, slice profile, signal‐to‐noise ratio (SNR), proton‐spin relaxation times, image uniformity and proton density.
Methods
The system phantom, designed by the International Society of Magnetic Resonance in Medicine ad hoc committee on Standards for Quantitative MR, is a 200 mm spherical structure that contains a 57‐element fiducial array; two relaxation time arrays; a proton density/SNR array; resolution and slice‐profile insets. Standard imaging protocols are presented, which provide rapid assessment of geometric distortion, image uniformity, T1 and T2 mapping, image resolution, slice profile, and SNR.
Results
Fiducial array analysis gives assessment of intrinsic geometric distortions, which can vary considerably between scanners and correction techniques. This analysis also measures scanner/coil image uniformity, spatial calibration accuracy, and local volume distortion. An advanced resolution analysis gives both scanner and protocol contributions. SNR analysis gives both temporal and spatial contributions.
Conclusions
A standard system phantom is useful for characterization of scanner performance, monitoring a scanner over time, and to compare different scanners. This type of calibration structure is useful for quality assurance, benchmarking quantitative MRI protocols, and to transition MRI from a qualitative imaging technique to a precise metrology with documented accuracy and uncertainty.
Abstract Background Regulatory qualification of a biomarker for a defined context of use provides scientifically robust assurances to sponsors and regulators that accelerate appropriate adoption of ...biomarkers into drug development. Methods The Coalition Against Major Diseases submitted a dossier to the Scientific Advice Working Party of the European Medicines Agency requesting a qualification opinion on the use of hippocampal volume as a biomarker for enriching clinical trials in subjects with mild cognitive impairment, incorporating a scientific rationale, a literature review and a de novo analysis of Alzheimer's Disease Neuroimaging Initiative data. Results The literature review and de novo analysis were consistent with the proposed context of use, and the Committee for Medicinal Products for Human Use released an opinion in November 2011. Conclusions We summarize the scientific rationale and the data that supported the first qualification of an imaging biomarker by the European Medicines Agency.
Abstract The objective of this study was to evaluate the effect of computational algorithm, measurement variability, and cut point on hippocampal volume (HCV)-based patient selection for clinical ...trials in mild cognitive impairment (MCI). We used normal control and amnestic MCI subjects from the Alzheimer's Disease Neuroimaging Initiative 1 (ADNI-1) as normative reference and screening cohorts. We evaluated the enrichment performance of 4 widely used hippocampal segmentation algorithms (FreeSurfer, Hippocampus Multi-Atlas Propagation and Segmentation (HMAPS), Learning Embeddings Atlas Propagation (LEAP), and NeuroQuant) in terms of 2-year changes in Mini-Mental State Examination (MMSE), Alzheimer's Disease Assessment Scale-Cognitive Subscale (ADAS-Cog), and Clinical Dementia Rating Sum of Boxes (CDR-SB). We modeled the implications for sample size, screen fail rates, and trial cost and duration. HCV based patient selection yielded reduced sample sizes (by ∼40%–60%) and lower trial costs (by ∼30%–40%) across a wide range of cut points. These results provide a guide to the choice of HCV cut point for amnestic MCI clinical trials, allowing an informed tradeoff between statistical and practical considerations.