Alzheimer's disease (AD) is characterized by the progressive alterations seen in brain images which give rise to the onset of various sets of symptoms. The variability in the dynamics of changes in ...both brain images and cognitive impairments remains poorly understood. This paper introduces AD Course Map a spatiotemporal atlas of Alzheimer's disease progression. It summarizes the variability in the progression of a series of neuropsychological assessments, the propagation of hypometabolism and cortical thinning across brain regions and the deformation of the shape of the hippocampus. The analysis of these variations highlights strong genetic determinants for the progression, like possible compensatory mechanisms at play during disease progression. AD Course Map also predicts the patient's cognitive decline with a better accuracy than the 56 methods benchmarked in the open challenge TADPOLE. Finally, AD Course Map is used to simulate cohorts of virtual patients developing Alzheimer's disease. AD Course Map offers therefore new tools for exploring the progression of AD and personalizing patients care.
•We reviewed the state-of-the-art on classification of AD based on CNN and T1 MRI.•We unveiled data leakage, leading to biased results, in some reviewed studies.•We proposed a framework for ...reproducible evaluation of AD classification methods.•We demonstrated the use of the proposed framework on three public datasets.•We assessed generalizability both within a dataset and between datasets.
Numerous machine learning (ML) approaches have been proposed for automatic classification of Alzheimer's disease (AD) from brain imaging data. In particular, over 30 papers have proposed to use convolutional neural networks (CNN) for AD classification from anatomical MRI. However, the classification performance is difficult to compare across studies due to variations in components such as participant selection, image preprocessing or validation procedure. Moreover, these studies are hardly reproducible because their frameworks are not publicly accessible and because implementation details are lacking. Lastly, some of these papers may report a biased performance due to inadequate or unclear validation or model selection procedures. In the present work, we aim to address these limitations through three main contributions. First, we performed a systematic literature review. We identified four main types of approaches: i) 2D slice-level, ii) 3D patch-level, iii) ROI-based and iv) 3D subject-level CNN. Moreover, we found that more than half of the surveyed papers may have suffered from data leakage and thus reported biased performance. Our second contribution is the extension of our open-source framework for classification of AD using CNN and T1-weighted MRI. The framework comprises previously developed tools to automatically convert ADNI, AIBL and OASIS data into the BIDS standard, and a modular set of image preprocessing procedures, classification architectures and evaluation procedures dedicated to deep learning. Finally, we used this framework to rigorously compare different CNN architectures. The data was split into training/validation/test sets at the very beginning and only the training/validation sets were used for model selection. To avoid any overfitting, the test sets were left untouched until the end of the peer-review process. Overall, the different 3D approaches (3D-subject, 3D-ROI, 3D-patch) achieved similar performances while that of the 2D slice approach was lower. Of note, the different CNN approaches did not perform better than a SVM with voxel-based features. The different approaches generalized well to similar populations but not to datasets with different inclusion criteria or demographical characteristics. All the code of the framework and the experiments is publicly available: general-purpose tools have been integrated into the Clinica software (www.clinica.run) and the paper-specific code is available at: https://github.com/aramis-lab/AD-DL.
Display omitted
Previous studies have reported a possible prodrome in multiple sclerosis (MS) defined by nonspecific symptoms including mood disorder or genitourinary symptoms and increased health care use detected ...several years before diagnosis. This study aimed to evaluate agnostically the associations between diseases and symptoms diagnosed in primary care and the risk of MS relative to controls and 2 other autoimmune inflammatory diseases with similar population characteristics, namely lupus and Crohn disease (CD).
A case-control study was conducted using electronic health records from the Health Improvement Network database in the United Kingdom and France. We agnostically assessed the associations between 113 diseases and symptoms in the 5 years before and after diagnosis in patients with subsequent diagnosis of MS. Individuals with a diagnosis of MS were compared with individuals without MS and individuals with 2 other autoimmune diseases, CD and lupus.
The study population consisted of patients with MS (n = 20,174), patients without MS (n = 54,790), patients with CD (n = 30,477), and patients with lupus (n = 7,337). Twelve
codes were significantly positively associated with the risk of MS compared with controls without MS. After considering
codes suggestive of neurologic symptoms as the first diagnosis of MS, 5
codes remained significantly associated with MS: depression (UK: odds ratio 1.22, 95% CI 1.11-1.34), sexual dysfunction (1.47, 1.11-1.95), constipation (1.5, 1.27-1.78), cystitis (1.21, 1.05-1.39), and urinary tract infections of unspecified site (1.38, 1.18-1.61). However, none of these conditions was selectively associated with MS in comparisons with both lupus and CD. All 5
codes identified were still associated with MS during the 5 years after diagnosis.
We identified 5 health conditions associated with subsequent MS diagnosis, which may be considered not only prodromal but also early-stage symptoms. However, these health conditions overlap with prodrome of 2 other autoimmune diseases; hence, they lack specificity to MS.
Alzheimer's Disease (AD) is a heterogeneous disease that disproportionately affects women and people with the APOE-
ε
4 susceptibility gene. We aim to describe the not-well-understood influence of ...both risk factors on the dynamics of brain atrophy in AD and healthy aging. Regional cortical thinning and brain atrophy were modeled over time using non-linear mixed-effect models and the FreeSurfer software with t1-MRI scans from the Alzheimer's Disease Neuroimaging Initiative (
N
= 1,502 subjects, 6,728 images in total). Covariance analysis was used to disentangle the effect of sex and APOE genotype on the regional onset age and pace of atrophy, while correcting for educational level. A map of the regions mostly affected by neurodegeneration is provided. Results were confirmed on gray matter density data from the SPM software. Women experience faster atrophic rates in the temporal, frontal, parietal lobes and limbic system and earlier onset in the amygdalas, but slightly later onset in the postcentral and cingulate gyri as well as all regions of the basal ganglia and thalamus. APOE-
ε
4 genotypes leads to earlier and faster atrophy in the temporal, frontal, parietal lobes, and limbic system in AD patients, but not in healthy patients. Higher education was found to slightly delay atrophy in healthy patients, but not for AD patients. A cohort of amyloid positive patients with MCI showed a similar impact of sex as in the healthy cohort, while APOE-
ε
4 showed similar associations as in the AD cohort. Female sex is as strong a risk factor for AD as APOE−ε4 genotype regarding neurodegeneration. Women experience a sharper atrophy in the later stages of the disease, although not a significantly earlier onset. These findings may have important implications for the development of targeted intervention.
A large number of papers have introduced novel machine learning and feature extraction methods for automatic classification of Alzheimer's disease (AD). However, while the vast majority of these ...works use the public dataset ADNI for evaluation, they are difficult to reproduce because different key components of the validation are often not readily available. These components include selected participants and input data, image preprocessing and cross-validation procedures. The performance of the different approaches is also difficult to compare objectively. In particular, it is often difficult to assess which part of the method (e.g. preprocessing, feature extraction or classification algorithms) provides a real improvement, if any. In the present paper, we propose a framework for reproducible and objective classification experiments in AD using three publicly available datasets (ADNI, AIBL and OASIS). The framework comprises: i) automatic conversion of the three datasets into a standard format (BIDS); ii) a modular set of preprocessing pipelines, feature extraction and classification methods, together with an evaluation framework, that provide a baseline for benchmarking the different components. We demonstrate the use of the framework for a large-scale evaluation on 1960 participants using T1 MRI and FDG PET data. In this evaluation, we assess the influence of different modalities, preprocessing, feature types (regional or voxel-based features), classifiers, training set sizes and datasets. Performances were in line with the state-of-the-art. FDG PET outperformed T1 MRI for all classification tasks. No difference in performance was found for the use of different atlases, image smoothing, partial volume correction of FDG PET images, or feature type. Linear SVM and L2-logistic regression resulted in similar performance and both outperformed random forests. The classification performance increased along with the number of subjects used for training. Classifiers trained on ADNI generalized well to AIBL and OASIS. All the code of the framework and the experiments is publicly available: general-purpose tools have been integrated into the Clinica software (www.clinica.run) and the paper-specific code is available at: https://gitlab.icm-institute.org/aramislab/AD-ML.
This paper proposes an original approach for the statistical analysis of longitudinal shape data. The proposed method allows the characterization of typical growth patterns and subject-specific shape ...changes in repeated time-series observations of several subjects. This can be seen as the extension of usual longitudinal statistics of scalar measurements to high-dimensional shape or image data. The method is based on the estimation of continuous subject-specific growth trajectories and the comparison of such temporal shape changes across subjects. Differences between growth trajectories are decomposed into morphological deformations, which account for shape changes independent of the time, and time warps, which account for different rates of shape changes over time. Given a longitudinal shape data set, we estimate a mean growth scenario representative of the population, and the variations of this scenario both in terms of shape changes and in terms of change in growth speed. Then, intrinsic statistics are derived in the space of spatiotemporal deformations, which characterize the typical variations in shape and in growth speed within the studied population. They can be used to detect systematic developmental delays across subjects. In the context of neuroscience, we apply this method to analyze the differences in the growth of the hippocampus in children diagnosed with autism, developmental delays and in controls. Result suggest that group differences may be better characterized by a different speed of maturation rather than shape differences at a given age. In the context of anthropology, we assess the differences in the typical growth of the endocranium between chimpanzees and bonobos. We take advantage of this study to show the robustness of the method with respect to change of parameters and perturbation of the age estimates.
Variability in neurodegenerative disease progression poses great challenges for the evaluation of potential treatments. Identifying the persons who will experience significant progression in the ...short term is key for the implementation of trials with smaller sample sizes. We apply here disease course mapping to forecast biomarker progression for individual carriers of the pathological CAG repeat expansions responsible for Huntington disease. We used data from two longitudinal studies (TRACK-HD and TRACK-ON) to synchronize temporal progression of 15 clinical and imaging biomarkers from 290 participants with Huntington disease. We used then the resulting HD COURSE MAP to forecast clinical endpoints from the baseline data of 11,510 participants from ENROLL-HD, an external validation cohort. We used such forecasts to select participants at risk for progression and compute the power of trials for such an enriched population. HD COURSE MAP forecasts biomarkers 5 years after the baseline measures with a maximum mean absolute error of 10 points for the total motor score and 2.15 for the total functional capacity. This allowed reducing sample sizes in trial up to 50% including participants with a higher risk for progression ensuring a more homogeneous group of participants.
Background
Studies showed the impact of sex and onset site (spinal or bulbar) on disease onset and survival in ALS. However, they mainly result from cross-sectional or survival analysis, and the ...interaction of sex and onset site on the different proxies of disease trajectory has not been fully investigated.
Methods
We selected all patients with repeated observations in the PRO-ACT database. We divided them into four groups depending on their sex and onset site. We estimated a multivariate disease progression model, named ALS Course Map, to investigate the combined temporal changes of the four sub-scores of the revised ALS functional rating scale (ALSFRSr), the forced vital capacity (FVC), and the body mass index (BMI). We then compared the progression rate, the estimated age at onset, and the relative progression of the outcomes across each group.
Results
We included 1438 patients from the PRO-ACT database. They were 51% men with spinal onset, 12% men with bulbar onset, 26% women with spinal onset, and 11% women with bulbar onset. We showed a significant influence of both sex and onset site on the ALSFRSr progression. The BMI decreased 8.9 months earlier (95% CI 3.9, 13.8) in women than men, after correction for the onset site. Among patients with bulbar onset, FVC was impaired 2.6 months earlier (95% CI 0.6, 4.6) in women.
Conclusion
Using a multivariable disease modelling approach, we showed that sex and onset site are important drivers of the progression of motor function, BMI, and FVC decline.
•We systematically and quantitatively review 234 experiments from 111 articles predicting the future progression to Alzheimer’s disease, reporting their characteristics in terms of algorithm, input ...features, methodological issues and performance measures.•We show that the best performances were achieved using cognition or fluorodeoxyglucose-positron emission tomography, whereas T1 magnetic resonance imaging lead to relatively lower performance.•We identify methodological issues regarding misuse of the test set in 26.5% of articles.•We show that short-term predictions are likely not to perform better than predicting that subjects stay stable over time.•We propose several guidelines for the development of methods aiming to predict the future, such as the need to pre-register the time-to-prediction and a careful choice of the control group that needs to be followed for the same period of time.
Display omitted
We performed a systematic review of studies focusing on the automatic prediction of the progression of mild cognitive impairment to Alzheimer’s disease (AD) dementia, and a quantitative analysis of the methodological choices impacting performance. This review included 172 articles, from which 234 experiments were extracted. For each of them, we reported the used data set, the feature types, the algorithm type, performance and potential methodological issues. The impact of these characteristics on the performance was evaluated using a multivariate mixed effect linear regressions. We found that using cognitive, fluorodeoxyglucose-positron emission tomography or potentially electroencephalography and magnetoencephalography variables significantly improved predictive performance compared to not including them, whereas including other modalities, in particular T1 magnetic resonance imaging, did not show a significant effect. The good performance of cognitive assessments questions the wide use of imaging for predicting the progression to AD and advocates for exploring further fine domain-specific cognitive assessments. We also identified several methodological issues, including the absence of a test set, or its use for feature selection or parameter tuning in nearly a fourth of the papers. Other issues, found in 15% of the studies, cast doubts on the relevance of the method to clinical practice. We also highlight that short-term predictions are likely not to be better than predicting that subjects stay stable over time. These issues highlight the importance of adhering to good practices for the use of machine learning as a decision support system for the clinical practice.
Multiple sclerosis (MS) is a demyelinating and inflammatory disease of the central nervous system (CNS). The demyelination process can be repaired by the generation of a new sheath of myelin around ...the axon, a process termed remyelination. In MS patients, the demyelination–remyelination cycles are highly dynamic. Over the years, magnetic resonance imaging (MRI) has been increasingly used in the diagnosis of MS and it is currently the most useful paraclinical tool to assess this diagnosis. However, conventional MRI pulse sequences are not specific for pathological mechanisms such as demyelination and remyelination. Recently, positron emission tomography (PET) with radiotracer 11CPIB has become a promising tool to measure in-vivo myelin content changes which is essential to push forward our understanding of mechanisms involved in the pathology of MS, and to monitor individual patients in the context of clinical trials focused on repair therapies. However, PET imaging is invasive due to the injection of a radioactive tracer. Moreover, it is an expensive imaging test and not offered in the majority of medical centers in the world. In this work, by using multisequence MRI, we thus propose a method to predict the parametric map of 11CPIB PET, from which we derived the myelin content changes in a longitudinal analysis of patients with MS. The method is based on the proposed conditional flexible self-attention GAN (CF-SAGAN) which is specifically adjusted for high-dimensional medical images and able to capture the relationships between the spatially separated lesional regions during the image synthesis process. Jointly applying the sketch-refinement process and the proposed attention regularization that focuses on the MS lesions, our approach is shown to outperform the state-of-the-art methods qualitatively and quantitatively. Specifically, our method demonstrated a superior performance for the prediction of myelin content at voxel-wise level. More important, our method for the prediction of myelin content changes in patients with MS shows similar clinical correlations to the PET-derived gold standard indicating the potential for clinical management of patients with MS.