Objective
The APOE4 allele is the strongest genetic risk factor for sporadic Alzheimer disease (AD). Case–control studies suggest the APOE4 link to AD is stronger in women. We examined the ...APOE4‐by‐sex interaction in conversion risk (from healthy aging to mild cognitive impairment (MCI)/AD or from MCI to AD) and cerebrospinal fluid (CSF) biomarker levels.
Methods
Cox proportional hazards analysis was used to compute hazard ratios (HRs) for an APOE‐by‐sex interaction on conversion in controls (n = 5,496) and MCI patients (n = 2,588). The interaction was also tested in CSF biomarker levels of 980 subjects from the Alzheimer's Disease Neuroimaging Initiative.
Results
Among controls, male and female carriers were more likely to convert to MCI/AD, but the effect was stronger in women (HR = 1.81 for women; HR = 1.27 for men; interaction: p = 0.011). The interaction remained significant in a predefined subanalysis restricted to APOE3/3 and APOE3/4 genotypes. Among MCI patients, both male and female APOE4 carriers were more likely to convert to AD (HR = 2.16 for women; HR = 1.64 for men); the interaction was not significant (p = 0.14). In the subanalysis restricted to APOE3/3 and APOE3/4 genotypes, the interaction was significant (p = 0.02; HR = 2.17 for women; HR = 1.51 for men). The APOE4‐by‐sex interaction on biomarker levels was significant for MCI patients for total tau and the tau‐to‐Aβ ratio (p = 0.009 and p = 0.02, respectively; more AD‐like in women).
Interpretation
APOE4 confers greater AD risk in women. Biomarker results suggest that increased APOE‐related risk in women may be associated with tau pathology. These findings have important clinical implications and suggest novel research approaches into AD pathogenesis. Ann Neurol 2014;75:563–573
Full text
Available for:
BFBNIB, FZAB, GIS, IJS, KILJ, NLZOH, NUK, OILJ, SBCE, SBMB, UL, UM, UPUK
At this moment, databanks worldwide contain brain images of previously unimaginable numbers. Combined with developments in data science, these massive data provide the potential to better understand ...the genetic underpinnings of brain diseases. However, different datasets, which are stored at different institutions, cannot always be shared directly due to privacy and legal concerns, thus limiting the full exploitation of big data in the study of brain disorders. Here we propose a federated learning framework for securely accessing and meta-analyzing any biomedical data without sharing individual information. We illustrate our framework by investigating brain structural relationships across diseases and clinical cohorts. The framework is first tested on synthetic data and then applied to multi-centric, multi-database studies including ADNI, PPMI, MIRIAD and UK Biobank, showing the potential of the approach for further applications in distributed analysis of multi-centric cohorts.
Microarray technologies are established approaches for high throughput gene expression, methylation and genotyping analysis. An accurate mapping of the array probes is essential to generate reliable ...biological findings. However, manufacturers of the microarray platforms typically provide incomplete and outdated annotation tables, which often rely on older genome and transcriptome versions that differ substantially from up-to-date sequence databases. Here, we present the Re-Annotator, a re-annotation pipeline for microarray probe sequences. It is primarily designed for gene expression microarrays but can also be adapted to other types of microarrays. The Re-Annotator uses a custom-built mRNA reference database to identify the positions of gene expression array probe sequences. We applied Re-Annotator to the Illumina Human-HT12 v4 microarray platform and found that about one quarter (25%) of the probes differed from the manufacturer's annotation. In further computational experiments on experimental gene expression data, we compared Re-Annotator to another probe re-annotation tool, ReMOAT, and found that Re-Annotator provided an improved re-annotation of microarray probes. A thorough re-annotation of probe information is crucial to any microarray analysis. The Re-Annotator pipeline is freely available at http://sourceforge.net/projects/reannotator along with re-annotated files for Illumina microarrays HumanHT-12 v3/v4 and MouseRef-8 v2.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
The brain's functional connectome is dynamic, constantly reconfiguring in an individual-specific manner. However, which characteristics of such reconfigurations are subject to genetic effects, and to ...what extent, is largely unknown. Here, we identified heritable dynamic features, quantified their heritability, and determined their association with cognitive phenotypes. In resting-state fMRI, we obtained multivariate features, each describing a temporal or spatial characteristic of connectome dynamics jointly over a set of connectome states. We found strong evidence for heritability of temporal features, particularly, Fractional Occupancy (FO) and Transition Probability (TP), representing the duration spent in each connectivity configuration and the frequency of shifting between configurations, respectively. These effects were robust against methodological choices of number of states and global signal regression. Genetic effects explained a substantial proportion of phenotypic variance of these features (h2=0.39, 95% CI= .24,.54 for FO; h2=0.43, 95% CI=.29,.57 for TP). Moreover, these temporal phenotypes were associated with cognitive performance. Contrarily, we found no robust evidence for heritability of spatial features of the dynamic states (i.e., states’ Modularity and connectivity pattern). Genetic effects may therefore primarily contribute to how the connectome transitions across states, rather than the precise spatial instantiation of the states in individuals. In sum, genetic effects impact the dynamic trajectory of state transitions (captured by FO and TP), and such temporal features may act as endophenotypes for cognitive abilities.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
Quantitative computed tomographic (CT) measures of baseline disease severity might identify patients with idiopathic pulmonary fibrosis (IPF) with an increased mortality risk. We evaluated whether ...quantitative CT variables could act as a cohort enrichment tool in future IPF drug trials.
To determine whether computer-derived CT measures, specifically measures of pulmonary vessel-related structures (VRSs), can better predict functional decline and survival in IPF and reduce requisite sample sizes in drug trial populations.
Patients with IPF undergoing volumetric noncontrast CT imaging at the Royal Brompton Hospital, London, and St. Antonius Hospital, Utrecht, were examined to identify pulmonary function measures (including FVC) and visual and computer-derived (CALIPER Computer-Aided Lung Informatics for Pathology Evaluation and Rating software) CT features predictive of mortality and FVC decline. The discovery cohort comprised 247 consecutive patients, with validation of results conducted in a separate cohort of 284 patients, all fulfilling drug trial entry criteria.
In the discovery and validation cohorts, CALIPER-derived features, particularly VRS scores, were among the strongest predictors of survival and FVC decline. CALIPER results were accentuated in patients with less extensive disease, outperforming pulmonary function measures. When used as a cohort enrichment tool, a CALIPER VRS score greater than 4.4% of the lung was able to reduce the requisite sample size of an IPF drug trial by 26%.
Our study has validated a new quantitative CT measure in patients with IPF fulfilling drug trial entry criteria-the VRS score-that outperformed current gold standard measures of outcome. When used for cohort enrichment in an IPF drug trial setting, VRS threshold scores can reduce a required IPF drug trial population size by 25%, thereby limiting prohibitive trial costs. Importantly, VRS scores identify patients in whom antifibrotic medication prolongs life and reduces FVC decline.
Genome-Wide Association Studies of imaging-derived biomarkers in Alzheimer's disease have focused on phenotypes derived from single imaging modalities. Scelsi et al. report the first use of a ...multimodal phenotype, derived through disease progression modelling, and identify a novel variant that is protective against conversion from healthy ageing to Alzheimer's disease.
Abstract
Identifying genetic risk factors underpinning different aspects of Alzheimer's disease has the potential to provide important insights into pathogenesis. Moving away from simple case-control definitions, there is considerable interest in using quantitative endophenotypes, such as those derived from imaging as outcome measures. Previous genome-wide association studies of imaging-derived biomarkers in sporadic late-onset Alzheimer's disease focused only on phenotypes derived from single imaging modalities. In contrast, we computed a novel multi-modal neuroimaging phenotype comprising cortical amyloid burden and bilateral hippocampal volume. Both imaging biomarkers were used as input to a disease progression modelling algorithm, which estimates the biomarkers' long-term evolution curves from population-based longitudinal data. Among other parameters, the algorithm computes the shift in time required to optimally align a subjects' biomarker trajectories with these population curves. This time shift serves as a disease progression score and it was used as a quantitative trait in a discovery genome-wide association study with n = 944 subjects from the Alzheimer's Disease Neuroimaging Initiative database diagnosed as Alzheimer's disease, mild cognitive impairment or healthy at the time of imaging. We identified a genome-wide significant locus implicating LCORL (rs6850306, chromosome 4; P = 1.03 × 10−8). The top variant rs6850306 was found to act as an expression quantitative trait locus for LCORL in brain tissue. The clinical role of rs6850306 in conversion from healthy ageing to mild cognitive impairment or Alzheimer's disease was further validated in an independent cohort comprising healthy, older subjects from the National Alzheimer's Coordinating Center database. Specifically, possession of a minor allele at rs6850306 was protective against conversion from mild cognitive impairment to Alzheimer's disease in the National Alzheimer's Coordinating Center cohort (hazard ratio = 0.593, 95% confidence interval = 0.387-0.907, n = 911, PBonf = 0.032), in keeping with the negative direction of effect reported in the genome-wide association study (βdisease progression score = −0.07 ± 0.01). The implicated locus is linked to genes with known connections to Alzheimer's disease pathophysiology and other neurodegenerative diseases. Using multimodal imaging phenotypes in association studies may assist in unveiling the genetic drivers of the onset and progression of complex diseases.
•Three consistent atrophy subtypes were found in both ADNI and UK biobank datasets.•High consistency in individuals’ subtype and stage assignment under different models.•Data harmonization is ...essential to achieve successful model transfer.•The typical subtype shows the most AD-like CSF biomarker profile.•Cholesterol and blood pressure medications are associated with subtypes.
In the past, methods to subtype or biotype patients using brain imaging data have been developed. However, it is unclear whether and how these trained machine learning models can be successfully applied to population cohorts to study the genetic and lifestyle factors underpinning these subtypes. This work, using the Subtype and Stage Inference (SuStaIn) algorithm, examines the generalisability of data-driven Alzheimer's disease (AD) progression models.
We first compared SuStaIn models trained separately on Alzheimer's disease neuroimaging initiative (ADNI) data and an AD-at-risk population constructed from the UK Biobank dataset. We further applied data harmonization techniques to remove cohort effects. Next, we built SuStaIn models on the harmonized datasets, which were then used to subtype and stage subjects in the other harmonized dataset.
The first key finding is that three consistent atrophy subtypes were found in both datasets, which match the previously identified subtype progression patterns in AD: ‘typical’, ‘cortical’ and ‘subcortical’. Next, the subtype agreement was further supported by high consistency in individuals’ subtypes and stage assignment based on the different models: more than 92% of the subjects, with reliable subtype assignment in both ADNI and UK Biobank dataset, were assigned to an identical subtype under the model built on the different datasets. The successful transferability of AD atrophy progression subtypes across cohorts capturing different phases of disease development enabled further investigations of associations between AD atrophy subtypes and risk factors. Our study showed that (1) the average age is highest in the typical subtype and lowest in the subcortical subtype; (2) the typical subtype is associated with statistically more-AD-like cerebrospinal fluid biomarkers values in comparison to the other two subtypes; and (3) in comparison to the subcortical subtype, the cortical subtype subjects are more likely to associate with prescription of cholesterol and high blood pressure medications.
In summary, we presented cross-cohort consistent recovery of AD atrophy subtypes, showing how the same subtypes arise even in cohorts capturing substantially different disease phases. Our study opened opportunities for future detailed investigations of atrophy subtypes with a broad range of early risk factors, which will potentially lead to a better understanding of the disease aetiology and the role of lifestyle and behaviour on AD.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
Motivation: In life sciences, interpretability of machine learning models is as important as their prediction accuracy. Linear models are probably the most frequently used methods for assessing ...feature relevance, despite their relative inflexibility. However, in the past years effective estimators of feature relevance have been derived for highly complex or non-parametric models such as support vector machines and RandomForest (RF) models. Recently, it has been observed that RF models are biased in such a way that categorical variables with a large number of categories are preferred. Results: In this work, we introduce a heuristic for normalizing feature importance measures that can correct the feature importance bias. The method is based on repeated permutations of the outcome vector for estimating the distribution of measured importance for each variable in a non-informative setting. The P-value of the observed importance provides a corrected measure of feature importance. We apply our method to simulated data and demonstrate that (i) non-informative predictors do not receive significant P-values, (ii) informative variables can successfully be recovered among non-informative variables and (iii) P-values computed with permutation importance (PIMP) are very helpful for deciding the significance of variables, and therefore improve model interpretability. Furthermore, PIMP was used to correct RF-based importance measures for two real-world case studies. We propose an improved RF model that uses the significant variables with respect to the PIMP measure and show that its prediction accuracy is superior to that of other existing models. Availability: R code for the method presented in this article is available at http://www.mpi-inf.mpg.de/∼altmann/download/PIMP.R Contact: altmann@mpi-inf.mpg.de, laura.tolosi@mpi-inf.mpg.de Supplementary information: Supplementary data are available at Bioinformatics online.
Nomograms are important clinical tools applied widely in both developing and aging populations. They are generally constructed as normative models identifying cases as outliers to a distribution of ...healthy controls. Currently used normative models do not account for genetic heterogeneity. Hippocampal volume (HV) is a key endophenotype for many brain disorders. Here, we examine the impact of genetic adjustment on HV nomograms and the translational ability to detect dementia patients. Using imaging data from 35,686 healthy subjects aged 44–82 from the UK Biobank (UKB), we built HV nomograms using Gaussian process regression (GPR), which – compared to a previous method – extended the application age by 20 years, including dementia critical age ranges. Using HV polygenic scores (HV-PGS), we built genetically adjusted nomograms from participants stratified into the top and bottom 30% of HV-PGS. This shifted the nomograms in the expected directions by ~100 mm
3
(2.3% of the average HV), which equates to 3 years of normal aging for a person aged ~65. Clinical impact of genetically adjusted nomograms was investigated by comparing 818 subjects from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database diagnosed as either cognitively normal (CN), having mild cognitive impairment (MCI) or Alzheimer’s disease (AD) patients. While no significant change in the survival analysis was found for MCI-to-AD conversion, an average of 68% relative decrease was found in intra-diagnostic-group variance, highlighting the importance of genetic adjustment in untangling phenotypic heterogeneity.