The genetic architecture of ALS Shatunov, Aleksey; Al-Chalabi, Ammar
Neurobiology of disease,
January 2021, 2021-01-00, 20210101, 2021-01-01, Letnik:
147
Journal Article
Identifying large expansions of short tandem repeats (STRs), such as those that cause amyotrophic lateral sclerosis (ALS) and fragile X syndrome, is challenging for short-read whole-genome sequencing ...(WGS) data. A solution to this problem is an important step toward integrating WGS into precision medicine. We developed a software tool called ExpansionHunter that, using PCR-free WGS short-read data, can genotype repeats at the locus of interest, even if the expanded repeat is larger than the read length. We applied our algorithm to WGS data from 3001 ALS patients who have been tested for the presence of the
repeat expansion with repeat-primed PCR (RP-PCR). Compared against this truth data, ExpansionHunter correctly classified all (212/212, 95% CI 0.98, 1.00) of the expanded samples as either expansions (208) or potential expansions (4). Additionally, 99.9% (2786/2789, 95% CI 0.997, 1.00) of the wild-type samples were correctly classified as wild type by this method with the remaining three samples identified as possible expansions. We further applied our algorithm to a set of 152 samples in which every sample had one of eight different pathogenic repeat expansions, including those associated with fragile X syndrome, Friedreich's ataxia, and Huntington's disease, and correctly flagged all but one of the known repeat expansions. Thus, ExpansionHunter can be used to accurately detect known pathogenic repeat expansions and provides researchers with a tool that can be used to identify new pathogenic repeat expansions.
Summary Background Amyotrophic lateral sclerosis (ALS) is a progressive neurodegenerative disease of upper and lower motor neurons, associated with frontotemporal dementia (FTD) in about 14% of ...incident cases. We assessed the frequency of the recently identified C9orf72 repeat expansion in familial and apparently sporadic cases of ALS and characterised the cognitive and clinical phenotype of patients with this expansion. Methods A population-based register of patients with ALS has been in operation in Ireland since 1995, and an associated DNA bank has been in place since 1999. 435 representative DNA samples from the bank were screened using repeat-primed PCR for the presence of a GGGGCC repeat expansion in C9orf72 . We assessed clinical, cognitive, behavioural, MRI, and survival data from 191 (44%) of these patients, who comprised a population-based incident group and had previously participated in a longitudinal study of cognitive and behavioural changes in ALS. Findings Samples from the DNA bank included 49 cases of known familial ALS and 386 apparently sporadic cases. Of these samples, 20 (41%) cases of familial ALS and 19 (5%) cases of apparently sporadic ALS had the C9orf72 repeat expansion. Of the 191 patients for whom phenotype data were available, 21 (11%) had the repeat expansion. Age at disease onset was lower in patients with the repeat expansion (mean 56·3 SD 8·3 years) than in those without (61·3 10·6 years; p=0·043). A family history of ALS or FTD was present in 18 (86%) of those with the repeat expansion. Patients with the repeat expansion had significantly more co-morbid FTD than patients without the repeat (50% vs 12%), and a distinct pattern of non-motor cortex changes on high-resolution 3 T magnetic resonance structural neuroimaging. Age-matched univariate analysis showed shorter survival (20 months vs 26 months) in patients with the repeat expansion. Multivariable analysis showed an increased hazard rate of 1·9 (95% 1·1–3·7; p=0·035) in those patients with the repeat expansion compared with patients without the expansion Interpretation Patients with ALS and the C9orf72 repeat expansion seem to present a recognisable phenotype characterised by earlier disease onset, the presence of cognitive and behavioural impairment, specific neuroimaging changes, a family history of neurodegeneration with autosomal dominant inheritance, and reduced survival. Recognition of patients with ALS who carry an expanded repeat is likely to be important in the context of appropriate disease management, stratification in clinical trials, and in recognition of other related phenotypes in family members. Funding Health Seventh Framework Programme, Health Research Board, Research Motor Neuron, Irish Motor Neuron Disease Association, The Motor Neurone Disease Association of Great Britain and Northern Ireland, ALS Association.
Amyotrophic lateral sclerosis is a progressive neurodegenerative disease of motor neurons. About 25 genes have been verified as relevant to the disease process, with rare and common variation ...implicated. We used next generation sequencing and repeat sizing to comprehensively assay genetic variation in a panel of known amyotrophic lateral sclerosis genes in 1126 patient samples and 613 controls. About 10% of patients were predicted to carry a pathological expansion of the C9orf72 gene. We found an increased burden of rare variants in patients within the untranslated regions of known disease-causing genes, driven by SOD1, TARDBP, FUS, VCP, OPTN and UBQLN2. We found 11 patients (1%) carried more than one pathogenic variant (P = 0.001) consistent with an oligogenic basis of amyotrophic lateral sclerosis. These findings show that the genetic architecture of amyotrophic lateral sclerosis is complex and that variation in the regulatory regions of associated genes may be important in disease pathogenesis.
There is increasing evidence that endogenous retroviruses (ERVs) play a significant role in central nervous system diseases, including amyotrophic lateral sclerosis (ALS). Studies of ALS have ...consistently identified retroviral enzyme reverse transcriptase activity in patients. Evidence indicates that ERVs are the cause of reverse transcriptase activity in ALS, but it is currently unclear whether this is due to a specific ERV locus or a family of ERVs. We employed a combination of bioinformatic methods to identify whether specific ERVs or ERV families are associated with ALS. Using the largest post-mortem RNA-sequence datasets available we selectively identified ERVs that closely resembled full-length proviruses. In the discovery dataset there was one ERV locus (HML6_3p21.31c) that showed significant increased expression in post-mortem motor cortex tissue after multiple-testing correction. Using six replication post-mortem datasets we found HML6_3p21.31c was consistently upregulated in ALS in motor cortex and cerebellum tissue. In addition, HML6_3p21.31c showed significant co-expression with cytokine binding and genes involved in EBV, HTLV-1 and HIV type-1 infections. There were no significant differences in ERV family expression between ALS and controls. Our results support the hypothesis that specific ERV loci are involved in ALS pathology.
Superoxide dismutase (SOD1) gene variants may cause amyotrophic lateral sclerosis, some of which are associated with a distinct phenotype. Most studies assess limited variants or sample sizes. In ...this international, retrospective observational study, we compare phenotypic and demographic characteristics between people with SOD1-ALS and people with ALS and no recorded SOD1 variant. We investigate which variants are associated with age at symptom onset and time from onset to death or censoring using Cox proportional-hazards regression. The SOD1-ALS dataset reports age of onset for 1122 and disease duration for 883 people; the comparator population includes 10,214 and 9010 people respectively. Eight variants are associated with younger age of onset and distinct survival trajectories; a further eight associated with younger onset only and one with distinct survival only. Here we show that onset and survival are decoupled in SOD1-ALS. Future research should characterise rarer variants and molecular mechanisms causing the observed variability.
The ALS Online Genetics Database (ALSoD) website holds mutation, geographical, and phenotype data on genes implicated in amyotrophic lateral sclerosis (ALS) and links to bioinformatics resources, ...publications, and tools for analysis. On average, there are 300 unique visits per day, suggesting a high demand from the research community. To enable wider access, we developed a mobile-friendly version of the website and a smartphone app.
We sought to compare data traffic before and after implementation of a mobile version of the website to assess utility.
We identified the most frequently viewed pages using Google Analytics and our in-house analytic monitoring. For these, we optimized the content layout of the screen, reduced image sizes, and summarized available information. We used the Microsoft .NET framework mobile detection property (HttpRequest.IsMobileDevice in the Request.Browser object in conjunction with HttpRequest.UserAgent), which returns a true value if the browser is a recognized mobile device. For app development, we used the Eclipse integrated development environment with Android plug-ins. We wrapped the mobile website version with the WebView object in Android. Simulators were downloaded to test and debug the applications.
The website automatically detects access from a mobile phone and redirects pages to fit the smaller screen. Because the amount of data stored on ALSoD is very large, the available information for display using smartphone access is deliberately restricted to improve usability. Visits to the website increased from 2231 to 2820, yielding a 26% increase from the pre-mobile to post-mobile period and an increase from 103 to 340 visits (230%) using mobile devices (including tablets). The smartphone app is currently available on BlackBerry and Android devices and will be available shortly on iOS as well.
Further development of the ALSoD website has allowed access through smartphones and tablets, either through the website or directly through a mobile app, making genetic data stored on the database readily accessible to researchers and patients across multiple devices.
The expansion of a hexanucleotide repeat GGGGCC in C9orf72 is the most common known cause of ALS accounting for ~ 40% familial cases and ~ 7% sporadic cases in the European population. In most ...people, the repeat length is 2, but in people with ALS, hundreds to thousands of repeats may be observed. A small proportion of people have an intermediate expansion, of the order of 20 to 30 repeats in size, and it remains unknown whether intermediate expansions confer risk of ALS in the same way that massive expansions do. We investigated the association of this intermediate repeat with ALS by performing a meta-analysis of four previously published studies and a new British/Alzheimer's Disease Neuroimaging Initiative dataset of 1295 cases and 613 controls. The final dataset comprised 5071 cases and 3747 controls. Our meta-analysis showed association between ALS and intermediate C9orf72 repeats of 24 to 30 repeats in size (random-effects model OR = 4.2, 95% CI = 1.23-14.35, p-value = 0.02). Furthermore, we showed a different frequency of the repeat between the northern and southern European populations (Fisher's exact test p-value = 5 × 10
). Our findings provide evidence for the association between intermediate repeats and ALS (p-value = 2 × 10
) with direct relevance for research and clinical practice by showing that an expansion of 24 or more repeats should be considered pathogenic.
A massive hexanucleotide repeat expansion mutation (HREM) in C9ORF72 has recently been linked to amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD). Here we describe the frequency, ...origin and stability of this mutation in ALS+/-FTD from five European cohorts (total n=1347). Single-nucleotide polymorphisms defining the risk haplotype in linked kindreds were genotyped in cases (n=434) and controls (n=856). Haplotypes were analysed using PLINK and aged using DMLE+. In a London clinic cohort, the HREM was the most common mutation in familial ALS+/-FTD: C9ORF72 29/112 (26%), SOD1 27/112 (24%), TARDBP 1/112 (1%) and FUS 4/112 (4%) and detected in 13/216 (6%) of unselected sporadic ALS cases but was rare in controls (3/856, 0.3%). HREM prevalence was high for familial ALS+/-FTD throughout Europe: Belgium 19/22 (86%), Sweden 30/41 (73%), the Netherlands 10/27 (37%) and Italy 4/20 (20%). The HREM did not affect the age at onset or survival of ALS patients. Haplotype analysis identified a common founder in all 137 HREM carriers that arose around 6300 years ago. The haplotype from which the HREM arose is intrinsically unstable with an increased number of repeats (average 8, compared with 2 for controls, P<10(-8)). We conclude that the HREM has a single founder and is the most common mutation in familial and sporadic ALS in Europe.