Abstract
Summary
We describe a novel computational method for genotyping repeats using sequence graphs. This method addresses the long-standing need to accurately genotype medically important loci ...containing repeats adjacent to other variants or imperfect DNA repeats such as polyalanine repeats. Here we introduce a new version of our repeat genotyping software, ExpansionHunter, that uses this method to perform targeted genotyping of a broad class of such loci.
Availability and implementation
ExpansionHunter is implemented in C++ and is available under the Apache License Version 2.0. The source code, documentation, and Linux/macOS binaries are available at https://github.com/Illumina/ExpansionHunter/.
Supplementary information
Supplementary data are available at Bioinformatics online.
To identify further Mendelian causes of intellectual disability (ID), we screened a cohort of 996 individuals with ID for variants in 565 known or candidate genes by using a targeted next-generation ...sequencing approach. Seven loss-of-function (LoF) mutations—four nonsense (c.1195A>T p.Lys399∗, c.1333C>T p.Arg445∗, c.1866C>G p.Tyr622∗, and c.3001C>T p.Arg1001∗) and three frameshift (c.2177_2178del p.Thr726Asnfs∗39, c.3771dup p.Ser1258Glufs∗65, and c.3856del p.Ser1286Leufs∗84)—were identified in SETD5, a gene predicted to encode a methyltransferase. All mutations were compatible with de novo dominant inheritance. The affected individuals had moderate to severe ID with additional variable features of brachycephaly; a prominent high forehead with synophrys or striking full and broad eyebrows; a long, thin, and tubular nose; long, narrow upslanting palpebral fissures; and large, fleshy low-set ears. Skeletal anomalies, including significant leg-length discrepancy, were a frequent finding in two individuals. Congenital heart defects, inguinal hernia, or hypospadias were also reported. Behavioral problems, including obsessive-compulsive disorder, hand flapping with ritualized behavior, and autism, were prominent features. SETD5 lies within the critical interval for 3p25 microdeletion syndrome. The individuals with SETD5 mutations showed phenotypic similarity to those previously reported with a deletion in 3p25, and thus loss of SETD5 might be sufficient to account for many of the clinical features observed in this condition. Our findings add to the growing evidence that mutations in genes encoding methyltransferases regulating histone modification are important causes of ID. This analysis provides sufficient evidence that rare de novo LoF mutations in SETD5 are a relatively frequent (0.7%) cause of ID.
Spinal muscular atrophy (SMA), caused by loss of the SMN1 gene, is a leading cause of early childhood death. Due to the near identical sequences of SMN1 and SMN2, analysis of this region is ...challenging. Population-wide SMA screening to quantify the SMN1 copy number (CN) is recommended by the American College of Medical Genetics and Genomics.
We developed a method that accurately identifies the CN of SMN1 and SMN2 using genome sequencing (GS) data by analyzing read depth and eight informative reference genome differences between SMN1/2.
We characterized SMN1/2 in 12,747 genomes, identified 1568 samples with SMN1 gains or losses and 6615 samples with SMN2 gains or losses, and calculated a pan-ethnic carrier frequency of 2%, consistent with previous studies. Additionally, 99.8% of our SMN1 and 99.7% of SMN2 CN calls agreed with orthogonal methods, with a recall of 100% for SMA and 97.8% for carriers, and a precision of 100% for both SMA and carriers.
This SMN copy-number caller can be used to identify both carrier and affected status of SMA, enabling SMA testing to be offered as a comprehensive test in neonatal care and an accurate carrier screening tool in GS sequencing projects.
With a wealth of disease-associated DNA variants being recently reported, the challenges of providing their functional characterization are mounting. Previously, as part of a large systematic ...resequencing of the X chromosome in 208 unrelated families with nonsyndromic X-linked intellectual disability, we identified three unique variants (two missense and one protein truncating) in USP9X. To assess the functional significance of these variants, we took advantage of the Usp9x knockout mouse we generated. Loss of Usp9x causes reduction in both axonal growth and neuronal cell migration. Although overexpression of wild-type human USP9X rescued these defects, all three USP9X variants failed to rescue axonal growth, caused reduced USP9X protein localization in axonal growth cones, and (in 2/3 variants) failed to rescue neuronal cell migration. Interestingly, in one of these families, the proband was subsequently identified to have a microdeletion encompassing ARID1B, a known ID gene. Given our findings it is plausible that loss of function of both genes contributes to the individual's phenotype. This case highlights the complexity of the interpretations of genetic findings from genome-wide investigations. We also performed proteomics analysis of neurons from both the wild-type and Usp9x knockout embryos and identified disruption of the cytoskeleton as the main underlying consequence of the loss of Usp9x. Detailed clinical assessment of all three families with USP9X variants identified hypotonia and behavioral and morphological defects as common features in addition to ID. Together our data support involvement of all three USP9X variants in ID in these families and provide likely cellular and molecular mechanisms involved.
Patients and non-specialist healthcare professionals are increasingly expected to understand and interpret the results of genetic or genomic testing. These results are currently reported using a ...variety of templates, containing different amounts, levels, and layouts of information. We set out to establish a set of recommendations for communicating genetic test results to non-expert readers. We employed a qualitative-descriptive study design with user-centred design principles, including a mixture of in-person semi-structured interviews and online questionnaires with patients, healthcare professionals and the general public. The resulting recommendations and example template include providing at-a-glance comprehension of what the test results mean for the patient; suggested next steps; and details of further information and support. Separation and inclusion of technical methodological details enhances non-specialists' understanding, while retaining important information for specialists and the patients' records. The recommendations address the high-level needs of patients and their non-specialist clinicians when receiving genetic test results. These recommendations provide a solid foundation for the major content and structure of reports, and we recommend further engagement with patients and clinicians to tailor reports to specific types of test and results.
Chorea is a hyperkinetic movement disorder resulting from dysfunction of striatal medium spiny neurons (MSNs), which form the main output projections from the basal ganglia. Here, we used whole-exome ...sequencing to unravel the underlying genetic cause in three unrelated individuals with a very similar and unique clinical presentation of childhood-onset chorea and characteristic brain MRI showing symmetrical bilateral striatal lesions. All individuals were identified to carry a de novo heterozygous mutation in PDE10A (c.898T>C p.Phe300Leu in two individuals and c.1000T>C p.Phe334Leu in one individual), encoding a phosphodiesterase highly and selectively present in MSNs. PDE10A contributes to the regulation of the intracellular levels of cyclic adenosine monophosphate (cAMP) and cyclic guanosine monophosphate (cGMP). Both substitutions affect highly conserved amino acids located in the regulatory GAF-B domain, which, by binding to cAMP, stimulates the activity of the PDE10A catalytic domain. In silico modeling showed that the altered residues are located deep in the binding pocket, where they are likely to alter cAMP binding properties. In vitro functional studies showed that neither substitution affects the basal PDE10A activity, but they severely disrupt the stimulatory effect mediated by cAMP binding to the GAF-B domain. The identification of PDE10A mutations as a cause of chorea further motivates the study of cAMP signaling in MSNs and highlights the crucial role of striatal cAMP signaling in the regulation of basal ganglia circuitry. Pharmacological modulation of this pathway could offer promising etiologically targeted treatments for chorea and other hyperkinetic movement disorders.
IMPORTANCE: Mutations in genes traditionally associated with syndromic retinal disease are increasingly found to cause nonsyndromic inherited retinal degenerations. Mutations in CLN3 are classically ...associated with juvenile neuronal ceroid lipofuscinosis, a rare neurodegenerative disease with early retinal degeneration and progressive neurologic deterioration, but have recently also been identified in patients with nonsyndromic inherited retinal degenerations. To our knowledge, detailed clinical characterization of such cases has yet to be reported. OBJECTIVE: To provide detailed clinical, electrophysiologic, structural, and molecular genetic findings in nonsyndromic inherited retinal degenerations associated with CLN3 mutations. DESIGN, SETTING, AND PARTICIPANTS: A multi-institutional case series of 10 patients who presented with isolated nonsyndromic retinal disease and mutations in CLN3. Patient ages ranged from 16 to 70 years; duration of follow-up ranged from 3 to 29 years. MAIN OUTCOMES AND MEASURES: Longitudinal clinical evaluation, including full ophthalmic examination, multimodal retinal imaging, perimetry, and electrophysiology. Molecular analyses were performed using whole-genome sequencing or whole-exome sequencing. Electron microscopy studies of peripheral lymphocytes and CLN3 transcript analysis with polymerase chain reaction amplification were performed in a subset of patients. RESULTS: There were 7 females and 3 males in this case series, with a mean (range) age at last review of 37.1 (16-70) years. Of the 10 patients, 4 had a progressive late-onset rod-cone dystrophy, with a mean (range) age at onset of 29.7 (20-40) years, and 6 had an earlier onset rod-cone dystrophy, with a mean (range) age at onset of 12.1 (7-17) years. Ophthalmoscopic examination features included macular edema, mild intraretinal pigment migration, and widespread atrophy in advanced disease. Optical coherence tomography imaging demonstrated significant photoreceptor loss except in patients with late-onset disease who had a focal preservation of the ellipsoid zone and outer nuclear layer in the fovea. Electroretinography revealed a rod-cone pattern of dysfunction in 6 patients and were completely undetectable in 2 patients. Six novel CLN3 variants were identified in molecular analyses. CONCLUSIONS AND RELEVANCE: This report describes detailed clinical, imaging, and genetic features of CLN3-associated nonsyndromic retinal degeneration. The age at onset and natural progression of retinal disease differs greatly between syndromic and nonsyndromic CLN3 disease, which may be associated with genotypic differences.
Studies have shown that complex structural variants (cxSVs) contribute to human genomic variation and can cause Mendelian disease. We aimed to identify cxSVs relevant to Mendelian disease using ...short-read whole-genome sequencing (WGS), resolve the precise variant configuration and investigate possible mechanisms of cxSV formation.
We performed short-read WGS and analysis of breakpoint junctions to identify cxSVs in a cohort of 1324 undiagnosed rare disease patients. Long-read WGS and gene expression analysis were used to resolve one case.
We identified three pathogenic cxSVs: a de novo duplication-inversion-inversion-deletion affecting ARID1B, a de novo deletion-inversion-duplication affecting HNRNPU and a homozygous deletion-inversion-deletion affecting CEP78. Additionally, a de novo duplication-inversion-duplication overlapping CDKL5 was resolved by long-read WGS demonstrating the presence of both a disrupted and an intact copy of CDKL5 on the same allele, and gene expression analysis showed both parental alleles of CDKL5 were expressed. Breakpoint analysis in all the cxSVs revealed both microhomology and longer repetitive elements.
Our results corroborate that cxSVs cause Mendelian disease, and we recommend their consideration during clinical investigations. We show that resolution of breakpoints can be critical to interpret pathogenicity and present evidence of replication-based mechanisms in cxSV formation.