Severe intellectual disability (ID) occurs in 0.5% of newborns and is thought to be largely genetic in origin. The extensive genetic heterogeneity of this disorder requires a genome-wide detection of ...all types of genetic variation. Microarray studies and, more recently, exome sequencing have demonstrated the importance of de novo copy number variations (CNVs) and single-nucleotide variations (SNVs) in ID, but the majority of cases remain undiagnosed. Here we applied whole-genome sequencing to 50 patients with severe ID and their unaffected parents. All patients included had not received a molecular diagnosis after extensive genetic prescreening, including microarray-based CNV studies and exome sequencing. Notwithstanding this prescreening, 84 de novo SNVs affecting the coding region were identified, which showed a statistically significant enrichment of loss-of-function mutations as well as an enrichment for genes previously implicated in ID-related disorders. In addition, we identified eight de novo CNVs, including single-exon and intra-exonic deletions, as well as interchromosomal duplications. These CNVs affected known ID genes more frequently than expected. On the basis of diagnostic interpretation of all de novo variants, a conclusive genetic diagnosis was reached in 20 patients. Together with one compound heterozygous CNV causing disease in a recessive mode, this results in a diagnostic yield of 42% in this extensively studied cohort, and 62% as a cumulative estimate in an unselected cohort. These results suggest that de novo SNVs and CNVs affecting the coding region are a major cause of severe ID. Genome sequencing can be applied as a single genetic test to reliably identify and characterize the comprehensive spectrum of genetic variation, providing a genetic diagnosis in the majority of patients with severe ID.
A significant proportion of children have unexplained problems acquiring proficient linguistic skills despite adequate intelligence and opportunity. Developmental language disorders are highly ...heritable with substantial societal impact. Molecular studies have begun to identify candidate loci, but much of the underlying genetic architecture remains undetermined. We performed whole-exome sequencing of 43 unrelated probands affected by severe specific language impairment, followed by independent validations with Sanger sequencing, and analyses of segregation patterns in parents and siblings, to shed new light on aetiology. By first focusing on a pre-defined set of known candidates from the literature, we identified potentially pathogenic variants in genes already implicated in diverse language-related syndromes, including ERC1, GRIN2A, and SRPX2. Complementary analyses suggested novel putative candidates carrying validated variants which were predicted to have functional effects, such as OXR1, SCN9A and KMT2D. We also searched for potential "multiple-hit" cases; one proband carried a rare AUTS2 variant in combination with a rare inherited haplotype affecting STARD9, while another carried a novel nonsynonymous variant in SEMA6D together with a rare stop-gain in SYNPR. On broadening scope to all rare and novel variants throughout the exomes, we identified biological themes that were enriched for such variants, including microtubule transport and cytoskeletal regulation.
We recently reported the genetic cause of autosomal dominant chronic mucocutaneous candidiasis (AD-CMC) as a mutation in the STAT1 gene. In the present study we show that STAT1 Arg274Trp mutations in ...the coiled-coil (CC) domain is the genetic cause of AD-CMC in three families of patients. Cloning and transfection experiments demonstrate that mutated STAT1 inhibits IL12R/IL-23R signaling, with hyperphosphorylation of STAT1 as the likely underlying molecular mechanism. Inhibition of signaling through the receptors for IL-12 and IL-23 leads to strongly diminished Th1/Th17 responses and hence to increased susceptibility to fungal infections. The challenge for the future is to translate this knowledge into novel strategies for the treatment of this severe immunodeficiency.
New human mutations are thought to originate in germ cells, thus making a recurrence of the same mutation in a sibling exceedingly rare. However, increasing sensitivity of genomic technologies has ...anecdotally revealed mosaicism for mutations in somatic tissues of apparently healthy parents. Such somatically mosaic parents might also have germline mosaicism that can potentially cause unexpected intergenerational recurrences. Here, we show that somatic mosaicism for transmitted mutations among parents of children with simplex genetic disease is more common than currently appreciated. Using the sensitivity of individual-specific breakpoint PCR, we prospectively screened 100 families with children affected by genomic disorders due to rare deletion copy-number variants (CNVs) determined to be de novo by clinical analysis of parental DNA. Surprisingly, we identified four cases of low-level somatic mosaicism for the transmitted CNV in DNA isolated from parental blood. Integrated probabilistic modeling of gametogenesis developed in response to our observations predicts that mutations in parental blood increase recurrence risk substantially more than parental mutations confined to the germline. Moreover, despite the fact that maternally transmitted mutations are the minority of alleles, our model suggests that sexual dimorphisms in gametogenesis result in a greater proportion of somatically mosaic transmitting mothers who are thus at increased risk of recurrence. Therefore, somatic mosaicism together with sexual differences in gametogenesis might explain a considerable fraction of unexpected recurrences of X-linked recessive disease. Overall, our results underscore an important role for somatic mosaicism and mitotic replicative mutational mechanisms in transmission genetics.
Spinal muscular atrophy (SMA) is a heterogeneous group of neuromuscular disorders caused by degeneration of lower motor neurons. Although functional loss of SMN1 is associated with ...autosomal-recessive childhood SMA, the genetic cause for most families affected by dominantly inherited SMA is unknown. Here, we identified pathogenic variants in bicaudal D homolog 2 (Drosophila) (BICD2) in three families afflicted with autosomal-dominant SMA. Affected individuals displayed congenital slowly progressive muscle weakness mainly of the lower limbs and congenital contractures. In a large Dutch family, linkage analysis identified a 9q22.3 locus in which exome sequencing uncovered c.320C>T (p.Ser107Leu) in BICD2. Sequencing of 23 additional families affected by dominant SMA led to the identification of pathogenic variants in one family from Canada (c.2108C>T p.Thr703Met) and one from the Netherlands (c.563A>C p.Asn188Thr). BICD2 is a golgin and motor-adaptor protein involved in Golgi dynamics and vesicular and mRNA transport. Transient transfection of HeLa cells with all three mutant BICD2 cDNAs caused massive Golgi fragmentation. This observation was even more prominent in primary fibroblasts from an individual harboring c.2108C>T (p.Thr703Met) (affecting the C-terminal coiled-coil domain) and slightly less evident in individuals with c.563A>C (p.Asn188Thr) (affecting the N-terminal coiled-coil domain). Furthermore, BICD2 levels were reduced in affected individuals and trapped within the fragmented Golgi. Previous studies have shown that Drosophila mutant BicD causes reduced larvae locomotion by impaired clathrin-mediated synaptic endocytosis in neuromuscular junctions. These data emphasize the relevance of BICD2 in synaptic-vesicle recycling and support the conclusion that BICD2 mutations cause congenital slowly progressive dominant SMA.
Primary Familial Brain Calcification (PFBC) is a rare calcifying disorder of the brain with autosomal dominant inheritance, of unknown prevalence. Four causal genes have been identified so far: ...SLC20A2, PDGFB, PDGFRB, and XPR1, with pathogenic, probably pathogenic or missense variants of unknown significance found in 27.7% probands in the French PFBC series. Estimating PFBC prevalence from a clinical input is arduous due to a large diversity of symptoms and ages of onset and to incomplete clinical penetrance. Abnormal calcifications on CT scan can be used as a reliable diagnostic biomarker whatever the clinical status, but differential diagnoses should be ruled out including the challenging exclusion of common basal ganglia calcifications. Our primary aim was to estimate the minimal prevalence of PFBC due to a variant in one of the known genes. We extracted variants from the four known genes present in the gnomAD database gathering genomic data from 138,632 individuals. We interpreted all variants based on their predicted effect, their frequency, and previous studies on PFBC patients. Using the most conservative estimate, the minimal prevalence of PFBC related to a variant in one of the four known genes was 4.5 p. 10,000 (95%CI 3.4–5.5 p. 10,000). We then used variant detection rates in patients to extrapolate an overall minimal prevalence of PFBC to 2.1 p. 1,000 (95%CI 1.9–2.4 p. 1,000). The population‐based genomic analysis indicates that PFBC is not an exceptionally rare disorder, still underestimated and underdiagnosed.
Many laboratories now use genomic microarrays as their first-tier diagnostic test for copy number variation (CNV) detection. In addition, whole exome sequencing is increasingly being offered as a ...diagnostic test for heterogeneous disorders. Although mostly used for the detection of point mutations and small insertion-deletions, exome sequencing can also be used to call CNVs, allowing combined small and large variant analysis. Whole genome sequencing in addition to these advantages also offers the potential to characterize CNVs to unprecedented levels of accuracy, providing position and orientation information. In this review, we discuss the clinical potential of CNV identification in whole exome sequencing and whole genome sequencing data and the implications this has on diagnostic laboratories.
Whole exomes of patients with a genetic disorder are nowadays routinely sequenced but interpretation of the identified genetic variants remains a major challenge. The increased availability of ...population‐based human genetic variation has given rise to measures of genetic tolerance that have been used, for example, to predict disease‐causing genes in neurodevelopmental disorders. Here, we investigated whether combining variant information from homologous protein domains can improve variant interpretation. For this purpose, we developed a framework that maps population variation and known pathogenic mutations onto 2,750 “meta‐domains.” These meta‐domains consist of 30,853 homologous Pfam protein domain instances that cover 36% of all human protein coding sequences. We find that genetic tolerance is consistent across protein domain homologues, and that patterns of genetic tolerance faithfully mimic patterns of evolutionary conservation. Furthermore, for a significant fraction (68%) of the meta‐domains high‐frequency population variation re‐occurs at the same positions across domain homologues more often than expected. In addition, we observe that the presence of pathogenic missense variants at an aligned homologous domain position is often paired with the absence of population variation and vice versa. The use of these meta‐domains can improve the interpretation of genetic variation.
We developed a framework to map population variation and known pathogenic mutations onto 2,750 “meta‐domains.” These meta‐domains consist of 30,853 within‐human protein domain homologues. We find that population variation re‐occurs at the same positions across domain homologues more often than expected. Additionally, we observe that the presence of pathogenic variants at an aligned homologous domain position is often paired with the absence of population variation and vice versa. These meta‐domains aid in interpreting genetic variants in protein domains.
Familial exudative vitreoretinopathy (FEVR) is a genetically heterogeneous disorder characterized by abnormal vascularization of the peripheral retina, which can result in retinal detachment and ...severe visual impairment. In a large Dutch FEVR family, we performed linkage analysis, exome sequencing, and segregation analysis of DNA variants. We identified putative disease-causing DNA variants in proline-alanine-rich ste20-related kinase (c.791dup; p.Ser265ValfsX64) and zinc finger protein 408 (ZNF408) (c.1363C>T; p.His455Tyr), the latter of which was also present in an additional Dutch FEVR family that subsequently appeared to share a common ancestor with the original family. Sequence analysis of ZNF408 in 132 additional individuals with FEVR revealed another potentially pathogenic missense variant, p.Ser126Asn, in a Japanese family. Immunolocalization studies in COS-1 cells transfected with constructs encoding the WT and mutant ZNF408 proteins, revealed that the WT and the p.Ser126Asn mutant protein show complete nuclear localization, whereas the p.His455Tyr mutant protein was localized almost exclusively in the cytoplasm. Moreover, in a cotransfection assay, the p.His455Tyr mutant protein retains the WT ZNF408 protein in the cytoplasm, suggesting that this mutation acts in a dominant-negative fashion. Finally, morpholino-induced knockdown of znf408 in zebrafish revealed defects in developing retinal and trunk vasculature, that could be rescued by coinjection of RNA encoding human WT ZNF408 but not p.His455Tyr mutant ZNF408. Together, our data strongly suggest that mutant ZNF408 results in abnormal retinal vasculogenesis in humans and is associated with FEVR.
Schinzel-Giedion syndrome is characterized by severe mental retardation, distinctive facial features and multiple congenital malformations; most affected individuals die before the age of ten. We ...sequenced the exomes of four affected individuals (cases) and found heterozygous de novo variants in SETBP1 in all four. We also identified SETBP1 mutations in eight additional cases using Sanger sequencing. All mutations clustered to a highly conserved 11-bp exonic region, suggesting a dominant-negative or gain-of-function effect.