Various species of the intestinal microbiota have been associated with the development of colorectal cancer
, but it has not been demonstrated that bacteria have a direct role in the occurrence of ...oncogenic mutations. Escherichia coli can carry the pathogenicity island pks, which encodes a set of enzymes that synthesize colibactin
. This compound is believed to alkylate DNA on adenine residues
and induces double-strand breaks in cultured cells
. Here we expose human intestinal organoids to genotoxic pks
E. coli by repeated luminal injection over five months. Whole-genome sequencing of clonal organoids before and after this exposure revealed a distinct mutational signature that was absent from organoids injected with isogenic pks-mutant bacteria. The same mutational signature was detected in a subset of 5,876 human cancer genomes from two independent cohorts, predominantly in colorectal cancer. Our study describes a distinct mutational signature in colorectal cancer and implies that the underlying mutational process results directly from past exposure to bacteria carrying the colibactin-producing pks pathogenicity island.
Cerebellar ataxia, neuropathy and vestibular areflexia syndrome (CANVAS) is an autosomal recessive neurodegenerative disease, usually caused by biallelic AAGGG repeat expansions in RFC1. In this ...study, we leveraged whole genome sequencing data from nearly 10 000 individuals recruited within the Genomics England sequencing project to investigate the normal and pathogenic variation of the RFC1 repeat. We identified three novel repeat motifs, AGGGC (n = 6 from five families), AAGGC (n = 2 from one family) and AGAGG (n = 1), associated with CANVAS in the homozygous or compound heterozygous state with the common pathogenic AAGGG expansion. While AAAAG, AAAGGG and AAGAG expansions appear to be benign, we revealed a pathogenic role for large AAAGG repeat configuration expansions (n = 5). Long-read sequencing was used to characterize the entire repeat sequence, and six patients exhibited a pure AGGGC expansion, while the other patients presented complex motifs with AAGGG or AAAGG interruptions. All pathogenic motifs appeared to have arisen from a common haplotype and were predicted to form highly stable G quadruplexes, which have previously been demonstrated to affect gene transcription in other conditions. The assessment of these novel configurations is warranted in CANVAS patients with negative or inconclusive genetic testing. Particular attention should be paid to carriers of compound AAGGG/AAAGG expansions when the AAAGG motif is very large (>500 repeats) or the AAGGG motif is interrupted. Accurate sizing and full sequencing of the satellite repeat with long-read sequencing is recommended in clinically selected cases to enable accurate molecular diagnosis and counsel patients and their families.
Determining the role of DYNC2H1 variants in nonsyndromic inherited retinal disease (IRD).
Genome and exome sequencing were performed for five unrelated cases of IRD with no identified variant. In ...vitro assays were developed to validate the variants identified (fibroblast assay, induced pluripotent stem cell iPSC derived retinal organoids, and a dynein motility assay).
Four novel DYNC2H1 variants (V1, g.103327020_103327021dup; V2, g.103055779A>T; V3, g.103112272C>G; V4, g.103070104A>C) and one previously reported variant (V5, g.103339363T>G) were identified. In proband 1 (V1/V2), V1 was predicted to introduce a premature termination codon (PTC), whereas V2 disrupted the exon 41 splice donor site causing incomplete skipping of exon 41. V1 and V2 impaired dynein-2 motility in vitro and perturbed IFT88 distribution within cilia. V3, homozygous in probands 2-4, is predicted to cause a PTC in a retina-predominant transcript. Analysis of retinal organoids showed that this new transcript expression increased with organoid differentiation. V4, a novel missense variant, was in trans with V5, previously associated with Jeune asphyxiating thoracic dystrophy (JATD).
The DYNC2H1 variants discussed herein were either hypomorphic or affecting a retina-predominant transcript and caused nonsyndromic IRD. Dynein variants, specifically DYNC2H1 variants are reported as a cause of non syndromic IRD.
Lamins are the major component of nuclear lamina, maintaining structural integrity of the nucleus. Lamin A/C variants are well established to cause a spectrum of disorders ranging from myopathies to ...progeria, termed laminopathies. Phenotypes resulting from variants in LMNB1 and LMNB2 have been much less clearly defined.
We investigated exome and genome sequencing from the Deciphering Developmental Disorders Study and the 100,000 Genomes Project to identify novel microcephaly genes.
Starting from a cohort of patients with extreme microcephaly, 13 individuals with heterozygous variants in the two human B-type lamins were identified. Recurrent variants were established to be de novo in nine cases and shown to affect highly conserved residues within the lamin ɑ-helical rod domain, likely disrupting interactions required for higher-order assembly of lamin filaments.
We identify dominant pathogenic variants in LMNB1 and LMNB2 as a genetic cause of primary microcephaly, implicating a major structural component of the nuclear envelope in its etiology and defining a new form of laminopathy. The distinct nature of this lamin B-associated phenotype highlights the strikingly different developmental requirements for lamin paralogs and suggests a novel mechanism for primary microcephaly warranting future investigation.
Alport syndrome is the commonest inherited kidney disease and nearly half the pathogenic variants in the COL4A3-COL4A5 genes that cause Alport syndrome result in Gly substitutions. This study ...examined the molecular characteristics of Gly substitutions that determine the severity of clinical features. Pathogenic COL4A5 variants affecting Gly in the Leiden Open Variation Database in males with X-linked Alport syndrome were correlated with age at kidney failure (n = 157) and hearing loss diagnosis (n = 80). Heterozygous pathogenic COL4A3 and COL4A4 variants affecting Gly (n = 304) in autosomal dominant Alport syndrome were correlated with the risk of haematuria in the UK 100,000 Genomes Project. Gly substitutions were stratified by exon location (1 to 20 or 21 to carboxyl terminus), being adjacent to a non-collagenous region (interruption or terminus), and the degree of instability caused by the replacement residue. Pathogenic COL4A5 variants that resulted in a Gly substitution with a highly destabilising residue reduced the median age at kidney failure by 7 years (p = 0.002), and age at hearing loss diagnosis by 21 years (p = 0.004). Substitutions adjacent to a non-collagenous region delayed kidney failure by 19 years (p = 0.014). Heterozygous pathogenic COL4A3 and COL4A4 variants that resulted in a Gly substitution with a highly destabilising residue (Arg, Val, Glu, Asp, Trp) were associated with an increased risk of haematuria (p = 0.018), and those adjacent to a non-collagenous region were associated with a reduced risk (p = 0.046). Exon location had no effect. In addition, COL4A5 variants adjacent to non-collagenous regions were over-represented in the normal population in gnomAD (p < 0.001). The nature of the substitution and of nearby residues determine the risk of haematuria, early onset kidney failure and hearing loss for Gly substitutions in X-linked and autosomal dominant Alport syndrome.
Multi-locus Inherited Neoplasia Allele Syndrome (MINAS) refers to individuals with germline pathogenic variants in two or more cancer susceptibility genes(CSGs). With increased use of exome/genome ...sequencing it would be predicted that detection of MINAS would become more frequent. Here we review recent progress in knowledge of MINAS. A systematic literature search for reports of individuals with germline pathogenic variants in 2 or more of 94 CSGs was performed. In addition, participants with multiple primary tumours who underwent genome sequencing as part of the Rare Disease arm of the UK 100,000 Genomes Project were interrogated to detect additional cases. We identified 385 MINAS cases (211 reported in the last 5 years, 6 from 100,000 genomes participants). Most (287/385) cases contained at least one pathogenic variant in either BRCA1 or BRCA2. 108/385 MINAS cases had multiple primary tumours at presentation and a subset of cases presented unusual multiple tumour phenotypes. We conclude that, as predicted, increasing numbers of individuals with MINAS are being have been reported but, except for individuals with BRCA1/BRCA2 MINAS, individual CSG combinations are generally rare. In many cases it appears that the clinical phenotype is that which would be expected from the effects of the constituent CSG variants acting independently. However, in some instances the presence of unusual tumour phenotypes and/or multiple primary tumours suggests that there may be complex interactions between the relevant MINAS CSGs. Systematic reporting of MINAS cases in a MINAS database (e.g. https://databases.lovd.nl/shared/diseases/04296 ) will facilitate more accurate prognostic predictions for specific CSG combinations.
The development of computational methods to assess pathogenicity of pre-messenger RNA splicing variants is critical for diagnosis of human disease. We assessed the capability of eight algorithms, and ...a consensus approach, to prioritize 249 variants of uncertain significance (VUSs) that underwent splicing functional analyses. The capability of algorithms to differentiate VUSs away from the immediate splice site as being 'pathogenic' or 'benign' is likely to have substantial impact on diagnostic testing. We show that SpliceAI is the best single strategy in this regard, but that combined usage of tools using a weighted approach can increase accuracy further. We incorporated prioritization strategies alongside diagnostic testing for rare disorders. We show that 15% of 2783 referred individuals carry rare variants expected to impact splicing that were not initially identified as 'pathogenic' or 'likely pathogenic'; one in five of these cases could lead to new or refined diagnoses.
Pathogenic variants in Lysyl-tRNA synthetase 1 (KARS1) have increasingly been recognized as a cause of early-onset complex neurological phenotypes. To advance the timely diagnosis of KARS1-related ...disorders, we sought to delineate its phenotype and generate a disease model to understand its function in vivo.
Through international collaboration, we identified 22 affected individuals from 16 unrelated families harboring biallelic likely pathogenic or pathogenic in KARS1 variants. Sequencing approaches ranged from disease-specific panels to genome sequencing. We generated loss-of-function alleles in zebrafish.
We identify ten new and four known biallelic missense variants in KARS1 presenting with a moderate-to-severe developmental delay, progressive neurological and neurosensory abnormalities, and variable white matter involvement. We describe novel KARS1-associated signs such as autism, hyperactive behavior, pontine hypoplasia, and cerebellar atrophy with prevalent vermian involvement. Loss of kars1 leads to upregulation of p53, tissue-specific apoptosis, and downregulation of neurodevelopmental related genes, recapitulating key tissue-specific disease phenotypes of patients. Inhibition of p53 rescued several defects of kars1−/− knockouts.
Our work delineates the clinical spectrum associated with KARS1 defects and provides a novel animal model for KARS1-related human diseases revealing p53 signaling components as potential therapeutic targets.
We aimed to define a novel autosomal recessive neurodevelopmental disorder, characterize its clinical features, and identify the underlying genetic cause for this condition.
We performed a detailed ...clinical characterization of 19 individuals from nine unrelated, consanguineous families with a neurodevelopmental disorder. We used genome/exome sequencing approaches, linkage and cosegregation analyses to identify disease-causing variants, and we performed three-dimensional molecular in silico analysis to predict causality of variants where applicable.
In all affected individuals who presented with a neurodevelopmental syndrome with progressive microcephaly, seizures, and intellectual disability we identified biallelic disease-causing variants in Protocadherin-gamma-C4 (PCDHGC4). Five variants were predicted to induce premature protein truncation leading to a loss of PCDHGC4 function. The three detected missense variants were located in extracellular cadherin (EC) domains EC5 and EC6 of PCDHGC4, and in silico analysis of the affected residues showed that two of these substitutions were predicted to influence the Ca
-binding affinity, which is essential for multimerization of the protein, whereas the third missense variant directly influenced the cis-dimerization interface of PCDHGC4.
We show that biallelic variants in PCDHGC4 are causing a novel autosomal recessive neurodevelopmental disorder and link PCDHGC4 as a member of the clustered PCDH family to a Mendelian disorder in humans.
Several strands of evidence question the dogma that human mitochondrial DNA (mtDNA) is inherited exclusively down the maternal line, most recently in three families where several individuals harbored ...a 'heteroplasmic haplotype' consistent with biparental transmission. Here we report a similar genetic signature in 7 of 11,035 trios, with allelic fractions of 5-25%, implying biparental inheritance of mtDNA in 0.06% of offspring. However, analysing the nuclear whole genome sequence, we observe likely large rare or unique nuclear-mitochondrial DNA segments (mega-NUMTs) transmitted from the father in all 7 families. Independently detecting mega-NUMTs in 0.13% of fathers, we see autosomal transmission of the haplotype. Finally, we show the haplotype allele fraction can be explained by complex concatenated mtDNA-derived sequences rearranged within the nuclear genome. We conclude that rare cryptic mega-NUMTs can resemble paternally mtDNA heteroplasmy, but find no evidence of paternal transmission of mtDNA in humans.