Genetic mutations account for many devastating early onset immune deficiencies. In contrast, less severe and later onset immune diseases, including in patients with no prior family history, remain ...poorly understood. Whole exome sequencing in two cohorts of such patients identified a novel heterozygous de novo
missense mutation (c.607G>A) in two separate kindreds in whom probands presented with immune dysregulation, combined T and B cell deficiency, inflammation, and epithelial defects.
encodes IKK2, which activates NF-κB signaling. IKK2
results in enhanced NF-κB signaling, as well as T and B cell functional defects. IKK2
is a highly conserved residue, and to prove causation, we generated an accurate mouse model by introducing the precise orthologous codon change in
using CRISPR/Cas9. Mice and humans carrying this missense mutation exhibit remarkably similar cellular and biochemical phenotypes. Accurate mouse models engineered by CRISPR/Cas9 can help characterize novel syndromes arising from de novo germline mutations and yield insight into pathogenesis.
Each person’s genome sequence has thousands of missense variants. Practical interpretation of their functional significance must rely on computational inferences in the absence of exhaustive ...experimental measurements. Here we analyzed the efficacy of these inferences in 33 de novo missense mutations revealed by sequencing in first-generation progeny ofN-ethyl-N-nitrosourea–treated mice, involving 23 essential immune system genes. Poly-Phen2, SIFT, MutationAssessor, Panther, CADD, and Condel were used to predict each mutation’s functional importance, whereas the actual effect was measured by breeding and testing homozygotes for the expected in vivo loss-of-function phenotype. Only 20% of mutations predicted to be deleterious by PolyPhen2 (and 15% by CADD) showed a discernible phenotype in individual homozygotes. Half of all possible missense mutations in the same 23 immune genes were predicted to be deleterious, and most of these appear to become subject to purifying selection because few persist between separate mouse substrains, rodents, or primates. Because defects in immune genes could be phenotypically masked in vivo by compensation and environment, we compared inferences by the same tools with the in vitro phenotype of all 2,314 possible missense variants inTP53; 42% of mutations predicted by PolyPhen2 to be deleterious (and 45% by CADD) had little measurable consequence forTP53-promoted transcription. We conclude that for de novo or low-frequency missense mutations found by genome sequencing, half those inferred as deleterious correspond to nearly neutral mutations that have little impact on the clinical phenotype of individual cases but will nevertheless become subject to purifying selection.
Gastrointestinal (GI) parasites, hookworms in particular, have evolved to cause minimal harm to their hosts, allowing them to establish chronic infections. This is mediated by creating an ...immunoregulatory environment. Indeed, hookworms are such potent suppressors of inflammation that they have been used in clinical trials to treat inflammatory bowel diseases (IBD) and celiac disease. Since the recent description of helminths (worms) secreting extracellular vesicles (EVs), exosome-like EVs from different helminths have been characterized and their salient roles in parasite-host interactions have been highlighted. Here, we analyze EVs from the rodent parasite
, which has been used as a model for human hookworm infection.
EVs (
-EVs) are actively internalized by mouse gut organoids, indicating a role in driving parasitism. We used proteomics and RNA-Seq to profile the molecular composition of
-EVs. We identified 81 proteins, including proteins frequently present in exosomes (like tetraspanin, enolase, 14-3-3 protein, and heat shock proteins), and 27 sperm-coating protein-like extracellular proteins. RNA-Seq analysis revealed 52 miRNA species, many of which putatively map to mouse genes involved in regulation of inflammation. To determine whether GI nematode EVs had immunomodulatory properties, we assessed their potential to suppress GI inflammation in a mouse model of inducible chemical colitis. EVs from
but not those from the whipworm
or control vesicles from grapes protected against colitic inflammation in the gut of mice that received a single intraperitoneal injection of EVs. Key cytokines associated with colitic pathology (IL-6, IL-1β, IFNγ, and IL-17a) were significantly suppressed in colon tissues from EV-treated mice. By contrast, high levels of the anti-inflammatory cytokine IL-10 were detected in
-EV-treated mice. Proteins and miRNAs contained within helminth EVs hold great potential application in development of drugs to treat helminth infections as well as chronic non-infectious diseases resulting from a dysregulated immune system, such as IBD.
Whipworms are parasitic nematodes that live in the gut of more than 500 million people worldwide. Owing to the difficulty in obtaining parasite material, the mouse whipworm Trichuris muris has been ...extensively used as a model to study human whipworm infections. These nematodes secrete a multitude of compounds that interact with host tissues where they orchestrate a parasitic existence. Herein we provide the first comprehensive characterization of the excretory/secretory products of T. muris. We identify 148 proteins secreted by T. muris and show for the first time that the mouse whipworm secretes exosome-like extracellular vesicles (EVs) that can interact with host cells. We use an Optiprep® gradient to purify the EVs, highlighting the suitability of this method for purifying EVs secreted by a parasitic nematode. We also characterize the proteomic and genomic content of the EVs, identifying >350 proteins, 56 miRNAs (22 novel) and 475 full-length mRNA transcripts mapping to T. muris gene models. Many of the miRNAs putatively mapped to mouse genes are involved in regulation of inflammation, implying a role in parasite-driven immunomodulation. In addition, for the first time to our knowledge, colonic organoids have been used to demonstrate the internalization of parasite EVs by host cells. Understanding how parasites interact with their host is crucial to develop new control measures. This first characterization of the proteins and EVs secreted by T. muris provides important information on whipworm-host communication and forms the basis for future studies.
Mass extinctions have repeatedly shaped global biodiversity. The Cretaceous-Paleogene (K-Pg) mass extinction caused the demise of numerous vertebrate groups, and its aftermath saw the rapid ...diversification of surviving mammals, birds, frogs, and teleost fishes. However, the effects of the K-Pg extinction on the evolution of snakes-a major clade of predators comprising over 3,700 living species-remains poorly understood. Here, we combine an extensive molecular dataset with phylogenetically and stratigraphically constrained fossil calibrations to infer an evolutionary timescale for Serpentes. We reveal a potential diversification among crown snakes associated with the K-Pg mass extinction, led by the successful colonisation of Asia by the major extant clade Afrophidia. Vertebral morphometrics suggest increasing morphological specialisation among marine snakes through the Paleogene. The dispersal patterns of snakes following the K-Pg underscore the importance of this mass extinction event in shaping Earth's extant vertebrate faunas.
A diversity of tools is available for identification of variants from genome sequence data. Given the current complexity of incorporating external software into a genome analysis infrastructure, a ...tendency exists to rely on the results from a single tool alone. The quality of the output variant calls is highly variable however, depending on factors such as sequence library quality as well as the choice of short-read aligner, variant caller, and variant caller filtering strategy. Here we present a two-part study first using the high quality 'genome in a bottle' reference set to demonstrate the significant impact the choice of aligner, variant caller, and variant caller filtering strategy has on overall variant call quality and further how certain variant callers outperform others with increased sample contamination, an important consideration when analyzing sequenced cancer samples. This analysis confirms previous work showing that combining variant calls of multiple tools results in the best quality resultant variant set, for either specificity or sensitivity, depending on whether the intersection or union, of all variant calls is used respectively. Second, we analyze a melanoma cell line derived from a control lymphocyte sample to determine whether software choices affect the detection of clinically important melanoma risk-factor variants finding that only one of the three such variants is unanimously detected under all conditions. Finally, we describe a cogent strategy for implementing a clinical variant detection pipeline; a strategy that requires careful software selection, variant caller filtering optimizing, and combined variant calls in order to effectively minimize false negative variants. While implementing such features represents an increase in complexity and computation the results offer indisputable improvements in data quality.
Extensive evaluation of RNA-seq methods have demonstrated that no single algorithm consistently outperforms all others. Removal of unwanted variation (RUV) has also been proposed as a method for ...stabilizing differential expression (DE) results. Despite this, it remains a challenge to run multiple RNA-seq algorithms to identify significant differences common to multiple algorithms, whilst also integrating and assessing the impact of RUV into all algorithms. consensusDE was developed to automate the process of identifying significant DE by combining the results from multiple algorithms with minimal user input and with the option to automatically integrate RUV. consensusDE only requires a table describing the sample groups, a directory containing BAM files or preprocessed count tables and an optional transcript database for annotation. It supports merging of technical replicates, paired analyses and outputs a compendium of plots to guide the user in subsequent analyses. Herein, we assess the ability of RUV to improve DE stability when combined with multiple algorithms and between algorithms, through application to real and simulated data. We find that, although RUV increased fold change stability between algorithms, it demonstrated improved FDR in a setting of low replication for the intersect, the effect was algorithm specific and diminished with increased replication, reinforcing increased replication for recovery of true DE genes. We finish by offering some rules and considerations for the application of RUV in a consensus-based setting. consensusDE is freely available, implemented in R and available as a Bioconductor package, under the GPL-3 license, along with a comprehensive vignette describing functionality: http://bioconductor.org/packages/consensusDE/.
Short-read resequencing of genomes produces abundant information of the genetic variation of individuals. Due to their numerous nature, these variants are rarely exhaustively validated. Furthermore, ...low levels of undetected variant miscalling will have a systematic and disproportionate impact on the interpretation of individual genome sequence information, especially should these also be carried through into in reference databases of genomic variation.
We find that sequence variation from short-read sequence data is subject to recurrent-yet-intermittent miscalling that occurs in a sequence intrinsic manner and is very sensitive to sequence read length. The miscalls arise from difficulties aligning short reads to redundant genomic regions, where the rate of sequencing error approaches the sequence diversity between redundant regions. We find the resultant miscalled variants to be sensitive to small sequence variations between genomes, and thereby are often intrinsic to an individual, pedigree, strain or human ethnic group. In human exome sequences, we identify 2-300 recurrent false positive variants per individual, almost all of which are present in public databases of human genomic variation. From the exomes of non-reference strains of inbred mice, we identify 3-5000 recurrent false positive variants per mouse - the number of which increasing with greater distance between an individual mouse strain and the reference C57BL6 mouse genome. We show that recurrently miscalled variants may be reproduced for a given genome from repeated simulation rounds of read resampling, realignment and recalling. As such, it is possible to identify more than two-thirds of false positive variation from only ten rounds of simulation.
Identification and removal of recurrent false positive variants from specific individual variant sets will improve overall data quality. Variant miscalls arising are highly sequence intrinsic and are often specific to an individual, pedigree or ethnicity. Further, read length is a strong determinant of whether given false variants will be called for any given genome - which has profound significance for cohort studies that pool datasets collected and sequenced at different points in time.
Systemic lupus erythematosus (SLE) is the prototypic systemic autoimmune disease. It is thought that many common variant gene loci of weak effect act additively to predispose to common autoimmune ...diseases, while the contribution of rare variants remains unclear. Here we describe that rare coding variants in lupus-risk genes are present in most SLE patients and healthy controls. We demonstrate the functional consequences of rare and low frequency missense variants in the interacting proteins BLK and BANK1, which are present alone, or in combination, in a substantial proportion of lupus patients. The rare variants found in patients, but not those found exclusively in controls, impair suppression of IRF5 and type-I IFN in human B cell lines and increase pathogenic lymphocytes in lupus-prone mice. Thus, rare gene variants are common in SLE and likely contribute to genetic risk.
Objective
Systemic lupus erythematosus (SLE) is a chronic and heterogeneous autoimmune disease. Both twin and sibling studies indicate a strong genetic contribution to lupus, but in the majority of ...cases the pathogenic variant remains to be identified. The genetic contribution to disease is likely to be greatest in cases with early onset and severe phenotypes. Whole‐exome sequencing now offers the possibility of identifying rare alleles responsible for disease in such cases. This study was undertaken to identify genetic causes of SLE using whole‐exome sequencing.
Methods
We performed whole‐exome sequencing in a 4‐year‐old girl with early‐onset SLE and conducted biochemical analysis of the putative defect.
Results
Whole‐exome sequencing in a 4‐year‐old girl with cerebral lupus identified a rare, homozygous mutation in the three prime repair exonuclease 1 gene (TREX1) that was predicted to be highly deleterious. The TREX1 R97H mutant protein had a 20‐fold reduction in exonuclease activity and was associated with an elevated interferon‐α (IFNα) signature in the patient. The discovery and characterization of a pathogenic TREX1 variant in our proband has therapeutic implications. The patient is now a candidate for neutralizing anti‐IFNα therapy.
Conclusion
Our study is the first to demonstrate that whole‐exome sequencing can be used to identify rare or novel deleterious variants as genetic causes of SLE and, through a personalized approach, improve therapeutic options.