Genome editing has attracted wide interest for the generation of cellular models of disease using human pluripotent stem cells and other cell types. CRISPR-Cas systems and TALENs can target desired ...genomic sites with high efficiency in human cells, but recent publications have led to concern about the extent to which these tools may cause off-target mutagenic effects that could potentially confound disease-modeling studies. Using CRISPR-Cas9 and TALEN targeted human pluripotent stem cell clones, we performed whole-genome sequencing at high coverage in order to assess the degree of mutagenesis across the entire genome. In both types of clones, we found that off-target mutations attributable to the nucleases were very rare. From this analysis, we suggest that, although some cell types may be at risk for off-target mutations, the incidence of such effects in human pluripotent stem cells may be sufficiently low and thus not a significant concern for disease modeling and other applications.
Display omitted
•CRISPR-Cas9 and TALENs were used to genome edit human pluripotent stem cells•Whole-genome sequencing of individual clones found very few off-target mutations
Recent work suggested that off-target mutagenesis is a problem for genome-editing technology, but Veres et al. show that, at least for human pluripotent stem cells, the frequency of such mutations appears to be low.
Truncating mutations of chromodomain helicase DNA-binding protein 8 ( CHD8 ), and of many other genes with diverse functions, are strong-effect risk factors for autism spectrum disorder (ASD), ...suggesting multiple mechanisms of pathogenesis. We explored the transcriptional networks that CHD8 regulates in neural progenitor cells (NPCs) by reducing its expression and then integrating transcriptome sequencing (RNA sequencing) with genome-wide CHD8 binding (ChIP sequencing). Suppressing CHD8 to levels comparable with the loss of a single allele caused altered expression of 1,756 genes, 64.9% of which were up-regulated. CHD8 showed widespread binding to chromatin, with 7,324 replicated sites that marked 5,658 genes. Integration of these data suggests that a limited array of direct regulatory effects of CHD8 produced a much larger network of secondary expression changes. Genes indirectly down-regulated (i.e., without CHD8-binding sites) reflect pathways involved in brain development, including synapse formation, neuron differentiation, cell adhesion, and axon guidance, whereas CHD8-bound genes are strongly associated with chromatin modification and transcriptional regulation. Genes associated with ASD were strongly enriched among indirectly down-regulated loci ( P < 10 ⁻⁸) and CHD8-bound genes ( P = 0.0043), which align with previously identified coexpression modules during fetal development. We also find an intriguing enrichment of cancer-related gene sets among CHD8-bound genes ( P < 10 ⁻¹⁰). In vivo suppression of chd8 in zebrafish produced macrocephaly comparable to that of humans with inactivating mutations. These data indicate that heterozygous disruption of CHD8 precipitates a network of gene-expression changes involved in neurodevelopmental pathways in which many ASD-associated genes may converge on shared mechanisms of pathogenesis.
Significance Truncating mutation of chromodomain helicase DNA-binding protein 8 ( CHD8 ) represents one of the strongest known risk factors for autism spectrum disorder (ASD). We mimicked the effects of such heterozygous loss-of-function mutations in neural progenitor cells and integrated RNA sequencing with genome-wide delineation of CHD8 binding. Our results reveal that the molecular mechanism by which CHD8 alters neurodevelopmental pathways may involve both direct and indirect effects, the latter involving down-regulation following CHD8 suppression. We also find that chd8 suppression in zebrafish results in macrocephaly, consistent with observations in patients harboring loss-of-function mutations. We show that reduced expression of CHD8 impacts a variety of other functionally distinct ASD-associated genes, suggesting that the diverse functions of ASD risk factors may constitute multiple means of triggering a smaller number of final common pathways.
Abstract
Pre-mRNA splicing is a key controller of human gene expression. Disturbances in splicing due to mutation lead to dysregulated protein expression and contribute to a substantial fraction of ...human disease. Several classes of splicing modulator compounds (SMCs) have been recently identified and establish that pre-mRNA splicing represents a target for therapy. We describe herein the identification of BPN-15477, a SMC that restores correct splicing of
ELP1
exon 20. Using transcriptome sequencing from treated fibroblast cells and a machine learning approach, we identify BPN-15477 responsive sequence signatures. We then leverage this model to discover 155 human disease genes harboring ClinVar mutations predicted to alter pre-mRNA splicing as targets for BPN-15477. Splicing assays confirm successful correction of splicing defects caused by mutations in
CFTR
,
LIPA
,
MLH1
and
MAPT
. Subsequent validations in two disease-relevant cellular models demonstrate that BPN-15477 increases functional protein, confirming the clinical potential of our predictions.
Chromosome 16p11.2 reciprocal genomic disorder, resulting from recurrent copy-number variants (CNVs), involves intellectual disability, autism spectrum disorder (ASD), and schizophrenia, but the ...responsible mechanisms are not known. To systemically dissect molecular effects, we performed transcriptome profiling of 350 libraries from six tissues (cortex, cerebellum, striatum, liver, brown fat, and white fat) in mouse models harboring CNVs of the syntenic 7qF3 region, as well as cellular, transcriptional, and single-cell analyses in 54 isogenic neural stem cell, induced neuron, and cerebral organoid models of CRISPR-engineered 16p11.2 CNVs. Transcriptome-wide differentially expressed genes were largely tissue-, cell-type-, and dosage-specific, although more effects were shared between deletion and duplication and across tissue than expected by chance. The broadest effects were observed in the cerebellum (2,163 differentially expressed genes), and the greatest enrichments were associated with synaptic pathways in mouse cerebellum and human induced neurons. Pathway and co-expression analyses identified energy and RNA metabolism as shared processes and enrichment for ASD-associated, loss-of-function constraint, and fragile X messenger ribonucleoprotein target gene sets. Intriguingly, reciprocal 16p11.2 dosage changes resulted in consistent decrements in neurite and electrophysiological features, and single-cell profiling of organoids showed reciprocal alterations to the proportions of excitatory and inhibitory GABAergic neurons. Changes both in neuronal ratios and in gene expression in our organoid analyses point most directly to calretinin GABAergic inhibitory neurons and the excitatory/inhibitory balance as targets of disruption that might contribute to changes in neurodevelopmental and cognitive function in 16p11.2 carriers. Collectively, our data indicate the genomic disorder involves disruption of multiple contributing biological processes and that this disruption has relative impacts that are context specific.
Display omitted
The 16p11.2 genomic disorder involves deletion and duplication of a contiguous set of genes, resulting in neurodevelopmental and other abnormalities. On the basis of comparisons of mouse models and human neuronal cells, the effects of these lesions on genome-wide gene expression and cellular function are highly dependent on cellular context.
Preeclampsia is a hypertensive disorder of pregnancy, which complicates up to 15% of US deliveries. It is an idiopathic disorder associated with several different phenotypes. We sought to determine ...if the genetic architecture of preeclampsia can be described by clusters of patients with variants in genes in shared protein interaction networks. We performed a case-control study using whole exome sequencing on early onset preeclamptic mothers with severe clinical features and control mothers with uncomplicated pregnancies between 2016 and 2020. A total of 143 patients were enrolled, 61 women with early onset preeclampsia with severe features based on ACOG criteria, and 82 control women at term, matched for race and ethnicity. A network analysis and visualization tool, Proteinarium, was used to confirm there are clusters of patients with shared gene networks associated with severe preeclampsia. The majority of the sequenced patients appear in two significant clusters. We identified one case dominant and one control dominant cluster. Thirteen genes were unique to the case dominated cluster. Among these genes, LAMB2, PTK2, RAC1, QSOX1, FN1, and VCAM1 have known associations with the pathogenic mechanisms of preeclampsia. Using bioinformatic analysis, we were able to identify subsets of patients with shared protein interaction networks, thus confirming our hypothesis about the genetic architecture of preeclampsia.
MBD5, encoding the methyl-CpG-binding domain 5 protein, has been proposed as a necessary and sufficient driver of the 2q23.1 microdeletion syndrome. De novo missense and protein-truncating variants ...from exome sequencing studies have directly implicated MBD5 in the etiology of autism spectrum disorder (ASD) and related neurodevelopmental disorders (NDDs). However, little is known concerning the specific function(s) of MBD5.
To gain insight into the complex interactions associated with alteration of MBD5 in individuals with ASD and related NDDs, we explored the transcriptional landscape of MBD5 haploinsufficiency across multiple mouse brain regions of a heterozygous hypomorphic Mbd5
mouse model, and compared these results to CRISPR-mediated mutations of MBD5 in human iPSC-derived neuronal models.
Gene expression analyses across three brain regions from Mbd5
mice showed subtle transcriptional changes, with cortex displaying the most widespread changes following Mbd5 reduction, indicating context-dependent effects. Comparison with MBD5 reduction in human neuronal cells reinforced the context-dependence of gene expression changes due to MBD5 deficiency. Gene co-expression network analyses revealed gene clusters that were associated with reduced MBD5 expression and enriched for terms related to ciliary function.
These analyses included a limited number of mouse brain regions and neuronal models, and the effects of the gene knockdown are subtle. As such, these results will not reflect the full extent of MBD5 disruption across human brain regions during early neurodevelopment in ASD, or capture the diverse spectrum of cell-type-specific changes associated with MBD5 alterations.
Our study points to modest and context-dependent transcriptional consequences of Mbd5 disruption in the brain. It also suggests a possible link between MBD5 and perturbations in ciliary function, which is an established pathogenic mechanism in developmental disorders and syndromes.
Nearly 6,000 QTL have been reported for 588 different traits in pigs, more than in any other livestock species. However, this effort has translated into only a few confirmed causative variants. A ...powerful strategy for revealing candidate genes involves expression QTL (eQTL) mapping, where the mRNA abundance of a set of transcripts is used as the response variable for a QTL scan.
We utilized a whole genome expression microarray and an F(2) pig resource population to conduct a global eQTL analysis in loin muscle tissue, and compared results to previously inferred phenotypic QTL (pQTL) from the same experimental cross. We found 62 unique eQTL (FDR <10%) and identified 3 gene networks enriched with genes subject to genetic control involved in lipid metabolism, DNA replication, and cell cycle regulation. We observed strong evidence of local regulation (40 out of 59 eQTL with known genomic position) and compared these eQTL to pQTL to help identify potential candidate genes. Among the interesting associations, we found aldo-keto reductase 7A2 (AKR7A2) and thioredoxin domain containing 12 (TXNDC12) eQTL that are part of a network associated with lipid metabolism and in turn overlap with pQTL regions for marbling, % intramuscular fat (% fat) and loin muscle area on Sus scrofa (SSC) chromosome 6. Additionally, we report 13 genomic regions with overlapping eQTL and pQTL involving 14 local eQTL.
Results of this analysis provide novel candidate genes for important complex pig phenotypes.
We provide a program update on the COBRE Center for the Computational Biology of Human Disease (CBHD) at Brown University and affiliated hospitals. High throughput data from multiple 'omics-level' ...technologies are fundamental factors in identifying and treating human disease. The acquisition of these data is now straightforward, but the efficient and creative interpretation of these data remains a serious impediment to progress for faculty at all levels in both the basic and translational aspects of biomedical science. The CBHD COBRE seeks to build close collaboration between laboratory scientists working with model systems and data scientists working with computational and bioinformatics tools that can accelerate human disease research implementation. We describe the accomplishments of junior faculty Project Leaders (9) and Pilots Project leaders (8) and the objectives of the CBHD COBRE's core facility: The Computational Biology Core (CBC). To extend the CBHD COBRE's reach in the future, we encourage one and all to visit the CBHD COBRE and bring your data sets and questions. Only by engaging with new people and challenges can the program grow to serve the broader biomedical research community in the State of Rhode Island.
X-linked Dystonia-Parkinsonism (XDP) is a Mendelian neurodegenerative disease that is endemic to the Philippines and is associated with a founder haplotype. We integrated multiple genome and ...transcriptome assembly technologies to narrow the causal mutation to the TAF1 locus, which included a SINE-VNTR-Alu (SVA) retrotransposition into intron 32 of the gene. Transcriptome analyses identified decreased expression of the canonical cTAF1 transcript among XDP probands, and de novo assembly across multiple pluripotent stem-cell-derived neuronal lineages discovered aberrant TAF1 transcription that involved alternative splicing and intron retention (IR) in proximity to the SVA that was anti-correlated with overall TAF1 expression. CRISPR/Cas9 excision of the SVA rescued this XDP-specific transcriptional signature and normalized TAF1 expression in probands. These data suggest an SVA-mediated aberrant transcriptional mechanism associated with XDP and may provide a roadmap for layered technologies and integrated assembly-based analyses for other unsolved Mendelian disorders.
Display omitted
•Genome assembly narrows the causal XDP locus to TAF1•An XDP-specific SVA insertion causes intron retention and reduced expression of TAF1•CRISPR/Cas9 excision of SVA rescues aberrant splicing and TAF1 expression in XDP•Expression profiling implicates neurodevelopment and dystonia pathways in XDP
A Mendelian form of dystonia arises from altered splicing and intron retention within a general transcription factor.
Structural variation (SV) influences genome organization and contributes to human disease. However, the complete mutational spectrum of SV has not been routinely captured in disease association ...studies.
We sequenced 689 participants with autism spectrum disorder (ASD) and other developmental abnormalities to construct a genome-wide map of large SV. Using long-insert jumping libraries at 105X mean physical coverage and linked-read whole-genome sequencing from 10X Genomics, we document seven major SV classes at ~5 kb SV resolution. Our results encompass 11,735 distinct large SV sites, 38.1% of which are novel and 16.8% of which are balanced or complex. We characterize 16 recurrent subclasses of complex SV (cxSV), revealing that: (1) cxSV are larger and rarer than canonical SV; (2) each genome harbors 14 large cxSV on average; (3) 84.4% of large cxSVs involve inversion; and (4) most large cxSV (93.8%) have not been delineated in previous studies. Rare SVs are more likely to disrupt coding and regulatory non-coding loci, particularly when truncating constrained and disease-associated genes. We also identify multiple cases of catastrophic chromosomal rearrangements known as chromoanagenesis, including somatic chromoanasynthesis, and extreme balanced germline chromothripsis events involving up to 65 breakpoints and 60.6 Mb across four chromosomes, further defining rare categories of extreme cxSV.
These data provide a foundational map of large SV in the morbid human genome and demonstrate a previously underappreciated abundance and diversity of cxSV that should be considered in genomic studies of human disease.