Genomic insertions, duplications and insertion/deletions (indels), which account for ~14% of human pathogenic mutations, cannot be accurately or efficiently corrected by current gene-editing methods, ...especially those that involve larger alterations (>100 base pairs (bp)). Here, we optimize prime editing (PE) tools for creating precise genomic deletions and direct the replacement of a genomic fragment ranging from ~1 kilobases (kb) to ~10 kb with a desired sequence (up to 60 bp) in the absence of an exogenous DNA template. By conjugating Cas9 nuclease to reverse transcriptase (PE-Cas9) and combining it with two PE guide RNAs (pegRNAs) targeting complementary DNA strands, we achieve precise and specific deletion and repair of target sequences via using this PE-Cas9-based deletion and repair (PEDAR) method. PEDAR outperformed other genome-editing methods in a reporter system and at endogenous loci, efficiently creating large and precise genomic alterations. In a mouse model of tyrosinemia, PEDAR removed a 1.38-kb pathogenic insertion within the Fah gene and precisely repaired the deletion junction to restore FAH expression in liver.
Piwi-interacting RNAs (piRNAs) silence transposons to safeguard genome integrity in animals. However, the functions of the many piRNAs that do not map to transposons remain unknown. Here, we show ...that piRNA targeting in
can tolerate a few mismatches but prefer perfect pairing at the seed region. The broad targeting capacity of piRNAs underlies the germline silencing of transgenes in
Transgenes engineered to avoid piRNA recognition are stably expressed. Many endogenous germline-expressed genes also contain predicted piRNA targeting sites, and periodic An/Tn clusters (PATCs) are an intrinsic signal that provides resistance to piRNA silencing. Together, our study revealed the piRNA targeting rules and highlights a distinct strategy that
uses to distinguish endogenous from foreign nucleic acids.
Protein-protein interactions are essential to cellular and immune function, and in many cases, because of the absence of an experimentally determined structure of the complex, these interactions must ...be modeled to obtain an understanding of their molecular basis. We present a user-friendly protein docking server, based on the rigid-body docking programs ZDOCK and M-ZDOCK, to predict structures of protein-protein complexes and symmetric multimers. With a goal of providing an accessible and intuitive interface, we provide options for users to guide the scoring and the selection of output models, in addition to dynamic visualization of input structures and output docking models. This server enables the research community to easily and quickly produce structural models of protein-protein complexes and symmetric multimers for their own analysis.
The ZDOCK server is freely available to all academic and non-profit users at: http://zdock.umassmed.edu. No registration is required.
The YAP and TAZ paralogs are transcriptional co-activators recruited to target sites by TEAD proteins. Here, we show that YAP and TAZ are also recruited by JUNB (a member of the AP-1 family) and ...STAT3, key transcription factors that mediate an epigenetic switch linking inflammation to cellular transformation. YAP and TAZ directly interact with JUNB and STAT3 via a WW domain important for transformation, and they stimulate transcriptional activation by AP-1 proteins. JUNB, STAT3, and TEAD co-localize at virtually all YAP/TAZ target sites, yet many target sites only contain individual AP-1, TEAD, or STAT3 motifs. This observation and differences in relative crosslinking efficiencies of JUNB, TEAD, and STAT3 at YAP/TAZ target sites suggest that YAP/TAZ is recruited by different forms of an AP-1/STAT3/TEAD complex depending on the recruiting motif. The different classes of YAP/TAZ target sites are associated with largely non-overlapping genes with distinct functions. A small minority of target sites are YAP- or TAZ-specific, and they are associated with different sequence motifs and gene classes from shared YAP/TAZ target sites. Genes containing either the AP-1 or TEAD class of YAP/TAZ sites are associated with poor survival of breast cancer patients with the triple-negative form of the disease.
The growing list of mutations implicated in monogenic disorders of the developing brain includes at least seven genes (ARX, CUL4B, KDM5A, KDM5C, KMT2A, KMT2C, KMT2D) with loss-of-function mutations ...affecting proper regulation of histone H3 lysine 4 methylation, a chromatin mark which on a genome-wide scale is broadly associated with active gene expression, with its mono-, di- and trimethylated forms differentially enriched at promoter and enhancer and other regulatory sequences. In addition to these rare genetic syndromes, dysregulated H3K4 methylation could also play a role in the pathophysiology of some cases diagnosed with autism or schizophrenia, two conditions which on a genome-wide scale are associated with H3K4 methylation changes at hundreds of loci in a subject-specific manner. Importantly, the reported alterations for some of the diseased brain specimens included a widespread broadening of H3K4 methylation profiles at gene promoters, a process that could be regulated by the UpSET(KMT2E/MLL5)-histone deacetylase complex. Furthermore, preclinical studies identified maternal immune activation, parental care and monoaminergic drugs as environmental determinants for brain-specific H3K4 methylation. These novel insights into the epigenetic risk architectures of neurodevelopmental disease will be highly relevant for efforts aimed at improved prevention and treatment of autism and psychosis spectrum disorders.
Computational prediction of the 3D structures of molecular interactions is a challenging area, often requiring significant computational resources to produce structural predictions with atomic-level ...accuracy. This can be particularly burdensome when modeling large sets of interactions, macromolecular assemblies, or interactions between flexible proteins. We previously developed a protein docking program, ZDOCK, which uses a fast Fourier transform to perform a 3D search of the spatial degrees of freedom between two molecules. By utilizing a pairwise statistical potential in the ZDOCK scoring function, there were notable gains in docking accuracy over previous versions, but this improvement in accuracy came at a substantial computational cost. In this study, we incorporated a recently developed 3D convolution library into ZDOCK, and additionally modified ZDOCK to dynamically orient the input proteins for more efficient convolution. These modifications resulted in an average of over 8.5-fold improvement in running time when tested on 176 cases in a newly released protein docking benchmark, as well as substantially less memory usage, with no loss in docking accuracy. We also applied these improvements to a previous version of ZDOCK that uses a simpler non-pairwise atomic potential, yielding an average speed improvement of over 5-fold on the docking benchmark, while maintaining predictive success. This permits the utilization of ZDOCK for more intensive tasks such as docking flexible molecules and modeling of interactomes, and can be run more readily by those with limited computational resources.
Insertions and excisions of transposable elements (TEs) affect both the stability and variability of the genome. Studying the dynamics of transposition at the population level can provide crucial ...insights into the processes and mechanisms of genome evolution. Pooling genomic materials from multiple individuals followed by high-throughput sequencing is an efficient way of characterizing genomic polymorphisms in a population. Here we describe a novel method named TEMP, specifically designed to detect TE movements present with a wide range of frequencies in a population. By combining the information provided by pair-end reads and split reads, TEMP is able to identify both the presence and absence of TE insertions in genomic DNA sequences derived from heterogeneous samples; accurately estimate the frequencies of transposition events in the population and pinpoint junctions of high frequency transposition events at nucleotide resolution. Simulation data indicate that TEMP outperforms other algorithms such as PoPoolationTE, RetroSeq, VariationHunter and GASVPro. TEMP also performs well on whole-genome human data derived from the 1000 Genomes Project. We applied TEMP to characterize the TE frequencies in a wild Drosophila melanogaster population and study the inheritance patterns of TEs during hybrid dysgenesis. We also identified sequence signatures of TE insertion and possible molecular effects of TE movements, such as altered gene expression and piRNA production. TEMP is freely available at github: https://github.com/JialiUMassWengLab/TEMP.git.
PIWI-interacting RNAs (piRNAs) protect the animal germ line by silencing transposons. Primary piRNAs, generated from transcripts of genomic transposon "junkyards" (piRNA clusters), are amplified by ...the "ping-pong" pathway, yielding secondary piRNAs. We report that secondary piRNAs, bound to the PIWI protein Ago3, can initiate primary piRNA production from cleaved transposon RNAs. The first ∼26 nucleotides (nt) of each cleaved RNA becomes a secondary piRNA, but the subsequent ∼26 nt become the first in a series of phased primary piRNAs that bind Piwi, allowing piRNAs to spread beyond the site of RNA cleavage. The ping-pong pathway increases only the abundance of piRNAs, whereas production of phased primary piRNAs from cleaved transposon RNAs adds sequence diversity to the piRNA pool, allowing adaptation to changes in transposon sequence.