The Universal PBM Resource for Oligonucleotide Binding Evaluation (UniPROBE) serves as a convenient source of information on published data generated using universal protein-binding microarray (PBM) ...technology, which provides in vitro data about the relative DNA-binding preferences of transcription factors for all possible sequence variants of a length k ('k-mers'). The database displays important information about the proteins and displays their DNA-binding specificity data in terms of k-mers, position weight matrices and graphical sequence logos. This update to the database documents the growth of UniPROBE since the last update 4 years ago, and introduces a variety of new features and tools, including a new streamlined pipeline that facilitates data deposition by universal PBM data generators in the research community, a tool that generates putative nonbinding (i.e. negative control) DNA sequences for one or more proteins and novel motifs obtained by analyzing the PBM data using the BEEML-PBM algorithm for motif inference. The UniPROBE database is available at http://uniprobe.org.
A major challenge in biology is to understand how complex gene expression patterns are encoded in the genome. While transcriptional enhancers have been studied extensively, few transcriptional ...silencers have been identified, and they remain poorly understood. Here, we used a novel strategy to screen hundreds of sequences for tissue-specific silencer activity in whole Drosophila embryos. Almost all of the transcriptional silencers that we identified were also active enhancers in other cellular contexts. These elements are bound by more transcription factors than non-silencers. A subset of these silencers forms long-range contacts with promoters. Deletion of a silencer caused derepression of its target gene. Our results challenge the common practice of treating enhancers and silencers as separate classes of regulatory elements and suggest the possibility that thousands or more bifunctional CRMs remain to be discovered in Drosophila and 104–105 in humans.
Display omitted
•Silencers are bifunctional and can act as enhancers in other cellular contexts•A subset of silencers forms long-range contacts to promoters•Deletion of a silencer by genome editing caused derepression of its target gene•Results suggest that thousands of bifunctional elements in flies remain to be discovered
Gisselbrecht et al. performed a screen in developing Drosophila embryos for genomic sequences that can act as transcriptional silencers. They report that nearly all silencers are enhancers in other tissues or at other developmental stages. Their silencers fall into two classes, one of which forms physical chromosomal contacts with promoters.
The evolution of transcriptional regulatory networks entails the expansion and diversification of transcription factor (TF) families. The forkhead family of TFs, defined by a highly conserved winged ...helix DNA-binding domain (DBD), has diverged into dozens of subfamilies in animals, fungi, and related protists. We have used a combination of maximum-likelihood phylogenetic inference and independent, comprehensive functional assays of DNA-binding capacity to explore the evolution of DNA-binding specificity within the forkhead family. We present converging evidence that similar alternative sequence preferences have arisen repeatedly and independently in the course of forkhead evolution. The vast majority of DNA-binding specificity changes we observed are not explained by alterations in the known DNA-contacting amino acid residues conferring specificity for canonical forkhead binding sites. Intriguingly, we have found forkhead DBDs that retain the ability to bind very specifically to two completely distinct DNA sequence motifs. We propose an alternate specificity-determining mechanism whereby conformational rearrangements of the DBD broaden the spectrum of sequence motifs that a TF can recognize. DNA-binding bispecificity suggests a previously undescribed source of modularity and flexibility in gene regulation and may play an important role in the evolution of transcriptional regulatory networks.
Silencers are regulatory DNA elements that reduce transcription from their target promoters; they are the repressive counterparts of enhancers. Although discovered decades ago, and despite evidence ...of their importance in development and disease, silencers have been much less studied than enhancers. Recently, however, a series of papers have reported systematic studies of silencers in various model systems. Silencers are often bifunctional regulatory elements that can also act as enhancers, depending on cellular context, and are enriched for expression quantitative trait loci (eQTLs) and disease-associated variants. There is not yet evidence of a ‘silencer chromatin signature’, in the distribution of histone modifications or associated proteins, that is common to all silencers; instead, silencers may fall into various subclasses, acting by distinct (and possibly overlapping) mechanisms.
Silencers are less well-studied than enhancers, but a recent spate of papers has begun to systematically explore these repressive regulatory elements.Silencers are important for precise control of gene expression and are enriched for disease-associated regulatory variants.Most newly discovered silencers are bifunctional regulatory elements that also act as enhancers in other contexts. The traditional distinction between enhancers and silencers may be an oversimplification.Silencers appear to act by a variety of mechanisms, and may fall into various functionally distinct subclasses. This complicates the search for a predictive chromatin signature that would enable the rapid identification of silencers from high-throughput data.Understanding silencer function will be key to predicting the regulatory impact of genomic variation.
Sequencing of exomes and genomes has revealed abundant genetic variation affecting the coding sequences of human transcription factors (TFs), but the consequences of such variation remain largely ...unexplored. We developed a computational, structure-based approach to evaluate TF variants for their impact on DNA binding activity and used universal protein-binding microarrays to assay sequence-specific DNA binding activity across 41 reference and 117 variant alleles found in individuals of diverse ancestries and families with Mendelian diseases. We found 77 variants in 28 genes that affect DNA binding affinity or specificity and identified thousands of rare alleles likely to alter the DNA binding activity of human sequence-specific TFs. Our results suggest that most individuals have unique repertoires of TF DNA binding activities, which may contribute to phenotypic variation.
The lysine acetyltransferase KAT6A (MOZ, MYST3) belongs to the MYST family of chromatin regulators, facilitating histone acetylation. Dysregulation of KAT6A has been implicated in developmental ...syndromes and the onset of acute myeloid leukemia (AML). Previous work suggests that KAT6A is recruited to its genomic targets by a combinatorial function of histone binding PHD fingers, transcription factors and chromatin binding interaction partners. Here, we demonstrate that a winged helix (WH) domain at the very N-terminus of KAT6A specifically interacts with unmethylated CpG motifs. This DNA binding function leads to the association of KAT6A with unmethylated CpG islands (CGIs) genome-wide. Mutation of the essential amino acids for DNA binding completely abrogates the enrichment of KAT6A at CGIs. In contrast, deletion of a second WH domain or the histone tail binding PHD fingers only subtly influences the binding of KAT6A to CGIs. Overexpression of a KAT6A WH1 mutant has a dominant negative effect on H3K9 histone acetylation, which is comparable to the effects upon overexpression of a KAT6A HAT domain mutant. Taken together, our work revealed a previously unrecognized chromatin recruitment mechanism of KAT6A, offering a new perspective on the role of KAT6A in gene regulation and human diseases.
Combinatorial interactions among transcription factors (TFs) play essential roles in generating gene expression specificity and diversity in metazoans. Using yeast 2-hybrid (Y2H) assays on nearly all ...sequence-specific Drosophila TFs, we identified 1,983 protein-protein interactions (PPIs), more than doubling the number of currently known PPIs among Drosophila TFs. For quality assessment, we validated a subset of our interactions using MITOMI and bimolecular fluorescence complementation assays. We combined our interactome with prior PPI data to generate an integrated Drosophila TF-TF binary interaction network. Our analysis of ChIP-seq data, integrating PPI and gene expression information, uncovered different modes by which interacting TFs are recruited to DNA. We further demonstrate the utility of our Drosophila interactome in shedding light on human TF-TF interactions. This study reveals how TFs interact to bind regulatory elements in vivo and serves as a resource of Drosophila TF-TF binary PPIs for understanding tissue-specific gene regulation.
Display omitted
•Comprehensive yeast PPI screen among essentially all D. melanogaster TFs•A subset of Y2H results validated by in vitro and in vivo binary interaction assays•Compilation of an integrated fly TF-TF interactome•TF pairs can be recruited to DNA in different modes
Combinatorial regulation by transcription factors (TFs) is one mechanism for achieving condition and tissue-specific gene regulation. Shokri et al. mapped TF-TF interactions between most Drosophila TFs, reporting a comprehensive TF-TF network integrated with previously known interactions. They used this network to discern distinct TF-DNA binding modes.
Homeodomains (HDs) are the second largest class of DNA binding domains (DBDs) among eukaryotic sequence-specific transcription factors (TFs) and are the TF structural class with the largest number of ...disease-associated mutations in the Human Gene Mutation Database (HGMD). Despite numerous structural studies and large-scale analyses of HD DNA binding specificity, HD-DNA recognition is still not fully understood. Here, we analyze 92 human HD mutants, including disease-associated variants and variants of uncertain significance (VUS), for their effects on DNA binding activity. Many of the variants alter DNA binding affinity and/or specificity. Detailed biochemical analysis and structural modeling identifies 14 previously unknown specificity-determining positions, 5 of which do not contact DNA. The same missense substitution at analogous positions within different HDs often exhibits different effects on DNA binding activity. Variant effect prediction tools perform moderately well in distinguishing variants with altered DNA binding affinity, but poorly in identifying those with altered binding specificity. Our results highlight the need for biochemical assays of TF coding variants and prioritize dozens of variants for further investigations into their pathogenicity and the development of clinical diagnostics and precision therapies.
Deciphering the interplay between chromatin accessibility and transcription factor (TF) binding is fundamental to understanding transcriptional regulation, control of cellular states, and the ...establishment of new phenotypes. Recent genome-wide chromatin accessibility profiling studies have provided catalogs of putative open regions, where TFs can recognize their motifs and regulate gene expression programs. Here, we present motif enrichment in differential elements of accessibility (MEDEA), a computational tool that analyzes high-throughput chromatin accessibility genomic data to identify cell-type-specific accessible regions and lineage-specific motifs associated with TF binding therein. To benchmark MEDEA, we used a panel of reference cell lines profiled by ENCODE and curated by the ENCODE Project Consortium for the ENCODE-DREAM Challenge. By comparing results with RNA-seq data, ChIP-seq peaks, and DNase-seq footprints, we show that MEDEA improves the detection of motifs associated with known lineage specifiers. We then applied MEDEA to 610 ENCODE DNase-seq data sets, where it revealed significant motifs even when absolute enrichment was low and where it identified novel regulators, such as NRF1 in kidney development. Finally, we show that MEDEA performs well on both bulk and single-cell ATAC-seq data. MEDEA is publicly available as part of our Glossary-GENRE suite for motif enrichment analysis.
Genetic mutations responsible for oblique facial clefts (ObFC), a unique class of facial malformations, are largely unknown. We show that loss-of-function mutations in SPECC1L are pathogenic for this ...human developmental disorder and that SPECC1L is a critical organizer of vertebrate facial morphogenesis. During murine embryogenesis, Specc1l is expressed in cell populations of the developing facial primordial, which proliferate and fuse to form the face. In zebrafish, knockdown of a SPECC1L homolog produces a faceless phenotype with loss of jaw and facial structures, and knockdown in Drosophila phenocopies mutants in the integrin signaling pathway that exhibit cell-migration and -adhesion defects. Furthermore, in mammalian cells, SPECC1L colocalizes with both tubulin and actin, and its deficiency results in defective actin-cytoskeleton reorganization, as well as abnormal cell adhesion and migration. Collectively, these data demonstrate that SPECC1L functions in actin-cytoskeleton reorganization and is required for proper facial morphogenesis.