Functional genomics screens using multi-parametric assays are powerful approaches for identifying genes involved in particular cellular processes. However, they suffer from problems like noise, and ...often provide little insight into molecular mechanisms. A bottleneck for addressing these issues is the lack of computational methods for the systematic integration of multi-parametric phenotypic datasets with molecular interactions. Here, we present Integrative Multi Profile Analysis of Cellular Traits (IMPACT). The main goal of IMPACT is to identify the most consistent phenotypic profile among interacting genes. This approach utilizes two types of external information: sets of related genes (IMPACT-sets) and network information (IMPACT-modules). Based on the notion that interacting genes are more likely to be involved in similar functions than non-interacting genes, this data is used as a prior to inform the filtering of phenotypic profiles that are similar among interacting genes. IMPACT-sets selects the most frequent profile among a set of related genes. IMPACT-modules identifies sub-networks containing genes with similar phenotype profiles. The statistical significance of these selections is subsequently quantified via permutations of the data. IMPACT (1) handles multiple profiles per gene, (2) rescues genes with weak phenotypes and (3) accounts for multiple biases e.g. caused by the network topology. Application to a genome-wide RNAi screen on endocytosis showed that IMPACT improved the recovery of known endocytosis-related genes, decreased off-target effects, and detected consistent phenotypes. Those findings were confirmed by rescreening 468 genes. Additionally we validated an unexpected influence of the IGF-receptor on EGF-endocytosis. IMPACT facilitates the selection of high-quality phenotypic profiles using different types of independent information, thereby supporting the molecular interpretation of functional screens.
Deregulation of transcription factor (TF) networks is emerging as a major pathogenic event in many human cancers (Darnell, 2002 1; Libermann and Zerbini, 2006 2; Laoukili et al., 2007 3). Small ...molecule intervention is an attractive avenue to understand TF regulatory mechanisms in healthy and disease state, as well as for exploiting these targets therapeutically (Koehler et al., 2003 4; Berg, 2008 5; Koehler, 2010 6). However, because of their physico-chemical properties, TF targeting has been proven to be difficult (Verdine and Walensky, 2007 7). The TF FOXM1 is an important mitotic player (Wonsey and Follettie, 2005 8; Laoukili et al., 2005 9; McDonald, 2005 10) also implicated in cancer progression (Laoukili et al., 2007 3; Teh, 2011 11; Koo, 2012 12) and drug resistance development (Kwok et al., 2010 13; Carr et al., 14). Therefore, its inhibition is an attractive goal for cancer therapy. Here, we describe a computational biology approach, by giving detailed insights into methodologies and technical results, which was used to analyze the transcriptional RNA-Seq data presented in our previous work (Gormally et al., 2014 20). Our Bioinformatics analysis shed light on the cellular effect of a novel FOXM1 inhibitor (FDI-6) newly identified through a biophysical screen. The data for this report is available at the public GEO repository (accession numberhttp://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE58626).
We introduce RNA G-quadruplex sequencing (rG4-seq), a transcriptome-wide RNA G-quadruplex (rG4) profiling method that couples rG4-mediated reverse transcriptase stalling with next-generation ...sequencing. Using rG4-seq on polyadenylated-enriched HeLa RNA, we generated a global in vitro map of thousands of canonical and noncanonical rG4 structures. We characterize rG4 formation relative to cytosine content and alternative RNA structure stability, uncover rG4-dependent differences in RNA folding and show evolutionarily conserved enrichment in transcripts mediating RNA processing and stability.
G-rich DNA sequences can form four-stranded G-quadruplex (G4) secondary structures and are linked to fundamental biological processes such as transcription, replication and telomere maintenance. G4s ...are also implicated in promoting genome instability, cancer and other diseases. Here, we describe a detailed G4 ChIP-seq method that robustly enables the determination of G4 structure formation genome-wide in chromatin. This protocol adapts traditional ChIP-seq for the detection of DNA secondary structures through the use of a G4-structure-specific single-chain antibody with refinements in chromatin immunoprecipitation followed by high-throughput sequencing. This technology does not require expression of the G4 antibody in situ, enabling broad applicability to theoretically all chromatin sources. Beginning with chromatin isolation and antibody preparation, the entire protocol can be completed in <1 week, including basic computational analysis.
G-quadruplex (G4) structural motifs have been linked to transcription, replication and genome instability and are implicated in cancer and other diseases. However, it is crucial to demonstrate the ...bona fide formation of G4 structures within an endogenous chromatin context. Herein we address this through the development of G4 ChIP-seq, an antibody-based G4 chromatin immunoprecipitation and high-throughput sequencing approach. We find ∼10,000 G4 structures in human chromatin, predominantly in regulatory, nucleosome-depleted regions. G4 structures are enriched in the promoters and 5' UTRs of highly transcribed genes, particularly in genes related to cancer and in somatic copy number amplifications, such as MYC. Strikingly, de novo and enhanced G4 formation are associated with increased transcriptional activity, as shown by HDAC inhibitor-induced chromatin relaxation and observed in immortalized as compared to normal cellular states. Our findings show that regulatory, nucleosome-depleted chromatin and elevated transcription shape the endogenous human G4 DNA landscape.
Abstract
Genomic maps of DNA G-quadruplexes (G4s) can help elucidate the roles that these secondary structures play in various organisms. Herein, we employ an improved version of a G-quadruplex ...sequencing method (G4-seq) to generate whole genome G4 maps for 12 species that include widely studied model organisms and also pathogens of clinical relevance. We identify G4 structures that form under physiological K+ conditions and also G4s that are stabilized by the G4-targeting small molecule pyridostatin (PDS). We discuss the various structural features of the experimentally observed G-quadruplexes (OQs), highlighting differences in their prevalence and enrichment across species. Our study describes diversity in sequence composition and genomic location for the OQs in the different species and reveals that the enrichment of OQs in gene promoters is particular to mammals such as mouse and human, among the species studied. The multi-species maps have been made publicly available as a resource to the research community. The maps can serve as blueprints for biological experiments in those model organisms, where G4 structures may play a role.
Exploring the cell biology of hepatocytes in vitro could be a powerful strategy to dissect the molecular mechanisms underlying the structure and function of the liver in vivo. However, this approach ...relies on appropriate in vitro cell culture systems that can recapitulate the cell biological and metabolic features of the hepatocytes in the liver whilst being accessible to experimental manipulations. Here, we adapted protocols for high-resolution fluorescence microscopy and quantitative image analysis to compare two primary hepatocyte culture systems, monolayer and collagen sandwich, with respect to the distribution of two distinct populations of early endosomes (APPL1 and EEA1-positive), endocytic capacity, metabolic and signaling activities. In addition to the re-acquisition of hepatocellular polarity, primary hepatocytes grown in collagen sandwich but not in monolayer culture recapitulated the apico-basal distribution of EEA1 endosomes observed in liver tissue. We found that such distribution correlated with the organization of the actin cytoskeleton in vitro and, surprisingly, was dependent on the nutritional state in vivo. Hepatocytes in collagen sandwich also exhibited faster kinetics of low-density lipoprotein (LDL) and epidermal growth factor (EGF) internalization, showed improved insulin sensitivity and preserved their ability for glucose production, compared to hepatocytes in monolayer cultures. Although no in vitro culture system can reproduce the exquisite structural features of liver tissue, our data nevertheless highlight the ability of the collagen sandwich system to recapitulate key structural and functional properties of the hepatocytes in the liver and, therefore, support the usage of this system to study aspects of hepatocellular biology in vitro.
Double-strand DNA breaks (DSBs) continuously arise and cause mutations and chromosomal rearrangements. Here, we present DSBCapture, a sequencing-based method that captures DSBs in situ and directly ...maps these at single-nucleotide resolution, enabling the study of DSB origin. DSBCapture shows substantially increased sensitivity and data yield compared with other methods. Using DSBCapture, we uncovered a striking relationship between DSBs and elevated transcription within nucleosome-depleted chromatin.
Response and resistance to anticancer therapies vary due to intertumor and intratumor heterogeneity
. Here, we map differentially enriched G-quadruplex (G4) DNA structure-forming regions (∆G4Rs) in ...22 breast cancer patient-derived tumor xenograft (PDTX) models. ∆G4Rs are associated with the promoters of highly amplified genes showing high expression, and with somatic single-nucleotide variants. Differences in ΔG4R landscapes reveal seven transcription factor programs across PDTXs. ∆G4R abundance and locations stratify PDTXs into at least three G4-based subtypes. ∆G4Rs in most PDTXs (14 of 22) were found to associate with more than one breast cancer subtype, which we also call an integrative cluster (IC)
. This suggests the frequent coexistence of multiple breast cancer states within a PDTX model, the majority of which display aggressive triple-negative IC10 gene activity. Short-term cultures of PDTX models with increased ∆G4R levels are more sensitive to small molecules targeting G4 DNA. Thus, G4 landscapes reveal additional IC-related intratumor heterogeneity in PDTX biopsies, improving breast cancer stratification and potentially identifying new treatment strategies.
Control of DNA methylation level is critical for gene regulation, and the factors that govern hypomethylation at CpG islands (CGIs) are still being uncovered. Here, we provide evidence that ...G-quadruplex (G4) DNA secondary structures are genomic features that influence methylation at CGIs. We show that the presence of G4 structure is tightly associated with CGI hypomethylation in the human genome. Surprisingly, we find that these G4 sites are enriched for DNA methyltransferase 1 (DNMT1) occupancy, which is consistent with our biophysical observations that DNMT1 exhibits higher binding affinity for G4s as compared to duplex, hemi-methylated, or single-stranded DNA. The biochemical assays also show that the G4 structure itself, rather than sequence, inhibits DNMT1 enzymatic activity. Based on these data, we propose that G4 formation sequesters DNMT1 thereby protecting certain CGIs from methylation and inhibiting local methylation.