We present single-cell clustering using bifurcation analysis (SCUBA), a novel computational method for extracting lineage relationships from single-cell gene expression data and modeling the dynamic ...changes associated with cell differentiation. SCUBA draws techniques from nonlinear dynamics and stochastic differential equation theories, providing a systematic framework for modeling complex processes involving multilineage specifications. By applying SCUBA to analyze two complementary, publicly available datasets we successfully reconstructed the cellular hierarchy during early development of mouse embryos, modeled the dynamic changes in gene expression patterns, and predicted the effects of perturbing key transcriptional regulators on inducing lineage biases. The results were robust with respect to experimental platform differences between RT-PCR and RNA sequencing. We selectively tested our predictions in Nanog mutants and found good agreement between SCUBA predictions and the experimental data. We further extended the utility of SCUBA by developing a method to reconstruct missing temporal-order information from a typical single-cell dataset. Analysis of a hematopoietic dataset suggests that our method is effective for reconstructing gene expression dynamics during human B-cell development. In summary, SCUBA provides a useful single-cell data analysis tool that is well-suited for the investigation of developmental processes.
Significance Characterization of cellular heterogeneity and hierarchy are important tasks in developmental biology and may help overcome drug resistance in treatment of cancer and other diseases. Single-cell technologies provide a powerful tool for detecting rare cell types and cell-fate transition events, whereas traditional gene expression profiling methods can be used only to measure the average behavior of a cell population. However, the lack of suitable computational methods for single-cell data analysis has become a bottleneck. Here we present a method with the focuses on automatically detecting multilineage transitions and on modeling the associated changes in gene expression patterns. We show that our method is generally applicable and that its applications provide biological insights into developmental processes.
Genome-wide mapping of three dimensional chromatin organization is an important yet technically challenging task. To aid experimental effort and to understand the determinants of long-range chromatin ...interactions, we have developed a computational model integrating Hi-C and histone mark ChIP-seq data to predict two important features of chromatin organization: chromatin interaction hubs and topologically associated domain (TAD) boundaries. Our model accurately and robustly predicts these features across datasets and cell types. Cell-type specific histone mark information is required for prediction of chromatin interaction hubs but not for TAD boundaries. Our predictions provide a useful guide for the exploration of chromatin organization.
CRISPR-Cas systems have been broadly embraced as effective tools for genome engineering applications, with most studies to date utilizing the Streptococcus pyogenes Cas9. Here we characterize and ...manipulate the smaller, 1053 amino acid nuclease Staphylococcus aureus Cas9.
We find that the S. aureus Cas9 recognizes an NNGRRT protospacer adjacent motif (PAM) and cleaves target DNA at high efficiency with a variety of guide RNA (gRNA) spacer lengths. When directed against genomic targets with mutually permissive NGGRRT PAMs, the S. pyogenes Cas9 and S. aureus Cas9 yield indels at comparable rates. We additionally show D10A and N580A paired nickase activity with S. aureus Cas9, and we further package it with two gRNAs in a single functional adeno-associated virus (AAV) vector. Finally, we assess comparative S. pyogenes and S. aureus Cas9 specificity using GUIDE-seq.
Our results reveal an S. aureus Cas9 that is effective for a variety of genome engineering purposes, including paired nickase approaches and all-in-one delivery of Cas9 and multiple gRNA expression cassettes with AAV vectors.
The CCCTC-binding factor (CTCF), which anchors DNA loops that organize the genome into structural domains, has a central role in gene control by facilitating or constraining interactions between ...genes and their regulatory elements
. In cancer cells, the disruption of CTCF binding at specific loci by somatic mutation
or DNA hypermethylation
results in the loss of loop anchors and consequent activation of oncogenes. By contrast, the germ-cell-specific paralogue of CTCF, BORIS (brother of the regulator of imprinted sites, also known as CTCFL)
, is overexpressed in several cancers
, but its contributions to the malignant phenotype remain unclear. Here we show that aberrant upregulation of BORIS promotes chromatin interactions in ALK-mutated, MYCN-amplified neuroblastoma
cells that develop resistance to ALK inhibition. These cells are reprogrammed to a distinct phenotypic state during the acquisition of resistance, a process defined by the initial loss of MYCN expression followed by subsequent overexpression of BORIS and a concomitant switch in cellular dependence from MYCN to BORIS. The resultant BORIS-regulated alterations in chromatin looping lead to the formation of super-enhancers that drive the ectopic expression of a subset of proneural transcription factors that ultimately define the resistance phenotype. These results identify a previously unrecognized role of BORIS-to promote regulatory chromatin interactions that support specific cancer phenotypes.
Understanding the diversity of repair outcomes after introducing a genomic cut is essential for realizing the therapeutic potential of genomic editing technologies. Targeted PCR amplification ...combined with Next Generation Sequencing (NGS) or enzymatic digestion, while broadly used in the genome editing field, has critical limitations for detecting and quantifying structural variants such as large deletions (greater than approximately 100 base pairs), inversions, and translocations.
To overcome these limitations, we have developed a Uni-Directional Targeted Sequencing methodology, UDiTaS, that is quantitative, removes biases associated with variable-length PCR amplification, and can measure structural changes in addition to small insertion and deletion events (indels), all in a single reaction. We have applied UDiTaS to a variety of samples, including those treated with a clinically relevant pair of S. aureus Cas9 single guide RNAs (sgRNAs) targeting CEP290, and a pair of S. pyogenes Cas9 sgRNAs at T-cell relevant loci. In both cases, we have simultaneously measured small and large edits, including inversions and translocations, exemplifying UDiTaS as a valuable tool for the analysis of genome editing outcomes.
UDiTaS is a robust and streamlined sequencing method useful for measuring small indels as well as structural rearrangements, like translocations, in a single reaction. UDiTaS is especially useful for pre-clinical and clinical application of gene editing to measure on- and off-target editing, large and small.
Stem cell differentiation pathways are most often studied at the population level, whereas critical decisions are executed at the level of single cells. We have established a highly multiplexed, ...quantitative PCR assay to profile in an unbiased manner a panel of all commonly used cell surface markers (280 genes) from individual cells. With this method, we analyzed over 1,500 single cells throughout the mouse hematopoietic system and illustrate its utility for revealing important biological insights. The comprehensive single cell data set permits mapping of the mouse hematopoietic stem cell differentiation hierarchy by computational lineage progression analysis. Further profiling of 180 intracellular regulators enabled construction of a genetic network to assign the earliest differentiation event during hematopoietic lineage specification. Analysis of acute myeloid leukemia elicited by MLL-AF9 uncovered a distinct cellular hierarchy containing two independent self-renewing lineages with different clonal activities. The strategy has broad applicability in other cellular systems.
Display omitted
•Robust methodology for single-cell analysis of the cell surface repertoire•Comprehensive single-cell analysis for the mouse hematopoietic system•Lineage progression analysis and network construction using single-cell data•Characterization of the unique cellular hierarchy in a mouse AML model
Single-cell analysis of the cell surface repertoire enables mapping of cellular heterogeneity and differentiation hierarchy, applied here to the hematopoietic system but with broad applicability.
Chromatin-state analysis is widely applied in the studies of development and diseases. However, existing methods operate at a single length scale, and therefore cannot distinguish large domains from ...isolated elements of the same type. To overcome this limitation, we present a hierarchical hidden Markov model, diHMM, to systematically annotate chromatin states at multiple length scales. We apply diHMM to analyse a public ChIP-seq data set. diHMM not only accurately captures nucleosome-level information, but identifies domain-level states that vary in nucleosome-level state composition, spatial distribution and functionality. The domain-level states recapitulate known patterns such as super-enhancers, bivalent promoters and Polycomb repressed regions, and identify additional patterns whose biological functions are not yet characterized. By integrating chromatin-state information with gene expression and Hi-C data, we identify context-dependent functions of nucleosome-level states. Thus, diHMM provides a powerful tool for investigating the role of higher-order chromatin structure in gene regulation.
Brain metastases represent the greatest clinical challenge in treating HER2-positive breast cancer. We report the development of orthotopic patient-derived xenografts (PDXs) of HER2-expressing breast ...cancer brain metastases (BCBM), and their use for the identification of targeted combination therapies. Combined inhibition of PI3K and mTOR resulted in durable tumor regressions in three of five PDXs, and therapeutic response was correlated with a reduction in the phosphorylation of 4EBP1, an mTORC1 effector. The two nonresponding PDXs showed hypermutated genomes with enrichment of mutations in DNA-repair genes, which suggests an association of genomic instability with therapeutic resistance. These findings suggest that a biomarker-driven clinical trial of PI3K inhibitor in combination with an mTOR inhibitor should be conducted for patients with HER2-positive BCBM.
Kinetochores attach sister chromatids to microtubules of the mitotic spindle and orchestrate chromosome disjunction at anaphase. Although S. cerevisiae has the simplest known kinetochores, they ...nonetheless contain approximately 70 subunits that assemble on centromeric DNA in a hierarchical manner. Developing an accurate picture of the DNA-binding, linker and microtubule-binding layers of kinetochores, including the functions of individual proteins in these layers, is a key challenge in the field of yeast chromosome segregation. Moreover, comparison of orthologous proteins in yeast and humans promises to extend insight obtained from the study of simple fungal kinetochores to complex animal cell kinetochores.
We show that S. cerevisiae Spc105p forms a heterotrimeric complex with Kre28p, the likely orthologue of the metazoan kinetochore protein Zwint-1. Through systematic analysis of interdependencies among kinetochore complexes, focused on Spc105p/Kre28p, we develop a comprehensive picture of the assembly hierarchy of budding yeast kinetochores. We find Spc105p/Kre28p to comprise the third linker complex that, along with the Ndc80 and MIND linker complexes, is responsible for bridging between centromeric heterochromatin and kinetochore MAPs and motors. Like the Ndc80 complex, Spc105p/Kre28p is also essential for kinetochore binding by components of the spindle assembly checkpoint. Moreover, these functions are conserved in human cells.
Spc105p/Kre28p is the last of the core linker complexes to be analyzed in yeast and we show it to be required for kinetochore binding by a discrete subset of kMAPs (Bim1p, Bik1p, Slk19p) and motors (Cin8p, Kar3p), all of which are nonessential. Strikingly, dissociation of these proteins from kinetochores prevents bipolar attachment, even though the Ndc80 and DASH complexes, the two best-studied kMAPs, are still present. The failure of Spc105 deficient kinetochores to bind correctly to spindle microtubules and to recruit checkpoint proteins in yeast and human cells explains the observed severity of missegregation phenotypes.
E-type cyclins (cyclins E1 and E2) are components of the cell cycle machinery that has been conserved from yeast to humans. The major function of E-type cyclins is to drive cell division. It is ...unknown whether in addition to their 'core' cell cycle functions, E-type cyclins also perform unique tissue-specific roles. Here, we applied high-throughput mass spectrometric analyses of mouse organs to define the repertoire of cyclin E protein partners in vivo. We found that cyclin E interacts with distinct sets of proteins in different compartments. These cyclin E interactors are highly enriched for phosphorylation targets of cyclin E and its catalytic partner, the cyclin-dependent kinase 2 (Cdk2). Among cyclin E interactors we identified several novel tissue-specific substrates of cyclin E-Cdk2 kinase. In proliferating compartments, cyclin E-Cdk2 phosphorylates Lin proteins within the DREAM complex. In the testes, cyclin E-Cdk2 phosphorylates Mybl1 and Dmrtc2, two meiotic transcription factors that represent key regulators of spermatogenesis. In embryonic and adult brains cyclin E interacts with proteins involved in neurogenesis, while in adult brains also with proteins regulating microtubule-based processes and microtubule cytoskeleton. We also used quantitative proteomics to demonstrate re-wiring of the cyclin E interactome upon ablation of Cdk2. This approach can be used to study how protein interactome changes during development or in any pathological state such as aging or cancer.