The repair outcomes at site-specific DNA double-strand breaks (DSBs) generated by the RNA-guided DNA endonuclease Cas9 determine how gene function is altered. Despite the widespread adoption ...of CRISPR-Cas9 technology to induce DSBs for genome engineering, the resulting repair products have not been examined in depth. Here, the DNA repair profiles of 223 sites in the human genome demonstrate that the pattern of DNA repair following Cas9 cutting at each site is nonrandom and consistent across experimental replicates, cell lines, and reagent delivery methods. Furthermore, the repair outcomes are determined by the protospacer sequence rather than genomic context, indicating that DNA repair profiling in cell lines can be used to anticipate repair outcomes in primary cells. Chemical inhibition of DNA-PK enabled dissection of the DNA repair profiles into contributions from c-NHEJ and MMEJ. Finally, this work elucidates a strategy for using “error-prone” DNA-repair machinery to generate precise edits.
Display omitted
•DNA repair profiles of 223 sites in the human genome reveal nonrandom outcomes•The protospacer sequence, not genomic context, determines Cas9 DSB repair outcomes•Each DNA repair profile is composed of specific contributions from c-NHEJ and MMEJ•DNA repair profiling can be used to anticipate repair outcomes in primary cells
van Overbeek, Capurso et al. demonstrate that repair outcomes are nonrandom at S. pyogenes Cas9-mediated DSBs and are determined by the protospacer sequence rather than genomic context. DNA repair profiling reveals specific contributions of c-NHEJ and MMEJ at each site and an approach to generate precise edits without exogenous template.
RNA-guided CRISPR-Cas9 endonucleases are widely used for genome engineering, but our understanding of Cas9 specificity remains incomplete. Here, we developed a biochemical method (SITE-Seq), using ...Cas9 programmed with single-guide RNAs (sgRNAs), to identify the sequence of cut sites within genomic DNA. Cells edited with the same Cas9-sgRNA complexes are then assayed for mutations at each cut site using amplicon sequencing. We used SITE-Seq to examine Cas9 specificity with sgRNAs targeting the human genome. The number of sites identified depended on sgRNA sequence and nuclease concentration. Sites identified at lower concentrations showed a higher propensity for off-target mutations in cells. The list of off-target sites showing activity in cells was influenced by sgRNP delivery, cell type and duration of exposure to the nuclease. Collectively, our results underscore the utility of combining comprehensive biochemical identification of off-target sites with independent cell-based measurements of activity at those sites when assessing nuclease activity and specificity.
Recombinant expression of large, multiprotein complexes is essential and often rate limiting for determining structural, biophysical, and biochemical properties of DNA repair, replication, ...transcription, and other key cellular processes. Baculovirus-infected insect cell expression systems are especially well suited for producing large, human proteins recombinantly, and multigene baculovirus systems have facilitated studies of multiprotein complexes. In this chapter, we describe a multigene baculovirus system called MacroBac that uses a Biobricks-type assembly method based on restriction and ligation (Series 11) or ligation-independent cloning (Series 438). MacroBac cloning and assembly is efficient and equally well suited for either single subcloning reactions or high-throughput cloning using 96-well plates and liquid handling robotics. MacroBac vectors are polypromoter with each gene flanked by a strong polyhedrin promoter and an SV40 poly(A) termination signal that minimize gene order expression level effects seen in many polycistronic assemblies. Large assemblies are robustly achievable, and we have successfully assembled as many as 10 genes into a single MacroBac vector. Importantly, we have observed significant increases in expression levels and quality of large, multiprotein complexes using a single, multigene, polypromoter virus rather than coinfection with multiple, single-gene viruses. Given the importance of characterizing functional complexes, we believe that MacroBac provides a critical enabling technology that may change the way that structural, biophysical, and biochemical research is done.
Protein fate in higher eukaryotes is controlled by three complexes that share conserved architectural elements: the proteasome, COP9 signalosome, and eukaryotic translation initiation factor 3 ...(eIF3). Here we reconstitute the 13-subunit human eIF3 in Escherichia coli, revealing its structural core to be the eight subunits with conserved orthologues in the proteasome lid complex and COP9 signalosome. This structural core in eIF3 binds to the small (40S) ribosomal subunit, to translation initiation factors involved in mRNA cap-dependent initiation, and to the hepatitis C viral (HCV) internal ribosome entry site (IRES) RNA. Addition of the remaining eIF3 subunits enables reconstituted eIF3 to assemble intact initiation complexes with the HCV IRES. Negative-stain EM reconstructions of reconstituted eIF3 further reveal how the approximately 400 kDa molecular mass structural core organizes the highly flexible 800 kDa molecular mass eIF3 complex, and mediates translation initiation.
The mechanism of DNA mismatch repair has been modeled upon biochemical studies of the
E. coli DNA adenine methylation–instructed pathway where the initial recognition of mismatched nucleotides is ...performed by the MutS protein.
M
ut
S
h
omologs (MSH) have been identified based on a highly conserved region containing a Walker-A adenine nucleotide binding motif. Here we show that adenine nucleotide binding and hydrolysis by the human mismatch recognition complex hMSH2-hMSH6 functions as a novel molecular switch. The hMSH2-hMSH6 complex is ON (binds mismatched nucleotides) in the ADP-bound form and OFF in the ATP-bound form. These results suggest a new model for the function of MutS proteins during mismatch repair in which the switch determines the timing of downstream events.
The Escherichia coli MukB, MukE, and MukF proteins form a bacterial condensin (MukBEF) that contributes to chromosome management by compacting DNA. MukB is an ATPase and DNA‐binding protein of the ...SMC superfamily; however, the structure and function of non‐SMC components, such as MukF, have been less forthcoming. Here, we report the crystal structure of the N‐terminal 287 amino acids of MukF at 2.9 Å resolution. This region folds into a winged‐helix domain and an extended coiled‐coil domain that self‐associate to form a stable, doubly domain‐swapped dimer. Protein dissection and affinity purification data demonstrate that the region of MukF C‐terminal to this fragment binds to MukE and MukB. Our findings, together with sequence analyses, indicate that MukF is a kleisin subunit for E. coli condensin and suggest a means by which it may organize the MukBEF assembly.
Structural biology and structural genomics projects routinely rely on recombinantly expressed proteins, but many proteins and complexes are difficult to obtain by this approach. We investigated ...native source proteins for high-throughput protein crystallography applications. The Escherichia coli proteome was fractionated, purified, crystallized, and structurally characterized. Macro-scale fermentation and fractionation were used to subdivide the soluble proteome into 408 unique fractions of which 295 fractions yielded crystals in microfluidic crystallization chips. Of the 295 crystals, 152 were selected for optimization, diffraction screening, and data collection. Twenty-three structures were determined, four of which were novel. This study demonstrates the utility of native source proteins for high-throughput crystallography.
We have previously shown that hMSH2-hMSH6 contains an intrinsic ATPase which is activated by mismatch-provoked ADP→ATP exchange that coordinately induces the formation of a sliding clamp capable of ...hydrolysis-independent diffusion along the DNA backbone (1, 2). These studies suggested that mismatch repair could be propagated by a signaling event transduced via diffusion of ATP-bound hMSH2-hMSH6 molecular switches to the DNA repair machinery. The Molecular Switch model (Fishel, R. (1998) Genes Dev. 12, 2096–2101) is considerably different than the Hydrolysis-Driven Translocation model (Blackwell, L. J., Martik, D., Bjornson, K. P., Bjornson, E. S., and Modrich, P. (1998) J. Biol. Chem. 273, 32055–32062) and makes additional testable predictions beyond the demonstration of hydrolysis-independent diffusion (Gradia, S., Subramanian, D., Wilson, T., Acharya, S., Makhov, A., Griffith, J., and Fishel, R. (1999) Mol. Cell3, 255–261): (i) individual mismatch-provoked ADP→ATP exchange should be unique and rate-limiting, and (ii) thekcat·DNA for the DNA-stimulated ATPase activity should decrease with increasing chain length. Here we have examined hMSH2-hMSH6 affinity and ATPase stimulatory activity for several DNA substrates containing mispaired nucleotides as well as the chain length dependence of a defined mismatch under physiological conditions. We find that the results are most consistent with the predictions of the Molecular Switch model.
The genetic and biochemical properties of three human MutS homologues, hMSH2, hMSH3, and hMSH6, have been examined. The full-length hMSH6 cDNA and genomic locus were isolated and characterized, and ...it was demonstrated that the hMSH6 gene consisted of 10 exons and mapped to chromosome 2p15-16. The hMSH3 cDNA was in some cases found to contain a 27-bp deletion resulting in a loss of nine amino acids, depending on the individual from which the cDNA was isolated. hMSH2, hMSH3, and hMSH6 all showed similar tissue-specific expression patterns. hMSH2 protein formed a complex with both hMSH3 and hMSH6 proteins, similar to protein complexes demonstrated by studies of the Saccharomyces cerevisiae MSH2, MSH3, and MSH6. hMSH2 was also found to form a homomultimer complex, but neither hMSH3 nor hMSH6 appear to interact with themselves or each other. Analysis of the mismatched nucleotide-binding specificity of the hMSH2-hMSH3 and hMSH2-hMSH6 protein complexes showed that they have overlapping but not identical binding specificity. These results help to explain the distribution of mutations in different mismatch-repair genes seen in hereditary nonpolyposis colon cancer.