Highlights • Seed sequences are present in small RNA-guided gene regulatory and immune systems. • The seed enables a rapid target search by RNA-guided complexes. • Proteins such as Argonaute, ...Cascade, and Cas9 are essential for seed functioning. • Seed sequences contribute to the evolution of complex gene regulatory networks.
Prokaryotes encode adaptive immune systems, called CRISPR-Cas (clustered regularly interspaced short palindromic repeats-CRISPR associated), to provide resistance against mobile invaders, such as ...viruses and plasmids. Host immunity is based on incorporation of invader DNA sequences in a memory locus (CRISPR), the formation of guide RNAs from this locus, and the degradation of cognate invader DNA (protospacer). Invaders can escape type I-E CRISPR-Cas immunity in Escherichia coli K12 by making point mutations in the seed region of the protospacer or its adjacent motif (PAM), but hosts quickly restore immunity by integrating new spacers in a positive-feedback process termed "priming." Here, by using a randomized protospacer and PAM library and high-throughput plasmid loss assays, we provide a systematic analysis of the constraints of both direct interference and subsequent priming in E. coli. We have defined a high-resolution genetic map of direct interference by Cascade and Cas3, which includes five positions of the protospacer at 6-nt intervals that readily tolerate mutations. Importantly, we show that priming is an extremely robust process capable of using degenerate target regions, with up to 13 mutations throughout the PAM and protospacer region. Priming is influenced by the number of mismatches, their position, and is nucleotide dependent. Our findings imply that even outdated spacers containing many mismatches can induce a rapid primed CRISPR response against diversified or related invaders, giving microbes an advantage in the coevolutionary arms race with their invaders.
Small RNA-guided protein complexes play an essential role in CRISPR-mediated immunity in prokaryotes. While these complexes initiate interference by flagging cognate invader DNA for destruction, ...recent evidence has implicated their involvement in new CRISPR memory formation, called priming, against mutated invader sequences. The mechanism by which the target recognition complex mediates these disparate responses—interference and priming—remains poorly understood. Using single-molecule FRET, we visualize how bona fide and mutated targets are differentially probed by E. coli Cascade. We observe that the recognition of bona fide targets is an ordered process that is tightly controlled for high fidelity. Mutated targets are recognized with low fidelity, which is featured by short-lived and PAM- and seed-independent binding by any segment of the crRNA. These dual roles of Cascade in immunity with distinct fidelities underpin CRISPR-Cas robustness, allowing for efficient degradation of bona fide targets and priming of mutated DNA targets.
Display omitted
•Cascade exhibits two distinct binding modes that guide interference and priming•Target recognition for interference initiates from the PAM and seed sequence•Stepwise target recognition promotes high fidelity in the interference pathway•Priming is promoted by low-fidelity binding that is independent from PAM and seed
It has remained unknown how the CRISPR effector complex recognizes a mutated viral genome. Using single-molecule fluorescence, Blosser et al. show that E. coli Cascade uses a distinct DNA binding mode to flag mutated targets for updating CRISPR memory.
Prokaryotes use a mechanism called priming to update their CRISPR immunological memory to rapidly counter revisiting, mutated viruses, and plasmids. Here we have determined how new spacers are ...produced and selected for integration into the CRISPR array during priming. We show that Cas3 couples CRISPR interference to adaptation by producing DNA breakdown products that fuel the spacer integration process in a two-step, PAM-associated manner. The helicase-nuclease Cas3 pre-processes target DNA into fragments of about 30–100 nt enriched for thymine-stretches in their 3′ ends. The Cas1-2 complex further processes these fragments and integrates them sequence-specifically into CRISPR repeats by coupling of a 3′ cytosine of the fragment. Our results highlight that the selection of PAM-compliant spacers during priming is enhanced by the combined sequence specificities of Cas3 and the Cas1-2 complex, leading to an increased propensity of integrating functional CTT-containing spacers.
Display omitted
•CRISPR interference and adaptation are coupled processes•Moderate direct interference rates stimulate primed spacer acquisition•Cas1-2 recycle target DNA degradation fragments to form new spacers•The cleavage specificity of Cas3 contributes to functional PAM selection
Künne et al. demonstrate that CRISPR systems cleverly couple target interference to CRISPR memory update. The Cas3 nuclease fragments invader DNA into pieces of near-spacer length enriched for PAM sequences on 3′ ends to form ideal spacer precursors.
Type I CRISPR-Cas systems are the most abundant adaptive immune systems in bacteria and archaea
. Target interference relies on a multi-subunit, RNA-guided complex called Cascade
, which recruits a ...trans-acting helicase-nuclease, Cas3, for target degradation
. Type I systems have rarely been used for eukaryotic genome engineering applications owing to the relative difficulty of heterologous expression of the multicomponent Cascade complex. Here, we fuse Cascade to the dimerization-dependent, non-specific FokI nuclease domain
and achieve RNA-guided gene editing in multiple human cell lines with high specificity and efficiencies of up to ~50%. FokI-Cascade can be reconstituted via an optimized two-component expression system encoding the CRISPR-associated (Cas) proteins on a single polycistronic vector and the guide RNA (gRNA) on a separate plasmid. Expression of the full Cascade-Cas3 complex in human cells resulted in targeted deletions of up to ~200 kb in length. Our work demonstrates that highly abundant, previously untapped type I CRISPR-Cas systems can be harnessed for genome engineering applications in eukaryotic cells.
The prokaryotic CRISPR/Cas immune system is based on genomic loci that contain incorporated sequence tags from viruses and plasmids. Using small guide RNA molecules, these sequences act as a memory ...to reject returning invaders. Both the Cascade ribonucleoprotein complex and the Cas3 nuclease/helicase are required for CRISPR interference in Escherichia coli, but it is unknown how natural target DNA molecules are recognized and neutralized by their combined action. Here we show that Cascade efficiently locates target sequences in negatively supercoiled DNA, but only if these are flanked by a protospacer-adjacent motif (PAM). PAM recognition by Cascade exclusively involves the crRNA-complementary DNA strand. After Cascade-mediated R loop formation, the Cse1 subunit recruits Cas3, which catalyzes nicking of target DNA through its HD-nuclease domain. The target is then progressively unwound and cleaved by the joint ATP-dependent helicase activity and Mg2+-dependent HD-nuclease activity of Cas3, leading to complete target DNA degradation and invader neutralization.
► Cascade requires negative supercoiling energy for binding dsDNA targets ► PAM recognition takes place in the DNA strand that base pairs with the crRNA ► After specific binding of target DNA, Cascade recruits Cas3 via its Cse1 subunit ► Cas3 degrades target DNA by its joint helicase, endonuclease, and exonuclease activities
Abstract
Prokaryotes use primed CRISPR adaptation to update their memory bank of spacers against invading genetic elements that have escaped CRISPR interference through mutations in their protospacer ...target site. We previously observed a trend that nucleotide-dependent mismatches between crRNA and the protospacer strongly influence the efficiency of primed CRISPR adaptation. Here we show that guanine-substitutions in the target strand of the protospacer are highly detrimental to CRISPR interference and interference-dependent priming, while cytosine-substitutions are more readily tolerated. Furthermore, we show that this effect is based on strongly decreased binding affinity of the effector complex Cascade for guanine-mismatched targets, while cytosine-mismatched targets only minimally affect target DNA binding. Structural modeling of Cascade-bound targets with mismatches shows that steric clashes of mismatched guanines lead to unfavorable conformations of the RNA-DNA duplex. This effect has strong implications for the natural selection of target site mutations that lead to effective escape from type I CRISPR-Cas systems.
Clustered regularly interspaced short palindromic repeats and CRISPR-associated proteins (CRISPR-Cas) has revolutionized genome editing and has great potential for many applications, such as ...correcting human genetic disorders. To increase the safety of genome editing applications, CRISPR-Cas may benefit from strict control over Cas enzyme activity. Previously, anti-CRISPR proteins and designed oligonucleotides have been proposed to modulate CRISPR-Cas activity. In this study, we report on the potential of guide-complementary DNA oligonucleotides as controlled inhibitors of Cas9 ribonucleoprotein complexes. First, we show that DNA oligonucleotides inhibit Cas9 activity in human cells, reducing both on- and off-target cleavage. We then used
in vitro
assays to better understand how inhibition is achieved and under which conditions. Two factors were found to be important for robust inhibition: the length of the complementary region and the presence of a protospacer adjacent motif-loop on the inhibitor. We conclude that DNA oligonucleotides can be used to effectively inhibit Cas9 activity both
ex vivo
and
in vitro
.
The Electrophoretic Mobility Shift Assay is a straightforward and inexpensive method for the determination and quantification of protein-nucleic acid interactions. It relies on the different mobility ...of free and protein-bound nucleic acid in a gel matrix during electrophoresis. Nucleic acid affinities of crRNA-Cas complexes can be quantified by calculating the dissociation constant (Kd). Here, we describe how two types of EMSA assays are performed using the Cascade ribonucleoprotein complex from Escherichia coli as an example.
Host-pathogen interactions are among the most prevalent and evolutionary important interactions known today. The predation of prokaryotes by their viruses is happening on an especially large scale ...and had a major influence on the evolutionary history of prokaryotes. Since most viruses are lytic at some point in their life-cycle, there is a high selection pressure for prokaryotes to develop defense mechanisms. As described in Chapter 1, the CRISPR-Cas system is a relatively recently discovered defense system and is also the first adaptive defense system discovered in prokaryotes. CRISPR-Cas systems are widespread, occurring in the majority of archaea and also a considerable fraction of bacteria. This diversity is also reflected in the diversity of different types of CRISPR-Cas systems, currently being divided into 6 major types with a large number of subtypes. The type I-E system of Escherichia coli is a well-studied model system and of high relevance, since it is a major subtype of type I systems which make up around 50 % of all discovered CRISPR-Cas systems. CRISPR-Cas systems basically comprise the CRISPR array, made up of repeats and foreign derived spacers, and a set of cas genes. Immunity is commonly divided into three functional stages, adaptation, expression and interference. Adaptation is the acquisition of new spacers from the foreign nucleic acid and its incorporation into the CRISPR array. During expression, the CRISPR array is transcribed, processed and assembled with Cas proteins into CRISPR RNA (crRNA) guided ribonucleoprotein complexes (crRNP). Interference is the detection, binding and destruction of foreign nucleic acids by the crRNP and in type I systems the Cas3 nuclease. The type I-E system contains another function, called primed adaptation. Primed adaptation is a more rapid and efficient version of regular (naïve) adaptation. In addition to the adaptation machinery, primed adaptation also requires the interference machinery.Chapter 2 describes and compares a fundamental feature of most, if not all, CRISPR-Cas systems and also many other small RNA based systems. The mode of action of small RNAs relies on protein-assisted base pairing of the guide RNA with target mRNA or DNA to interfere with their transcription, translation or replication. Several unrelated classes of small non-coding RNAs have been identified including eukaryotic RNA silencing associated small RNAs, prokaryotic small regulatory RNAs and prokaryotic CRISPR (clustered regularly interspaced short palindromic repeats) RNAs. All three groups identify their target sequence by base pairing after finding it in a pool of millions of other nucleotide sequences in the cell. In this complicated target search process, a region of 6 to 12 nucleotides of the small RNA termed the ‘seed’ plays a critical role. The seed is often a structurally pre-ordered region that increases accessibility and lowers the energy barrier of RNA-DNA duplex formation. Furthermore, the length of the seed is optimally chosen to allow rapid probing and also rejection of potential target sites. The seed is a perfect example of parallel evolution, showing that nature comes up with the same strategy independently multiple times.Chapter 3 provides a description and protocol of the Electrophoretic Mobility Shift Assay (EMSA) and its use for studying crRNPs. EMSA is a straightforward and inexpensive method for the determination and quantification of protein–nucleic acid interactions. It relies on the different mobility of free and protein-bound nucleic acid in a gel matrix during electrophoresis. Nucleic acid affinities of crRNPs can be quantified by calculating the dissociation constant (Kd). Protocols for two types of EMSA assays are described using the Cascade ribonucleoprotein complex from Escherichia coli as an example. One protocol uses plasmid DNA as substrate, while the other uses short linear oligonucleotides. Plasmids can be easily visualized with traditional DNA staining, while oligos have to be radioactively labelled using the 32Phosphate isotope. The EMSA method and these protocols are applied throughout the other chapters of this thesis.Chapter 4 focusses on the processes of interference and primed adaptation, specifically on their tolerance of mutations. Invaders can escape Type I-E CRISPR-Cas immunity in E. coli by making point mutations in the protospacer (especially in the seed) or its adjacent motif (PAM), but hosts quickly restore immunity by integrating new spacers in a positive feedback process termed priming. Here, we provide a systematic analysis of the constraints of both direct interference and subsequent priming in E. coli. We have defined a high-resolution genetic map of direct interference by Cascade and Cas3, which includes five positions of the protospacer at 6 nt intervals that readily tolerate mutations. Importantly, we show that priming is an extremely robust process capable of utilizing degenerate target regions with up to at least eleven mutations throughout the PAM and protospacer region. Priming is influenced by the number of mismatches, their position and is nucleotide dependent. Our findings imply that even out-dated spacers containing many mismatches can induce a rapid primed CRISPR response against diversified or related invaders, giving microbes an advantage in the co- evolutionary arms race with their invaders.In Chapter 5 we elucidate the mechanism of priming. Specifically, we determine how new spacers are produced and selected for integration into the CRISPR array during priming. We show that priming is directly dependent on interference. Rapid priming occurs when the rate of interference is high, delayed priming occurs when the rate of interference is low. Using in vitro assays and next generation sequencing, we show that Cas3 couples CRISPR interference to adaptation by producing DNA breakdown products that fuel the spacer integration process in a two-step, PAM-associated manner. The helicase-nuclease Cas3 pre-processes target DNA into fragments of about 30–100 nt enriched for thymine-stretches in their 3’ ends. By reconstituting the spacer integration process in vitro, we show that the Cas1-2 complex further processes these fragments and integrates them sequence- specifically into CRISPR repeats by coupling of a 3’ cytosine of the fragment. Our results highlight that the selection of PAM-compliant spacers during priming is enhanced by the combined sequence specificities of Cas3 and the Cas1-2 complex, leading to an increased propensity of integrating functional CTT-containing spacers.In Chapter 6 we look deeper into a nucleotide specific effect on priming that was discovered in Chapter 4. Immunity is based on the complementarity of host encoded spacer sequences with protospacers on the foreign genetic element. The efficiency of both direct interference and primed acquisition depends on the degree of complementarity between spacer and protospacer. Previous studies focused on the amount and positions of mutations, not the identity of the substituted nucleotide. In Chapter 4, we describe a nucleotide bias, showing a positive effect on priming of C substitutions and a negative effect on priming of G substitutions in the basepairing strand of the protospacer. Here we show that these substitutions rather directly influence the efficiency of interference and therefore indirectly influence the efficiency of interference dependent priming. We show that G substitutions have a profoundly negative effect on interference, while C substitutions are readily tolerated when in the same positions. Furthermore, we show that this effect is based on strongly decreased binding of the effector complex Cascade to G mutants, while C mutants only minimally affect binding. In Chapter 5 we showed a connection between the rate of interference and the time of occurrence of priming. Here, we also quantify the extent of priming and show that priming is very prevalent in a population that shows intermediate levels of interference, while high or low levels of interference lead to a lower prevalence of priming.Chapter 7 describes an attempt to make use of our knowledge about the Cascade complex and develop it into a genome editing tool. The development of genome editing tools has made major leaps in the last decade. Recently, RNA guided endonucleases (RGENs) such as Cas9 or Cpf1 have revolutionized genome editing. These RGENs are the hallmark proteins of class II CRISPR-Cas systems. Here, we have explored the possibility to develop a new genome editing tool that makes use of the Cascade complex from E. coli. This RNA guided protein complex is fused to a FokI nuclease domain to sequence specifically cleave DNA. We validate the tool in vitro using purified protein and two sets of guide RNAs, showing specific cleavage activity. The tool requires two target sites of 32 nt each at a distance of 30-40 nt and inward facing three nucleotide flexible PAM sequences. Cleavage occurs in the middle between the two binding sites and primarily creates 4 nt overhangs. Furthermore, we show that an additional RFP can be fused to FokI-Cascade, allowing visualization of the complex in target cells. Unfortunately, we were not able to successfully apply the tool in vivo in eukaryotic cells.