Highlights • Two classes of CRISPR-Cas systems differ by the architectures of effector modules. • Effectors of Class 2 CRISPR-Cas systems are large, multidomain proteins. • Effector modules of Class ...2 CRISPR-Cas systems independently evolved from transposon genes. • Some Class 2 CRISPR-Cas effectors are also involved in pre-crRNA provessing. • Dedicated RNA-targeting Class 2 CRISPR-cas systems employ HEPN domains for target cleavage.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UL, UM, UPCLJ, UPUK, ZRSKP
Origins and evolution of CRISPR-Cas systems Koonin, Eugene V; Makarova, Kira S
Philosophical transactions - Royal Society. Biological sciences,
05/2019, Volume:
374, Issue:
1772
Journal Article
Peer reviewed
Open access
CRISPR-Cas, the bacterial and archaeal adaptive immunity systems, encompass a complex machinery that integrates fragments of foreign nucleic acids, mostly from mobile genetic elements (MGE), into ...CRISPR arrays embedded in microbial genomes. Transcripts of the inserted segments (spacers) are employed by CRISPR-Cas systems as guide (g)RNAs for recognition and inactivation of the cognate targets. The CRISPR-Cas systems consist of distinct adaptation and effector modules whose evolutionary trajectories appear to be at least partially independent. Comparative genome analysis reveals the origin of the adaptation module from casposons, a distinct type of transposons, which employ a homologue of Cas1 protein, the integrase responsible for the spacer incorporation into CRISPR arrays, as the transposase. The origin of the effector module(s) is far less clear. The CRISPR-Cas systems are partitioned into two classes, class 1 with multisubunit effectors, and class 2 in which the effector consists of a single, large protein. The class 2 effectors originate from nucleases encoded by different MGE, whereas the origin of the class 1 effector complexes remains murky. However, the recent discovery of a signalling pathway built into the type III systems of class 1 might offer a clue, suggesting that type III effector modules could have evolved from a signal transduction system involved in stress-induced programmed cell death. The subsequent evolution of the class 1 effector complexes through serial gene duplication and displacement, primarily of genes for proteins containing RNA recognition motif domains, can be hypothetically reconstructed. In addition to the multiple contributions of MGE to the evolution of CRISPR-Cas, the reverse flow of information is notable, namely, recruitment of minimalist variants of CRISPR-Cas systems by MGE for functions that remain to be elucidated. Here, we attempt a synthesis of the diverse threads that shed light on CRISPR-Cas origins and evolution. This article is part of a discussion meeting issue 'The ecology and evolution of prokaryotic CRISPR-Cas adaptive immune systems'.
Full text
Available for:
BFBNIB, NMLJ, NUK, PNG, SAZU, UL, UM, UPUK
Abstract
The Clusters of Orthologous Genes (COG) database, also referred to as the Clusters of Orthologous Groups of proteins, was created in 1997 and went through several rounds of updates, most ...recently, in 2014. The current update, available at https://www.ncbi.nlm.nih.gov/research/COG, substantially expands the scope of the database to include complete genomes of 1187 bacteria and 122 archaea, typically, with a single genome per genus. In addition, the current version of the COGs includes the following new features: (i) the recently deprecated NCBI’s gene index (gi) numbers for the encoded proteins are replaced with stable RefSeq or GenBank\ENA\DDBJ coding sequence (CDS) accession numbers; (ii) COG annotations are updated for >200 newly characterized protein families with corresponding references and PDB links, where available; (iii) lists of COGs grouped by pathways and functional systems are added; (iv) 266 new COGs for proteins involved in CRISPR-Cas immunity, sporulation in Firmicutes and photosynthesis in cyanobacteria are included; and (v) the database is made available as a web page, in addition to FTP. The current release includes 4877 COGs. Future plans include further expansion of the COG collection by adding archaeal COGs (arCOGs), splitting the COGs containing multiple paralogs, and continued refinement of COG annotations.
The principal biological function of bacterial and archaeal CRISPR systems is RNA-guided adaptive immunity against viruses and other mobile genetic elements (MGEs). These systems show remarkable ...evolutionary plasticity and functional versatility at multiple levels, including both the defense mechanisms that lead to direct, specific elimination of the target DNA or RNA and those that cause programmed cell death (PCD) or induction of dormancy. This flexibility is also evident in the recruitment of CRISPR systems for nondefense functions. Defective CRISPR systems or individual CRISPR components have been recruited by transposons for RNA-guided transposition, by plasmids for interplasmid competition, and by viruses for antidefense and interviral conflicts. Additionally, multiple highly derived CRISPR variants of yet unknown functions have been discovered. A major route of innovation in CRISPR evolution is the repurposing of diverged repeat variants encoded outside CRISPR arrays for various structural and regulatory functions. The evolutionary plasticity and functional versatility of CRISPR systems are striking manifestations of the ubiquitous interplay between defense and "normal" cellular functions.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Microbial CRISPR-Cas systems are divided into Class 1, with multisubunit effector complexes, and Class 2, with single protein effectors. Currently, only two Class 2 effectors, Cas9 and Cpf1, are ...known. We describe here three distinct Class 2 CRISPR-Cas systems. The effectors of two of the identified systems, C2c1 and C2c3, contain RuvC-like endonuclease domains distantly related to Cpf1. The third system, C2c2, contains an effector with two predicted HEPN RNase domains. Whereas production of mature CRISPR RNA (crRNA) by C2c1 depends on tracrRNA, C2c2 crRNA maturation is tracrRNA independent. We found that C2c1 systems can mediate DNA interference in a 5′-PAM-dependent fashion analogous to Cpf1. However, unlike Cpf1, which is a single-RNA-guided nuclease, C2c1 depends on both crRNA and tracrRNA for DNA cleavage. Finally, comparative analysis indicates that Class 2 CRISPR-Cas systems evolved on multiple occasions through recombination of Class 1 adaptation modules with effector proteins acquired from distinct mobile elements.
Display omitted
•A computational pipeline was developed to discover unknown CRISPR-Cas systems•Three distinct CRISPR-Cas subtypes were identified in microbial genomes•Two of the discovered CRISPR-Cas variants were experimentally characterized•The discovered CRISPR-Cas systems show unique properties
Class 2 CRISPR-Cas systems are a source of powerful genome editing tools. We describe the discovery of three distinct subtypes of such systems by genome mining and experimental validation for two of them. The properties of the discovered CRISPR-Cas variants substantially differ from those of previously characterized Cas9 and Cpf1.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
The CRISPR-Cas systems of archaeal and bacterial adaptive immunity are classified into three types that differ by the repertoires of CRISPR-associated (cas) genes, the organization of cas operons and ...the structure of repeats in the CRISPR arrays. The simplest among the CRISPR-Cas systems is type II in which the endonuclease activities required for the interference with foreign deoxyribonucleic acid (DNA) are concentrated in a single multidomain protein, Cas9, and are guided by a co-processed dual-tracrRNA:crRNA molecule. This compact enzymatic machinery and readily programmable site-specific DNA targeting make type II systems top candidates for a new generation of powerful tools for genomic engineering. Here we report an updated census of CRISPR-Cas systems in bacterial and archaeal genomes. Type II systems are the rarest, missing in archaea, and represented in ∼ 5% of bacterial genomes, with an over-representation among pathogens and commensals. Phylogenomic analysis suggests that at least three cas genes, cas1, cas2 and cas4, and the CRISPR repeats of the type II-B system were acquired via recombination with a type I CRISPR-Cas locus. Distant homologs of Cas9 were identified among proteins encoded by diverse transposons, suggesting that type II CRISPR-Cas evolved via recombination of mobile nuclease genes with type I loci.
Bacterial class 2 CRISPR-Cas systems utilize a single RNA-guided protein effector to mitigate viral infection. We aggregated genomic data from multiple sources and constructed an expanded database of ...predicted class 2 CRISPR-Cas systems. A search for novel RNA-targeting systems identified subtype VI-D, encoding dual HEPN domain-containing Cas13d effectors and putative WYL-domain-containing accessory proteins (WYL1 and WYL-b1 through WYL-b5). The median size of Cas13d proteins is 190 to 300 aa smaller than that of Cas13a–Cas13c. Despite their small size, Cas13d orthologs from Eubacterium siraeum (Es) and Ruminococcus sp. (Rsp) are active in both CRISPR RNA processing and targeting, as well as collateral RNA cleavage, with no target-flanking sequence requirements. The RspWYL1 protein stimulates RNA cleavage by both EsCas13d and RspCas13d, demonstrating a common regulatory mechanism for divergent Cas13d orthologs. The small size, minimal targeting constraints, and modular regulation of Cas13d effectors further expands the CRISPR toolkit for RNA manipulation and detection.
Display omitted
•Type VI-D is a CRISPR-Cas system with a Cas13d effector and a WYL domain accessory•Cas13d is an RNA-guided RNase approximately 20% smaller than Cas13a–Cas13c effectors•WYL1 positively modulates Cas13d target and collateral RNase activity•Cas13d has minimal sequence and secondary structure requirements for targeting
Compiling an expanded database of predicted class 2 CRISPR-Cas systems, Yan et al. identify and characterize subtype VI-D. Cas13d is an RNA-guided RNase effector with polyphyletic WYL-domain accessory proteins. One WYL1 ortholog enhances activity of divergent Cas13d orthologs. The small effector size and modular enhancement further expand RNA modification capabilities.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
The microbial adaptive immune system CRISPR mediates defense against foreign genetic elements through two classes of RNA-guided nuclease effectors. Class 1 effectors utilize multi-protein complexes, ...whereas class 2 effectors rely on single-component effector proteins such as the well-characterized Cas9. Here, we report characterization of Cpf1, a putative class 2 CRISPR effector. We demonstrate that Cpf1 mediates robust DNA interference with features distinct from Cas9. Cpf1 is a single RNA-guided endonuclease lacking tracrRNA, and it utilizes a T-rich protospacer-adjacent motif. Moreover, Cpf1 cleaves DNA via a staggered DNA double-stranded break. Out of 16 Cpf1-family proteins, we identified two candidate enzymes from Acidaminococcus and Lachnospiraceae, with efficient genome-editing activity in human cells. Identifying this mechanism of interference broadens our understanding of CRISPR-Cas systems and advances their genome editing applications.
Display omitted
•CRISPR-Cpf1 is a class 2 CRISPR system•Cpf1 is a CRISPR-associated two-component RNA-programmable DNA nuclease•Targeted DNA is cleaved as a 5-nt staggered cut distal to a 5′ T-rich PAM•Two Cpf1 orthologs exhibit robust nuclease activity in human cells
Cpf1 is a RNA-guided DNA nuclease that provides immunity in bacteria and can be adapted for genome editing in mammalian cells.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
Cpf1 is an RNA-guided endonuclease of a type V CRISPR-Cas system that has been recently harnessed for genome editing. Here, we report the crystal structure of Acidaminococcus sp. Cpf1 (AsCpf1) in ...complex with the guide RNA and its target DNA at 2.8 Å resolution. AsCpf1 adopts a bilobed architecture, with the RNA-DNA heteroduplex bound inside the central channel. The structural comparison of AsCpf1 with Cas9, a type II CRISPR-Cas nuclease, reveals both striking similarity and major differences, thereby explaining their distinct functionalities. AsCpf1 contains the RuvC domain and a putative novel nuclease domain, which are responsible for cleaving the non-target and target strands, respectively, and for jointly generating staggered DNA double-strand breaks. AsCpf1 recognizes the 5′-TTTN-3′ protospacer adjacent motif by base and shape readout mechanisms. Our findings provide mechanistic insights into RNA-guided DNA cleavage by Cpf1 and establish a framework for rational engineering of the CRISPR-Cpf1 toolbox.
Display omitted
•Crystal structure of Acidaminococcus sp. Cpf1 in complex with crRNA and target DNA•Mechanistic insights into Cpf1-induced, staggered DNA double-strand breaks•Recognition of the 5′-TTTN-3′ PAM via base and shape readout mechanisms•Striking similarity and major differences between the structures of Cpf1 and Cas9
The structure of Cpf1, a type V CRISPR-Cas effector nuclease, in complex with crRNA and its target DNA provides mechanistic insights into RNA-guided DNA cleavage by Cpf1 and establishes a framework for rational engineering of the CRISPR-Cpf1 toolbox.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
The type-V CRISPR effector Cas12b (formerly known as C2c1) has been challenging to develop for genome editing in human cells, at least in part due to the high temperature requirement of the ...characterized family members. Here we explore the diversity of the Cas12b family and identify a promising candidate for human gene editing from Bacillus hisashii, BhCas12b. However, at 37 °C, wild-type BhCas12b preferentially nicks the non-target DNA strand instead of forming a double strand break, leading to lower editing efficiency. Using a combination of approaches, we identify gain-of-function mutations for BhCas12b that overcome this limitation. Mutant BhCas12b facilitates robust genome editing in human cell lines and ex vivo in primary human T cells, and exhibits greater specificity compared to S. pyogenes Cas9. This work establishes a third RNA-guided nuclease platform, in addition to Cas9 and Cpf1/Cas12a, for genome editing in human cells.