Clustered regularly interspaced short palindromic repeats and CRISPR-associated protein (CRISPR-Cas) systems store the memory of past encounters with foreign DNA in unique spacers that are inserted ...between direct repeats in CRISPR arrays. For only a small fraction of the spacers, homologous sequences, called protospacers, are detectable in viral, plasmid, and microbial genomes. The rest of the spacers remain the CRISPR "dark matter." We performed a comprehensive analysis of the spacers from all CRISPR-
loci identified in bacterial and archaeal genomes, and we found that, depending on the CRISPR-Cas subtype and the prokaryotic phylum, protospacers were detectable for 1% to about 19% of the spacers (~7% global average). Among the detected protospacers, the majority, typically 80 to 90%, originated from viral genomes, including proviruses, and among the rest, the most common source was genes that are integrated into microbial chromosomes but are involved in plasmid conjugation or replication. Thus, almost all spacers with identifiable protospacers target mobile genetic elements (MGE). The GC content, as well as dinucleotide and tetranucleotide compositions, of microbial genomes, their spacer complements, and the cognate viral genomes showed a nearly perfect correlation and were almost identical. Given the near absence of self-targeting spacers, these findings are most compatible with the possibility that the spacers, including the dark matter, are derived almost completely from the species-specific microbial mobilomes.
The principal function of CRISPR-Cas systems is thought to be protection of bacteria and archaea against viruses and other parasitic genetic elements. The CRISPR defense function is mediated by sequences from parasitic elements, known as spacers, that are inserted into CRISPR arrays and then transcribed and employed as guides to identify and inactivate the cognate parasitic genomes. However, only a small fraction of the CRISPR spacers match any sequences in the current databases, and of these, only a minority correspond to known parasitic elements. We show that nearly all spacers with matches originate from viral or plasmid genomes that are either free or have been integrated into the host genome. We further demonstrate that spacers with no matches have the same properties as those of identifiable origins, strongly suggesting that all spacers originate from mobile elements.
CRISPR (clustered regularly interspaced short palindromic repeats)-Cas (CRISPR-associated) is an adaptive immunity system in bacteria and archaea that functions via a distinct self/non-self ...recognition mechanism that involves unique spacers homologous with viral or plasmid DNA and integrated into the CRISPR loci. Most of the Cas proteins evolve under relaxed purifying selection and some underwent dramatic structural rearrangements during evolution. In many cases, CRISPR-Cas system components are replaced either by homologous or by analogous proteins or domains in some bacterial and archaeal lineages. However, recent advances in comparative sequence analysis, structural studies and experimental data suggest that, despite this remarkable evolutionary plasticity, all CRISPR-Cas systems employ the same architectural and functional principles, and given the conservation of the principal building blocks, share a common ancestry. We review recent advances in the understanding of the evolution and organization of CRISPR-Cas systems. Among other developments, we describe for the first time a group of archaeal cas1 gene homologues that are not associated with CRISPR-Cas loci and are predicted to be involved in functions other than adaptive immunity.
The CRISPR-Cas are adaptive bacterial and archaeal immunity systems that have been harnessed for the development of powerful genome editing and engineering tools. In the incessant host-parasite arms ...race, viruses evolved multiple anti-defense mechanisms including diverse anti-CRISPR proteins (Acrs) that specifically inhibit CRISPR-Cas and therefore have enormous potential for application as modulators of genome editing tools. Most Acrs are small and highly variable proteins which makes their bioinformatic prediction a formidable task. We present a machine-learning approach for comprehensive Acr prediction. The model shows high predictive power when tested against an unseen test set and was employed to predict 2,500 candidate Acr families. Experimental validation of top candidates revealed two unknown Acrs (AcrIC9, IC10) and three other top candidates were coincidentally identified and found to possess anti-CRISPR activity. These results substantially expand the repertoire of predicted Acrs and provide a resource for experimental Acr discovery.
The waste and by-products of the soybean industry could be an economic source of nutrients to satisfy the high nutritional demands for the cultivation of lactic acid bacteria. The aims of this work ...were to maximize the biomass production of
Lacticaseibacillus paracasei
90 (L90) in three culture media formulated from an effluent derived from soy protein concentrate production and to assess the effects these media have on the enzymatic activity of L90, together with their influence on its fermentation profile in milk. The presence of essential minerals and fermentable carbohydrates (sucrose, raffinose, and stachyose) in the effluent was verified. L90 reached high levels of microbiological counts (∼ 9 log cfu mL
−1
) and dry weight (> 1 g L
−1
) on the three optimized media. Enzymatic activities (lactate dehydrogenase and β-galactosidase) of L90, and its metabolism of lactose and citric acid, as well as lactic acid and pyruvic acid production in milk, were modified depending on the growth media. The ability of the L90 to produce the key flavour compounds (diacetyl and acetoin) was maintained or improved by growing in the optimized media in comparison with MRS.
Metachromatic leukodystrophy (MLD) is an autosomal recessively inherited metabolic disease characterized by deficient activity of the lysosomal enzyme arylsulfatase A. Its deficiency results in ...accumulation of sulfatides in neural and visceral tissues, and causes demyelination of the central and peripheral nervous system. This leads to a broad range of neurological symptoms and eventually premature death. In asymptomatic patients with juvenile and adult MLD, treatment with allogeneic hematopoietic stem cell transplantation (HCT) provides a symptomatic and survival benefit. However, this treatment mainly impacts brain white matter, whereas the peripheral neuropathy shows no or only limited response. Data about the impact of peripheral neuropathy in MLD patients are currently lacking, although in our experience peripheral neuropathy causes significant morbidity due to neuropathic pain, foot deformities and neurogenic bladder disturbances. Besides, the reasons for residual and often progressive peripheral neuropathy after HCT are not fully understood. Preliminary studies suggest that peripheral neuropathy might respond better to gene therapy due to higher enzyme levels achieved than with HCT. However, histopathological and clinical findings also suggest a role of neuroinflammation in the pathology of peripheral neuropathy in MLD. In this literature review, we discuss clinical aspects, pathological findings, distribution of mutations, and treatment approaches in MLD with particular emphasis on peripheral neuropathy. We believe that future therapies need more emphasis on the management of peripheral neuropathy, and additional research is needed to optimize care strategies.
Multiple constraints variously affect different parts of the genomes of diverse life forms. The selective pressures that shape the evolution of viral, archaeal, bacterial and eukaryotic genomes ...differ markedly, even among relatively closely related animal and bacterial lineages; by contrast, constraints affecting protein evolution seem to be more universal. The constraints that shape the evolution of genomes and phenomes are complemented by the plasticity and robustness of genome architecture, expression and regulation. Taken together, these findings are starting to reveal complex networks of evolutionary processes that must be integrated to attain a new synthesis of evolutionary biology.
Phagocytosis, that is, engulfment of large particles by eukaryotic cells, is found in diverse organisms and is often thought to be central to the very origin of the eukaryotic cell, in particular, ...for the acquisition of bacterial endosymbionts including the ancestor of the mitochondrion.
Comparisons of the sets of proteins implicated in phagocytosis in different eukaryotes reveal extreme diversity, with very few highly conserved components that typically do not possess readily identifiable prokaryotic homologs. Nevertheless, phylogenetic analysis of those proteins for which such homologs do exist yields clues to the possible origin of phagocytosis. The central finding is that a subset of archaea encode actins that are not only monophyletic with eukaryotic actins but also share unique structural features with actin-related proteins (Arp) 2 and 3. All phagocytic processes are strictly dependent on remodeling of the actin cytoskeleton and the formation of branched filaments for which Arp2/3 are responsible. The presence of common structural features in Arp2/3 and the archaeal actins suggests that the common ancestors of the archaeal and eukaryotic actins were capable of forming branched filaments, like modern Arp2/3. The Rho family GTPases that are ubiquitous regulators of phagocytosis in eukaryotes appear to be of bacterial origin, so assuming that the host of the mitochondrial endosymbiont was an archaeon, the genes for these GTPases come via horizontal gene transfer from the endosymbiont or in an earlier event.
The present findings suggest a hypothetical scenario of eukaryogenesis under which the archaeal ancestor of eukaryotes had no cell wall (like modern Thermoplasma) but had an actin-based cytoskeleton including branched actin filaments that allowed this organism to produce actin-supported membrane protrusions. These protrusions would facilitate accidental, occasional engulfment of bacteria, one of which eventually became the mitochondrion. The acquisition of the endosymbiont triggered eukaryogenesis, in particular, the emergence of the endomembrane system that eventually led to the evolution of modern-type phagocytosis, independently in several eukaryotic lineages.
The CRISPR-Cas adaptive immunity systems that are present in most Archaea and many Bacteria function by incorporating fragments of alien genomes into specific genomic loci, transcribing the inserts ...and using the transcripts as guide RNAs to destroy the genome of the cognate virus or plasmid. This RNA interference-like immune response is mediated by numerous, diverse and rapidly evolving Cas (CRISPR-associated) proteins, several of which form the Cascade complex involved in the processing of CRISPR transcripts and cleavage of the target DNA. Comparative analysis of the Cas protein sequences and structures led to the classification of the CRISPR-Cas systems into three Types (I, II and III).
A detailed comparison of the available sequences and structures of Cas proteins revealed several unnoticed homologous relationships. The Repeat-Associated Mysterious Proteins (RAMPs) containing a distinct form of the RNA Recognition Motif (RRM) domain, which are major components of the CRISPR-Cas systems, were classified into three large groups, Cas5, Cas6 and Cas7. Each of these groups includes many previously uncharacterized proteins now shown to adopt the RAMP structure. Evidence is presented that large subunits contained in most of the CRISPR-Cas systems could be homologous to Cas10 proteins which contain a polymerase-like Palm domain and are predicted to be enzymatically active in Type III CRISPR-Cas systems but inactivated in Type I systems. These findings, the fact that the CRISPR polymerases, RAMPs and Cas2 all contain core RRM domains, and distinct gene arrangements in the three types of CRISPR-Cas systems together provide for a simple scenario for origin and evolution of the CRISPR-Cas machinery. Under this scenario, the CRISPR-Cas system originated in thermophilic Archaea and subsequently spread horizontally among prokaryotes.
Because of the extreme diversity of CRISPR-Cas systems, in-depth sequence and structure comparison continue to reveal unexpected homologous relationship among Cas proteins. Unification of Cas protein families previously considered unrelated provides for improvement in the classification of CRISPR-Cas systems and a reconstruction of their evolution.
This article was reviewed by Malcolm White (nominated by Purficacion Lopez-Garcia), Frank Eisenhaber and Igor Zhulin. For the full reviews, see the Reviewers' Comments section.
At the time of this writing, December 2021, potential emergence of vaccine escape variants of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a grave global concern. The interface ...between the receptor-binding domain (RBD) of SARS-CoV-2 spike (S) protein and the host receptor (ACE2) overlaps the binding site of principal neutralizing antibodies (NAb), limiting the repertoire of viable mutations. Nonetheless, variants with multiple RBD mutations have risen to dominance. Nonadditive, epistatic relationships among RBD mutations are apparent, and assessing the impact of such epistasis on the mutational landscape, particularly the risk of vaccine escape, is crucial. We employed protein structure modeling using Rosetta to compare the effects of all single mutants at the RBD-NAb and RBD-ACE2 interfaces for the wild type and Delta, Gamma, and Omicron variants. Overall, epistasis at the RBD interface appears to be limited, and the effects of most multiple mutations are additive. Epistasis at the Delta variant interface weakly stabilizes NAb interaction relative to ACE2 interaction, whereas in Gamma, epistasis more substantially destabilizes NAb interaction. Despite bearing many more RBD mutations, the epistatic landscape of Omicron closely resembles that of Gamma. Thus, although Omicron poses new risks not observed with Delta, structural constraints on the RBD appear to hamper continued evolution toward more complete vaccine escape. The modest ensemble of mutations relative to the wild type that are currently known to reduce vaccine efficacy is likely to contain the majority of all possible escape mutations for future variants, predicting the continued efficacy of the existing vaccines.
Emergence of vaccine escape variants of SARS-CoV-2 is arguably the most pressing problem during the COVID-19 pandemic as vaccines are distributed worldwide. We employed a computational approach to assess the risk of antibody escape resulting from mutations in the receptor-binding domain of the spike protein of the wild-type SARS-CoV-2 virus as well as the Delta, Gamma, and Omicron variants. The efficacy of the existing vaccines against Omicron could be substantially reduced relative to the wild type, and the potential for vaccine escape is of grave concern. Our results suggest that although Omicron poses new evolutionary risks not observed for Delta, structural constraints on the RBD make continued evolution toward more complete vaccine escape from either Delta or Omicron unlikely. The modest set of escape-enhancing mutations already identified for the wild type likely include the majority of all possible mutations with this effect.
Summary
Three classes of low‐G+C Gram‐positive bacteria (Firmicutes), Bacilli, Clostridia and Negativicutes, include numerous members that are capable of producing heat‐resistant endospores. ...Spore‐forming firmicutes include many environmentally important organisms, such as insect pathogens and cellulose‐degrading industrial strains, as well as human pathogens responsible for such diseases as anthrax, botulism, gas gangrene and tetanus. In the best‐studied model organism Bacillus subtilis, sporulation involves over 500 genes, many of which are conserved among other bacilli and clostridia. This work aimed to define the genomic requirements for sporulation through an analysis of the presence of sporulation genes in various firmicutes, including those with smaller genomes than B. subtilis. Cultivable spore‐formers were found to have genomes larger than 2300 kb and encompass over 2150 protein‐coding genes of which 60 are orthologues of genes that are apparently essential for sporulation in B. subtilis. Clostridial spore‐formers lack, among others, spoIIB, sda, spoVID and safA genes and have non‐orthologous displacements of spoIIQ and spoIVFA, suggesting substantial differences between bacilli and clostridia in the engulfment and spore coat formation steps. Many B. subtilis sporulation genes, particularly those encoding small acid‐soluble spore proteins and spore coat proteins, were found only in the family Bacillaceae, or even in a subset of Bacillus spp. Phylogenetic profiles of sporulation genes, compiled in this work, confirm the presence of a common sporulation gene core, but also illuminate the diversity of the sporulation processes within various lineages. These profiles should help further experimental studies of uncharacterized widespread sporulation genes, which would ultimately allow delineation of the minimal set(s) of sporulation‐specific genes in Bacilli and Clostridia.