The bacterial cellulosome is an extracellular, multi-enzyme machinery, which efficiently depolymerizes plant biomass by degrading plant cell wall polysaccharides. Several cellulolytic bacteria have ...evolved various elaborate modular architectures of active cellulosomes. We present here a genome-wide analysis of a dozen mesophilic clostridia species, including both well-studied and yet-undescribed cellulosome-producing bacteria. We first report here, the presence of cellulosomal elements, thus expanding our knowledge regarding the prevalence of the cellulosomal paradigm in nature. We explored the genomic organization of key cellulosome components by comparing the cellulosomal gene clusters in each bacterial species, and the conserved sequence features of the specific cellulosomal modules (cohesins and dockerins), on the background of their phylogenetic relationship. Additionally, we performed comparative analyses of the species-specific repertoire of carbohydrate-degrading enzymes for each of the clostridial species, and classified each cellulosomal enzyme into a specific CAZy family, thus indicating their putative enzymatic activity (e.g., cellulases, hemicellulases, and pectinases). Our work provides, for this large group of bacteria, a broad overview of the blueprints of their multi-component cellulosomal complexes. The high similarity of their scaffoldin clusters and dockerin-based recognition residues suggests a common ancestor, and/or extensive horizontal gene transfer, and potential cross-species recognition. In addition, the sporadic spatial organization of the numerous dockerin-containing genes in several of the genomes, suggests the importance of the cellulosome paradigm in the given bacterial species. The information gained in this work may be utilized directly or developed further by genetically engineering and optimizing designer cellulosome systems for enhanced biotechnological biomass deconstruction and biofuel production.
Non-cellulosomal processive endoglucanase 9I (Cel9I) from Clostridium thermocellum is a modular protein, consisting of a family-9 glycoside hydrolase (GH9) catalytic module and two family-3 ...carbohydrate-binding modules (CBM3c and CBM3b), separated by linker regions. GH9 does not show cellulase activity when expressed without CBM3c and CBM3b and the presence of the CBM3c was previously shown to be essential for endoglucanase activity. Physical reassociation of independently expressed GH9 and CBM3c modules (containing linker sequences) restored 60-70% of the intact Cel9I endocellulase activity. However, the mechanism responsible for recovery of activity remained unclear. In this work we independently expressed recombinant GH9 and CBM3c with and without their interconnecting linker in Escherichia coli. We crystallized and determined the molecular structure of the GH9/linker-CBM3c heterodimer at a resolution of 1.68 Å to understand the functional and structural importance of the mutual spatial orientation of the modules and the role of the interconnecting linker during their re-association. Enzyme activity assays and isothermal titration calorimetry were performed to study and compare the effect of the linker on the re-association. The results indicated that reassembly of the modules could also occur without the linker, albeit with only very low recovery of endoglucanase activity. We propose that the linker regions in the GH9/CBM3c endoglucanases are important for spatial organization and fixation of the modules into functional enzymes.
Family 3 carbohydrate-binding modules (CBM3s) are among the most distinctive, diverse, and robust. CBM3s, which are numerous components of both free cellulases and cellulosomes, bind tightly to ...crystalline cellulose, and thus play a key role in cellulose degradation through their substrate targeting capacity. In addition to the accepted cellulose binding surface of the CBM3 molecule, a second type of conserved face (the "shallow groove") is retained on the opposite side of the molecule in all CBM3 subfamilies, irrespective of the loss or modification of the cellulose-binding function. The exact function of this highly conserved shallow groove is currently unknown. The cellulosomal system contains many linker segments that interconnect the various modules in long polypeptides chains. These linkers are varied in length (5-700 residues). The long linkers are commonly composed of repeated sequences that are often rich in Ser, Pro, and Thr residues. The exact function of the linker segments in the cellulosomal system is currently unknown, although they likely play several roles. In this chapter, we document the binding interaction between the conserved shallow-groove region of the CBM3s with selected cellulosomal linker segments, which may thus induce conformational changes in the quaternary structure of the cellulosome. These conformational changes would presumably promote changes in the overall arrangement of the cellulosomal enzymes, which would in turn serve to enhance cellulosome efficiency and degradation of recalcitrant polysaccharide substrates. Here, we describe two different methods for determining the interactions between a model CBM3 and cellulosomal linker peptides.
The innate binding specificity of different carbohydrate-binding modules (CBMs) offers a versatile approach for mapping the chemistry and structure of surfaces that contain complex carbohydrates. We ...have employed the distinct recognition properties of a double His-tagged recombinant CBM tagged with semiconductor quantum dots for direct imaging of crystalline cellulose at the molecular level of resolution, using transmission and scanning transmission electron microscopy. In addition, three different types of CBMs from families 3, 6, and 20 that exhibit different carbohydrate specificities were each fused with either green fluorescent protein (GFP) or red fluorescent protein (RFP) and employed for double-labeling fluorescence microscopy studies of primary cell walls and various mixtures of complex carbohydrate target molecules. CBM probes can be used for characterizing both native complex carbohydrates and engineered biomaterials.
Ruminococcus flavefaciens is one of the predominant fiber-degrading bacteria found in the rumen of herbivores. Bioinformatic analysis of the recently sequenced genome indicated that this bacterium ...produces one of the most intricate cellulosome systems known to date. A distinct ORF, encoding for a multi-modular protein, RflaF_05439, was discovered during mining of the genome sequence. It is composed of two tandem modules of currently undefined function that share 45% identity and a C-terminal X-dockerin modular dyad. Gaining insight into the diversity, architecture and organization of different types of proteins in the cellulosome system is essential for broadening our understanding of a multi-enzyme complex, considered to be one of the most efficient systems for plant cell wall polysaccharide degradation in nature.
Following bioinformatic analysis, the second tandem module of RflaF_05439 was cloned and its selenium-labeled derivative was expressed and crystallized. The crystals belong to space group P21 with unit-cell parameters of a = 65.81, b = 60.61, c = 66.13 Å, β = 107.66° and contain two protein molecules in the asymmetric unit. The crystal structure was determined at 1.38-Å resolution by X-ray diffraction using the single-wavelength anomalous dispersion (SAD) method and was refined to Rfactor and Rfree of 0.127 and 0.152 respectively. The protein molecule mainly comprises a β-sheet flanked by short α-helixes, and a globular α-helical domain. The structure was found to be structurally similar to members of the NlpC/P60 superfamily of cysteine peptidases.
The 3D structure of the second repeat of the RflaF_05439 enabled us to propose a role for the currently undefined function of this protein. Its putative function as a cysteine peptidase is inferred from in silico structural homology studies. It is therefore apparent that cellulosomes integrate proteins with other functions in addition to the classic well-defined carbohydrate active enzymes.
Summary
Many of the Firmicutes bacteria responsible for plant polysaccharide degradation in Nature produce a multiprotein complex called a cellulosome, which co‐ordinates glycoside hydrolase ...assembly, bacterial adhesion to substrate and polysaccharide hydrolysis. Cellulosomal proteins possess a dockerin module, which mediates their attachment to the scaffoldin protein via its interaction with cohesin modules, and only glycoside hydrolases and other carbohydrate active enzymes were known to reside within the cellulosome. We show here with Clostridium thermocellum ATCC 27405 that members of the serpin superfamily of serine proteinase inhibitors, which are best recognized for their conformational flexibility and co‐ordination of key regulatory functions in multicellular eukaryotes, also reside within the cellulosome. These studies are the first to expand the cellulosome paradigm of protein complex assembly beyond glycoside hydrolase and carbohydrate active enzymes, and to include a newly identified functionality in the Firmicutes.
The cellulosome complex is composed of a conglomerate of subunits, each of which comprises a set of interacting functional modules. Scaffoldin (Sca), a major cellulosomal subunit, is responsible for ...organizing the cellulolytic subunits into the complex. This is accomplished by the interaction of two complementary classes of modules—a cohesin (Coh) module on the Sca subunit and a dockerin module on each of the enzymatic subunits. Although individual Coh modules from different cellulosomal scaffoldins have been subjected to intensive structural investigation, the Sca subunit in its entirety has not, and there remains a paucity of information on the arrangement and interactions of Cohs within the Sca subunit. In the present work, we describe the crystal structure of a type II Coh dyad from the ScaB “adaptor” Sca of Acetivibrio cellulolyticus. The ScaB Cohs are oriented in an “antiparallel” manner relative to one another, with their dockerin-interacting surfaces (β-strands 8–3–6–5) facing the same direction—aligned on the same plane. A set of extensive hydrophobic and hydrogen-bond contacts between the Cohs and the short interconnecting linker segment between them stabilizes the modular orientation. This Coh dyad structure provides novel information about Coh–Coh association and arrangement in the Sca and further insight into intermodular linker interactions. Putative structural arrangements of a hexamodular complex, composed of the Coh dyad bound to two X-dockerin modules, were suggested.
The bovine rumen maintains a diverse microbial community that serves to break down indigestible plant substrates. However, those bacteria specifically adapted to degrade cellulose, the major ...structural component of plant biomass, represent a fraction of the rumen microbiome. Previously, we proposed scaC as a candidate for phylotyping Ruminococcus flavefaciens, one of three major cellulolytic bacterial species isolated from the rumen. In the present report we examine the dynamics and diversity of scaC-types both within and between cattle temporally, following a dietary switch from corn-silage to grass-legume hay. These results were placed in the context of the overall bacterial population dynamics measured using the 16S rRNA.
As many as 117 scaC-types were estimated, although just nineteen were detected in each of three rumens tested, and these collectively accounted for the majority of all types present. Variation in scaC populations was observed between cattle, between planktonic and fiber-associated fractions and temporally over the six-week survey, and appeared related to scaC phylogeny. However, by the sixth week no significant separation of scaC populations was seen between animals, suggesting enrichment of a constrained set of scaC-types. Comparing the amino-acid translation of each scaC-type revealed sequence variation within part of the predicted dockerin module but strong conservation in the N-terminus, where the cohesin module is located.
The R. flavefaciens species comprises a multiplicity of scaC-types in-vivo. Enrichment of particular scaC-types temporally, following a dietary switch, and between fractions along with the phylogenetic congruence suggests that functional differences exist between types. Observed differences in dockerin modules suggest at least part of the functional heterogeneity may be conferred by scaC. The polymorphic nature of scaC enables the relative distribution of R. flavefaciens strains to be examined and represents a gene-centric approach to investigating the intraspecific adaptation of an important specialist population.