Unlike proteins, glycan chains are not directly encoded by DNA, but by the specificity of the enzymes that assemble them. Theoretical calculations have proposed an astronomical number of possible ...isomers (> 10
hexasaccharides) but the actual diversity of glycan structures in nature is not known. Bacteria of the Bacteroidetes phylum are considered primary degraders of polysaccharides and they are found in all ecosystems investigated. In Bacteroidetes genomes, carbohydrate-degrading enzymes (CAZymes) are arranged in gene clusters termed polysaccharide utilization loci (PULs). The depolymerization of a given complex glycan by Bacteroidetes PULs requires bespoke enzymes; conversely, the enzyme composition in PULs can provide information on the structure of the targeted glycans. Here we group the 13,537 PULs encoded by 964 Bacteroidetes genomes according to their CAZyme composition. We find that collectively Bacteroidetes have elaborated a few thousand enzyme combinations for glycan breakdown, suggesting a global estimate of diversity of glycan structures much smaller than the theoretical one.
Abstract
Thirty years have elapsed since the emergence of the classification of carbohydrate-active enzymes in sequence-based families that became the CAZy database over 20 years ago, freely ...available for browsing and download at www.cazy.org. In the era of large scale sequencing and high-throughput Biology, it is important to examine the position of this specialist database that is deeply rooted in human curation. The three primary tasks of the CAZy curators are (i) to maintain and update the family classification of this class of enzymes, (ii) to classify sequences newly released by GenBank and the Protein Data Bank and (iii) to capture and present functional information for each family. The CAZy website is updated once a month. Here we briefly summarize the increase in novel families and the annotations conducted during the last 8 years. We present several important changes that facilitate taxonomic navigation, and allow to download the entirety of the annotations. Most importantly we highlight the considerable amount of work that accompanies the analysis and report of biochemical data from the literature.
Abstract
The Polysaccharide Utilization Loci (PUL) database was launched in 2015 to present PUL predictions in ∼70 Bacteroidetes species isolated from the human gastrointestinal tract, as well as ...PULs derived from the experimental data reported in the literature. In 2018 PULDB offers access to 820 genomes, sampled from various environments and covering a much wider taxonomical range. A Krona dynamic chart was set up to facilitate browsing through taxonomy. Literature surveys now allows the presentation of the most recent (i) PUL repertoires deduced from RNAseq large-scale experiments, (ii) PULs that have been subjected to in-depth biochemical analysis and (iii) new Carbohydrate-Active enzyme (CAZyme) families that contributed to the refinement of PUL predictions. To improve PUL visualization and genome browsing, the previous annotation of genes encoding CAZymes, regulators, integrases and SusCD has now been expanded to include functionally relevant protein families whose genes are significantly found in the vicinity of PULs: sulfatases, proteases, ROK repressors, epimerases and ATP-Binding Cassette and Major Facilitator Superfamily transporters. To cope with cases where susCD may be absent due to incomplete assemblies/split PULs, we present 'CAZyme cluster' predictions. Finally, a PUL alignment tool, operating on the tagged families instead of amino-acid sequences, was integrated to retrieve PULs similar to a query of interest. The updated PULDB website is accessible at www.cazy.org/PULDB_new/
The roots of Arabidopsis thaliana host diverse fungal communities that affect plant health and disease states. Here, we sequence the genomes of 41 fungal isolates representative of the A. thaliana ...root mycobiota for comparative analysis with other 79 plant-associated fungi. Our analyses indicate that root mycobiota members evolved from ancestors with diverse lifestyles and retain large repertoires of plant cell wall-degrading enzymes (PCWDEs) and effector-like small secreted proteins. We identify a set of 84 gene families associated with endophytism, including genes encoding PCWDEs acting on xylan (family GH10) and cellulose (family AA9). Transcripts encoding these enzymes are also part of a conserved transcriptional program activated by phylogenetically-distant mycobiota members upon host contact. Recolonization experiments with individual fungi indicate that strains with detrimental effects in mono-association with the host colonize roots more aggressively than those with beneficial activities, and dominate in natural root samples. Furthermore, we show that the pectin-degrading enzyme family PL1_7 links aggressiveness of endophytic colonization to plant health.
Glomeromycotina is a lineage of early diverging fungi that establish arbuscular mycorrhizal (AM) symbiosis with land plants. Despite their major ecological role, the genetic basis of their obligate ...mutualism remains largely unknown, hindering our understanding of their evolution and biology.
We compared the genomes of Glomerales (Rhizophagus irregularis, Rhizophagus diaphanus, Rhizophagus cerebriforme) and Diversisporales (Gigaspora rosea) species, together with those of saprotrophic Mucoromycota, to identify gene families and processes associated with these lineages and to understand the molecular underpinning of their symbiotic lifestyle.
Genomic features in Glomeromycotina appear to be very similar with a very high content in transposons and protein-coding genes, extensive duplications of protein kinase genes, and loss of genes coding for lignocellulose degradation, thiamin biosynthesis and cytosolic fatty acid synthase. Most symbiosis-related genes in R. irregularis and G. rosea are specific to Glomeromycotina. We also confirmed that the present species have a homokaryotic genome organisation.
The high interspecific diversity of Glomeromycotina gene repertoires, affecting all known protein domains, as well as symbiosis-related orphan genes, may explain the known adaptation of Glomeromycotina to a wide range of environmental settings. Our findings contribute to an increasingly detailed portrait of genomic features defining the biology of AM fungi.
Aspergillus section Nigri comprises filamentous fungi relevant to biomedicine, bioenergy, health, and biotechnology. To learn more about what genetically sets these species apart, as well as about ...potential applications in biotechnology and biomedicine, we sequenced 23 genomes de novo, forming a full genome compendium for the section (26 species), as well as 6 Aspergillus niger isolates. This allowed us to quantify both inter- and intraspecies genomic variation. We further predicted 17,903 carbohydrate-active enzymes and 2,717 secondary metabolite gene clusters, which we condensed into 455 distinct families corresponding to compound classes, 49% of which are only found in single species. We performed metabolomics and genetic engineering to correlate genotypes to phenotypes, as demonstrated for the metabolite aurasperone, and by heterologous transfer of citrate production to Aspergillus nidulans. Experimental and computational analyses showed that both secondary metabolism and regulation are key factors that are significant in the delineation of Aspergillus species.
Abstract
As actors of global carbon cycle, Agaricomycetes (Basidiomycota) have developed complex enzymatic machineries that allow them to decompose all plant polymers, including lignin. Among them, ...saprotrophic Agaricales are characterized by an unparalleled diversity of habitats and lifestyles. Comparative analysis of 52 Agaricomycetes genomes (14 of them sequenced de novo) reveals that Agaricales possess a large diversity of hydrolytic and oxidative enzymes for lignocellulose decay. Based on the gene families with the predicted highest evolutionary rates—namely cellulose-binding CBM1, glycoside hydrolase GH43, lytic polysaccharide monooxygenase AA9, class-II peroxidases, glucose–methanol–choline oxidase/dehydrogenases, laccases, and unspecific peroxygenases—we reconstructed the lifestyles of the ancestors that led to the extant lignocellulose-decomposing Agaricomycetes. The changes in the enzymatic toolkit of ancestral Agaricales are correlated with the evolution of their ability to grow not only on wood but also on leaf litter and decayed wood, with grass-litter decomposers as the most recent eco-physiological group. In this context, the above families were analyzed in detail in connection with lifestyle diversity. Peroxidases appear as a central component of the enzymatic toolkit of saprotrophic Agaricomycetes, consistent with their essential role in lignin degradation and high evolutionary rates. This includes not only expansions/losses in peroxidase genes common to other basidiomycetes but also the widespread presence in Agaricales (and Russulales) of new peroxidases types not found in wood-rotting Polyporales, and other Agaricomycetes orders. Therefore, we analyzed the peroxidase evolution in Agaricomycetes by ancestral-sequence reconstruction revealing several major evolutionary pathways and mapped the appearance of the different enzyme types in a time-calibrated species tree.
β-Mannans are a heterogeneous group of polysaccharides with a common main chain of β-1,4-linked mannopyranoside residues. The cleavage of β-mannan chains is catalyzed by glycoside hydrolases called ...β-mannanases. In the CAZy database, β-mannanases are grouped by sequence similarity in families GH5, GH26, GH113 and GH134. Family GH113 has been under-explored so far with six enzymes characterized, all from the Firmicutes phylum. We undertook the functional characterization of 14 enzymes from a selection of 31 covering the diversity of the family GH113. Our observations suggest that GH113 is a family with specificity towards mannans, with variations in the product profiles and modes of action. We were able to assign mannanase and mannosidase activities to four out of the five clades of the family, increasing by 200% the number of characterized GH113 members, and expanding the toolbox for fine-tuning of mannooligosaccharides.
Ecological niche breadth and the mechanisms facilitating its evolution are fundamental to understanding adaptation to changing environments, persistence of generalist and specialist lineages and the ...formation of new species. Woody substrates are structurally complex resources utilized by organisms with specialized decay machinery. Wood-decaying fungi represent ideal model systems to study evolution of niche breadth, as they vary greatly in their host range and preferred decay stage of the substrate. In order to dissect the genetic basis for niche specialization in the invasive brown rot fungus Serpula lacrymans, we used phenotyping and integrative analysis of phylogenomic and transcriptomic data to compare this species to wild relatives in the Serpulaceae with a range of specialist to generalist decay strategies. Our results indicate specialist species have rewired regulatory networks active during wood decay towards decreased reliance on enzymatic machinery, and therefore nitrogen-intensive decay components. This shift was likely accompanied with adaptation to a narrow tree line habitat and switch to a pioneer decomposer strategy, both requiring rapid colonization of a nitrogen-limited substrate. Among substrate specialists with narrow niches, we also found evidence for pathways facilitating reversal to generalism, highlighting how evolution may move along different axes of niche space.
Summary
Ectomycorrhizal fungi play a key role in forests by establishing mutualistic symbioses with woody plants. Genome analyses have identified conserved symbiosis‐related traits among ...ectomycorrhizal fungal species, but the molecular mechanisms underlying host specificity remain poorly known.
We sequenced and compared the genomes of seven species of milk‐cap fungi (Lactarius, Russulales) with contrasting host specificity. We also compared these genomes with those of symbiotic and saprotrophic Russulales species, aiming to identify genes involved in their ecology and host specificity.
The size of Lactarius genomes is significantly larger than other Russulales species, owing to a massive accumulation of transposable elements and duplication of dispensable genes. As expected, their repertoire of genes coding for plant cell wall‐degrading enzymes is restricted, but they retained a substantial set of genes involved in microbial cell wall degradation. Notably, Lactarius species showed a striking expansion of genes encoding proteases, such as secreted ectomycorrhiza‐induced sedolisins. A high copy number of genes coding for small secreted LysM proteins and Lactarius‐specific lectins were detected, which may be linked to host specificity.
This study revealed a large diversity in the genome landscapes and gene repertoires within Russulaceae. The known host specificity of Lactarius symbionts may be related to mycorrhiza‐induced species‐specific genes, including secreted sedolisins.