With the delivery of millions of sequence reads in a single experiment, next-generation sequencing (NGS) is currently revolutionizing surveys of microorganism diversity. In particular, when applied ...to Eukaryotes, we are still lacking a rigorous comparison of morphological and NGS-based diversity estimates. In this report, we studied the diversity and the seasonal community turnover of alveolates (Ciliophora and Dinophyceae) in an oligotrophic freshwater lake by SSU amplicon sequencing with NGS as well as by classical morphological analysis. We complemented the morphological analysis by single-cell PCR followed by Sanger sequencing to provide an unambiguous link to the NGS data. We show that NGS and morphological analyses generally capture frequency shifts of abundant taxa over our seasonal samples. The observed incongruencies are probably largely due to rDNA copy number variation among taxa and heterogeneity in the efficiency of cell lysis. Overall, NGS-based amplicon sequencing was superior in detecting rare species. We propose that in the absence of other nuclear markers less susceptible to copy number variation, rDNA-based diversity studies need to be adjusted for confounding effects of copy number variation.
Phototrophic eukaryotes have evolved mainly by the primary or secondary uptake of photosynthetic organisms. A return to heterotrophy occurred multiple times in various protistan groups such as ...Chrysophyceae, despite the expected advantage of autotrophy. It is assumed that the evolutionary shift to mixotrophy and further to heterotrophy is triggered by a differential importance of nutrient and carbon limitation. We sequenced the genomes of 16 chrysophyte strains and compared them in terms of size, function, and sequence characteristics in relation to photo-, mixo- and heterotrophic nutrition. All strains were sequenced with Illumina and partly with PacBio. Heterotrophic taxa have reduced genomes and a higher GC content of up to 59% as compared to phototrophic taxa. Heterotrophs have a large pan genome, but a small core genome, indicating a differential specialization of the distinct lineages. The pan genome of mixotrophs and heterotrophs taken together but not the pan genome of the mixotrophs alone covers the complete functionality of the phototrophic strains indicating a random reduction of genes. The observed ploidy ranges from di- to tetraploidy and was found to be independent of taxonomy or trophic mode. Our results substantiate an evolution driven by nutrient and carbon limitation.
Protists are perhaps the most lineage-rich of microbial lifeforms, but remain largely unknown. High-throughput sequencing technologies provide opportunities to screen whole habitats in depth and ...enable detailed comparisons of different habitats to measure, compare and map protistan diversity. Such comparisons are often limited by low sample numbers within single studies and a lack of standardisation between studies. Here, we analysed 232 samples from 10 sampling campaigns using a standardised PCR protocol and bioinformatics pipeline. We show that protistan community patterns are highly consistent within habitat types and geographic regions, provided that sample processing is standardised. Community profiles are only weakly affected by fluctuations of the abundances of the most abundant taxa and, therefore, provide a sound basis for habitat comparison beyond random short-term fluctuations in the community composition. Further, we provide evidence that distribution patterns are not solely resulting from random processes. Distinct habitat types and distinct taxonomic groups are dominated by taxa with distinct distribution patterns that reflect their ecology with respect to dispersal and habitat colonisation. However, there is no systematic shift of the distribution pattern with taxon abundance.
Sequencing of marker genes amplified from environmental samples, known as amplicon sequencing, allows us to resolve some of the hidden diversity and elucidate evolutionary relationships and ...ecological processes among complex microbial communities. The analysis of large numbers of samples at high sequencing depths generated by high throughput sequencing technologies requires efficient, flexible, and reproducible bioinformatics pipelines. Only a few existing workflows can be run in a user-friendly, scalable, and reproducible manner on different computing devices using an efficient workflow management system.
We present Natrix, an open-source bioinformatics workflow for preprocessing raw amplicon sequencing data. The workflow contains all analysis steps from quality assessment, read assembly, dereplication, chimera detection, split-sample merging, sequence representative assignment (OTUs or ASVs) to the taxonomic assignment of sequence representatives. The workflow is written using Snakemake, a workflow management engine for developing data analysis workflows. In addition, Conda is used for version control. Thus, Snakemake ensures reproducibility and Conda offers version control of the utilized programs. The encapsulation of rules and their dependencies support hassle-free sharing of rules between workflows and easy adaptation and extension of existing workflows. Natrix is freely available on GitHub ( https://github.com/MW55/Natrix ) or as a Docker container on DockerHub ( https://hub.docker.com/r/mw55/natrix ).
Natrix is a user-friendly and highly extensible workflow for processing Illumina amplicon data.
Here, we analyzed patterns of taxon richness and endemism of freshwater protists in Europe. Even though the significance of physicochemical parameters but also of geographic constraints for protist ...distribution is documented, it remains unclear where regional areas of high protist diversity are located and whether areas of high taxon richness harbor a high proportion of endemics. Further, patterns may be universal for protists or deviate between taxonomic groups. Based on amplicon sequencing campaigns targeting the SSU and ITS region of the rDNA we address these patterns at two different levels of phylogenetic resolution. Our analyses demonstrate that protists have restricted geographical distribution areas. For many taxonomic groups the regions of high taxon richness deviate from those having a high proportion of putative endemics. In particular, the diversity of high mountain lakes as azonal habitats deviated from surrounding lowlands, i.e. many taxa were found exclusively in high mountain lakes and several putatively endemic taxa occurred in mountain regions like the Alps, the Pyrenees or the Massif Central. Beyond that, taxonomic groups showed a pronounced accumulation of putative endemics in distinct regions, e.g. Dinophyceae along the Baltic Sea coastline, and Chrysophyceae in Scandinavia. Many other groups did not have pronounced areas of increased endemism but geographically restricted taxa were found across Europe.
With the advent of molecular methods, it became clear that microbial biodiversity had been vastly underestimated. Since then, species abundance patterns were determined for several environments, but ...temporal changes in species composition were not studied to the same level of resolution. Using massively parallel sequencing on the 454 GS FLX platform we identified a highly dynamic turnover of the seasonal abundance of protists in the Austrian lake Fuschlsee. We show that seasonal abundance patterns of protists closely match their biogeographic distribution. The stable predominance of few highly abundant taxa, which previously led to the suggestion of a low global protist species richness, is contrasted by a highly dynamic turnover of rare species. We suggest that differential seasonality of rare and abundant protist taxa explains the—so far—conflicting evidence in the ‘everything is everywhere’ dispute. Consequently temporal sampling is basic for adequate diversity and species richness estimates.
Display omitted
•Quantifying antimicrobial resistance using standardized samples from European lakes.•Multi-omics approach including 16 S rRNA amplicon sequencing and metagenome analysis.•Reference ...to monitor temporal development of AMR in Europe and other countries.•Resistance levels emphasizing AMR as a significant challenge for future healthcare.
The surveillance of wastewater for the Covid-19 virus during this unprecedented pandemic and mapped to the distribution and magnitude of the infected in the population near real-time exemplifies the importance of tracking rapidly changing trends of pathogens or public health problems at a large scale. The rising trends of antimicrobial resistance (AMR) with multidrug-resistant pathogens from the environmental water have similarly gained much attention in recent years. Wastewater-based epidemiology from water samples has shown that a wide range of AMR-related genes is frequently detected. Albeit sewage is treated before release and thus, the abundance of pathogens should be significantly reduced or even pathogen-free, several studies indicated the contrary. Pathogens are still measurable in the released water, ultimately entering freshwaters, such as rivers and lakes. Furthermore, socio-economic and environmental factors, such as chemical industries and animal farming nearby, impact the presence of AMR. Many bacterial species from the environment are intrinsically resistant and also contribute to the resistome of freshwater lakes. This study collected the most extensive standardized freshwater data set from hundreds of European lakes and conducted a comprehensive multi-omics analysis on antimicrobial resistance from these freshwater lakes. Our research shows that genes encoding for AMR against tetracyclines, cephalosporins, and quinolones were commonly identified, while for some, such as sulfonamides, resistance was less frequently present. We provide an estimation of the characteristic resistance of AMR in European lakes, which can be used as a comprehensive resistome dataset to facilitate and monitor temporal changes in the development of AMR in European freshwater lakes.
Shifts in the nutritional mode between phototrophy, mixotrophy and heterotrophy are a widespread phenomenon in the evolution of eukaryotic diversity. The transition between nutritional modes is ...particularly pronounced in chrysophytes and occurred independently several times through parallel evolution. Thus, chrysophytes provide a unique opportunity for studying the molecular basis of nutritional diversification and of the accompanying pathway reduction and degradation of plastid structures. In order to analyze the succession in switching the nutritional mode from mixotrophy to heterotrophy, we compared the transcriptome of the mixotrophic Poterioochromonas malhamensis with the transcriptomes of three obligate heterotrophic species of Ochromonadales. We used the transcriptome of P. malhamensis as a reference for plastid reduction in the heterotrophic taxa. The analyzed heterotrophic taxa were in different stages of plastid reduction. We investigated the reduction of several photosynthesis related pathways e.g. the xanthophyll cycle, the mevalonate pathway, the shikimate pathway and the tryptophan biosynthesis as well as the reduction of plastid structures and postulate a presumable succession of pathway reduction and degradation of accompanying structures.
Chrysophyceae are a diverse group of planktonic protists widely distributed in freshwater. They encompass a variety of orders, whereby heterotrophy has evolved independently in several phylogenetic ...lineages. Therefore, closely related taxa evolved that developed different feeding strategies (photo-, mixo-, heterotrophy). In high-throughput sequencing studies, the Chrysophyceae were usually addressed as a group rather than split in individual phylogenetic orders. Also because of the close relationship of the distinct nutritional strategies, no functional statements were made about nutritional patterns. Based on an extensive phylogenetic tree and phylogenetic placement, we link OTU diversity of Chrysophyceae from 218 freshwater lakes with phylogenetic affiliations. This provides information on the relative importance of lineages affiliated with different nutrition modes. Our study demonstrates that Chrysophyceae are one of the most common groups in freshwaters. We found Chrysophyceae in 213 out of 218 sample sites across Europe and in several sites they belong to the most commonly retrieved taxa. Ochromonadales and a Chrysosacca-Apoikiida clade (including Apoikiida, Chrysosaccales,
Chrysastrella
) are the most widespread Chrysophyceae groups and show a high degree of OTU diversity. Most detected and assignable OTUs were affiliated with mixotrophic Chrysophyceae. Niche width differs only slightly between members of different clades and between the different trophic modes. We found several OTUs within the Ochromonadales, Synurales, and Chrysosacca-Apoikiida clade, that show a wide distribution and large tolerance ranges concerning ecophysiological factors.
Microbes are essentail components of all ecosystems because they drive many biochemical processes and act as primary producers. In freshwater ecosystems, the biodiversity in and the composition of ...microbial communities can be used as indicators for environmental quality. Recently, some environmental features have been identified that influence microbial ecosystems. However, the impact of human action on lake microbiomes is not well understood. This is, in part, due to the fact that environmental data is, albeit theoretically accessible, not easily available.
In this work, we present SEDE-GPS, a tool that gathers data that are relevant to the environment of an user-provided GPS coordinate. To this end, it accesses a list of public and corporate databases and aggregates the information in a single file, which can be used for further analysis. To showcase the use of SEDE-GPS, we enriched a lake microbial ecology sequencing dataset with around 18,000 socio-economic, climate, and geographic features. The sources of SEDE-GPS are public databases such as Eurostat, the Climate Data Center, and OpenStreetMap, as well as corporate sources such as Twitter. Using machine learning and feature selection methods, we were able to identify features in the data provided by SEDE-GPS that can be used to predict lake microbiome alpha diversity.
The results presented in this study show that SEDE-GPS is a handy and easy-to-use tool for comprehensive data enrichment for studies of ecology and other processes that are affected by environmental features. Furthermore, we present lists of environmental, socio-economic, and climate features that are predictive for microbial biodiversity in lake ecosystems. These lists indicate that human action has a major impact on lake microbiomes. SEDE-GPS and its source code is available for download at http://SEDE-GPS.heiderlab.de.