Sequencing-based analyses of microbiomes have traditionally focused on addressing the question of community membership and profiling taxonomic abundance through amplicon sequencing of 16 rRNA genes. ...More recently, shotgun metagenomics, which involves the random sequencing of all genomic content of a microbiome, has dominated this arena due to advancements in sequencing technology throughput and capability to profile genes as well as microbiome membership. While these methods have revealed a great number of insights into a wide variety of microbiomes, both of these approaches only describe the presence of organisms or genes, and not whether they are active members of the microbiome. To obtain deeper insights into how a microbial community responds over time to their changing environmental conditions, microbiome scientists are beginning to employ large-scale metatranscriptomics approaches. Here, we present a comprehensive review on computational metatranscriptomics approaches to study microbial community transcriptomes. We review the major advancements in this burgeoning field, compare strengths and weaknesses to other microbiome analysis methods, list available tools and workflows, and describe use cases and limitations of this method. We envision that this field will continue to grow exponentially, as will the scope of projects (e.g. longitudinal studies of community transcriptional responses to perturbations over time) and the resulting data. This review will provide a list of options for computational analysis of these data and will highlight areas in need of development.
There is growing interest in reconstructing phylogenies from the copious amounts of genome sequencing projects that target related viral, bacterial or eukaryotic organisms. To facilitate the ...construction of standardized and robust phylogenies for disparate types of projects, we have developed a complete bioinformatic workflow, with a web-based component to perform phylogenetic and molecular evolutionary (PhaME) analysis from sequencing reads, draft assemblies or completed genomes of closely related organisms. Furthermore, the ability to incorporate raw data, including some metagenomic samples containing a target organism (e.g. from clinical samples with suspected infectious agents), shows promise for the rapid phylogenetic characterization of organisms within complex samples without the need for prior assembly.
In the last 5 years, the rapid pace of innovations and improvements in sequencing technologies has completely changed the landscape of metagenomic and metagenetic experiments. Therefore, it is ...critical to benchmark the various methodologies for interrogating the composition of microbial communities, so that we can assess their strengths and limitations. The most common phylogenetic marker for microbial community diversity studies is the 16S ribosomal RNA gene and in the last 10 years the field has moved from sequencing a small number of amplicons and samples to more complex studies where thousands of samples and multiple different gene regions are interrogated.
We assembled 2 synthetic communities with an even (EM) and uneven (UM) distribution of archaeal and bacterial strains and species, as metagenomic control material, to assess performance of different experimental strategies. The 2 synthetic communities were used in this study, to highlight the limitations and the advantages of the leading sequencing platforms: MiSeq (Illumina), The Pacific Biosciences RSII, 454 GS-FLX/+ (Roche), and IonTorrent (Life Technologies). We describe an extensive survey based on synthetic communities using 3 experimental designs (fusion primers, universal tailed tag, ligated adaptors) across the 9 hypervariable 16S rDNA regions. We demonstrate that library preparation methodology can affect data interpretation due to different error and chimera rates generated during the procedure. The observed community composition was always biased, to a degree that depended on the platform, sequenced region and primer choice. However, crucially, our analysis suggests that 16S rRNA sequencing is still quantitative, in that relative changes in abundance of taxa between samples can be recovered, despite these biases.
We have assessed a range of experimental conditions across several next generation sequencing platforms using the most up-to-date configurations. We propose that the choice of sequencing platform and experimental design needs to be taken into consideration in the early stage of a project by running a small trial consisting of several hypervariable regions to quantify the discriminatory power of each region. We also suggest that the use of a synthetic community as a positive control would be beneficial to identify the potential biases and procedural drawbacks that may lead to data misinterpretation. The results of this study will serve as a guideline for making decisions on which experimental condition and sequencing platform to consider to achieve the best microbial profiling.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Summary
Next‐generation sequencing has dramatically changed the landscape of microbial ecology, large‐scale and in‐depth diversity studies being now widely accessible. However, determining the ...accuracy of taxonomic and quantitative inferences and comparing results obtained with different approaches are complicated by incongruence of experimental and computational data types and also by lack of knowledge of the true ecological diversity. Here we used highly diverse bacterial and archaeal synthetic communities assembled from pure genomic DNAs to compare inferences from metagenomic and SSU rRNA amplicon sequencing. Both Illumina and 454 metagenomic data outperformed amplicon sequencing in quantifying the community composition, but the outcome was dependent on analysis parameters and platform. New approaches in processing and classifying amplicons can reconstruct the taxonomic composition of the community with high reproducibility within primer sets, but all tested primers sets lead to significant taxon‐specific biases. Controlled synthetic communities assembled to broadly mimic the phylogenetic richness in target environments can provide important validation for fine‐tuning experimental and computational parameters used to characterize natural communities.
Bacterial and fungal communities associated with plant roots are central to the host health, survival and growth. However, a robust understanding of the root-microbiome and the factors that drive ...host associated microbial community structure have remained elusive, especially in mature perennial plants from natural settings. Here, we investigated relationships of bacterial and fungal communities in the rhizosphere and root endosphere of the riparian tree species Populus deltoides, and the influence of soil parameters, environmental properties (host phenotype and aboveground environmental settings), host plant genotype (Simple Sequence Repeat (SSR) markers), season (Spring vs. Fall) and geographic setting (at scales from regional watersheds to local riparian zones) on microbial community structure. Each of the trees sampled displayed unique aspects to its associated community structure with high numbers of Operational Taxonomic Units (OTUs) specific to an individual trees (bacteria >90%, fungi >60%). Over the diverse conditions surveyed only a small number of OTUs were common to all samples within rhizosphere (35 bacterial and 4 fungal) and endosphere (1 bacterial and 1 fungal) microbiomes. As expected, Proteobacteria and Ascomycota were dominant in root communities (>50%) while other higher-level phylogenetic groups (Chytridiomycota, Acidobacteria) displayed greatly reduced abundance in endosphere compared to the rhizosphere. Variance partitioning partially explained differences in microbiome composition between all sampled roots on the basis of seasonal and soil properties (4% to 23%). While most variation remains unattributed, we observed significant differences in the microbiota between watersheds (Tennessee vs. North Carolina) and seasons (Spring vs. Fall). SSR markers clearly delineated two host populations associated with the samples taken in TN vs. NC, but overall host genotypic distances did not have a significant effect on corresponding communities that could be separated from other measured effects.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Despite unprecedented global sequencing and surveillance of SARS-CoV-2, timely identification of the emergence and spread of novel variants of concern (VoCs) remains a challenge. Several million raw ...genome sequencing runs are now publicly available. We sought to survey these datasets for intrahost variation to study emerging mutations of concern. We developed iSKIM (“intrahost SARS-CoV-2 k-mer identification method”) to relatively quickly and efficiently screen the many SARS-CoV-2 datasets to identify intrahost mutations belonging to lineages of concern. Certain mutations surged in frequency as intrahost minor variants just prior to, or while lineages of concern arose. The Spike N501Y change common to several VoCs was found as a minor variant in 834 samples as early as October 2020. This coincides with the timing of the first detected samples with this mutation in the Alpha/B.1.1.7 and Beta/B.1.351 lineages. Using iSKIM, we also found that Spike L452R was detected as an intrahost minor variant as early as September 2020, prior to the observed rise of the Epsilon/B.1.429/B.1.427 lineages in late 2020. iSKIM rapidly screens for mutations of interest in raw data, prior to genome assembly, and can be used to detect increases in intrahost variants, potentially providing an early indication of novel variant spread.
Genomic sequencing of clinical samples to identify emerging variants of SARS-CoV-2 has been a key public health tool for curbing the spread of the virus. As a result, an unprecedented number of ...SARS-CoV-2 genomes were sequenced during the COVID-19 pandemic, which allowed for rapid identification of genetic variants, enabling the timely design and testing of therapies and deployment of new vaccine formulations to combat the new variants. However, despite the technological advances of deep sequencing, the analysis of the raw sequence data generated globally is neither standardized nor consistent, leading to vastly disparate sequences that may impact identification of variants. Here, we show that for both Illumina and Oxford Nanopore sequencing platforms, downstream bioinformatic protocols used by industry, government, and academic groups resulted in different virus sequences from same sample. These bioinformatic workflows produced consensus genomes with differences in single nucleotide polymorphisms, inclusion and exclusion of insertions, and/or deletions, despite using the same raw sequence as input datasets. Here, we compared and characterized such discrepancies and propose a specific suite of parameters and protocols that should be adopted across the field. Consistent results from bioinformatic workflows are fundamental to SARS-CoV-2 and future pathogen surveillance efforts, including pandemic preparation, to allow for a data-driven and timely public health response.
Non-typhoidal Salmonella (NTS) is a major global health concern that often causes bloodstream infections in areas of the world affected by malnutrition and comorbidities such as HIV and malaria. ...Developing a strategy to control the emergence and spread of highly invasive and antimicrobial resistant NTS isolates requires a comprehensive analysis of epidemiological factors and molecular pathogenesis. Here, we characterize 11 NTS isolates that caused bloodstream infections in pediatric patients in Siaya, Kenya from 2003-2010. Nine isolates were identified as S. Typhimurium sequence type 313 while the other two were S. Enteritidis. Comprehensive genotypic and phenotypic analyses were performed to compare these isolates to those previously identified in sub-Saharan Africa. We identified a S. Typhimurium isolate referred to as UGA14 that displayed novel plasmid, pseudogene and resistance features as compared to other isolates reported from Africa. Notably, UGA14 is able to ferment both lactose and sucrose due to the acquisition of insertion elements on the pKST313 plasmid. These findings show for the first time the co-evolution of plasmid-mediated lactose and sucrose metabolism along with cephalosporin resistance in NTS further elucidating the evolutionary mechanisms of invasive NTS phenotypes. These results further support the use of combined genomic and phenotypic approaches to detect and characterize atypical NTS isolates in order to advance biosurveillance efforts that inform countermeasures aimed at controlling invasive and antimicrobial resistant NTS.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Despite over three decades of progress, extraction of high molecular weight (HMW) DNA from high clay soils or iron oxide cemented clay has remained challenging. HMW DNA is desirable for next ...generation sequencing as it yields the most comprehensive coverage. Several DNA extraction procedures were compared from samples that exhibit strong nucleic acid adsorption. pH manipulation or use of alternative ion solutions offered no improvement in nucleic acid recovery. Lysis by liquid N2 grinding in concentrated guanidine followed by concentrated sodium phosphate extraction supported HMW DNA recovery from clays high in iron oxides. DNA recovered using 1 M sodium phosphate buffer (PB) as a competitive desorptive wash was 15.22±2.33 µg DNA/g clay, with most DNA consisting of >20 Kb fragments, compared to 2.46±0.25 µg DNA/g clay with the Powerlyzer system (MoBio). Increasing PB concentration in the lysis reagent coincided with increasing DNA fragment length during initial extraction. Rarefaction plots of 16S rRNA (V1-V3 region) pyrosequencing from A-horizon and clay soils showed an ∼80% and ∼400% larger accessed diversity compared to the Powerlyzer soil DNA system, respectively. The observed diversity from the Firmicutes showed the strongest increase with >3-fold more operational taxonomic units (OTU) recovered.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK