There is growing interest in reconstructing phylogenies from the copious amounts of genome sequencing projects that target related viral, bacterial or eukaryotic organisms. To facilitate the ...construction of standardized and robust phylogenies for disparate types of projects, we have developed a complete bioinformatic workflow, with a web-based component to perform phylogenetic and molecular evolutionary (PhaME) analysis from sequencing reads, draft assemblies or completed genomes of closely related organisms. Furthermore, the ability to incorporate raw data, including some metagenomic samples containing a target organism (e.g. from clinical samples with suspected infectious agents), shows promise for the rapid phylogenetic characterization of organisms within complex samples without the need for prior assembly.
Sequencing-based analyses of microbiomes have traditionally focused on addressing the question of community membership and profiling taxonomic abundance through amplicon sequencing of 16 rRNA genes. ...More recently, shotgun metagenomics, which involves the random sequencing of all genomic content of a microbiome, has dominated this arena due to advancements in sequencing technology throughput and capability to profile genes as well as microbiome membership. While these methods have revealed a great number of insights into a wide variety of microbiomes, both of these approaches only describe the presence of organisms or genes, and not whether they are active members of the microbiome. To obtain deeper insights into how a microbial community responds over time to their changing environmental conditions, microbiome scientists are beginning to employ large-scale metatranscriptomics approaches. Here, we present a comprehensive review on computational metatranscriptomics approaches to study microbial community transcriptomes. We review the major advancements in this burgeoning field, compare strengths and weaknesses to other microbiome analysis methods, list available tools and workflows, and describe use cases and limitations of this method. We envision that this field will continue to grow exponentially, as will the scope of projects (e.g. longitudinal studies of community transcriptional responses to perturbations over time) and the resulting data. This review will provide a list of options for computational analysis of these data and will highlight areas in need of development.
The human gastrointestinal tract contains a complex community of microbes, fulfilling important health-promoting functions. However, this vast complexity of species hampers the assignment of ...responsible organisms to these functions. Recently, Akkermansia muciniphila, a new species from the deeply branched phylum Verrucomicrobia, was isolated from the human intestinal tract based on its capacity to efficiently use mucus as a carbon and nitrogen source. This anaerobic resident is associated with the protective mucus lining of the intestines.
In order to uncover the functional potential of A. muciniphila, its genome was sequenced and annotated. It was found to contain numerous candidate mucinase-encoding genes, but lacking genes encoding canonical mucus-binding domains. Numerous phage-associated sequences found throughout the genome indicate that viruses have played an important part in the evolution of this species. Furthermore, we mined 37 GI tract metagenomes for the presence, and genetic diversity of Akkermansia sequences. Out of 37, eleven contained 16S ribosomal RNA gene sequences that are >95% identical to that of A. muciniphila. In addition, these libraries were found to contain large amounts of Akkermansia DNA based on average nucleotide identity scores, which indicated in one subject co-colonization by different Akkermansia phylotypes. An additional 12 libraries also contained Akkermansia sequences, making a total of ∼16 Mbp of new Akkermansia pangenomic DNA. The relative abundance of Akkermansia DNA varied between <0.01% to nearly 4% of the assembled metagenomic reads. Finally, by testing a large collection of full length 16S sequences, we find at least eight different representative species in the genus Akkermansia.
These large repositories allow us to further mine for genetic heterogeneity and species diversity in the genus Akkermansia, providing novel insight towards the functionality of this abundant inhabitant of the human intestinal tract.
Next generation sequencing (NGS) technologies that parallelize the sequencing process and produce thousands to millions, or even hundreds of millions of sequences in a single sequencing run, have ...revolutionized genomic and genetic research. Because of the vagaries of any platform's sequencing chemistry, the experimental processing, machine failure, and so on, the quality of sequencing reads is never perfect, and often declines as the read is extended. These errors invariably affect downstream analysis/application and should therefore be identified early on to mitigate any unforeseen effects.
Here we present a novel FastQ Quality Control Software (FaQCs) that can rapidly process large volumes of data, and which improves upon previous solutions to monitor the quality and remove poor quality data from sequencing runs. Both the speed of processing and the memory footprint of storing all required information have been optimized via algorithmic and parallel processing solutions. The trimmed output compared side-by-side with the original data is part of the automated PDF output. We show how this tool can help data analysis by providing a few examples, including an increased percentage of reads recruited to references, improved single nucleotide polymorphism identification as well as de novo sequence assembly metrics.
FaQCs combines several features of currently available applications into a single, user-friendly process, and includes additional unique capabilities such as filtering the PhiX control sequences, conversion of FASTQ formats, and multi-threading. The original data and trimmed summaries are reported within a variety of graphics and reports, providing a simple way to do data quality control and assurance.
Highlights ► NGS has broadly opened the door to shotgun metagenomics. ► Read-based ‘and’ assembly-based approaches should be used for the most detailed analysis. ► Online resources are available for ...some analysis techniques. ► Computational resources are current bottlenecks for in-depth metagenome analysis. ► Current tools will not scale and novel metagenomic analysis algorithms are needed.
Pancreatic cancer is one of the most aggressive malignancies, accounting for more than 45,750 deaths annually in the U.S. alone. The aggressive nature and late diagnosis of pancreatic cancer, coupled ...with the limitations of existing chemotherapy, present the pressing need for the development of novel therapeutic strategies. Recent reports have demonstrated a critical role of microRNAs (miRNAs) in the initiation, progression, and metastasis of cancer. Furthermore, aberrant expressions of miRNAs have often been associated with the cause and consequence of pancreatic cancer, emphasizing the possible use of miRNAs in the effective management of pancreatic cancer patients. In this review, we provide a brief overview of miRNA biogenesis and its role in fundamental cellular process and miRNA studies in pancreatic cancer patients and animal models. Subsequent sections narrate the role of miRNA in, (i) cell cycle and proliferation; (ii) apoptosis; (iii) invasions and metastasis; and (iv) various cellular signaling pathways. We also describe the role of miRNA's in pancreatic cancer; (i) diagnosis; (ii) prognosis and (iii) therapeutic intervention. Conclusion section describes the gist of review with future directions.
Summary
Quaternary ammonium compounds (QACs) represent widely used cationic biocides that persist in natural environments. Although microbial degradation, sensitivity and resistance to QACs have been ...extensively documented, a quantitative understanding of how whole communities adapt to QAC exposure remain elusive. To gain insights into these issues, we exposed a microbial community from a contaminated river sediment to varied levels of benzalkonium chlorides (BACs, a family of QACs) for 3 years. Comparative metagenomic analysis showed that the BAC‐fed communities were dramatically decreased in phylogenetic diversity compared with the control (no BAC exposure), resulting presumably from BAC toxicity, and dominated by Pseudomonas species (> 50% of the total). Time‐course metagenomics revealed that community adaptation occurred primarily via selective enrichment of BAC‐degrading Pseudomonas populations, particularly P. nitroreducens, and secondarily via amino acid substitutions and horizontal transfer of a few selected genes in the Pseudomonas populations, including a gene encoding a PAS/PAC sensor protein and ring‐hydroxylating dioxygenase genes. P. nitroreducens isolates were reproducibly recoverable from communities after prolonged periods of no‐BAC exposure, suggesting that they are robust BAC‐degraders. Our study provides new insights into the mechanisms and tempo of microbial community adaptation to QAC exposure and has implications for treating QACs in biological engineered systems.
A major challenge in the field of shotgun metagenomics is the accurate identification of organisms present within a microbial community, based on classification of short sequence reads. Though ...existing microbial community profiling methods have attempted to rapidly classify the millions of reads output from modern sequencers, the combination of incomplete databases, similarity among otherwise divergent genomes, errors and biases in sequencing technologies, and the large volumes of sequencing data required for metagenome sequencing has led to unacceptably high false discovery rates (FDR). Here, we present the application of a novel, gene-independent and signature-based metagenomic taxonomic profiling method with significantly and consistently smaller FDR than any other available method. Our algorithm circumvents false positives using a series of non-redundant signature databases and examines Genomic Origins Through Taxonomic CHAllenge (GOTTCHA). GOTTCHA was tested and validated on 20 synthetic and mock datasets ranging in community composition and complexity, was applied successfully to data generated from spiked environmental and clinical samples, and robustly demonstrates superior performance compared with other available tools.
The complete genome of the ammonia-oxidizing bacterium Nitrosospira multiformis (ATCC 25196T) consists of a circular chromosome and three small plasmids totaling 3,234,309 bp and encoding 2,827 ...putative proteins. Of the 2,827 putative proteins, 2,026 proteins have predicted functions and 801 are without conserved functional domains, yet 747 of these have similarity to other predicted proteins in databases. Gene homologs from Nitrosomonas europaea and Nitrosomonas eutropha were the best match for 42% of the predicted genes in N. multiformis. The N. multiformis genome contains three nearly identical copies of amo and hao gene clusters as large repeats. The features of N. multiformis that distinguish it from N. europaea include the presence of gene clusters encoding urease and hydrogenase, a ribulose-bisphosphate carboxylase/oxygenase-encoding operon of distinctive structure and phylogeny, and a relatively small complement of genes related to Fe acquisition. Systems for synthesis of a pyoverdine-like siderophore and for acyl-homoserine lactone were unique to N. multiformis among the sequenced genomes of ammonia-oxidizing bacteria. Gene clusters encoding proteins associated with outer membrane and cell envelope functions, including transporters, porins, exopolysaccharide synthesis, capsule formation, and protein sorting/export, were abundant. Numerous sensory transduction and response regulator gene systems directed toward sensing of the extracellular environment are described. Gene clusters for glycogen, polyphosphate, and cyanophycin storage and utilization were identified, providing mechanisms for meeting energy requirements under substrate-limited conditions. The genome of N. multiformis encodes the core pathways for chemolithoautotrophy along with adaptations for surface growth and survival in soil environments.