WIsH predicts prokaryotic hosts of phages from their genomic sequences. It achieves 63% mean accuracy when predicting the host genus among 20 genera for 3 kbp-long phage contigs. Over the best ...current tool, WisH shows much improved accuracy on phage sequences of a few kbp length and runs hundreds of times faster, making it suited for metagenomics studies.
OpenMP-parallelized GPL-licensed C ++ code available at https://github.com/soedinglab/wish.
clovis.galiez@mpibpc.mpg.de or soeding@mpibpc.mpg.de.
Supplementary data are available at Bioinformatics online.
Metagenomics, based on culture-independent sequencing, is a well-fitted approach to provide insights into the composition, structure and dynamics of environmental viral communities. Following recent ...advances in sequencing technologies, new challenges arise for existing bioinformatic tools dedicated to viral metagenome (i.e. virome) analysis as (i) the number of viromes is rapidly growing and (ii) large genomic fragments can now be obtained by assembling the huge amount of sequence data generated for each metagenome.
To face these challenges, a new version of Metavir was developed. First, all Metavir tools have been adapted to support comparative analysis of viromes in order to improve the analysis of multiple datasets. In addition to the sequence comparison previously provided, viromes can now be compared through their k-mer frequencies, their taxonomic compositions, recruitment plots and phylogenetic trees containing sequences from different datasets. Second, a new section has been specifically designed to handle assembled viromes made of thousands of large genomic fragments (i.e. contigs). This section includes an annotation pipeline for uploaded viral contigs (gene prediction, similarity search against reference viral genomes and protein domains) and an extensive comparison between contigs and reference genomes. Contigs and their annotations can be explored on the website through specifically developed dynamic genomic maps and interactive networks.
The new features of Metavir 2 allow users to explore and analyze viromes composed of raw reads or assembled fragments through a set of adapted tools and a user-friendly interface.
Recent studies suggest that members of the Microviridae (a family of ssDNA bacteriophages) might play an important role in a broad spectrum of environments, as they were found in great number among ...the viral fraction from seawater and human gut samples. 24 completely sequenced Microviridae have been described so far, divided into three distinct groups named Microvirus, Gokushovirinae and Alpavirinae, this last group being only composed of prophages. In this study, we present the analysis of 81 new complete Microviridae genomes, assembled from viral metagenomes originating from various ecosystems. The phylogenetic analysis of the core genes highlights the existence of four groups, confirming the three sub-families described so far and exhibiting a new group, named Pichovirinae. The genomic organizations of these viruses are strikingly coherent with their phylogeny, the Pichovirinae being the only group of this family with a different organization of the three core genes. Analysis of the structure of the major capsid protein reveals the presence of mushroom-like insertions conserved within all the groups except for the microviruses. In addition, a peptidase gene was found in 10 Microviridae and its analysis indicates a horizontal gene transfer that occurred several times between these viruses and their bacterial hosts. This is the first report of such gene transfer in Microviridae. Finally, searches against viral metagenomes revealed the presence of highly similar sequences in a variety of biomes indicating that Microviridae probably have both an important role in these ecosystems and an ancient origin.
Tailed bacteriophages are the most abundant and diverse viruses in the world, with genome sizes ranging from 10 kbp to over 500 kbp. Yet, due to historical reasons, all this diversity is confined to ...a single virus order—Caudovirales, composed of just four families: Myoviridae, Siphoviridae, Podoviridae, and the newly created Ackermannviridae family. In recent years, this morphology-based classification scheme has started to crumble under the constant flood of phage sequences, revealing that tailed phages are even more genetically diverse than once thought. This prompted us, the Bacterial and Archaeal Viruses Subcommittee of the International Committee on Taxonomy of Viruses (ICTV), to consider overall reorganization of phage taxonomy. In this study, we used a wide range of complementary methods—including comparative genomics, core genome analysis, and marker gene phylogenetics—to show that the group of Bacillus phage SPO1-related viruses previously classified into the Spounavirinae subfamily, is clearly distinct from other members of the family Myoviridae and its diversity deserves the rank of an autonomous family. Thus, we removed this group from the Myoviridae family and created the family Herelleviridae—a new taxon of the same rank. In the process of the taxon evaluation, we explored the feasibility of different demarcation criteria and critically evaluated the usefulness of our methods for phage classification. The convergence of results, drawing a consistent and comprehensive picture of a new family with associated subfamilies, regardless of method, demonstrates that the tools applied here are particularly useful in phage taxonomy. We are convinced that creation of this novel family is a crucial milestone toward much-needed reclassification in the Caudovirales order.
Metavir is a web server dedicated to the analysis of viral metagenomes (viromes). In addition to classical approaches for analyzing metagenomes (general sequence characteristics, taxonomic ...composition), new tools developed specifically for viral sequence analysis make it possible to: (i) explore viral diversity through automatically constructed phylogenies for selected marker genes, (ii) estimate gene richness through rarefaction curves and (iii) perform cross-comparison against other viromes using sequence similarities. Metavir is thus unique as a platform that allows a comprehensive virome analysis. AVAILABILITY: Metavir is freely available online at: http://metavir-meb.univ-bpclermont.fr CONTACT: simon.roux@univ-bpclermont.fr Supplementary Information: Supplementary data are available at Bioinformatics online.
Metagenomic studies have uncovered an astonishing diversity of ssDNA viruses encoding replication proteins (Reps) related to those of eukaryotic Circoviridae, Geminiviridae or Nanoviridae; however, ...exact evolutionary relationships among these viruses remain obscure. Recently, a unique chimeric virus (CHIV) genome, which has apparently emerged via recombination between ssRNA and ssDNA viruses, has been discovered. Here we report on the assembly of 13 new CHIV genomes recovered from various environments. Our results indicate a single event of capsid protein (CP) gene capture from an RNA virus in the history of this virus group. The domestication of the CP gene was followed by an unprecedented recurrent replacement of the Rep genes in CHIVs with distant counterparts from diverse ssDNA viruses. We suggest that parasitic and symbiotic interactions between unicellular eukaryotes were central for the emergence of CHIVs and that such turbulent evolution was primarily dictated by incongruence between the CP and Rep proteins.
Although the importance of viruses in natural ecosystems is widely acknowledged, the functional potential of viral communities is yet to be determined. Viral genomes are traditionally believed to ...carry only those genes that are directly pertinent to the viral life cycle, though this view was challenged by the discovery of metabolism genes in several phage genomes. Metagenomic approaches extended these analyses to a community scale, and several studies concluded that microbial and viral communities encompass similar functional potentials. However, these conclusions could originate from the presence of cellular DNA within viral metagenomes. We developed a computational method to estimate the proportion and origin of cellular sequences in a set of 67 published viromes. A quarter of the datasets were found to contain a substantial amount of sequences originating from cellular genomes. When considering only viromes with no cellular DNA detected, the functional potential of viral and microbial communities was found to be fundamentally different—a conclusion more consistent with the actual picture drawn from known viruses. Yet a significant number of cellular metabolism genes was still retrieved in these viromes, suggesting that the presence of auxiliary genes involved in various metabolic pathways within viral genomes is a general trend in the virosphere.
In this article, we – the Bacterial Viruses Subcommittee and the Archaeal Viruses Subcommittee of the International Committee on Taxonomy of Viruses (ICTV) – summarise the results of our activities ...for the period March 2020 – March 2021. We report the division of the former Bacterial and Archaeal Viruses Subcommittee in two separate Subcommittees, welcome new members, a new Subcommittee Chair and Vice Chair, and give an overview of the new taxa that were proposed in 2020, approved by the Executive Committee and ratified by vote in 2021. In particular, a new realm, three orders, 15 families, 31 subfamilies, 734 genera and 1845 species were newly created or redefined (moved/promoted).
Abstract
Summary
Viral genes, that are frequently small genes and/or with large overlaps, are still difficult to predict accurately. To help predict all genes in viral genomes, we provide CodingDiv ...that detects SNP-level microdiversity of all potential coding regions, using metagenomic reads and/or similar sequences from external databases. Protein coding regions can then be identified as the ones containing more synonymous SNPs than unfavorable nonsynonymous substitutions SNPs.
Availability and implementation
CodingDiv is released under the GPL license. Source code is available at https://github.com/ericolo/codingDiv. The software can be installed and used through a docker container.
Viral metagenomics (viromics) is a tremendous tool to reveal viral taxonomic and functional diversity across ecosystems ranging from the human gut to the world's oceans. As with microbes however, ...there appear vast swaths of "dark matter" yet to be documented for viruses, even among relatively well-studied viral types. Here, we use viromics to explore the "Far-T4 phages" sequence space, a neighbor clade from the well-studied T4-like phages that was first detected through PCR study in seawater and subsequently identified in freshwater lakes through 454-sequenced viromes. To advance the description of these viruses beyond this single marker gene, we explore Far-T4 genome fragments assembled from two deeply-sequenced freshwater viromes. Single gene phylogenetic trees confirm that the Far-T4 phages are divergent from the T4-like phages, genome fragments reveal largely collinear genome organizations, and both data led to the delineation of five Far-T4 clades. Three-dimensional models of major capsid proteins are consistent with a T4-like structure, and highlight a highly conserved core flanked by variable insertions. Finally, we contextualize these now better characterized Far-T4 phages by re-analyzing 196 previously published viromes. These suggest that Far-T4 are common in freshwater and seawater as only four of 82 aquatic viromes lacked Far-T4-like sequences. Variability in representation across the five newly identified clades suggests clade-specific niche differentiation may be occurring across the different biomes, though the underlying mechanism remains unidentified. While complete genome assembly from complex communities and the lack of host linkage information still bottleneck virus discovery through viromes, these findings exemplify the power of metagenomics approaches to assess the diversity, evolutionary history, and genomic characteristics of novel uncultivated phages.