The extension of virology beyond its traditional medical, veterinary, or agricultural applications, now called environmental virology, has shown that viruses are both the most numerous and diverse ...biological entities on Earth. In particular, virus isolations from unicellular eukaryotic hosts (heterotrophic and photosynthetic protozoans) revealed numerous viral types previously unexpected in terms of virion structure, gene content, or mode of replication. Complemented by large-scale metagenomic analyses, these discoveries have rekindled interest in the enigma of the origin of viruses, for which a description encompassing all their diversity remains not available. Several laboratories have repeatedly tackled the deep reconstruction of the evolutionary history of viruses, using various methods of molecular phylogeny applied to the few shared "core" genes detected in certain virus groups (e.g., the
). Beyond the practical difficulties of establishing reliable homology relationships from extremely divergent sequences, I present here conceptual arguments highlighting several fundamental limitations plaguing the reconstruction of the deep evolutionary history of viruses, and even more the identification of their unique or multiple origin(s). These arguments also underline the risk of establishing premature high level viral taxonomic classifications. Those limitations are direct consequences of the random mechanisms governing the reductive/retrogressive evolution of all obligate intracellular parasites.
With DNA genomes reaching 2.5 Mb packed in particles of bacterium-like shape and dimension, the first two Acanthamoeba-infecting pandoraviruses remained up to now the most complex viruses since their ...discovery in 2013. Our isolation of three new strains from distant locations and environments is now used to perform the first comparative genomics analysis of the emerging worldwide-distributed Pandoraviridae family. Thorough annotation of the genomes combining transcriptomic, proteomic, and bioinformatic analyses reveals many non-coding transcripts and significantly reduces the former set of predicted protein-coding genes. Here we show that the pandoraviruses exhibit an open pan-genome, the enormous size of which is not adequately explained by gene duplications or horizontal transfers. As most of the strain-specific genes have no extant homolog and exhibit statistical features comparable to intergenic regions, we suggest that de novo gene creation could contribute to the evolution of the giant pandoravirus genomes.
Chrysochromulina ericina virus CeV-01B (CeV) was isolated from Norwegian coastal waters in 1998. Its icosahedral particle is 160 nm in diameter and encloses a 474-kb double-stranded DNA (dsDNA) ...genome. This virus, although infecting a microalga (the haptophyceae
, formerly
), is phylogenetically related to members of the
family, initially established with the acanthamoeba-infecting mimivirus and megavirus as prototypes. This family was later split into two genera (
and
) following the characterization of a virus infecting the heterotrophic stramenopile
(CroV). CeV, as well as two of its close relatives, which infect the unicellular photosynthetic eukaryotes
(Phaeocystis globosa virus PgV) and
(Aureococcus anophagefferens virus AaV), are currently unclassified by the International Committee on Viral Taxonomy (ICTV). The detailed comparative analysis of the CeV genome presented here confirms the phylogenetic affinity of this emerging group of microalga-infecting viruses with the
but argues in favor of their classification inside a distinct clade within the family. Although CeV, PgV, and AaV share more common features among them than with the larger
, they also exhibit a large complement of unique genes, attesting to their complex evolutionary history. We identified several gene fusion events and cases of convergent evolution involving independent lateral gene acquisitions. Finally, CeV possesses an unusual number of inteins, some of which are closely related despite being inserted in nonhomologous genes. This appears to contradict the paradigm of allele-specific inteins and suggests that the
are especially efficient in spreading inteins while enlarging their repertoire of homing genes.
Although it infects the microalga
, CeV is more closely related to acanthamoeba-infecting viruses of the
family than to any member of the
, the ICTV-approved family historically including all alga-infecting large dsDNA viruses. CeV, as well as its relatives that infect the microalgae
(PgV) and
(AaV), remains officially unclassified and a source of confusion in the literature. Our comparative analysis of the CeV genome in the context of this emerging group of alga-infecting viruses suggests that they belong to a distinct clade within the established
family. The presence of a large number of unique genes as well as specific gene fusion events, evolutionary convergences, and inteins integrated at unusual locations document the complex evolutionary history of the CeV lineage.
Ten years ago, the discovery of Mimivirus, a virus infecting Acanthamoeba, initiated a reappraisal of the upper limits of the viral world, both in terms of particle size (>0.7 micrometers) and genome ...complexity (>1000 genes), dimensions typical of parasitic bacteria. The diversity of these giant viruses (the Megaviridae) was assessed by sampling a variety of aquatic environments and their associated sediments worldwide. We report the isolation of two giant viruses, one off the coast of central Chile, the other from a freshwater pond near Melbourne (Australia), without morphological or genomic resemblance to any previously defined virus families. Their micrometer-sized ovoid particles contain DNA genomes of at least 2.5 and 1.9 megabases, respectively. These viruses are the first members of the proposed "Pandoravirus" genus, a term reflecting their lack of similarity with previously described microorganisms and the surprises expected from their future study.
Mimivirus and its virophage Claverie, Jean-Michel; Abergel, Chantal
Annual review of genetics,
12/2009, Letnik:
43
Journal Article
Recenzirano
Mimivirus, a virus infecting amoebae of the acanthamoeba genus, is the prototype member of the Mimiviridae, the latest addition to the family of the nucleocytoplasmic large DNA viruses, already ...including the Poxviridae, the Iridoviridae, the Asfarviridae, and the Phycodnaviridae. Because of the size of its particle-a fiber-covered icosahedral protein capsid 0.75 microm in diameter-Mimivirus was initially mistaken for a parasitic bacterium. Its 1.2-Mb genome sequence encodes more than 900 proteins, many of them associated with functions never before encountered in a virus, such as four aminoacyl-tRNA synthetases. These findings revived the debate about the origin of DNA viruses and their possible role in the emergence of the eukaryotic nucleus. The recent isolation of a new type of satellite virus, called a virophage, associated with a second strain of Mimivirus, confirmed its unique position within the virus world. Post-genomic studies are now in progress, slowly shedding some light on the physiology of the most complex virus isolated to date.
The protein-coding exome of a patient with a monogenic disease contains about 20,000 variants, only one or two of which are disease causing. We found that 58% of rare variants in the protein-coding ...exome of the general population are located in only 2% of the genes. Prompted by this observation, we aimed to develop a gene-level approach for predicting whether a given human protein-coding gene is likely to harbor disease-causing mutations. To this end, we derived the gene damage index (GDI): a genome-wide, gene-level metric of the mutational damage that has accumulated in the general population. We found that the GDI was correlated with selective evolutionary pressure, protein complexity, coding sequence length, and the number of paralogs. We compared GDI with the leading gene-level approaches, genic intolerance, and de novo excess, and demonstrated that GDI performed best for the detection of false positives (i.e., removing exome variants in genes irrelevant to disease), whereas genic intolerance and de novo excess performed better for the detection of true positives (i.e., assessing de novo mutations in genes likely to be disease causing). The GDI server, data, and software are freely available to noncommercial users from lab.rockefeller.edu/casanova/GDI.
Acanthamoeba are infected by a remarkable diversity of large dsDNA viruses, the infectious cycles of which have been characterized using genomics, transcriptomics and electron microscopy. Given their ...gene content and the persistence of the host nucleus throughout their infectious cycle, the Marseilleviridae were initially assumed to fully replicate in the cytoplasm. Unexpectedly, we find that their virions do not incorporate the virus-encoded transcription machinery, making their replication nucleus-dependent. However, instead of delivering their DNA to the nucleus, the Marseilleviridae initiate their replication by transiently recruiting the nuclear transcription machinery to their cytoplasmic viral factory. The nucleus recovers its integrity after becoming leaky at an early stage. This work highlights the importance of virion proteomic analyses to complement genome sequencing in the elucidation of the replication scheme and evolution of large dsDNA viruses.
Microbes trapped in permanently frozen paleosoils (permafrost) are the focus of increasing research in the context of global warming. Our previous investigations led to the discovery and reactivation ...of two
-infecting giant viruses,
and
, from a 30,000-year old permafrost layer. While several modern pithovirus strains have since been isolated, no contemporary mollivirus relative was found. We now describe
, a close relative to
, isolated from surface soil sampled on the bank of the Kronotsky River in Kamchatka, Russian Federation. This discovery confirms that molliviruses have not gone extinct and are at least present in a distant subarctic continental location. This modern isolate exhibits a nucleocytoplasmic replication cycle identical to that of
Its spherical particle (0.6 μm in diameter) encloses a 648-kb GC-rich double-stranded DNA genome coding for 480 proteins, of which 61% are unique to these two molliviruses. The 461 homologous proteins are highly conserved (92% identical residues, on average), despite the presumed stasis of
for the last 30,000 years. Selection pressure analyses show that most of these proteins contribute to virus fitness. The comparison of these first two molliviruses clarify their evolutionary relationship with the pandoraviruses, supporting their provisional classification in a distinct family, the
, pending the eventual discovery of intermediary missing links better demonstrating their common ancestry.
Virology has long been viewed through the prism of human, cattle, or plant diseases, leading to a largely incomplete picture of the viral world. The serendipitous discovery of the first giant virus visible under a light microscope (i.e., >0.3 μm in diameter), mimivirus, opened a new era of environmental virology, now incorporating protozoan-infecting viruses. Planet-wide isolation studies and metagenome analyses have shown the presence of giant viruses in most terrestrial and aquatic environments, including upper Pleistocene frozen soils. Those systematic surveys have led authors to propose several new distinct families, including the
,
,
,
, and
We now propose to introduce one additional family, the
, following the description of
, the first modern relative of
, previously isolated from 30,000-year-old arctic permafrost.
Large dsDNA viruses are involved in the population control of many globally distributed species of eukaryotic phytoplankton and have a prominent role in bloom termination. The genus Phaeocystis ...(Haptophyta , Prymnesiophyceae) includes several high-biomass-forming phytoplankton species, such as Phaeocystis globosa , the blooms of which occur mostly in the coastal zone of the North Atlantic and the North Sea. Here, we report the 459,984-bp-long genome sequence of P. globosa virus strain PgV-16T, encoding 434 proteins and eight tRNAs and, thus, the largest fully sequenced genome to date among viruses infecting algae. Surprisingly, PgV-16T exhibits no phylogenetic affinity with other viruses infecting microalgae (e.g., phycodnaviruses), including those infecting Emiliania huxleyi , another ubiquitous bloom-forming haptophyte. Rather, PgV-16T belongs to an emerging clade (the Megaviridae) clustering the viruses endowed with the largest known genomes, including Megavirus, Mimivirus (both infecting acanthamoeba), and a virus infecting the marine microflagellate grazer Cafeteria roenbergensis . Seventy-five percent of the best matches of PgV-16T–predicted proteins correspond to two viruses Organic Lake phycodnavirus (OLPV)1 and OLPV2 from a hypersaline lake in Antarctica (Organic Lake), the hosts of which are unknown. As for OLPVs and other Megaviridae, the PgV-16T sequence data revealed the presence of a virophage-like genome. However, no virophage particle was detected in infected P. globosa cultures. The presence of many genes found only in Megaviridae in its genome and the presence of an associated virophage strongly suggest that PgV-16T shares a common ancestry with the largest known dsDNA viruses, the host range of which already encompasses the earliest diverging branches of domain Eukarya.
Fewer Genes, More Noncoding RNA Claverie, Jean-Michel
Science (American Association for the Advancement of Science),
09/2005, Letnik:
309, Številka:
5740
Journal Article
Recenzirano
Recent studies showing that most "messenger" RNAs do not encode proteins finally explain the long-standing discrepancy between the small number of protein-coding genes found in vertebrate genomes and ...the much larger and ever-increasing number of polyadenylated transcripts identified by tag-sampling or microarray-based methods. Exploring the role and diversity of these numerous noncoding RNAs now constitutes a main challenge in transcription research.