The identification of relevant virus-host pairs that globally account for a large pool of carbon and nutrients in the ocean is paramount to build accurate ecological models. A previous work using ...single-virus genomics led to the discovery of the uncultured single-virus vSAG 37-F6, originally sorted from the Mediterranean Sea (Blanes Bay Microbial Observatory), that represents one of the most abundant dsDNA viral population in the marine surface virosphere. Here, from same sampling site, we report that a Pelagibacter single-cell contained a viral member of vSAG 37-F6 population, by means of PCR screening of sorted, genome-amplified single cells with vSAG 37-F6-specific primers and whole-genome sequencing. Furthermore, viruses from this population were also found in three other Pelagibacter single cells from the South Pacific and Atlantic oceans. These new uncultured pelagiphages were genetically different from the previously characterized pelagiphage isolates. Data showed that the uncultured vSAG 37-F6 population represents the Pelagibacter phages that inhabit the sunlit ocean better, and contains a vast unrecognized microdiversity.
Single-cell genomics has unveiled the metabolic potential of dominant microbes inhabiting different environments, including the human body. The lack of genomic information for predominant microbes of ...the human body, such as bacteriophages, hinders our ability to answer fundamental questions about our viral communities. Here, we applied single-virus genomics (SVGs) to natural human salivary samples in combination with viral metagenomics to gain some insights into the viral community structure of the oral cavity. Saliva samples were processed for viral metagenomics (
= 15) and SVGs (
= 3). A total of 1328 uncultured single viruses were sorted by fluorescence-activated virus sorting followed by whole genome amplification. Sequencing of 24 viral single amplified genomes (vSAGs) showed that half of the vSAGs contained viral hallmark genes. Among those bona fide viruses, the uncultured single virus 92-C13 putatively infecting oral Streptococcus-like species was within the top ≈10 most abundant viruses in the oral virome. Viral gene network and viral metagenomics analyses of 439 oral viruses from cultures, metagenomics, and SVGs revealed that salivary viruses were tentatively structured into ≈200 major viral clusters, corresponding to approximately genus-level groupings. Data showed that none of the publicly available viral isolates, excepting an Actinomyces phage, were significantly abundant in the oral viromes. In addition, none of the obtained viral contigs and vSAGs from this study were present in all viromes. Overall, the data demonstrates that most viral isolates are not naturally abundant in saliva, and furthermore, the predominant viruses in the oral cavity are yet uncharacterized. Results suggest a variable, complex, and interpersonal viral profile. Finally, we demonstrated the power of SVGs in combination with viral metagenomics to unveil the genetic information of the uncultured viruses of the human virome.
Transcriptomic diversity greatly contributes to the fundamentals of disease, lineage-specific biology, and environmental adaptation. However, much of the actual isoform repertoire contributing to ...shaping primate evolution remains unknown. Here, we combined deep long- and short-read sequencing complemented with mass spectrometry proteomics in a panel of lymphoblastoid cell lines (LCLs) from human, three other great apes, and rhesus macaque, producing the largest full-length isoform catalog in primates to date. Around half of the captured isoforms are not annotated in their reference genomes, significantly expanding the gene models in primates. Furthermore, our comparative analyses unveil hundreds of transcriptomic innovations and isoform usage changes related to immune function and immunological disorders. The confluence of these evolutionary innovations with signals of positive selection and their limited impact in the proteome points to changes in alternative splicing in genes involved in immune response as an important target of recent regulatory divergence in primates.
Sorting of individual chromosomes by Flow Cytometry (flow-sorting) is an enrichment method to potentially simplify genome assembly by isolating chromosomes from the context of the genome. We have ...recently developed a workflow to sequence native, unamplified DNA and applied it to the smallest human chromosome, the Y chromosome. Here, we modify improve upon that workflow to increase DNA recovery from chromosome sorting as well as sequencing yield. We apply it to sequence and assemble the largest human chromosome - chromosome 1 - of a Chinese individual using a single Oxford Nanopore MinION flow cell. We generate a selective and highly continuous assembly whose continuity reaches into the order of magnitude of the human reference GRCh38. We then use this assembly to call candidate structural variants against the reference and find 685 putative novel SV candidates. We propose this workflow as a potential solution to assemble structurally complex chromosomes, or the study of very large plant or animal genomes that might challenge traditional assembly strategies.
Microbes drive ecosystems under constraints imposed by viruses. However, a lack of virus genome information hinders our ability to answer fundamental, biological questions concerning microbial ...communities. Here we apply single-virus genomics (SVGs) to assess whether portions of marine viral communities are missed by current techniques. The majority of the here-identified 44 viral single-amplified genomes (vSAGs) are more abundant in global ocean virome data sets than published metagenome-assembled viral genomes or isolates. This indicates that vSAGs likely best represent the dsDNA viral populations dominating the oceans. Species-specific recruitment patterns and virome simulation data suggest that vSAGs are highly microdiverse and that microdiversity hinders the metagenomic assembly, which could explain why their genomes have not been identified before. Altogether, SVGs enable the discovery of some of the likely most abundant and ecologically relevant marine viral species, such as vSAG 37-F6, which were overlooked by other methodologies.
Mammalian Y chromosomes are often neglected from genomic analysis. Due to their inherent assembly difficulties, high repeat content, and large ampliconic regions, only a handful of species have their ...Y chromosome properly characterized. To date, just a single human reference quality Y chromosome, of European ancestry, is available due to a lack of accessible methodology. To facilitate the assembly of such complicated genomic territory, we developed a novel strategy to sequence native, unamplified flow sorted DNA on a MinION nanopore sequencing device. Our approach yields a highly continuous assembly of the first human Y chromosome of African origin. It constitutes a significant improvement over comparable previous methods, increasing continuity by more than 800%. Sequencing native DNA also allows to take advantage of the nanopore signal data to detect epigenetic modifications in situ. This approach is in theory generalizable to any species simplifying the assembly of extremely large and repetitive genomes.
Single-virus genomics (SVGs) has been successfully applied to ocean surface samples allowing the discovery of widespread dominant viruses overlooked for years by metagenomics, such as the uncultured ...virus vSAG 37-F6 infecting the ubiquitous Pelagibacter spp. In SVGs, one uncultured virus at a time is sorted from the environmental sample, whole-genome amplified, and sequenced. Here, we have applied SVGs to deep-ocean samples (200–4000 m depth) from global Malaspina and MEDIMAX expeditions, demonstrating the feasibility of this method in deep-ocean samples. A total of 1328 virus-like particles were sorted from the North Atlantic Ocean, the deep Mediterranean Sea, and the Pacific Ocean oxygen minimum zone (OMZ). For this proof of concept, sixty single viruses were selected at random for sequencing. Genome annotation identified 27 of these genomes as bona fide viruses, and detected three auxiliary metabolic genes involved in nucleotide biosynthesis and sugar metabolism. Massive protein profile analysis confirmed that these viruses represented novel viral groups not present in databases. Although they were not previously assembled by viromics, global fragment recruitment analysis showed a conserved profile of relative abundance of these viruses in all analyzed samples spanning different oceans. Altogether, these results reveal the feasibility in using SVGs in this vast environment to unveil the genomes of relevant viruses.
Ecogenomics of the SAR11 clade Haro‐Moreno, Jose M.; Rodriguez‐Valera, Francisco; Rosselli, Riccardo ...
Environmental microbiology,
20/May , Letnik:
22, Številka:
5
Journal Article
Recenzirano
Odprti dostop
Summary
Members of the SAR11 clade, despite their high abundance, are often poorly represented by metagenome‐assembled genomes. This fact has hampered our knowledge about their ecology and genetic ...diversity. Here we examined 175 SAR11 genomes, including 47 new single‐amplified genomes. The presence of the first genomes associated with subclade IV suggests that, in the same way as subclade V, they might be outside the proposed Pelagibacterales order. An expanded phylogenomic classification together with patterns of metagenomic recruitment at a global scale have allowed us to define new ecogenomic units of classification (genomospecies), appearing at different, and sometimes restricted, metagenomic data sets. We detected greater microdiversity across the water column at a single location than in samples collected from similar depth across the global ocean, suggesting little influence of biogeography. In addition, pangenome analysis revealed that the flexible genome was essential to shape genomospecies distribution. In one genomospecies preferentially found within the Mediterranean, a set of genes involved in phosphonate utilization was detected. While another, with a more cosmopolitan distribution, was unique in having an aerobic purine degradation pathway. Together, these results provide a glimpse of the enormous genomic diversity within this clade at a finer resolution than the currently defined clades.
The emergence of new structures can often be linked to the evolution of novel cell types that follows the rewiring of developmental gene regulatory subnetworks. Vertebrates are characterized by a ...complex body plan compared to the other chordate clades and the question remains of whether and how the emergence of vertebrate morphological innovations can be related to the appearance of new embryonic cell populations. We previously proposed, by studying mesoderm development in the cephalochordate amphioxus, a scenario for the evolution of the vertebrate head mesoderm. To further test this scenario at the cell population level, we used scRNA-seq to construct a cell atlas of the amphioxus neurula, stage at which the main mesodermal compartments are specified. Our data allowed us to validate the presence of a prechordal-plate like territory in amphioxus. Additionally, the transcriptomic profile of somite cell populations supports the homology between specific territories of amphioxus somites and vertebrate cranial/pharyngeal and lateral plate mesoderm. Finally, our work provides evidence that the appearance of the specific mesodermal structures of the vertebrate head was associated to both segregation of pre-existing cell populations, and co-option of new genes for the control of myogenesis.
Recent advances in long-read sequencing technologies have allowed the generation and curation of more complete genome assemblies, enabling the analysis of traditionally neglected chromosomes, such as ...the human Y chromosome (chrY). Native DNA was sequenced on a MinION Oxford Nanopore Technologies sequencing device to generate genome assemblies for seven major chrY human haplogroups. We analyzed and compared the chrY enrichment of sequencing data obtained using two different selective sequencing approaches: adaptive sampling and flow cytometry chromosome sorting. We show that adaptive sampling can produce data to create assemblies comparable to chromosome sorting while being a less expensive and time-consuming technique. We also assessed haplogroup-specific structural variants, which would be otherwise difficult to study using short-read sequencing data only. Finally, we took advantage of this technology to detect and profile epigenetic modifications among the considered haplogroups. Altogether, we provide a framework to study complex genomic regions with a simple, fast, and affordable methodology that could be applied to larger population genomics datasets.