DNA methylation acts in concert with restriction enzymes to protect the integrity of prokaryotic genomes. Studies in a limited number of organisms suggest that methylation also contributes to ...prokaryotic genome regulation, but the prevalence and properties of such non-restriction-associated methylation systems remain poorly understood. Here, we used single molecule, real-time sequencing to map DNA modifications including m6A, m4C, and m5C across the genomes of 230 diverse bacterial and archaeal species. We observed DNA methylation in nearly all (93%) organisms examined, and identified a total of 834 distinct reproducibly methylated motifs. This data enabled annotation of the DNA binding specificities of 620 DNA Methyltransferases (MTases), doubling known specificities for previously hard to study Type I, IIG and III MTases, and revealing their extraordinary diversity. Strikingly, 48% of organisms harbor active Type II MTases with no apparent cognate restriction enzyme. These active 'orphan' MTases are present in diverse bacterial and archaeal phyla and show motif specificities and methylation patterns consistent with functions in gene regulation and DNA replication. Our results reveal the pervasive presence of DNA methylation throughout the prokaryotic kingdoms, as well as the diversity of sequence specificities and potential functions of DNA methylation systems.
Section Flavi encompasses both harmful and beneficial Aspergillus species, such as Aspergillus oryzae, used in food fermentation and enzyme production, and Aspergillus flavus, food spoiler and ...mycotoxin producer. Here, we sequence 19 genomes spanning section Flavi and compare 31 fungal genomes including 23 Flavi species. We reassess their phylogenetic relationships and show that the closest relative of A. oryzae is not A. flavus, but A. minisclerotigenes or A. aflatoxiformans and identify high genome diversity, especially in sub-telomeric regions. We predict abundant CAZymes (598 per species) and prolific secondary metabolite gene clusters (73 per species) in section Flavi. However, the observed phenotypes (growth characteristics, polysaccharide degradation) do not necessarily correlate with inferences made from the predicted CAZyme content. Our work, including genomic analyses, phenotypic assays, and identification of secondary metabolites, highlights the genetic and metabolic diversity within section Flavi.
Organisms orchestrate cellular functions through transcription factor (TF) interactions with their target genes, although these regulatory relationships are largely unknown in most species. Here we ...report a high-throughput approach for characterizing TF-target gene interactions across species and its application to 354 TFs across 48 bacteria, generating 17,000 genome-wide binding maps. This dataset revealed themes of ancient conservation and rapid evolution of regulatory modules. We observed rewiring, where the TF sensing and regulatory role is maintained while the arrangement and identity of target genes diverges, in some cases encoding entirely new functions. We further integrated phenotypic information to define new functional regulatory modules and pathways. Finally, we identified 242 new TF DNA binding motifs, including a 70% increase of known Escherichia coli motifs and the first annotation in Pseudomonas simiae, revealing deep conservation in bacterial promoter architecture. Our method provides a versatile tool for functional characterization of genetic pathways in prokaryotes and eukaryotes.
The process of crop domestication often consists of two stages: initial domestication, where the wild species is first cultivated by humans, followed by diversification, when the domesticated species ...are subsequently adapted to more environments and specialized uses. Selective pressure to increase sugar accumulation in certain varieties of the cereal crop Sorghum bicolor is an excellent example of the latter; this has resulted in pronounced phenotypic divergence between sweet and grain-type sorghums, but the genetic mechanisms underlying these differences remain poorly understood.
Here we present a new reference genome based on an archetypal sweet sorghum line and compare it to the current grain sorghum reference, revealing a high rate of nonsynonymous and potential loss of function mutations, but few changes in gene content or overall genome structure. We also use comparative transcriptomics to highlight changes in gene expression correlated with high stalk sugar content and show that changes in the activity and possibly localization of transporters, along with the timing of sugar metabolism play a critical role in the sweet phenotype.
The high level of genomic similarity between sweet and grain sorghum reflects their historical relatedness, rather than their current phenotypic differences, but we find key changes in signaling molecules and transcriptional regulators that represent new candidates for understanding and improving sugar metabolism in this important crop.
Environmental stress is a major driver of ecological community dynamics and agricultural productivity. This is especially true for soil water availability, because drought is the greatest abiotic ...inhibitor of worldwide crop yields. Here, we test the genetic basis of drought responses in the genetic model for C
perennial grasses, Panicum hallii, through population genomics, field-scale gene-expression (eQTL) analysis, and comparison of two complete genomes. While gene expression networks are dominated by local cis-regulatory elements, we observe three genomic hotspots of unlinked trans-regulatory loci. These regulatory hubs are four times more drought responsive than the genome-wide average. Additionally, cis- and trans-regulatory networks are more likely to have opposing effects than expected under neutral evolution, supporting a strong influence of compensatory evolution and stabilizing selection. These results implicate trans-regulatory evolution as a driver of drought responses and demonstrate the potential for crop improvement in drought-prone regions through modification of gene regulatory networks.
Sodiomyces alkalinus is one of the very few alkalophilic fungi, adapted to grow optimally at high pH. It is widely distributed at the plant‐deprived edges of extremely alkaline lakes and locally ...abundant. We sequenced the genome of S. alkalinus and reconstructed evolution of catabolic enzymes, using a phylogenomic comparison. We found that the genome of S. alkalinus is larger, but its predicted proteome is smaller and heavily depleted of both plant‐degrading enzymes and proteinases, when compared to its closest plant‐pathogenic relatives. Interestingly, despite overall losses, S. alkalinus has retained many proteinases families and acquired bacterial cell wall‐degrading enzymes, some of them via horizontal gene transfer from bacteria. This fungus has very potent proteolytic activity at high pH values, but slowly induced low activity of cellulases and hemicellulases. Our experimental and in silico data suggest that plant biomass, a common food source for most fungi, is not a preferred substrate for S. alkalinus in its natural environment. We conclude that the fungus has abandoned the ancestral plant‐based diet and has become specialized in a more protein‐rich food, abundantly available in soda lakes in the form of prokaryotes and small crustaceans.
Different high temperatures adversely affect crop and algal yields with various responses in photosynthetic cells. The list of genes required for thermotolerance remains elusive. Additionally, it is ...unclear how carbon source availability affects heat responses in plants and algae. We utilized the insertional, indexed, genome‐saturating mutant library of the unicellular, eukaryotic green alga Chlamydomonas reinhardtii to perform genome‐wide, quantitative, pooled screens under moderate (35°C) or acute (40°C) high temperatures with or without organic carbon sources. We identified heat‐sensitive mutants based on quantitative growth rates and identified putative heat tolerance genes (HTGs). By triangulating HTGs with heat‐induced transcripts or proteins in wildtype cultures and MapMan functional annotations, we presented a high/medium‐confidence list of 933 Chlamydomonas genes with putative roles in heat tolerance. Triangulated HTGs include those with known thermotolerance roles and novel genes with little or no functional annotation. About 50% of these high‐confidence HTGs in Chlamydomonas have orthologs in green lineage organisms, including crop species. Arabidopsis thaliana mutants deficient in the ortholog of a high‐confidence Chlamydomonas HTG were also heat sensitive. This work expands our knowledge of heat responses in photosynthetic cells and provides engineering targets to improve thermotolerance in algae and crops.
Summary Statement
We performed genome‐wide, pooled, quantitative, mutant screens in the unicellular green alga Chlamydomonas reinhardtii, combined with transcriptomics/proteomics and function annotation, and provided a list of 933 high/medium‐confidence genes with putative roles in heat tolerance in photosynthetic cells.
Co-infections or secondary infections with SARS-CoV-2 have the potential to affect disease severity and morbidity. Additionally, the potential influence of the nasal microbiome on COVID-19 illness is ...not well understood. In this study, we analyzed 203 residual samples, originally submitted for SARS-CoV-2 testing, for the presence of viral, bacterial, and fungal pathogens and non-pathogens using a comprehensive microarray technology, the Lawrence Livermore Microbial Detection Array (LLMDA). Eighty-seven percent of the samples were nasopharyngeal samples, and 23% of the samples were oral, nasal and oral pharyngeal swabs. We conducted bioinformatics analyses to examine differences in microbial populations of these samples, as a proxy for the nasal and oral microbiome, from SARS-CoV-2 positive and negative specimens. We found 91% concordance with the LLMDA relative to a diagnostic RT-qPCR assay for detection of SARS-CoV-2. Sixteen percent of all the samples (32/203) revealed the presence of an opportunistic bacterial or frank viral pathogen with the potential to cause co-infections. The two most detected bacteria, Streptococcus pyogenes and Streptococcus pneumoniae, were present in both SARS-CoV-2 positive and negative samples. Human metapneumovirus was the most prevalent viral pathogen in the SARS-CoV-2 negative samples. Sequence analysis of 16S rRNA was also conducted to evaluate bacterial diversity and confirm LLMDA results.
Co-infections or secondary infections with SARS-CoV-2 have the potential to affect disease severity and morbidity. Additionally, the potential influence of the nasal microbiome on COVID-19 illness is ...not well understood. In this study, we analyzed 203 residual samples, originally submitted for SARS-CoV-2 testing, for the presence of viral, bacterial, and fungal pathogens and non-pathogens using a comprehensive microarray technology, the Lawrence Livermore Microbial Detection Array (LLMDA). Eighty-seven percent of the samples were nasopharyngeal samples, and 23% of the samples were oral, nasal and oral pharyngeal swabs. We conducted bioinformatics analyses to examine differences in microbial populations of these samples, as a proxy for the nasal and oral microbiome, from SARS-CoV-2 positive and negative specimens. We found 91% concordance with the LLMDA relative to a diagnostic RT-qPCR assay for detection of SARS-CoV-2. Sixteen percent of all the samples (32/203) revealed the presence of an opportunistic bacterial or frank viral pathogen with the potential to cause co-infections. The two most detected bacteria,
Streptococcus pyogenes
and
Streptococcus pneumoniae
, were present in both SARS-CoV-2 positive and negative samples. Human metapneumovirus was the most prevalent viral pathogen in the SARS-CoV-2 negative samples. Sequence analysis of 16S rRNA was also conducted to evaluate bacterial diversity and confirm LLMDA results.
Flowering time is a major determinant of biomass yield in switchgrass (Panicum virgatum), a perennial bioenergy crop, because later flowering allows for an extended period of vegetative growth and ...increased biomass production. A better understanding of the genetic regulation of flowering time in switchgrass will aid the development of switchgrass varieties with increased biomass yields, particularly at northern latitudes, where late-flowering but southern-adapted varieties have high winter mortality.
We use genotypes derived from recently published exome-capture sequencing, which mitigates challenges related to the large, highly repetitive and polyploid switchgrass genome, to perform genome-wide association studies (GWAS) using flowering time data from a switchgrass association panel in an effort to characterize the genetic architecture and genes underlying flowering time regulation in switchgrass.
We identify associations with flowering time at multiple loci, including in a homolog of FLOWERING LOCUS T and in a locus containing TIMELESS, a homolog of a key circadian regulator in animals.
Our results suggest that flowering time variation in switchgrass is due to variation at many positions across the genome. The relationship of flowering time and geographic origin indicates likely roles for genes in the photoperiod and autonomous pathways in generating switchgrass flowering time variation.