Small genes (<150 nucleotides) have been systematically overlooked in phage genomes. We employ a large-scale comparative genomics approach to predict >40,000 small-gene families in ∼2.3 million phage ...genome contigs. We find that small genes in phage genomes are approximately 3-fold more prevalent than in host prokaryotic genomes. Our approach enriches for small genes that are translated in microbiomes, suggesting the small genes identified are coding. More than 9,000 families encode potentially secreted or transmembrane proteins, more than 5,000 families encode predicted anti-CRISPR proteins, and more than 500 families encode predicted antimicrobial proteins. By combining homology and genomic-neighborhood analyses, we reveal substantial novelty and diversity within phage biology, including small phage genes found in multiple host phyla, small genes encoding proteins that play essential roles in host infection, and small genes that share genomic neighborhoods and whose encoded proteins may share related functions.
Display omitted
•More than 40,000 small gene families predicted in phages from diverse environments•More than 5,000 small gene families predicted to encode anti-CRISPR proteins•More than 9,000 small gene families predicted to encode secreted or transmembrane proteins•Identified novel core phage proteins like baseplate proteins and phage tail proteins
Fremin et al. use comparative genomics to predict more than 40,000 small-gene families in phage from diverse environments. Small genes are approximately 3-fold more prevalent in phage than prokaryotic genomes. This resource includes more than 5,000 anti-CRISPR small-gene families and more than 9,000 secreted or transmembrane small-gene families.
Microbial production of fuels and commodity chemicals has been performed primarily using natural or slightly modified enzymes, which inherently limits the types of molecules that can be produced. ...Type I modular polyketide synthases (PKSs) are multi-domain enzymes that can produce unique and diverse molecular structures by combining particular types of catalytic domains in a specific order. This catalytic mechanism offers a wealth of engineering opportunities. Here we report engineered microbes that produce various short-chain (C5-C7) ketones using hybrid PKSs. Introduction of the genes into the chromosome of Streptomyces albus enables it to produce >1 g · l
of C6 and C7 ethyl ketones and several hundred mg · l
of C5 and C6 methyl ketones from plant biomass hydrolysates. Engine tests indicate these short-chain ketones can be added to gasoline as oxygenates to increase the octane of gasoline. Together, it demonstrates the efficient and renewable microbial production of biogasolines by hybrid enzymes.
Lignin valorization offers significant potential to enhance the economic viability of lignocellulosic biorefineries. However, because of its heterogeneous and recalcitrant nature, conversion of ...lignin to value-added coproducts remains a considerable technical challenge. In this study, we employ base-catalyzed depolymerization (BCD) using a process-relevant solid lignin stream produced via deacetylation, mechanical refining, and enzymatic hydrolysis to enable biological lignin conversion. BCD was conducted with the solid lignin substrate over a range of temperatures at two NaOH concentrations, and the results demonstrate that the lignin can be partially extracted and saponified at temperatures as low as 60 °C. At 120 °C and 2% NaOH, the high extent of lignin solubility was accompanied by a considerable decrease in the lignin average molecular weight and the release of lignin-derived monomers including hydroxycinnamic acids. BCD liquors were tested for microbial growth using seven aromatic-catabolizing bacteria and two yeasts. Three organisms (Pseudomonas putida KT2440, Rhodotorula mucilaginosa, and Corynebacterium glutamicum) tolerate high BCD liquor concentrations (up to 90% v/v) and rapidly consume the main lignin-derived monomers, resulting in lignin conversion of up to 15%. Furthermore, as a proof of concept, muconic acid production from a representative lignin BCD liquor was demonstrated with an engineered P. putida KT2440 strain. These results highlight the potential for a mild lignin depolymerization process to enhance the microbial conversion of solid lignin-rich biorefinery streams.
Multidimensional measurements using state-of-the-art separations and mass spectrometry provide advantages in untargeted metabolomics analyses for studying biological and environmental bio-chemical ...processes. However, the lack of rapid analytical methods and robust algorithms for these heterogeneous data has limited its application. Here, we develop and evaluate a sensitive and high-throughput analytical and computational workflow to enable accurate metabolite profiling. Our workflow combines liquid chromatography, ion mobility spectrometry and data-independent acquisition mass spectrometry with PeakDecoder, a machine learning-based algorithm that learns to distinguish true co-elution and co-mobility from raw data and calculates metabolite identification error rates. We apply PeakDecoder for metabolite profiling of various engineered strains of Aspergillus pseudoterreus, Aspergillus niger, Pseudomonas putida and Rhodosporidium toruloides. Results, validated manually and against selected reaction monitoring and gas-chromatography platforms, show that 2683 features could be confidently annotated and quantified across 116 microbial sample runs using a library built from 64 standards.
Antibodies with ontogenies from VH1-2 or VH1-46-germline genes dominate the broadly neutralizing response against the CD4-binding site (CD4bs) on HIV-1. Here, we define with longitudinal sampling ...from time-of-infection the development of a VH1-46-derived antibody lineage that matured to neutralize 90% of HIV-1 isolates. Structures of lineage antibodies CH235 (week 41 from time-of-infection, 18% breadth), CH235.9 (week 152, 77%), and CH235.12 (week 323, 90%) demonstrated the maturing epitope to focus on the conformationally invariant portion of the CD4bs. Similarities between CH235 lineage and five unrelated CD4bs lineages in epitope focusing, length-of-time to develop breadth, and extraordinary level of somatic hypermutation suggested commonalities in maturation among all CD4bs antibodies. Fortunately, the required CH235-lineage hypermutation appeared substantially guided by the intrinsic mutability of the VH1-46 gene, which closely resembled VH1-2. We integrated our CH235-lineage findings with a second broadly neutralizing lineage and HIV-1 co-evolution to suggest a vaccination strategy for inducing both lineages.
Display omitted
•Developmental pathway of a CD4-mimicking antibody lineage from time of infection•Maturation involves focused recognition of CD4-binding site of vulnerability•Intrinsic VH1-46 gene mutability guides somatic mutation of CD4-binding site bnAbs•Study provided a strategy for inducing both CD4-mimicking and CDR H3-using bnAbs
The complete life history of an HIV-1 broadly neutralizing antibody lineage, from the time of infection through development of its full potency and breadth, reveals key features that are common between HIV-1 antibodies from unrelated donors and that can be exploited to induce similar antibodies with vaccines.
Thermophilic bacteria are a potential source of enzymes for the deconstruction of lignocellulosic biomass. However, the complement of proteins used to deconstruct biomass and the specific roles of ...different microbial groups in thermophilic biomass deconstruction are not well-explored. Here we report on the metagenomic and proteogenomic analyses of a compost-derived bacterial consortium adapted to switchgrass at elevated temperature with high levels of glycoside hydrolase activities. Near-complete genomes were reconstructed for the most abundant populations, which included composite genomes for populations closely related to sequenced strains of Thermus thermophilus and Rhodothermus marinus, and for novel populations that are related to thermophilic Paenibacilli and an uncultivated subdivision of the little-studied Gemmatimonadetes phylum. Partial genomes were also reconstructed for a number of lower abundance thermophilic Chloroflexi populations. Identification of genes for lignocellulose processing and metabolic reconstructions suggested Rhodothermus, Paenibacillus and Gemmatimonadetes as key groups for deconstructing biomass, and Thermus as a group that may primarily metabolize low molecular weight compounds. Mass spectrometry-based proteomic analysis of the consortium was used to identify >3000 proteins in fractionated samples from the cultures, and confirmed the importance of Paenibacillus and Gemmatimonadetes to biomass deconstruction. These studies also indicate that there are unexplored proteins with important roles in bacterial lignocellulose deconstruction.
Plants and microbes share common metabolic pathways for producing a range of bioproducts that are potentially foundational to the future bioeconomy. However,
in planta
accumulation and microbial ...production of bioproducts have never been systematically compared on an economic basis to identify optimal routes of production. A detailed technoeconomic analysis of four exemplar compounds (4-hydroxybenzoic acid 4-HBA, catechol, muconic acid, and 2-pyrone-4,6-dicarboxylic acid PDC) is conducted with the highest reported yields and accumulation rates to identify economically advantaged platforms and breakeven targets for plants and microbes. The results indicate that
in planta
mass accumulation ranging from 0.1 to 0.3 dry weight % (dwt%) can achieve costs comparable to microbial routes operating at 40 to 55% of maximum theoretical yields. These yields and accumulation rates are sufficient to be cost competitive if the products are sold at market prices consistent with specialty chemicals ($20 to $50/kg). Prices consistent with commodity chemicals will require an order-of-magnitude-greater accumulation rate for plants and/or yields nearing theoretical maxima for microbial production platforms. This comparative analysis revealed that the demonstrated accumulation rates of 4-HBA (3.2 dwt%) and PDC (3.0 dwt%) in engineered plants vastly outperform microbial routes, even if microbial platforms were to reach theoretical maximum yields. Their recovery and sale as part of a lignocellulosic biorefinery could enable biofuel prices to be competitive with petroleum. Muconic acid and catechol, in contrast, are currently more attractive when produced microbially using a sugar feedstock. Ultimately, both platforms can play an important role in replacing fossil-derived products.