Integrating the governing chemistry with the genomics and phenotypes of microbial colonies has been a “holy grail” in microbiology. This work describes a highly sensitive, broadly applicable, and ...cost-effective approach that allows metabolic profiling of live microbial colonies directly from a Petri dish without any sample preparation. Nanospray desorption electrospray ionization mass spectrometry (MS), combined with alignment of MS data and molecular networking, enabled monitoring of metabolite production from live microbial colonies from diverse bacterial genera, including Bacillus subtilis, Streptomyces coelicolor, Mycobacterium smegmatis , and Pseudomonas aeruginosa . This work demonstrates that, by using these tools to visualize small molecular changes within bacterial interactions, insights can be gained into bacterial developmental processes as a result of the improved organization of MS/MS data. To validate this experimental platform, metabolic profiling was performed on Pseudomonas sp. SH-C52, which protects sugar beet plants from infections by specific soil-borne fungi R. Mendes et al. (2011) Science 332:1097–1100. The antifungal effect of strain SH-C52 was attributed to thanamycin, a predicted lipopeptide encoded by a nonribosomal peptide synthetase gene cluster. Our technology, in combination with our recently developed peptidogenomics strategy, enabled the detection and partial characterization of thanamycin and showed that it is a monochlorinated lipopeptide that belongs to the syringomycin family of antifungal agents. In conclusion, the platform presented here provides a significant advancement in our ability to understand the spatiotemporal dynamics of metabolite production in live microbial colonies and communities.
The plant kingdom contains vastly untapped natural product chemistry, which has been traditionally explored through the activity-guided approach. Here, we describe a gene-guided approach to discover ...and engineer a class of plant ribosomal peptides, the branched cyclic lyciumins. Initially isolated from the Chinese wolfberry Lycium barbarum, lyciumins are protease-inhibiting peptides featuring an N-terminal pyroglutamate and a macrocyclic bond between a tryptophan-indole nitrogen and a glycine α-carbon. We report the identification of a lyciumin precursor gene from L. barbarum, which encodes a BURP domain and repetitive lyciumin precursor peptide motifs. Genome mining enabled by this initial finding revealed rich lyciumin genotypes and chemotypes widespread in flowering plants. We establish a biosynthetic framework of lyciumins and demonstrate the feasibility of producing diverse natural and unnatural lyciumins in transgenic tobacco. With rapidly expanding plant genome resources, our approach will complement bioactivity-guided approaches to unlock and engineer hidden plant peptide chemistry for pharmaceutical and agrochemical applications.
Recent developments in next-generation sequencing technologies have brought recognition of microbial genomes as a rich resource for novel natural product discovery. However, owing to the scarcity of ...efficient procedures to connect genes to molecules, only a small fraction of secondary metabolomes have been investigated to date. Transformation-associated recombination (TAR) cloning takes advantage of the natural in vivo homologous recombination of Saccharomyces cerevisiae to directly capture large genomic loci. Here we report a TAR-based genetic platform that allows us to directly clone, refactor, and heterologously express a silent biosynthetic pathway to yield a new antibiotic. With this method, which involves regulatory gene remodeling, we successfully expressed a 67-kb nonribosomal peptide synthetase biosynthetic gene cluster from the marine actinomycete Saccharomonospora sp. CNQ-490 and produced the dichlorinated lipopeptide antibiotic taromycin A in the model expression host Streptomyces coelicolor. The taromycin gene cluster (tar) is highly similar to the clinically approved antibiotic daptomycin from Streptomyces roseosporus, but has notable structural differences in three amino acid residues and the lipid side chain. With the activation of the tar gene cluster and production of taromycin A, this study highlights a unique "plug-and-play" approach to efficiently gaining access to orphan pathways that may open avenues for novel natural product discoveries and drug development.
Polybrominated diphenyl ethers (PBDEs) and polybrominated bipyrroles are natural products that bioaccumulate in the marine food chain. PBDEs have attracted widespread attention because of their ...persistence in the environment and potential toxicity to humans. However, the natural origins of PBDE biosynthesis are not known. Here we report marine bacteria as producers of PBDEs and establish a genetic and molecular foundation for their production that unifies paradigms for the elaboration of bromophenols and bromopyrroles abundant in marine biota. We provide biochemical evidence of marine brominases revealing decarboxylative-halogenation enzymology previously unknown among halogenating enzymes. Biosynthetic motifs discovered in our study were used to mine sequence databases to discover unrealized marine bacterial producers of organobromine compounds.
Peptide natural products show broad biological properties and are commonly produced by orthogonal ribosomal and nonribosomal pathways in prokaryotes and eukaryotes. To harvest this large and diverse ...resource of bioactive molecules, we introduce here natural product peptidogenomics (NPP), a new MS-guided genome-mining method that connects the chemotypes of peptide natural products to their biosynthetic gene clusters by iteratively matching de novo tandem MS (MS(n)) structures to genomics-based structures following biosynthetic logic. In this study, we show that NPP enabled the rapid characterization of over ten chemically diverse ribosomal and nonribosomal peptide natural products of previously unidentified composition from Streptomycete bacteria as a proof of concept to begin automating the genome-mining process. We show the identification of lantipeptides, lasso peptides, linardins, formylated peptides and lipopeptides, many of which are from well-characterized model Streptomycetes, highlighting the power of NPP in the discovery of new peptide natural products from even intensely studied organisms.
Characterization of complex natural product mixtures to the absolute structural level of their components often requires significant amounts of starting materials and lengthy purification process, ...followed by arduous structure elucidation efforts. The crystalline sponge (CS) method has demonstrated utility in the absolute structure elucidation of isolated organic compounds at miniscule quantities compared to conventional methods. In this work, we developed a new CS‐based workflow that greatly expedites the in‐depth structural analysis of crude natural product extracts. Using a crude extract of the red alga Laurencia pacifica, we showed that CS affinity screening prior to compound isolation enables prioritization of analytes present in the extract, and we subsequently resolved the molecular structures of six sesquiterpenes with stereochemical clarity from around 10 mg crude extract. This study demonstrates a new chemotyping workflow that can greatly accelerate natural product discovery from complex samples.
Crystal clear: A crystalline sponge (CS)‐based method was developed for the prioritization of target analytes present in a methanolic crude extract from a red alga. Subsequent analysis by the CS method enabled the clarification of the structures of six sesquiterpenoid natural products, starting from only around 10 mg of the crude extract.
The ability to correlate the production of specialized metabolites to the genetic capacity of the organism that produces such molecules has become an invaluable tool in aiding the discovery of ...biotechnologically applicable molecules. Here, we accomplish this task by matching molecular families with gene cluster families, making these correlations to 60 microbes at one time instead of connecting one molecule to one organism at a time, such as how it is traditionally done. We can correlate these families through the use of nanospray desorption electrospray ionization MS/MS, an ambient pressure MS technique, in conjunction with MS/MS networking and peptidogenomics. We matched the molecular families of peptide natural products produced by 42 bacilli and 18 pseudomonads through the generation of amino acid sequence tags from MS/MS data of specific clusters found in the MS/MS network. These sequence tags were then linked to biosynthetic gene clusters in publicly accessible genomes, providing us with the ability to link particular molecules with the genes that produced them. As an example of its use, this approach was applied to two unsequenced Pseudoalteromonas species, leading to the discovery of the gene cluster for a molecular family, the bromoalterochromides, in the previously sequenced strain P. piscicida JCM 20779 ᵀ. The approach itself is not limited to 60 related strains, because spectral networking can be readily adopted to look at molecular family–gene cluster families of hundreds or more diverse organisms in one single MS/MS network.
Many bioactive plant cyclic peptides form side-chain-derived macrocycles. Lyciumins, cyclic plant peptides with tryptophan macrocyclizations, are ribosomal peptides (RiPPs) originating from ...repetitive core peptide motifs in precursor peptides with plant-specific BURP (BNM2, USP, RD22 and PG1beta) domains, but the biosynthetic mechanism for their formation has remained unknown. Here, we characterize precursor-peptide BURP domains as copper-dependent autocatalytic peptide cyclases and use a combination of tandem mass spectrometry-based metabolomics and plant genomics to systematically discover five BURP-domain-derived plant RiPP classes, with mono- and bicyclic structures formed via tryptophans and tyrosines, from botanical collections. As BURP-domain cyclases are scaffold-generating enzymes in plant specialized metabolism that are physically connected to their substrates in the same polypeptide, we introduce a bioinformatic method to mine plant genomes for precursor-peptide-encoding genes by detection of repetitive substrate domains and known core peptide features. Our study sets the stage for chemical, biosynthetic and biological exploration of plant RiPP natural products from BURP-domain cyclases.
Copper is an important transition metal cofactor in plant metabolism, which enables diverse biocatalysis in aerobic environments. Multiple classes of plant metalloenzymes evolved and underwent ...genetic expansions during the evolution of terrestrial plants and, to date, several representatives of these copper enzyme classes have characterized mechanisms. In this review, we give an updated overview of chemistry, structure, mechanism, function and phylogenetic distribution of plant copper metalloenzymes with an emphasis on biosynthesis of aromatic compounds such as phenylpropanoids (lignin, lignan, flavonoids) and cyclic peptides with macrocyclizations via aromatic amino acids. We also review a recent addition to plant copper enzymology in a copper-dependent peptide cyclase called the BURP domain. Given growing plant genetic resources, a large pool of copper biocatalysts remains to be characterized from plants as plant genomes contain on average more than 70 copper enzyme genes. A major challenge in characterization of copper biocatalysts from plant genomes is the identification of endogenous substrates and catalyzed reactions. We highlight some recent and future trends in filling these knowledge gaps in plant metabolism and the potential for genomic discovery of copper-based enzymology from plants.
Ribosomally synthesized and posttranslationally modified peptides (RiPPs), especially from microbial sources, are a large group of bioactive natural products that are a promising source of new ...(bio)chemistry and bioactivity. In light of exponentially increasing microbial genome databases and improved mass spectrometry (MS)-based metabolomic platforms, there is a need for computational tools that connect natural product genotypes predicted from microbial genome sequences with their corresponding chemotypes from metabolomic data sets. Here, we introduce RiPPquest, a tandem mass spectrometry database search tool for identification of microbial RiPPs, and apply it to lanthipeptide discovery. RiPPquest uses genomics to limit search space to the vicinity of RiPP biosynthetic genes and proteomics to analyze extensive peptide modifications and compute p-values of peptide-spectrum matches (PSMs). We highlight RiPPquest by connecting multiple RiPPs from extracts of Streptomyces to their gene clusters and by the discovery of a new class III lanthipeptide, informatipeptin, from Streptomyces viridochromogenes DSM 40736 to reflect that it is a natural product that was discovered by mass spectrometry based genome mining using algorithmic tools rather than manual inspection of mass spectrometry data and genetic information. The presented tool is available at cyclo.ucsd.edu.