Progress in synthetic biology tools has transformed the way we engineer living cells. Applications of circuit design have reached a new level, offering solutions for metabolic engineering challenges ...that include developing screening approaches for libraries of pathway variants. The use of transcription‐factor‐based biosensors for screening has shown promising results, but the quantitative relationship between the sensors and the sensed molecules still needs more rational understanding. Herein, we have successfully developed a novel biosensor to detect pinocembrin based on a transcriptional regulator. The FdeR transcription factor (TF), known to respond to naringenin, was combined with a fluorescent reporter protein. By varying the copy number of its plasmid and the concentration of the biosensor TF through a combinatorial library, different responses have been recorded and modeled. The fitted model provides a tool to understand the impact of these parameters on the biosensor behavior in terms of dose–response and time curves and offers guidelines to build constructs oriented to increased sensitivity and or ability of linear detection at higher titers. Our model, the first to explicitly take into account the impact of plasmid copy number on biosensor sensitivity using Hill‐based formalism, is able to explain uncharacterized systems without extensive knowledge of the properties of the TF. Moreover, it can be used to model the response of the biosensor to different compounds (here naringenin and pinocembrin) with minimal parameter refitting.
Quantitative analysis of hPSC differentiation systems to identify bottlenecks to differentiation efficiency is described. Selekman and coworkers have compartmentalized an epithelial hPSC differentiation system by identifying stable cell states and distinguishing these states using different marker proteins. By fitting an ODE‐based model to data representing dynamics of these cell states, the authors have estimated the rates of various cell fate decisions and determined which decisions are potentially limiting to the differentiation process. A novel strategy combining directed evolution and promoter engineering for rapid and efficient multigene pathway optimization was developed in this study, which provides potential applications in balancing and increasing the flux through an engineered heterologous pathway in a target organism to achieve high yield and productivity. By using this strategy, an industrial Saccharomyces cerevisiae strain capable of efficiently utilizing cellobiose was obtained. Synthetic interspecies communication between Gram‐negative Escherichia coli and Gram‐positive Bacillus megaterium was established using components from the Staphylococcus aureus agr quorum‐sensing system. Here, Marchand and Collins show that this communication system, which uses a novel peptide signal molecule, allows E. coli to control B. megaterium gene expression during coculture.
Cell-free transcription-translation systems have great potential for biosensing, yet the range of detectable chemicals is limited. Here we provide a workflow to expand the range of molecules ...detectable by cell-free biosensors through combining synthetic metabolic cascades with transcription factor-based networks. These hybrid cell-free biosensors have a fast response time, strong signal response, and a high dynamic range. In addition, they are capable of functioning in a variety of complex media, including commercial beverages and human urine, in which they can be used to detect clinically relevant concentrations of small molecules. This work provides a foundation to engineer modular cell-free biosensors tailored for many applications.
Synthetic biology applied to industrial biotechnology is transforming the way we produce chemicals. However, despite advances in the scale and scope of metabolic engineering, the research and ...development process still remains costly. In order to expand the chemical repertoire for the production of next generation compounds, a major engineering biology effort is required in the development of novel design tools that target chemical diversity through rapid and predictable protocols. Addressing that goal involves retrosynthesis approaches that explore the chemical biosynthetic space. However, the complexity associated with the large combinatorial retrosynthesis design space has often been recognized as the main challenge hindering the approach. Here, we provide RetroPath2.0, an automated open source workflow for retrosynthesis based on generalized reaction rules that perform the retrosynthesis search from chassis to target through an efficient and well-controlled protocol. Its easiness of use and the versatility of its applications make this tool a valuable addition to the biological engineer bench desk. We show through several examples the application of the workflow to biotechnological relevant problems, including the identification of alternative biosynthetic routes through enzyme promiscuity or the development of biosensors. We demonstrate in that way the ability of the workflow to streamline retrosynthesis pathway design and its major role in reshaping the design, build, test and learn pipeline by driving the process toward the objective of optimizing bioproduction. The RetroPath2.0 workflow is built using tools developed by the bioinformatics and cheminformatics community, because it is open source we anticipate community contributions will likely expand further the features of the workflow.
Display omitted
•State-of-the-art Computer-Aided Design retrosynthesis solutions lack open source tools and easiness of use.•We propose RetroPath2.0 a modular and open-source workflow to perform retrosynthesis.•RetroPath2.0 computes reaction network between Source and Sink sets of compounds.•RetroPath2.0 is distributed as a KNIME workflow for desktop computers.•RetroPath2.0 is ready-for-use and distributed with reaction rules.
Lysate-based cell-free systems have become a major platform to study gene expression but batch-to-batch variation makes protein production difficult to predict. Here we describe an active learning ...approach to explore a combinatorial space of ~4,000,000 cell-free buffer compositions, maximizing protein production and identifying critical parameters involved in cell-free productivity. We also provide a one-step-method to achieve high quality predictions for protein production using minimal experimental effort regardless of the lysate quality.
Among the main learning methods reviewed in this study and used in synthetic biology and metabolic engineering are supervised learning, reinforcement and active learning, and in vitro or in vivo ...learning.
In the context of biosynthesis, supervised machine learning is being exploited to predict biological sequence activities, predict structures and engineer sequences, and optimize culture conditions.
Active and reinforcement learning methods use training sets acquired through an iterative process generally involving experimental measurements. They are applied to design, engineer, and optimize metabolic pathways and bioprocesses.
The nascent but promising developments with in vitro and in vivo learning comprise molecular circuits performing simple tasks such as pattern recognition and classification.
Constraint-based metabolic models have been used for decades to predict the phenotype of microorganisms in different environments. However, quantitative predictions are limited unless labor-intensive ...measurements of media uptake fluxes are performed. We show how hybrid neural-mechanistic models can serve as an architecture for machine learning providing a way to improve phenotype predictions. We illustrate our hybrid models with growth rate predictions of Escherichia coli and Pseudomonas putida grown in different media and with phenotype predictions of gene knocked-out Escherichia coli mutants. Our neural-mechanistic models systematically outperform constraint-based models and require training set sizes orders of magnitude smaller than classical machine learning methods. Our hybrid approach opens a doorway to enhancing constraint-based modeling: instead of constraining mechanistic models with additional experimental measurements, our hybrid models grasp the power of machine learning while fulfilling mechanistic constrains, thus saving time and resources in typical systems biology or biological engineering projects.
Abstract
RetroRules is a database of reaction rules for metabolic engineering (https://retrorules.org). Reaction rules are generic descriptions of chemical reactions that can be used in ...retrosynthesis workflows in order to enumerate all possible biosynthetic routes connecting a target molecule to its precursors. The use of such rules is becoming increasingly important in the context of synthetic biology applied to de novo pathway discovery and in systems biology to discover underground metabolism due to enzyme promiscuity. Here, we provide for the first time a complete set containing >400 000 stereochemistry-aware reaction rules extracted from public databases and expressed in the community-standard SMARTS (SMIRKS) format, augmented by a rule representation at different levels of specificity (the atomic environment around the reaction center). Such numerous representations of reactions expand natural chemical diversity by predicting de novo reactions of promiscuous enzymes.
Metabolic engineering aims to produce chemicals of interest from living organisms, to advance toward greener chemistry. Despite efforts, the research and development process is still long and costly, ...and efficient computational design tools are required to explore the chemical biosynthetic space. Here, we propose to explore the bioretrosynthesis space using an artificial intelligence based approach relying on the Monte Carlo Tree Search reinforcement learning method, guided by chemical similarity. We implement this method in RetroPath RL, an open-source and modular command line tool. We validate it on a golden data set of 20 manually curated experimental pathways as well as on a larger data set of 152 successful metabolic engineering projects. Moreover, we provide a novel feature that suggests potential media supplements to complement the enzymatic synthesis plan.
Synthetic biological circuits are promising tools for developing sophisticated systems for medical, industrial, and environmental applications. So far, circuit implementations commonly rely on gene ...expression regulation for information processing using digital logic. Here, we present a different approach for biological computation through metabolic circuits designed by computer-aided tools, implemented in both whole-cell and cell-free systems. We first combine metabolic transducers to build an analog adder, a device that sums up the concentrations of multiple input metabolites. Next, we build a weighted adder where the contributions of the different metabolites to the sum can be adjusted. Using a computational model fitted on experimental data, we finally implement two four-input perceptrons for desired binary classification of metabolite combinations by applying model-predicted weights to the metabolic perceptron. The perceptron-mediated neural computing introduced here lays the groundwork for more advanced metabolic circuits for rapid and scalable multiplex sensing.
Highlights • Transcription Factors represent a starting material of choice to build new biosensors. • Evolution and rational design are making advances toward tailor-made TFs. • Metabolism indirectly ...expands the scope of chemicals that can be detected.