The fitness effects of all possible mutations available to an organism largely shape the dynamics of evolutionary adaptation. Yet, whether and how this adaptive landscape changes over evolutionary ...times, especially upon ecological diversification and changes in community composition, remains poorly understood. We sought to fill this gap by analyzing a stable community of two closely related ecotypes ("L" and "S") shortly after they emerged within the E. coli Long-Term Evolution Experiment (LTEE). We engineered genome-wide barcoded transposon libraries to measure the invasion fitness effects of all possible gene knockouts in the coexisting strains as well as their ancestor, for many different, ecologically relevant conditions. We find consistent statistical patterns of fitness effect variation across both genetic background and community composition, despite the idiosyncratic behavior of individual knockouts. Additionally, fitness effects are correlated with evolutionary outcomes for a number of conditions, possibly revealing shifting patterns of adaptation. Together, our results reveal how ecological and epistatic effects combine to shape the adaptive landscape in a nascent ecological community.
Although most organisms synthesize methionine from homocysteine and methyl folates, some have "core" methionine synthases that lack folate-binding domains and use other methyl donors. In vitro, the ...characterized core synthases use methylcobalamin as a methyl donor, but in vivo, they probably rely on corrinoid (vitamin B12-binding) proteins. We identified four families of core methionine synthases that are distantly related to each other (under 30% pairwise amino acid identity). From the characterized enzymes, we identified the families MesA, which is found in methanogens, and MesB, which is found in anaerobic bacteria and archaea with the Wood-Ljungdahl pathway. A third uncharacterized family, MesC, is found in anaerobic archaea that have the Wood-Ljungdahl pathway and lack known forms of methionine synthase. We predict that most members of the MesB and MesC families accept methyl groups from the iron-sulfur corrinoid protein of that pathway. The fourth family, MesD, is found only in aerobic bacteria. Using transposon mutants and complementation, we show that MesD does not require 5-methyltetrahydrofolate or cobalamin. Instead, MesD requires an uncharacterized protein family (DUF1852) and oxygen for activity.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Economic bioconversion of plant cell wall hydrolysates into fuels and chemicals has been hampered mainly due to the inability of microorganisms to efficiently co-ferment pentose and hexose sugars, ...especially glucose and xylose, which are the most abundant sugars in cellulosic hydrolysates. Saccharomyces cerevisiae cannot metabolize xylose due to a lack of xylose-metabolizing enzymes. We developed a rapid and efficient xylose-fermenting S. cerevisiae through rational and inverse metabolic engineering strategies, comprising the optimization of a heterologous xylose-assimilating pathway and evolutionary engineering. Strong and balanced expression levels of the XYL1, XYL2, and XYL3 genes constituting the xylose-assimilating pathway increased ethanol yields and the xylose consumption rates from a mixture of glucose and xylose with little xylitol accumulation. The engineered strain, however, still exhibited a long lag time when metabolizing xylose above 10 g/l as a sole carbon source, defined here as xylose toxicity. Through serial-subcultures on xylose, we isolated evolved strains which exhibited a shorter lag time and improved xylose-fermenting capabilities than the parental strain. Genome sequencing of the evolved strains revealed that mutations in PHO13 causing loss of the Pho13p function are associated with the improved phenotypes of the evolved strains. Crude extracts of a PHO13-overexpressing strain showed a higher phosphatase activity on xylulose-5-phosphate (X-5-P), suggesting that the dephosphorylation of X-5-P by Pho13p might generate a futile cycle with xylulokinase overexpression. While xylose consumption rates by the evolved strains improved substantially as compared to the parental strain, xylose metabolism was interrupted by accumulated acetate. Deletion of ALD6 coding for acetaldehyde dehydrogenase not only prevented acetate accumulation, but also enabled complete and efficient fermentation of xylose as well as a mixture of glucose and xylose by the evolved strain. These findings provide direct guidance for developing industrial strains to produce cellulosic fuels and chemicals.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
An inability to reliably predict quantitative behaviors for novel combinations of genetic elements limits the rational engineering of biological systems. We developed an expression cassette ...architecture for genetic elements controlling transcription and translation initiation in Escherichia coli: transcription elements encode a common mRNA start, and translation elements use an overlapping genetic motif found in many natural systems. We engineered libraries of constitutive and repressor-regulated promoters along with translation initiation elements following these definitions. We measured activity distributions for each library and selected elements that collectively resulted in expression across a 1,000-fold observed dynamic range. We studied all combinations of curated elements, demonstrating that arbitrary genes are reliably expressed to within twofold relative target expression windows with ∼93% reliability. We expect the genetic element definitions validated here can be collectively expanded to create collections of public-domain standard biological parts that support reliable forward engineering of gene expression at genome scales.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
Most cellular processes depend on intracellular locations and random collisions of individual protein molecules. To model these processes, we developed algorithms to simulate the diffusion, membrane ...interactions, and reactions of individual molecules, and implemented these in the Smoldyn program. Compared to the popular MCell and ChemCell simulators, we found that Smoldyn was in many cases more accurate, more computationally efficient, and easier to use. Using Smoldyn, we modeled pheromone response system signaling among yeast cells of opposite mating type. This model showed that secreted Bar1 protease might help a cell identify the fittest mating partner by sharpening the pheromone concentration gradient. This model involved about 200,000 protein molecules, about 7000 cubic microns of volume, and about 75 minutes of simulated time; it took about 10 hours to run. Over the next several years, as faster computers become available, Smoldyn will allow researchers to model and explore systems the size of entire bacterial and smaller eukaryotic cells.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
The inability to predict heterologous gene expression levels precisely hinders our ability to engineer biological systems. Using well-characterized regulatory elements offers a potential solution ...only if such elements behave predictably when combined. We synthesized 12,563 combinations of common promoters and ribosome binding sites and simultaneously measured DNA, RNA, and protein levels from the entire library. Using a simple model, we found that RNA and protein expression were within twofold of expected levels 80% and 64% of the time, respectively. The large dataset allowed quantitation of global effects, such as translation rate on mRNA stability and mRNA secondary structure on translation rate. However, the worst 5% of constructs deviated from prediction by 13-fold on average, which could hinder large-scale genetic engineering projects. The ease and scale this of approach indicates that rather than relying on prediction or standardization, we can screen synthetic libraries for desired behavior.
Full text
Available for:
BFBNIB, NMLJ, NUK, PNG, SAZU, UL, UM, UPUK
Genome sequencing has revealed an incredible diversity of bacteria and archaea, but there are no fast and convenient tools for browsing across these genomes. It is cumbersome to view the prevalence ...of homologs for a protein of interest, or the gene neighborhoods of those homologs, across the diversity of the prokaryotes. We developed a web-based tool, fast.genomics, that uses two strategies to support fast browsing across the diversity of prokaryotes. First, the database of genomes is split up. The main database contains one representative from each of the 6,377 genera that have a high-quality genome, and additional databases for each taxonomic order contain up to 10 representatives of each species. Second, homologs of proteins of interest are identified quickly by using accelerated searches, usually in a few seconds. Once homologs are identified, fast.genomics can quickly show their prevalence across taxa, view their neighboring genes, or compare the prevalence of two different proteins. Fast.genomics is available at https://fast.genomics.lbl.gov.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Natural lipids can be used to make biodiesel and many other value-added compounds. In this work, we explored a number of different metabolic engineering strategies for increasing lipid production in ...the oleaginous yeast
Rhodosporidium toruloides
IFO0880. These included increasing the expression of enzymes involved in different aspects of lipid biosynthesis—malic enzyme (
ME
), pyruvate carboxylase (
PYC1
), glycerol-3-P dehydrogenase (
GPD
), and stearoyl-CoA desaturase (
SCD
)—and deleting the gene
PEX10
, required for peroxisome biogenesis. Only malic enzyme and stearoyl-CoA desaturase, when overexpressed, were found to significantly increase lipid production. Only stearoyl-CoA desaturase, when overexpressed, further increased lipid production in a strain previously engineered to overexpress acetyl-CoA carboxylase (
ACC1
) and diacylglycerol acyltransferase (
DGA1
). Our best strain produced 27.4 g/L lipid with an average productivity of 0.31 g/L/h during batch growth on glucose and 89.4 g/L lipid with an average productivity of 0.61 g/L/h during fed-batch growth on glucose. These results further establish
R. toruloides
as a platform organism for the production of lipids and potentially other lipid-derived compounds from sugars.
Full text
Available for:
CEKLJ, DOBA, EMUNI, FZAB, GEOZS, IJS, IMTLJ, IZUM, KILJ, KISLJ, MFDPS, NUK, OILJ, PILJ, PNG, SAZU, SBCE, SBJE, SBMB, SBNM, UILJ, UKNU, UL, UM, UPUK, VKSCE, ZAGLJ
19.
The essential gene set of a photosynthetic organism Rubin, Benjamin E.; Wetmore, Kelly M.; Price, Morgan N. ...
Proceedings of the National Academy of Sciences - PNAS,
12/2015, Volume:
112, Issue:
48
Journal Article
Peer reviewed
Open access
Synechococcus elongatusPCC 7942 is a model organism used for studying photosynthesis and the circadian clock, and it is being developed for the production of fuel, industrial chemicals, and ...pharmaceuticals. To identify a comprehensive set of genes and intergenic regions that impacts fitness inS. elongatus,we created a pooled library of ∼250,000 transposon mutants and used sequencing to identify the insertion locations. By analyzing the distribution and survival of these mutants, we identified 718 of the organism’s 2,723 genes as essential for survival under laboratory conditions. The validity of the essential gene set is supported by its tight overlap with well-conserved genes and its enrichment for core biological processes. The differences noted between our dataset and these predictors of essentiality, however, have led to surprising biological insights. One such finding is that genes in a large portion of the TCA cycle are dispensable, suggesting thatS. elongatusdoes not require a cyclic TCA process. Furthermore, the density of the transposon mutant library enabled individual and global statements about the essentiality of noncoding RNAs, regulatory elements, and other intergenic regions. In this way, a group I intron located in tRNALeu, which has been used extensively for phylogenetic studies, was shown here to be essential for the survival ofS. elongatus.Our survey of essentiality for every locus in theS. elongatusgenome serves as a powerful resource for understanding the organism’s physiology and defines the essential gene set required for the growth of a photosynthetic organism.
Full text
Available for:
BFBNIB, NMLJ, NUK, PNG, SAZU, UL, UM, UPUK
Transposon mutagenesis with next-generation sequencing (TnSeq) is a powerful approach to annotate gene function in bacteria, but existing protocols for TnSeq require laborious preparation of every ...sample before sequencing. Thus, the existing protocols are not amenable to the throughput necessary to identify phenotypes and functions for the majority of genes in diverse bacteria. Here, we present a method, random bar code transposon-site sequencing (RB-TnSeq), which increases the throughput of mutant fitness profiling by incorporating random DNA bar codes into Tn5 and mariner transposons and by using bar code sequencing (BarSeq) to assay mutant fitness. RB-TnSeq can be used with any transposon, and TnSeq is performed once per organism instead of once per sample. Each BarSeq assay requires only a simple PCR, and 48 to 96 samples can be sequenced on one lane of an Illumina HiSeq system. We demonstrate the reproducibility and biological significance of RB-TnSeq with Escherichia coli, Phaeobacter inhibens, Pseudomonas stutzeri, Shewanella amazonensis, and Shewanella oneidensis. To demonstrate the increased throughput of RB-TnSeq, we performed 387 successful genome-wide mutant fitness assays representing 130 different bacterium-carbon source combinations and identified 5,196 genes with significant phenotypes across the five bacteria. In P. inhibens, we used our mutant fitness data to identify genes important for the utilization of diverse carbon substrates, including a putative d-mannose isomerase that is required for mannitol catabolism. RB-TnSeq will enable the cost-effective functional annotation of diverse bacteria using mutant fitness profiling.
A large challenge in microbiology is the functional assessment of the millions of uncharacterized genes identified by genome sequencing. Transposon mutagenesis coupled to next-generation sequencing (TnSeq) is a powerful approach to assign phenotypes and functions to genes. However, the current strategies for TnSeq are too laborious to be applied to hundreds of experimental conditions across multiple bacteria. Here, we describe an approach, random bar code transposon-site sequencing (RB-TnSeq), which greatly simplifies the measurement of gene fitness by using bar code sequencing (BarSeq) to monitor the abundance of mutants. We performed 387 genome-wide fitness assays across five bacteria and identified phenotypes for over 5,000 genes. RB-TnSeq can be applied to diverse bacteria and is a powerful tool to annotate uncharacterized genes using phenotype data.