The fitness effects of all possible mutations available to an organism largely shape the dynamics of evolutionary adaptation. Yet, whether and how this adaptive landscape changes over evolutionary ...times, especially upon ecological diversification and changes in community composition, remains poorly understood. We sought to fill this gap by analyzing a stable community of two closely related ecotypes ("L" and "S") shortly after they emerged within the E. coli Long-Term Evolution Experiment (LTEE). We engineered genome-wide barcoded transposon libraries to measure the invasion fitness effects of all possible gene knockouts in the coexisting strains as well as their ancestor, for many different, ecologically relevant conditions. We find consistent statistical patterns of fitness effect variation across both genetic background and community composition, despite the idiosyncratic behavior of individual knockouts. Additionally, fitness effects are correlated with evolutionary outcomes for a number of conditions, possibly revealing shifting patterns of adaptation. Together, our results reveal how ecological and epistatic effects combine to shape the adaptive landscape in a nascent ecological community.
Metagenomics facilitates the study of the genetic information from uncultured microbes and complex microbial communities. Assembling complete genomes from metagenomics data is difficult because most ...samples have high organismal complexity and strain diversity. Some studies have attempted to extract complete bacterial, archaeal, and viral genomes and often focus on species with circular genomes so they can help confirm completeness with circularity. However, less than 100 circularized bacterial and archaeal genomes have been assembled and published from metagenomics data despite the thousands of datasets that are available. Circularized genomes are important for (1) building a reference collection as scaffolds for future assemblies, (2) providing complete gene content of a genome, (3) confirming little or no contamination of a genome, (4) studying the genomic context and synteny of genes, and (5) linking protein coding genes to ribosomal RNA genes to aid metabolic inference in 16S rRNA gene sequencing studies. We developed a semi-automated method called Jorg to help circularize small bacterial, archaeal, and viral genomes using iterative assembly, binning, and read mapping. In addition, this method exposes potential misassemblies from k-mer based assemblies. We chose species of the Candidate Phyla Radiation (CPR) to focus our initial efforts because they have small genomes and are only known to have one ribosomal RNA operon. In addition to 34 circular CPR genomes, we present one circular Margulisbacteria genome, one circular Chloroflexi genome, and two circular megaphage genomes from 19 public and published datasets. We demonstrate findings that would likely be difficult without circularizing genomes, including that ribosomal genes are likely not operonic in the majority of CPR, and that some CPR harbor diverged forms of RNase P RNA. Code and a tutorial for this method is available at https://github.com/lmlui/Jorg and is available on the DOE Systems Biology KnowledgeBase as a beta app.
Bacteriophages (phages) are critical players in the dynamics and function of microbial communities and drive processes as diverse as global biogeochemical cycles and human health. Phages tend to be ...predators finely tuned to attack specific hosts, even down to the strain level, which in turn defend themselves using an array of mechanisms. However, to date, efforts to rapidly and comprehensively identify bacterial host factors important in phage infection and resistance have yet to be fully realized. Here, we globally map the host genetic determinants involved in resistance to 14 phylogenetically diverse double-stranded DNA phages using two model Escherichia coli strains (K-12 and BL21) with known sequence divergence to demonstrate strain-specific differences. Using genome-wide loss-of-function and gain-of-function genetic technologies, we are able to confirm previously described phage receptors as well as uncover a number of previously unknown host factors that confer resistance to one or more of these phages. We uncover differences in resistance factors that strongly align with the susceptibility of K-12 and BL21 to specific phage. We also identify both phage-specific mechanisms, such as the unexpected role of cyclic-di-GMP in host sensitivity to phage N4, and more generic defenses, such as the overproduction of colanic acid capsular polysaccharide that defends against a wide array of phages. Our results indicate that host responses to phages can occur via diverse cellular mechanisms. Our systematic and high-throughput genetic workflow to characterize phage-host interaction determinants can be extended to diverse bacteria to generate datasets that allow predictive models of how phage-mediated selection will shape bacterial phenotype and evolution. The results of this study and future efforts to map the phage resistance landscape will lead to new insights into the coevolution of hosts and their phage, which can ultimately be used to design better phage therapeutic treatments and tools for precision microbiome engineering.
Economic bioconversion of plant cell wall hydrolysates into fuels and chemicals has been hampered mainly due to the inability of microorganisms to efficiently co-ferment pentose and hexose sugars, ...especially glucose and xylose, which are the most abundant sugars in cellulosic hydrolysates. Saccharomyces cerevisiae cannot metabolize xylose due to a lack of xylose-metabolizing enzymes. We developed a rapid and efficient xylose-fermenting S. cerevisiae through rational and inverse metabolic engineering strategies, comprising the optimization of a heterologous xylose-assimilating pathway and evolutionary engineering. Strong and balanced expression levels of the XYL1, XYL2, and XYL3 genes constituting the xylose-assimilating pathway increased ethanol yields and the xylose consumption rates from a mixture of glucose and xylose with little xylitol accumulation. The engineered strain, however, still exhibited a long lag time when metabolizing xylose above 10 g/l as a sole carbon source, defined here as xylose toxicity. Through serial-subcultures on xylose, we isolated evolved strains which exhibited a shorter lag time and improved xylose-fermenting capabilities than the parental strain. Genome sequencing of the evolved strains revealed that mutations in PHO13 causing loss of the Pho13p function are associated with the improved phenotypes of the evolved strains. Crude extracts of a PHO13-overexpressing strain showed a higher phosphatase activity on xylulose-5-phosphate (X-5-P), suggesting that the dephosphorylation of X-5-P by Pho13p might generate a futile cycle with xylulokinase overexpression. While xylose consumption rates by the evolved strains improved substantially as compared to the parental strain, xylose metabolism was interrupted by accumulated acetate. Deletion of ALD6 coding for acetaldehyde dehydrogenase not only prevented acetate accumulation, but also enabled complete and efficient fermentation of xylose as well as a mixture of glucose and xylose by the evolved strain. These findings provide direct guidance for developing industrial strains to produce cellulosic fuels and chemicals.
The essential gene set of a photosynthetic organism Rubin, Benjamin E.; Wetmore, Kelly M.; Price, Morgan N. ...
Proceedings of the National Academy of Sciences - PNAS,
12/2015, Letnik:
112, Številka:
48
Journal Article
Recenzirano
Odprti dostop
Synechococcus elongatusPCC 7942 is a model organism used for studying photosynthesis and the circadian clock, and it is being developed for the production of fuel, industrial chemicals, and ...pharmaceuticals. To identify a comprehensive set of genes and intergenic regions that impacts fitness inS. elongatus,we created a pooled library of ∼250,000 transposon mutants and used sequencing to identify the insertion locations. By analyzing the distribution and survival of these mutants, we identified 718 of the organism’s 2,723 genes as essential for survival under laboratory conditions. The validity of the essential gene set is supported by its tight overlap with well-conserved genes and its enrichment for core biological processes. The differences noted between our dataset and these predictors of essentiality, however, have led to surprising biological insights. One such finding is that genes in a large portion of the TCA cycle are dispensable, suggesting thatS. elongatusdoes not require a cyclic TCA process. Furthermore, the density of the transposon mutant library enabled individual and global statements about the essentiality of noncoding RNAs, regulatory elements, and other intergenic regions. In this way, a group I intron located in tRNALeu, which has been used extensively for phylogenetic studies, was shown here to be essential for the survival ofS. elongatus.Our survey of essentiality for every locus in theS. elongatusgenome serves as a powerful resource for understanding the organism’s physiology and defines the essential gene set required for the growth of a photosynthetic organism.
Most cellular processes depend on intracellular locations and random collisions of individual protein molecules. To model these processes, we developed algorithms to simulate the diffusion, membrane ...interactions, and reactions of individual molecules, and implemented these in the Smoldyn program. Compared to the popular MCell and ChemCell simulators, we found that Smoldyn was in many cases more accurate, more computationally efficient, and easier to use. Using Smoldyn, we modeled pheromone response system signaling among yeast cells of opposite mating type. This model showed that secreted Bar1 protease might help a cell identify the fittest mating partner by sharpening the pheromone concentration gradient. This model involved about 200,000 protein molecules, about 7000 cubic microns of volume, and about 75 minutes of simulated time; it took about 10 hours to run. Over the next several years, as faster computers become available, Smoldyn will allow researchers to model and explore systems the size of entire bacterial and smaller eukaryotic cells.
New regulatory roles continue to emerge for both natural and engineered noncoding RNAs, many of which have specific secondary and tertiary structures essential to their function. Thus there is a ...growing need to develop technologies that enable rapid characterization of structural features within complex RNA populations. We have developed a high-throughput technique, SHAPE-Seq, that can simultaneously measure quantitative, single nucleotide-resolution secondary and tertiary structural information for hundreds of RNA molecules of arbitrary sequence. SHAPE-Seq combines selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) chemistry with multiplexed paired-end deep sequencing of primer extension products. This generates millions of sequencing reads, which are then analyzed using a fully automated data analysis pipeline, based on a rigorous maximum likelihood model of the SHAPE-Seq experiment. We demonstrate the ability of SHAPE-Seq to accurately infer secondary and tertiary structural information, detect subtle conformational changes due to single nucleotide point mutations, and simultaneously measure the structures of a complex pool of different RNA molecules. SHAPE-Seq thus represents a powerful step toward making the study of RNA secondary and tertiary structures high throughput and accessible to a wide array of scientific pursuits, from fundamental biological investigations to engineering RNA for synthetic biological systems.
The widespread natural ability of RNA to sense small molecules and regulate genes has become an important tool for synthetic biology in applications as diverse as environmental sensing and metabolic ...engineering. Previous work in RNA synthetic biology has engineered RNA mechanisms that independently regulate multiple targets and integrate regulatory signals. However, intracellular regulatory networks built with these systems have required proteins to propagate regulatory signals. In this work, we remove this requirement and expand the RNA synthetic biology toolkit by engineering three unique features of the plasmid pT181 antisense-RNA-mediated transcription attenuation mechanism. First, because the antisense RNA mechanism relies on RNA-RNA interactions, we show how the specificity of the natural system can be engineered to create variants that independently regulate multiple targets in the same cell. Second, because the pT181 mechanism controls transcription, we show how independently acting variants can be configured in tandem to integrate regulatory signals and perform genetic logic. Finally, because both the input and output of the attenuator is RNA, we show how these variants can be configured to directly propagate RNA regulatory signals by constructing an RNA-meditated transcriptional cascade. The combination of these three features within a single RNA-based regulatory mechanism has the potential to simplify the design and construction of genetic networks by directly propagating signals as RNA molecules.
Transposon mutagenesis with next-generation sequencing (TnSeq) is a powerful approach to annotate gene function in bacteria, but existing protocols for TnSeq require laborious preparation of every ...sample before sequencing. Thus, the existing protocols are not amenable to the throughput necessary to identify phenotypes and functions for the majority of genes in diverse bacteria. Here, we present a method, random bar code transposon-site sequencing (RB-TnSeq), which increases the throughput of mutant fitness profiling by incorporating random DNA bar codes into Tn5 and mariner transposons and by using bar code sequencing (BarSeq) to assay mutant fitness. RB-TnSeq can be used with any transposon, and TnSeq is performed once per organism instead of once per sample. Each BarSeq assay requires only a simple PCR, and 48 to 96 samples can be sequenced on one lane of an Illumina HiSeq system. We demonstrate the reproducibility and biological significance of RB-TnSeq with Escherichia coli, Phaeobacter inhibens, Pseudomonas stutzeri, Shewanella amazonensis, and Shewanella oneidensis. To demonstrate the increased throughput of RB-TnSeq, we performed 387 successful genome-wide mutant fitness assays representing 130 different bacterium-carbon source combinations and identified 5,196 genes with significant phenotypes across the five bacteria. In P. inhibens, we used our mutant fitness data to identify genes important for the utilization of diverse carbon substrates, including a putative d-mannose isomerase that is required for mannitol catabolism. RB-TnSeq will enable the cost-effective functional annotation of diverse bacteria using mutant fitness profiling.
A large challenge in microbiology is the functional assessment of the millions of uncharacterized genes identified by genome sequencing. Transposon mutagenesis coupled to next-generation sequencing (TnSeq) is a powerful approach to assign phenotypes and functions to genes. However, the current strategies for TnSeq are too laborious to be applied to hundreds of experimental conditions across multiple bacteria. Here, we describe an approach, random bar code transposon-site sequencing (RB-TnSeq), which greatly simplifies the measurement of gene fitness by using bar code sequencing (BarSeq) to monitor the abundance of mutants. We performed 387 genome-wide fitness assays across five bacteria and identified phenotypes for over 5,000 genes. RB-TnSeq can be applied to diverse bacteria and is a powerful tool to annotate uncharacterized genes using phenotype data.