The organization of the cellular interior gives rise to properties including metabolic channeling and micro‐compartmentalization of signaling. Here, we use a lattice model of molecular crowding, ...together with literature‐derived protein interactions and abundances, to describe the molecular organization and stoichiometry of local cellular regions, showing that physical protein–protein interactions induce emergent structures not seen when random interaction networks are modeled. Specifically, we find that the lattices give rise to micro‐groups of enzymes on the surfaces of protein clusters. These arrangements of proteins are also robust to protein overexpression, while still showing evidence for expression tuning. Our results indicate that some of the complex organization of the cell may derive from simple rules of molecular aggregation and interaction.
We use a lattice model of cells’ interior to describe the organization of local regions, showing that protein‐protein interactions give rise to emergent structures. For instance, the lattices contain micro‐groups of enzymes on protein cluster surfaces. These lattices are also robust to over‐expression. We suggest that some of the cell's organization derives from simple rules of aggregation and interaction.
The members of the tribe Brassiceae share a whole-genome triplication (WGT), and one proposed model for its formation is a two-step pair of hybridizations producing hexaploid descendants. However, ...evidence for this model is incomplete, and the evolutionary and functional constraints that drove evolution after the hexaploidy are even less understood. Here, we report a new genome sequence of
, a species sister to most sequenced Brassiceae. Using this new genome and three others that share the hexaploidy, we traced the history of gene loss after the WGT using the Polyploidy Orthology Inference Tool (POInT). We confirm the two-step formation model and infer that there was a significant temporal gap between those two allopolyploidizations, with about a third of the gene losses from the first two subgenomes occurring before the arrival of the third. We also, for the 90,000 individual genes in our study, make parental subgenome assignments, inferring, with measured uncertainty, from which of the progenitor genomes of the allohexaploidy each gene derives. We further show that each subgenome has a statistically distinguishable rate of homoeolog losses. There is little indication of functional distinction between the three subgenomes: the individual subgenomes show no patterns of functional enrichment, no excess of shared protein-protein or metabolic interactions between their members, and no biases in their likelihood of having experienced a recent selective sweep. We propose a "mix and match" model of allopolyploidy, in which subgenome origin drives homoeolog loss propensities but where genes from different subgenomes function together without difficulty.
The past few years have witnessed a paradigm shift in molecular systematics from phylogenetic methods (using one or a few genes) to those that can be described as phylogenomics (phylogenetic ...inference with entire genomes). One approach that has recently emerged is phylo-transcriptomics (transcriptome-based phylogenetic inference). As in any phylogenetics experiment, accurate orthology inference is critical to phylo-transcriptomics. To date, most analyses have inferred orthology based either on pure sequence similarity or using gene-tree approaches. The use of conserved genome synteny in orthology detection has been relatively under-employed in phylogenetics, mainly due to the cost of sequencing genomes. While current trends focus on the quantity of genes included in an analysis, the use of synteny is likely to improve the quality of ortholog inference. In this study, we combine de novo transcriptome data and sequenced genomes from an economically important group of grass species, the tribe Paniceae, to make phylogenomic inferences. This method, which we call "genome-guided phylo-transcriptomics", is compared to other recently published orthology inference pipelines, and benchmarked using a set of sequenced genomes from across the grasses. These comparisons provide a framework for future researchers to evaluate the costs and benefits of adding sequenced genomes to transcriptome data sets.
Premise
Whole‐genome duplications (WGDs) are prevalent throughout the evolutionary history of plants. For example, dozens of WGDs have been phylogenetically localized across the order Brassicales, ...specifically, within the family Brassicaceae. A WGD event has also been identified in the Cleomaceae, the sister family to Brassicaceae, yet its placement, as well as that of WGDs in other families in the order, remains unclear.
Methods
Phylo‐transcriptomic data were generated and used to infer a nuclear phylogeny for 74 Brassicales taxa. Genome survey sequencing was also performed on 66 of those taxa to infer a chloroplast phylogeny. These phylogenies were used to assess and confirm relationships among the major families of the Brassicales and within Brassicaceae. Multiple WGD inference methods were then used to assess the placement of WGDs on the nuclear phylogeny.
Results
Well‐supported chloroplast and nuclear phylogenies for the Brassicales and the putative placement of the Cleomaceae‐specific WGD event Th‐ɑ are presented. This work also provides evidence for previously hypothesized WGDs, including a well‐supported event shared by at least two members of the Resedaceae family, and a possible event within the Capparaceae.
Conclusions
Phylogenetics and the placement of WGDs within highly polyploid lineages continues to be a major challenge. This study adds to the conversation on WGD inference difficulties by demonstrating that sampling is especially important for WGD identification and phylogenetic placement. Given its economic importance and genomic resources, the Brassicales continues to be an ideal group for assessing WGD inference methods.
Convergent evolution is a potent indicator of optimal design. We show here that convergent evolution occurs in genetic networks. Specifically, we show that multiple types of transcriptional ...regulation circuitry in Escherichia coli and the yeast Saccharomyces cerevisiae have evolved independently and not by duplication of one or a few ancestral circuits.
Celotno besedilo
Dostopno za:
DOBA, IJS, IZUM, KILJ, NUK, PILJ, PNG, SAZU, UILJ, UKNU, UL, UM, UPUK
Model species continue to underpin groundbreaking plant science research. At the same time, the phylogenetic resolution of the land plant tree of life continues to improve. The intersection of these ...2 research paths creates a unique opportunity to further extend the usefulness of model species across larger taxonomic groups. Here we promote the utility of the Arabidopsis thaliana model species, especially the ability to connect its genetic and functional resources, to species across the entire Brassicales order. We focus on the utility of using genomics and phylogenomics to bridge the evolution and diversification of several traits across the Brassicales to the resources in Arabidopsis, thereby extending scope from a model species by establishing a "model clade." These Brassicales-wide traits are discussed in the context of both the model species Arabidopsis and the family Brassicaceae. We promote the utility of such a "model clade" and make suggestions for building global networks to support future studies in the model order Brassicales.
I study the reorganization of the yeast transcriptional regulatory network after whole-genome duplication (WGD). Individual transcription factors (TFs) were computationally removed from the ...regulatory network, and the resulting networks were analysed. TF gene pairs that survive in duplicate from WGD show detectable redundancy as a result of that duplication. However, in most other respects, these duplicated TFs are indistinguishable from other TFs in the genome, suggesting that the duplicate TFs produced by WGD were rapidly diverted to distinct functional roles in the regulatory network. Separately, I find that genes targeted by many TFs appear to be preferentially retained in duplicate after WGD, an effect I attribute to selection to maintain dosage balance in the regulatory network after WGD.
Ultraconserved elements (UCEs) are DNA sequences that are 100% identical (no base substitutions, insertions, or deletions) and located in syntenic positions in at least two genomes. Although hundreds ...of UCEs have been found in animal genomes, little is known about the incidence of ultraconservation in plant genomes. Using an alignment-free information-retrieval approach, we have comprehensively identified all long identical multispecies elements (LIMEs), which include both syntenic and nonsyntenic regions, of at least 100 identical base pairs shared by at least two genomes. Among six animal genomes, we found the previously known syntenic UCEs as well as previously undescribed nonsyntenic elements. In contrast, among six plant genomes, we only found nonsyntenic LIMEs. LIMEs can also be classified as either simple (repetitive) or complex (nonrepetitive), they may occur in multiple copies in a genome, and they are often spread across multiple chromosomes. Although complex LIMEs were found in both animal and plant genomes, they differed significantly in their composition and copy number. Further analyses of plant LIMEs revealed their functional diversity, encompassing elements found near rRNA and enzyme-coding genes, as well as those found in transposons and noncoding DNA. We conclude that despite the common presence of LIMEs in both animal and plant lineages, the evolutionary processes involved in the creation and maintenance of these elements differ in the two groups and are likely attributable to several mechanisms, including transfer of genetic material from organellar to nuclear genomes, de novo sequence manufacturing, and purifying selection.
Convergent evolution has fascinated and occasionally mystified biologists since the principle of universal common ancestry was accepted. Similar phenotypes can arise by common ancestry (including ...preadaptations) or through constraints in the space of possible phenotypes and can increase in a population via either drift or selection. Assessing which of these mechanisms to invoke for any given example remains challenging for both simple and complex phenotypes. However, barriers in this area are slowly breaking down with recent advances in genomics and systems biology. A renaissance in the study of convergent evolution may be on its way, as surprising explanations for similar phenotypes, such as the metabolic similarities between yeast and cancer cells, are uncovered with network and metabolic models. We argue that although examples of convergence are known from many domains of life, green plants in particular have remarkable promise for the study of convergence because they are experimentally tractable, have considerable “-omics” and systems biology resources available, and show convergence in a number of important and complex traits. Four such examples include the domestication syndrome, duplicate loss and retention patterns following whole-genome duplication, the multiple appearances of C4 and crassulacean acid metabolism photosynthesis, and hybrid vigor.
Abstract
It has long been challenging to uncover the molecular mechanisms behind striking morphological innovations such as mammalian pregnancy. We studied the power of a robust comparative orthology ...pipeline based on gene synteny to address such problems. We inferred orthology relations between human genes and genes from each of 43 other vertebrate genomes, resulting in ∼18,000 orthologous pairs for each genome comparison. By identifying genes that first appear coincident with origin of the placental mammals, we hypothesized that we would define a subset of the genome enriched for genes that played a role in placental evolution. We thus pinpointed orthologs that appeared before and after the divergence of eutherian mammals from marsupials. Reinforcing previous work, we found instead that much of the genetic toolkit of mammalian pregnancy evolved through the repurposing of preexisting genes to new roles. These genes acquired regulatory controls for their novel roles from a group of regulatory genes, many of which did in fact originate at the appearance of the eutherians. Thus, orthologs appearing at the origin of the eutherians are enriched in functions such as transcriptional regulation by Krüppel-associated box-zinc-finger proteins, innate immune responses, keratinization, and the melanoma-associated antigen protein class. Because the cellular mechanisms of invasive placentae are similar to those of metastatic cancers, we then used our orthology inferences to explore the association between placenta invasion and cancer metastasis. Again echoing previous work, we find that genes that are phylogenetically older are more likely to be implicated in cancer development.