The ancestor of most teleost fishes underwent a whole-genome duplication event three hundred million years ago. Despite its antiquity, the effects of this event are evident both in the structure of ...teleost genomes and in how the surviving duplicated genes still operate to drive form and function. I inferred a set of shared syntenic regions that survive from the teleost genome duplication (TGD) using eight teleost genomes and the outgroup gar genome (which lacks the TGD). I then phylogenetically modeled the TGD's resolution via shared and independent gene losses and applied a new simulation-based statistical test for the presence of bias toward the preservation of genes from one parental subgenome. On the basis of that test, I argue that the TGD was likely an allopolyploidy. I find that duplicate genes surviving from this duplication in zebrafish are less likely to function in early embryo development than are genes that have returned to single copy at some point in this species' history. The tissues these ohnologs are expressed in, as well as their biological functions, lend support to recent suggestions that the TGD was the source of a morphological innovation in the structure of the teleost retina. Surviving duplicates also appear less likely to be essential than singletons, despite the fact that their single-copy orthologs in mouse are no less essential than other genes.
We describe POInT.sub.browse, a web portal that gives access to the orthology inferences made for polyploid genomes with POInT, the Polyploidy Orthology Inference Tool. Ancient, or paleo-, polyploidy ...events are widely distributed across the eukaryotic phylogeny, and the combination of duplicated and lost duplicated genes that these polyploidies produce can confound the identification of orthologous genes between genomes. POInT uses conserved synteny and phylogenetic models to infer orthologous genes between genomes with a shared polyploidy. It also gives confidence estimates for those orthology inferences. POInT.sub.browse gives both graphical and query-based access to these inferences from 12 different polyploidy events, allowing users to visualize genomic regions produced by polyploidies and perform batch queries for each polyploidy event, downloading genes trees and coding sequences for orthologous genes meeting user-specified criteria. POInT.sub.browse and the associated data are online at https://wgd.statgen.ncsu.edu.
Using a phylogenetic model of evolution after genome duplication (i.e., polyploidy) and 12 yeast genomes with a shared genome duplication, I show that the loss of duplicate genes after that ...duplication occurred in three phases. First, losses that occurred immediately after the event were biased toward genes functioning in DNA repair and organellar functions. Then, the main group of duplicate losses appear to have been shaped by a requirement to maintain balance in protein levels: There is a strong statistical association between the number of protein interactions a gene's product is involved in and its propensity to have remained in duplicate. Moreover, when duplicated genes with interactions were lost, it was more common than expected for both members of an interaction pair to have been lost on the same branch of the phylogeny. Finally, in the third phase of the resolution process, overretention of duplicated enzymes carrying high flux and of duplicated genes involved in transcriptional regulation became dominant. I speculate that initial retention of such genes by a requirement to maintain gene dosage set the stage for the later functional changes that then maintained these duplicates for long periods.
Gene duplication provides raw material for functional innovation. Recent advances have shed light on two fundamental questions regarding gene duplication: which genes tend to undergo duplication? And ...how does natural selection subsequently act on them? Genomic data suggest that different gene classes tend to be retained after single-gene and whole-genome duplications. We also know that functional differences between duplicate genes can originate in several different ways, including mutations that directly impart new functions, subdivision of ancestral functions and selection for changes in gene dosage. Interestingly, in many cases the 'new' function of one copy is a secondary property that was always present, but that has been co-opted to a primary role after the duplication.
We model the post-hexaploidy evolution of four genomes from the Solanaceae, a group of flowering plants comprising tomatoes, potatoes and their relatives. The hexaploidy that these genomes descend ...from occurred through two sequential allopolyploidy events and was marked by the unequal losses of duplicated genes from the different progenitor subgenomes. In contrast with the hexaploid Brassiceae (broccoli and its relatives), where the subgenome with the most surviving genes arrived last in the hexaploidy, among the Solanaceae the most preserved subgenome descends from one of the original two tetraploid progenitors. In fact, the last-arriving subgenome in these plants actually has the fewest surviving genes in the modern genomes. We explore whether the distribution of repetitive elements (REs) in these genomes can explain the biases in gene losses, but while the signals we find are broadly consistent with a role for high RE density in driving gene losses, the REs turn over so quickly that little signal of the RE condition at the time of paleopolyploidy is extant in the modern genomes.
We describe GenomeVx, a web-based tool for making editable, publication-quality, maps of mitochondrial and chloroplast genomes and of large plasmids. These maps show the location of genes and ...chromosomal features as well as a position scale. The program takes as input either raw feature positions or GenBank records. In the latter case, features are automatically extracted and colored, an example of which is given. Output is in the Adobe Portable Document Format (PDF) and can be edited by programs such as Adobe Illustrator. Availability: GenomeVx is available at http://wolfe.gen.tcd.ie/GenomeVx Contact: conantg@tcd.ie
Coevolutionary interactions are thought to have spurred the evolution of key innovations and driven the diversification of much of life on Earth. However, the genetic and evolutionary basis of the ...innovations that facilitate such interactions remains poorly understood. We examined the coevolutionary interactions between plants (Brassicales) and butterflies (Pieridae), and uncovered evidence for an escalating evolutionary arms-race. Although gradual changes in trait complexity appear to have been facilitated by allelic turnover, key innovations are associated with gene and genome duplications. Furthermore, we show that the origins of both chemical defenses and of molecular counter adaptations were associated with shifts in diversification rates during the arms-race. These findings provide an important connection between the origins of biodiversity, coevolution, and the role of gene and genome duplications as a substrate for novel traits.
•Dosage balance is fluid and will change over time.•Duplicate genes are preserved by interplay of multiple overlapping mechanisms.•Distinguishing duplicate gene retention mechanisms requires more ...than transcriptomes.•Dosage selection and subfunctionalization occur in the context of biological networks.
Requirements to maintain dosage balance shape many genome-scale patterns in organisms, including the resolution of whole genome duplications (WGD), as well as the varied effects of aneuploidy, segmental duplications, tandem duplications, gene copy number variations (CNV), and epigenetic marks. Like neofunctionalization and subfunctionalization, the impact of absolute and relative dosage varies over time. These variations are of particular importance in understanding the role of dosage in the evolution of polyploid organisms. Numerous investigations have found the consequences of polyploidy remain distinct from small-scale duplications (SSD). This observation is significant as all flowering plants have experienced at least two ancient polyploid events, and many angiosperm lineages have undergone additional rounds of polyploidy. Intriguingly, recent studies indicate a link between how epigenetic marks in recent allopolyploids may induce immediate changes in gene expression and the longer-term patterns of biased fractionation and chromosomal evolution. We argue that dosage effects represent one aspect of an emerging pluralistic framework, a framework that will use biophysics, genomic technologies, and systems-level models of cells to broaden our view of how genomes evolve.
Hybridization coupled to polyploidy, or allopolyploidy, has dramatically shaped the evolution of flowering plants, teleost fishes, and other lineages. Studies of recently formed allopolyploid plants ...have shown that the two subgenomes that merged to form that new allopolyploid do not generally express their genes equally. Instead, one of the two subgenomes expresses its paralogs more highly on average. Meanwhile, older allopolyploidy events tend to show biases in duplicate losses, with one of the two subgenomes retaining more genes than the other. Since reduced expression is a pathway to duplicate loss, understanding the origins of expression biases may help explain the origins of biased losses. Because we expect gene expression levels to experience stabilizing selection, our conceptual frameworks for how allopolyploid organisms form tend to assume that the new allopolyploid will show balanced expression between its subgenomes. It is then necessary to invoke phenomena such as differences in the suppression of repetitive elements to explain the observed expression imbalances. Here we show that, even for phenotypically identical diploid progenitors, the inherent kinetics of gene expression give rise to biases between the expression levels of the progenitor genes in the hybrid. Some of these biases are expected to be gene-specific and not give rise to global differences in progenitor gene expression. However, particularly in the case of allopolyploids formed from progenitors with different genome sizes, global expression biases favoring one subgenome are expected immediately on formation. Hence, expression biases are arguably the expectation upon allopolyploid formation rather than a phenomenon needing explanation. In the future, a deeper understanding of the kinetics of allopolyploidy may allow us to better understand both biases in duplicate losses and hybrid vigor.
Brassica napus, an allotetraploid crop, is hypothesized to be a hybrid from unknown varieties of Brassica rapa and Brassica oleracea. Despite the economic importance of B. napus, much is unresolved ...regarding its phylogenomic relationships, genetic structure, and diversification. Here we conduct a comprehensive study among diverse accessions from 183 B. napus (including rapeseed, rutabaga, and Siberian kale), 112 B. rapa, and 62 B. oleracea and its wild relatives. Using RNA-seq of B. napus accessions, we define the genetic diversity and sub-genome variance of six genetic clusters. Nuclear and organellar phylogenies for B. napus and its progenitors reveal varying patterns of inheritance and post-formation introgression. We discern regions with signatures of selective sweeps and detect 8,187 differentially expressed genes with implications for B. napus diversification. This study highlights the complex origin and evolution of B. napus providing insights that can further facilitate B. napus breeding and germplasm preservation.