Whole-genome duplications (WGDs) are rare evolutionary events with profound consequences. They double an organism's genetic content, immediately creating a reproductive barrier between it and its ...ancestors and providing raw material for the divergence of gene functions between paralogs. Almost all eukaryotic genome sequences bear evidence of ancient WGDs, but the causes of these events and the timing of intermediate steps have been difficult to discern. One of the best-characterized WGDs occurred in the lineage leading to the baker's yeast Saccharomyces cerevisiae. Marcet-Houben and Gabaldón now show that, rather than simply doubling the DNA of a single ancestor, the yeast WGD likely involved mating between two different ancestral species followed by a doubling of the genome to restore fertility.
It is often anticipated that many of today's diploid plant species are in fact paleopolyploids. Given that an ancient large-scale duplication will result in an excess of relatively old duplicated ...genes with similar ages, we analyzed the timing of duplication of pairs of paralogous genes in 14 model plant species. Using EST contigs (unigenes), we identified pairs of paralogous genes in each species and used the level of synonymous nucleotide substitution to estimate the relative ages of gene duplication. For nine of the investigated species (wheat Triticum aestivum, maize Zea mays, tetraploid cotton Gossypium hirsutum, diploid cotton G. arboretum, tomato Lycopersicon esculentum, potato Solanum tuberosum, soybean Glycine max, barrel medic Medicago truncatula, and Arabidopsis thaliana), the age distributions of duplicated genes contain peaks corresponding to short evolutionary periods during which large numbers of duplicated genes were accumulated. Large-scale duplications (polyploidy or aneuploidy) are strongly suspected to be the cause of these temporal peaks of gene duplication. However, the unusual age profile of tandem gene duplications in Arabidopsis indicates that other scenarios, such as variation in the rate at which duplicated genes are deleted, must also be considered.
To study the evolutionary effects of polyploidy on plant gene functions, we analyzed functional genomics data for a large number of duplicated gene pairs formed by ancient polyploidy events in ...Arabidopsis thaliana. Genes retained in duplicate are not distributed evenly among Gene Ontology or Munich Information Center for Protein Sequences functional categories, which indicates a nonrandom process of gene loss. Genes involved in signal transduction and transcription have been preferentially retained, and those involved in DNA repair have been preferentially lost. Although the two members of each gene pair must originally have had identical transcription profiles, less than half of the pairs formed by the most recent polyploidy event still retain significantly correlated profiles. We identified several cases where groups of duplicated gene pairs have diverged in concert, forming two parallel networks, each containing one member of each gene pair. In these cases, the expression of each gene is strongly correlated with the other nonhomologous genes in its network but poorly correlated with its paralog in the other network. We also find that the rate of protein sequence evolution has been significantly asymmetric in >20% of duplicate pairs. Together, these results suggest that functional diversification of the surviving duplicated genes is a major feature of the long-term evolution of polyploids.
Cell differentiation in yeast species is controlled by a reversible, programmed DNA-rearrangement process called mating-type switching. Switching is achieved by two functionally similar but ...structurally distinct processes in the budding yeast
and the fission yeast
In both species, haploid cells possess one active and two silent copies of the mating-type locus (a three-cassette structure), the active locus is cleaved, and synthesis-dependent strand annealing is used to replace it with a copy of a silent locus encoding the opposite mating-type information. Each species has its own set of components responsible for regulating these processes. In this review, we summarize knowledge about the function and evolution of mating-type switching components in these species, including mechanisms of heterochromatin formation,
locus cleavage, donor bias, lineage tracking, and environmental regulation of switching. We compare switching in these well-studied species to others such as
and the methylotrophic yeasts
and
We focus on some key questions: Which cells switch mating type? What molecular apparatus is required for switching? Where did it come from? And what is the evolutionary purpose of switching?
We developed the Yeast Gene Order Browser (YGOB; http://wolfe.gen.tcd.ie/ygob) to facilitate visual comparisons and computational analysis of synteny relationships in yeasts. The data presented in ...YGOB, currently covering seven species, are based on sets of homologous genes that have been intensively manually curated based on both sequence similarity and genomic context (synteny). We reconciled different laboratories' lists of paralogous Saccharomyces cerevisiae gene pairs formed by genome duplication (ohnologs), and present near-exhaustive lists of the ohnolog pairs retained in S. cerevisiae (551, including 22 previously unidentified), Saccharomyces castellii (599), and Candida glabrata (404).
We describe GenomeVx, a web-based tool for making editable, publication-quality, maps of mitochondrial and chloroplast genomes and of large plasmids. These maps show the location of genes and ...chromosomal features as well as a position scale. The program takes as input either raw feature positions or GenBank records. In the latter case, features are automatically extracted and colored, an example of which is given. Output is in the Adobe Portable Document Format (PDF) and can be edited by programs such as Adobe Illustrator. Availability: GenomeVx is available at http://wolfe.gen.tcd.ie/GenomeVx Contact: conantg@tcd.ie
Many interspecies hybrids have been discovered in yeasts, but most of these hybrids are asexual and can replicate only mitotically. Whole-genome duplication has been proposed as a mechanism by which ...interspecies hybrids can regain fertility, restoring their ability to perform meiosis and sporulate. Here, we show that this process occurred naturally during the evolution of Zygosaccharomyces parabailii, an interspecies hybrid that was formed by mating between 2 parents that differed by 7% in genome sequence and by many interchromosomal rearrangements. Surprisingly, Z. parabailii has a full sexual cycle and is genetically haploid. It goes through mating-type switching and autodiploidization, followed by immediate sporulation. We identified the key evolutionary event that enabled Z. parabailii to regain fertility, which was breakage of 1 of the 2 homeologous copies of the mating-type (MAT) locus in the hybrid, resulting in a chromosomal rearrangement and irreparable damage to 1 MAT locus. This rearrangement was caused by HO endonuclease, which normally functions in mating-type switching. With 1 copy of MAT inactivated, the interspecies hybrid now behaves as a haploid. Our results provide the first demonstration that MAT locus damage is a naturally occurring evolutionary mechanism for whole-genome duplication and restoration of fertility to interspecies hybrids. The events that occurred in Z. parabailii strongly resemble those postulated to have caused ancient whole-genome duplication in an ancestor of Saccharomyces cerevisiae.
Saccharomyces genomes are highly collinear and show relatively little structural variation, both within and between species of this yeast genus. We investigated the only common inversion polymorphism ...known in S. cerevisiae, which affects a 24-kb 'flip/flop' region containing 15 genes near the centromere of chromosome XIV. The region exists in two orientations, called reference (REF) and inverted (INV). Meiotic recombination in this region is suppressed in crosses between REF and INV orientation strains such as the BY x RM cross. We find that the inversion polymorphism is at least 17 million years old because it is conserved across the genus Saccharomyces. However, the REF and INV isomers are not ancient alleles but are continually being re-created by re-inversion of the region within each species. Inversion occurs due to continual homogenization of two almost identical 4-kb sequences that form an inverted repeat (IR) at the ends of the flip/flop region. The IR consists of two pairs of genes that are specifically and strongly expressed during the late stages of sporulation. We show that one of these gene pairs, YNL018C/YNL034W, codes for a protein that is essential for spore formation. YNL018C and YNL034W are the founder members of a gene family, Centroid, whose members in other Saccharomycetaceae species evolve fast, duplicate frequently, and are preferentially located close to centromeres. We tested the hypothesis that Centroid genes are a meiotic drive system, but found no support for this idea.
We investigated genomic diversity of a yeast species that is both an opportunistic pathogen and an important industrial yeast. Under the name Candida krusei, it is responsible for about 2% of yeast ...infections caused by Candida species in humans. Bloodstream infections with C. krusei are problematic because most isolates are fluconazole-resistant. Under the names Pichia kudriavzevii, Issatchenkia orientalis and Candida glycerinogenes, the same yeast, including genetically modified strains, is used for industrial-scale production of glycerol and succinate. It is also used to make some fermented foods. Here, we sequenced the type strains of C. krusei (CBS573T) and P. kudriavzevii (CBS5147T), as well as 30 other clinical and environmental isolates. Our results show conclusively that they are the same species, with collinear genomes 99.6% identical in DNA sequence. Phylogenetic analysis of SNPs does not segregate clinical and environmental isolates into separate clades, suggesting that C. krusei infections are frequently acquired from the environment. Reduced resistance of strains to fluconazole correlates with the presence of one gene instead of two at the ABC11-ABC1 tandem locus. Most isolates are diploid, but one-quarter are triploid. Loss of heterozygosity is common, including at the mating-type locus. Our PacBio/Illumina assembly of the 10.8 Mb CBS573T genome is resolved into 5 complete chromosomes, and was annotated using RNAseq support. Each of the 5 centromeres is a 35 kb gene desert containing a large inverted repeat. This species is a member of the genus Pichia and family Pichiaceae (the methylotrophic yeasts clade), and so is only distantly related to other pathogenic Candida species.
The yeast Komagataella phaffii (formerly called Pichia pastoris) is used widely as a host for secretion of heterologous proteins, but only a few isolates of this species exist and all the commonly ...used expression systems are derived from a single genetic background, CBS7435 (NRRL Y-11430). We hypothesized that other genetic backgrounds could harbor variants that affect yields of secreted proteins. We crossed CBS7435 with 2 other K. phaffii isolates and mapped quantitative trait loci (QTLs) for secretion of a heterologous protein, β-glucosidase, by sequencing individual segregant genomes. A major QTL mapped to a frameshift mutation in the mannosyltransferase gene HOC1, which gives CBS7435 a weaker cell wall and higher protein secretion than the other isolates. Inactivation of HOC1 in the other isolates doubled β-glucosidase secretion. A second QTL mapped to an amino acid substitution in IRA1 that tripled β-glucosidase secretion in 1-week batch cultures but reduced cell viability, and its effects are specific to this heterologous protein. Our results demonstrate that QTL analysis is a powerful method for dissecting the basis of biotechnological traits in nonconventional yeasts, and a route to improving their industrial performance.