We produced a reference sequence of the 1-gigabase chromosome 3B of hexaploid bread wheat. By sequencing 8452 bacterial artificial chromosomes in pools, we assembled a sequence of 774 megabases ...carrying 5326 protein-coding genes, 1938 pseudogenes, and 85% of transposable elements. The distribution of structural and functional features along the chromosome revealed partitioning correlated with meiotic recombination. Comparative analyses indicated high wheat-specific inter- and intrachromosomal gene duplication activities that are potential sources of variability for adaption. In addition to providing a better understanding of the organization, function, and evolution of a large and polyploid genome, the availability of a high-quality sequence anchored to genetic maps will accelerate the identification of genes underlying important agronomic traits.
Usual methods for inferring species boundaries from molecular sequence data rely either on gene trees or on population genetic analyses. Another way of delimiting species, based on a view of species ...as "fields for recombination" (FFRs) characterized by mutual allelic exclusivity, was suggested in 1995 by Doyle. Here we propose to use haplowebs (haplotype networks with additional connections between haplotypes found co-occurring in heterozygous individuals) to visualize and delineate single-locus FFRs (sl-FFRs). Furthermore, we introduce a method to quantify the reliability of putative species boundaries according to the number of independent markers that support them, and illustrate this approach with a case study of taxonomically difficult corals of the genus Pocillopora collected around Clipperton Island (far eastern Pacific).
One haploweb built from intron sequences of the ATP synthase β subunit gene revealed the presence of two sl-FFRs among our 74 coral samples, whereas a second one built from ITS sequences turned out to be composed of four sl-FFRs. As a third independent marker, we performed a combined analysis of two regions of the mitochondrial genome: since haplowebs are not suited to analyze non-recombining markers, individuals were sorted into four haplogroups according to their mitochondrial sequences. Among all possible bipartitions of our set of samples, thirteen were supported by at least one molecular dataset, none by two and only one by all three datasets: this congruent pattern obtained from independent nuclear and mitochondrial markers indicates that two species of Pocillopora are present in Clipperton.
Our approach builds on Doyle's method and extends it by introducing an intuitive, user-friendly graphical representation and by proposing a conceptual framework to analyze and quantify the congruence between sl-FFRs obtained from several independent markers. Like delineation methods based on population-level statistical approaches, our method can distinguish closely-related species that have not yet reached reciprocal monophyly at most or all of their loci; like tree-based approaches, it can yield meaningful conclusions using a number of independent markers as low as three. Future efforts will aim to develop programs that speed up the construction of haplowebs from FASTA sequence alignments and help perform the congruence analysis outlined in this article.
To improve our understanding of the organization and evolution of the wheat (Triticum aestivum) genome, we sequenced and annotated 13-Mb contigs (18.2 Mb) originating from different regions of its ...largest chromosome, 3B (1 Gb), and produced a 2x chromosome survey by shotgun Illumina/Solexa sequencing. All regions carried genes irrespective of their chromosomal location. However, gene distribution was not random, with 75% of them clustered into small islands containing three genes on average. A twofold increase of gene density was observed toward the telomeres likely due to high tandem and interchromosomal duplication events. A total of 3222 transposable elements were identified, including 800 new families. Most of them are complete but showed a highly nested structure spread over distances as large as 200 kb. A succession of amplification waves involving different transposable element families led to contrasted sequence compositions between the proximal and distal regions. Finally, with an estimate of 50,000 genes per diploid genome, our data suggest that wheat may have a higher gene number than other cereals. Indeed, comparisons with rice (Oryza sativa) and Brachypodium revealed that a high number of additional noncollinear genes are interspersed within a highly conserved ancestral grass gene backbone, supporting the idea of an accelerated evolution in the Triticeae lineages.
Although an increasing number of horizontal gene transfers have been reported in eukaryotes, experimental evidence for their adaptive value is lacking. Here, we report the recent transfer of a 158-kb ...genomic region between Torulaspora microellipsoides and Saccharomyces cerevisiae wine yeasts or closely related strains. This genomic region has undergone several rearrangements in S. cerevisiae strains, including gene loss and gene conversion between two tandemly duplicated FOT genes encoding oligopeptide transporters. We show that FOT genes confer a strong competitive advantage during grape must fermentation by increasing the number and diversity of oligopeptides that yeast can utilize as a source of nitrogen, thereby improving biomass formation, fermentation efficiency, and cell viability. Thus, the acquisition of FOT genes has favored yeast adaptation to the nitrogen-limited wine fermentation environment. This finding indicates that anthropic environments offer substantial ecological opportunity for evolutionary diversification through gene exchange between distant yeast species.
Root-knot nematodes (genus Meloidogyne) exhibit a diversity of reproductive modes ranging from obligatory sexual to fully asexual reproduction. Intriguingly, the most widespread and devastating ...species to global agriculture are those that reproduce asexually, without meiosis. To disentangle this surprising parasitic success despite the absence of sex and genetic exchanges, we have sequenced and assembled the genomes of three obligatory ameiotic and asexual Meloidogyne. We have compared them to those of relatives able to perform meiosis and sexual reproduction. We show that the genomes of ameiotic asexual Meloidogyne are large, polyploid and made of duplicated regions with a high within-species average nucleotide divergence of ~8%. Phylogenomic analysis of the genes present in these duplicated regions suggests that they originated from multiple hybridization events and are thus homoeologs. We found that up to 22% of homoeologous gene pairs were under positive selection and these genes covered a wide spectrum of predicted functional categories. To biologically assess functional divergence, we compared expression patterns of homoeologous gene pairs across developmental life stages using an RNAseq approach in the most economically important asexually-reproducing nematode. We showed that >60% of homoeologous gene pairs display diverged expression patterns. These results suggest a substantial functional impact of the genome structure. Contrasting with high within-species nuclear genome divergence, mitochondrial genome divergence between the three ameiotic asexuals was very low, signifying that these putative hybrids share a recent common maternal ancestor. Transposable elements (TE) cover a ~1.7 times higher proportion of the genomes of the ameiotic asexual Meloidogyne compared to the sexual relative and might also participate in their plasticity. The intriguing parasitic success of asexually-reproducing Meloidogyne species could be partly explained by their TE-rich composite genomes, resulting from allopolyploidization events, and promoting plasticity and functional divergence between gene copies in the absence of sex and meiosis.
The order Cetartiodactyla includes cetaceans (whales, dolphins and porpoises) that are found in all oceans and seas, as well as in some rivers, and artiodactyls (ruminants, pigs, peccaries, hippos, ...camels and llamas) that are present on all continents, except Antarctica and until recent invasions, Australia. There are currently 332 recognized cetartiodactyl species, which are classified into 132 genera and 22 families. Most phylogenetic studies have focused on deep relationships, and no comprehensive time-calibrated tree for the group has been published yet. In this study, 128 new complete mitochondrial genomes of Cetartiodactyla were sequenced and aligned with those extracted from nucleotide databases. Our alignment includes 14,902 unambiguously aligned nucleotide characters for 210 taxa, representing 183 species, 107 genera, and all cetartiodactyl families. Our mtDNA data produced a statistically robust tree, which is largely consistent with previous classifications. However, a few taxa were found to be para- or polyphyletic, including the family Balaenopteridae, as well as several genera and species. Accordingly, we propose several taxonomic changes in order to render the classification compatible with our molecular phylogeny. In some cases, the results can be interpreted as possible taxonomic misidentification or evidence for mtDNA introgression. The existence of three new cryptic species of Ruminantia should therefore be confirmed by further analyses using nuclear data. We estimate divergence times using Bayesian relaxed molecular clock models. The deepest nodes appeared very sensitive to prior assumptions leading to unreliable estimates, primarily because of the misleading effects of rate heterogeneity, saturation and divergent outgroups. In addition, we detected that Whippomorpha contains slow-evolving taxa, such as large whales and hippos, as well as fast-evolving taxa, such as river dolphins. Our results nevertheless indicate that the evolutionary history of cetartiodactyls was punctuated by four main phases of rapid radiation during the Cenozoic era: the sudden occurrence of the three extant lineages within Cetartiodactyla (Cetruminantia, Suina and Tylopoda); the basal diversification of Cetacea during the Early Oligocene; and two radiations that involve Cetacea and Pecora, one at the Oligocene/Miocene boundary and the other in the Middle Miocene. In addition, we show that the high species diversity now observed in the families Bovidae and Cervidae accumulated mainly during the Late Miocene and Plio-Pleistocene.
It is thought that speciation in phytophagous insects is often due to colonization of novel host plants, because radiations of plant and insect lineages are typically asynchronous. Recent ...phylogenetic comparisons have supported this model of diversification for both insect herbivores and specialized pollinators. An exceptional case where contemporaneous plant-insect diversification might be expected is the obligate mutualism between fig trees (Ficus species, Moraceae) and their pollinating wasps (Agaonidae, Hymenoptera). The ubiquity and ecological significance of this mutualism in tropical and subtropical ecosystems has long intrigued biologists, but the systematic challenge posed by > 750 interacting species pairs has hindered progress toward understanding its evolutionary history. In particular, taxon sampling and analytical tools have been insufficient for large-scale cophylogenetic analyses. Here, we sampled nearly 200 interacting pairs of fig and wasp species from across the globe. Two supermatrices were assembled: on an average, wasps had sequences from 77% of 6 genes (5.6 kb), figs had sequences from 60% of 5 genes (5.5 kb), and overall 850 new DNA sequences were generated for this study. We also developed a new analytical tool, Jane 2, for event-based phylogenetic reconciliation analysis of very large data sets. Separate Bayesian phylogenetic analyses for figs and fig wasps under relaxed molecular clock assumptions indicate Cretaceous diversification of crown groups and contemporaneous divergence for nearly half of all fig and pollinator lineages. Event-based cophylogenetic analyses further support the codiversification hypothesis. Biogeographic analyses indicate that the present-day distribution of fig and pollinator lineages is consistent with a Eurasian origin and subsequent dispersal, rather than with Gondwanan vicariance. Overall, our findings indicate that the fig-pollinator mutualism represents an extreme case among plant-insect interactions of coordinated dispersal and long-term codiversification.
Ciliates are unicellular eukaryotes with both a germline genome and a somatic genome in the same cytoplasm. The somatic macronucleus (MAC), responsible for gene expression, is not sexually ...transmitted but develops from a copy of the germline micronucleus (MIC) at each sexual generation. In the MIC genome of
Paramecium tetraurelia
, genes are interrupted by tens of thousands of unique intervening sequences called internal eliminated sequences (IESs), which have to be precisely excised during the development of the new MAC to restore functional genes. To understand the evolutionary origin of this peculiar genomic architecture, we sequenced the MIC genomes of 9
Paramecium
species (from approximately 100 Mb in
Paramecium aurelia
species to >1.5 Gb in
Paramecium caudatum
). We detected several waves of IES gains, both in ancestral and in more recent lineages. While the vast majority of IESs are single copy in present-day genomes, we identified several families of mobile IESs, including nonautonomous elements acquired via horizontal transfer, which generated tens to thousands of new copies. These observations provide the first direct evidence that transposable elements can account for the massive proliferation of IESs in
Paramecium
. The comparison of IESs of different evolutionary ages indicates that, over time, IESs shorten and diverge rapidly in sequence while they acquire features that allow them to be more efficiently excised. We nevertheless identified rare cases of IESs that are under strong purifying selection across the
aurelia
clade. The cases examined contain or overlap cellular genes that are inactivated by excision during development, suggesting conserved regulatory mechanisms. Similar to the evolution of introns in eukaryotes, the evolution of
Paramecium
IESs highlights the major role played by selfish genetic elements in shaping the complexity of genome architecture and gene expression.
The Q gene encodes an AP2-like transcription factor that played an important role in domestication of polyploid wheat. The chromosome 5A Q alleles (5AQ and 5Aq) have been well studied, but much less ...is known about the q alleles on wheat homoeologous chromosomes 5B (5Bq) and 5D (5Dq). We investigated the organization, evolution, and function of the Q/q homoeoalleles in hexaploid wheat (Triticum aestivum L.). Q/q gene sequences are highly conserved within and among the A, B, and D genomes of hexaploid wheat, the A and B genomes of tetraploid wheat, and the A, S, and D genomes of the diploid progenitors, but the intergenic regions of the Q/q locus are highly divergent among homoeologous genomes. Duplication of the q gene 5.8 Mya was likely followed by selective loss of one of the copies from the A genome progenitor and the other copy from the B, D, and S genomes. A recent V329-to-I mutation in the A lineage is correlated with the Q phenotype. The 5Bq homoeoalleles became a pseudogene after allotetraploidization. Expression analysis indicated that the homoeoalleles are coregulated in a complex manner. Combined phenotypic and expression analysis indicated that, whereas 5AQ plays a major role in conferring domestication-related traits, 5Dq contributes directly and 5Bq indirectly to suppression of the speltoid phenotype. The evolution of the Q/q loci in polyploid wheat resulted in the hyperfunctionalization of 5AQ, pseudogenization of 5Bq, and subfunctionalization of 5Dq, all contributing to the domestication traits.