Summary
Until recently, achieving a reference‐quality genome sequence for bread wheat was long thought beyond the limits of genome sequencing and assembly technology, primarily due to the large ...genome size and > 80% repetitive sequence content. The release of the chromosome scale 14.5‐Gb IWGSC RefSeq v1.0 genome sequence of bread wheat cv. Chinese Spring (CS) was, therefore, a milestone. Here, we used a direct label and stain (DLS) optical map of the CS genome together with a prior nick, label, repair and stain (NLRS) optical map, and sequence contigs assembled with Pacific Biosciences long reads, to refine the v1.0 assembly. Inconsistencies between the sequence and maps were reconciled and gaps were closed. Gap filling and anchoring of 279 unplaced scaffolds increased the total length of pseudomolecules by 168 Mb (excluding Ns). Positions and orientations were corrected for 233 and 354 scaffolds, respectively, representing 10% of the genome sequence. The accuracy of the remaining 90% of the assembly was validated. As a result of the increased contiguity, the numbers of transposable elements (TEs) and intact TEs have increased in IWGSC RefSeq v2.1 compared with v1.0. In total, 98% of the gene models identified in v1.0 were mapped onto this new assembly through development of a dedicated approach implemented in the MAGAAT pipeline. The numbers of high‐confidence genes on pseudomolecules have increased from 105 319 to 105 534. The reconciled assembly enhances the utility of the sequence for genetic mapping, comparative genomics, gene annotation and isolation, and more general studies on the biology of wheat.
Significance Statement
This new release of bread wheat cv. Chinese Spring reference genome sequence, IWGSC RefSeq v2.1, features correction of assembly errors affecting approximately 10% of the prior IWGSC RefSeq v1.0 release using genome‐wide optical maps and filling of gaps with single‐molecule long‐reads as well as incorporating re‐annotation of TEs and re‐computation of gene coordinates. These refinements enhance the sequence utility for breeding and research applications.
Transposable elements (TEs) are major components of large plant genomes and main drivers of genome evolution. The most recent assembly of hexaploid bread wheat recovered the highly repetitive TE ...space in an almost complete chromosomal context and enabled a detailed view into the dynamics of TEs in the A, B, and D subgenomes.
The overall TE content is very similar between the A, B, and D subgenomes, although we find no evidence for bursts of TE amplification after the polyploidization events. Despite the near-complete turnover of TEs since the subgenome lineages diverged from a common ancestor, 76% of TE families are still present in similar proportions in each subgenome. Moreover, spacing between syntenic genes is also conserved, even though syntenic TEs have been replaced by new insertions over time, suggesting that distances between genes, but not sequences, are under evolutionary constraints. The TE composition of the immediate gene vicinity differs from the core intergenic regions. We find the same TE families to be enriched or depleted near genes in all three subgenomes. Evaluations at the subfamily level of timed long terminal repeat-retrotransposon insertions highlight the independent evolution of the diploid A, B, and D lineages before polyploidization and cases of concerted proliferation in the AB tetraploid.
Even though the intergenic space is changed by the TE turnover, an unexpected preservation is observed between the A, B, and D subgenomes for features like TE family proportions, gene spacing, and TE enrichment near genes.
The Wheat@URGI portal has been developed to provide the international community of researchers and breeders with access to the bread wheat reference genome sequence produced by the International ...Wheat Genome Sequencing Consortium. Genome browsers, BLAST, and InterMine tools have been established for in-depth exploration of the genome sequence together with additional linked datasets including physical maps, sequence variations, gene expression, and genetic and phenomic data from other international collaborative projects already stored in the GnpIS information system. The portal provides enhanced search and browser features that will facilitate the deployment of the latest genomics resources in wheat improvement.
We produced a reference sequence of the 1-gigabase chromosome 3B of hexaploid bread wheat. By sequencing 8452 bacterial artificial chromosomes in pools, we assembled a sequence of 774 megabases ...carrying 5326 protein-coding genes, 1938 pseudogenes, and 85% of transposable elements. The distribution of structural and functional features along the chromosome revealed partitioning correlated with meiotic recombination. Comparative analyses indicated high wheat-specific inter- and intrachromosomal gene duplication activities that are potential sources of variability for adaption. In addition to providing a better understanding of the organization, function, and evolution of a large and polyploid genome, the availability of a high-quality sequence anchored to genetic maps will accelerate the identification of genes underlying important agronomic traits.
Because of their abundance and their amenability to high-throughput genotyping techniques, Single Nucleotide Polymorphisms (SNPs) are powerful tools for efficient genetics and genomics studies, ...including characterization of genetic resources, genome-wide association studies and genomic selection. In wheat, most of the previous SNP discovery initiatives targeted the coding fraction, leaving almost 98% of the wheat genome largely unexploited. Here we report on the use of whole-genome resequencing data from eight wheat lines to mine for SNPs in the genic, the repetitive and non-repetitive intergenic fractions of the wheat genome. Eventually, we identified 3.3 million SNPs, 49% being located on the B-genome, 41% on the A-genome and 10% on the D-genome. We also describe the development of the TaBW280K high-throughput genotyping array containing 280,226 SNPs. Performance of this chip was examined by genotyping a set of 96 wheat accessions representing the worldwide diversity. Sixty-nine percent of the SNPs can be efficiently scored, half of them showing a diploid-like clustering. The TaBW280K was proven to be a very efficient tool for diversity analyses, as well as for breeding as it can discriminate between closely related elite varieties. Finally, the TaBW280K array was used to genotype a population derived from a cross between Chinese Spring and Renan, leading to the construction a dense genetic map comprising 83,721 markers. The results described here will provide the wheat community with powerful tools for both basic and applied research.
Physical Map of the 1-Gigabase Bread Wheat Chromosome 3B Paux, Etienne; Sourdille, Pierre; Salse, Jérôme ...
Science (American Association for the Advancement of Science),
10/2008, Letnik:
322, Številka:
5898
Journal Article
Recenzirano
As the staple food for 35% of the world's population, wheat is one of the most important crop species. To date, sequence-based tools to accelerate wheat improvement are lacking. As part of the ...international effort to sequence the 17-billion-base-pair hexaploid bread wheat genome (2n = 6x = 42 chromosomes), we constructed a bacterial artificial chromosome (BAC)-based integrated physical map of the largest chromosome, 3B, that alone is 995 megabases. A chromosome-specific BAC library was used to assemble 82% of the chromosome into 1036 contigs that were anchored with 1443 molecular markers, providing a major resource for genetic and genomic studies. This physical map establishes a template for the remaining wheat chromosomes and demonstrates the feasibility of constructing physical maps in large, complex, polyploid genomes with a chromosome-based approach.
Because of its size, allohexaploid nature, and high repeat content, the bread wheat genome is a good model to study the impact of the genome structure on gene organization, function, and regulation. ...However, because of the lack of a reference genome sequence, such studies have long been hampered and our knowledge of the wheat gene space is still limited. The access to the reference sequence of the wheat chromosome 3B provided us with an opportunity to study the wheat transcriptome and its relationships to genome and gene structure at a level that has never been reached before.
By combining this sequence with RNA-seq data, we construct a fine transcriptome map of the chromosome 3B. More than 8,800 transcription sites are identified, that are distributed throughout the entire chromosome. Expression level, expression breadth, alternative splicing as well as several structural features of genes, including transcript length, number of exons, and cumulative intron length are investigated. Our analysis reveals a non-monotonic relationship between gene expression and structure and leads to the hypothesis that gene structure is determined by its function, whereas gene expression is subject to energetic cost. Moreover, we observe a recombination-based partitioning at the gene structure and function level.
Our analysis provides new insights into the relationships between gene and genome structure and function. It reveals mechanisms conserved with other plant species as well as superimposed evolutionary forces that shaped the wheat gene space, likely participating in wheat adaptation.
Meiotic recombination (or crossover, CO) is essential for gamete fertility as well as for alleles and genes reshuffling that is at the heart of plant breeding. However, CO remains a limited event, ...which strongly hampers the rapid production of original and improved cultivars.
is a gene encoding a helicase protein that, when mutated, contributes to improve recombination rate in all species where it has been evaluated so far.
In this study, we developed wheat (
L.) triple mutant (TM) for the three homoeologous copies of
as well as mutants for two copies and heterozygous for the last one (Htz-A, Htz-B, Htz-D).
Phenotypic observation revealed a significant reduction of fertility and pollen viability in TM and Htz-B plants compared to wild type plants suggesting major defects during meiosis. Cytogenetic analyses of these plants showed that complete absence of
as observed in TM plants, leads to chromosome fragmentation during the pachytene stage, resulting in problems in the segregation of chromosomes during meiosis. Htz-A and Htz-D mutants had an almost normal meiotic progression indicating that both
and
copies are functional and that there is no dosage effect for
in bread wheat. On the contrary, the
copy seems knocked-out, probably because of a SNP leading to a Threonine>Alanine change at position 539 (T539A) of the protein, that occurs in the crucial helicase ATP bind/DEAD/ResIII domain which unwinds nucleic acids. Occurrence of numerous multivalents in TM plants suggests that
could also play a role in the control of homoeologous recombination.
These findings provide a foundation for further molecular investigations into wheat meiosis regulation to fully understand the underlying mechanisms of how
affects chiasma formation, as well as to identify ways to mitigate these defects and enhance both homologous and homoeologous recombination efficiency in wheat.
Since its domestication in the Fertile Crescent ~8000 to 10,000 years ago, wheat has undergone a complex history of spread, adaptation, and selection. To get better insights into the wheat ...phylogeography and genetic diversity, we describe allele distribution through time using a set of 4506 landraces and cultivars originating from 105 different countries genotyped with a high-density single-nucleotide polymorphism array. Although the genetic structure of landraces is collinear to ancient human migration roads, we observe a reshuffling through time, related to breeding programs, with the appearance of new alleles enriched with structural variations that may be the signature of introgressions from wild relatives after 1960.
During meiosis, crossovers (COs) create new allele associations by reciprocal exchange of DNA. In bread wheat (
L.), COs are mostly limited to subtelomeric regions of chromosomes, resulting in a ...substantial loss of breeding efficiency in the proximal regions, though these regions carry ∼60-70% of the genes. Identifying sequence and/or chromosome features affecting recombination occurrence is thus relevant to improve and drive recombination. Using the recent release of a reference sequence of chromosome 3B and of the draft assemblies of the 20 other wheat chromosomes, we performed fine-scale mapping of COs and revealed that 82% of COs located in the distal ends of chromosome 3B representing 19% of the chromosome length. We used 774 SNPs to genotype 180 varieties representative of the Asian and European genetic pools and a segregating population of 1270 F
lines. We observed a common location for ancestral COs (predicted through linkage disequilibrium) and the COs derived from the segregating population. We delineated 73 small intervals (<26 kb) on chromosome 3B that contained 252 COs. We observed a significant association of COs with genic features (73 and 54% in recombinant and nonrecombinant intervals, respectively) and with those expressed during meiosis (67% in recombinant intervals and 48% in nonrecombinant intervals). Moreover, while the recombinant intervals contained similar amounts of retrotransposons and DNA transposons (42 and 53%), nonrecombinant intervals had a higher level of retrotransposons (63%) and lower levels of DNA transposons (28%). Consistent with this, we observed a higher frequency of a DNA motif specific to the
DNA transposon in recombinant intervals.