Because of their abundance and their amenability to high-throughput genotyping techniques, Single Nucleotide Polymorphisms (SNPs) are powerful tools for efficient genetics and genomics studies, ...including characterization of genetic resources, genome-wide association studies and genomic selection. In wheat, most of the previous SNP discovery initiatives targeted the coding fraction, leaving almost 98% of the wheat genome largely unexploited. Here we report on the use of whole-genome resequencing data from eight wheat lines to mine for SNPs in the genic, the repetitive and non-repetitive intergenic fractions of the wheat genome. Eventually, we identified 3.3 million SNPs, 49% being located on the B-genome, 41% on the A-genome and 10% on the D-genome. We also describe the development of the TaBW280K high-throughput genotyping array containing 280,226 SNPs. Performance of this chip was examined by genotyping a set of 96 wheat accessions representing the worldwide diversity. Sixty-nine percent of the SNPs can be efficiently scored, half of them showing a diploid-like clustering. The TaBW280K was proven to be a very efficient tool for diversity analyses, as well as for breeding as it can discriminate between closely related elite varieties. Finally, the TaBW280K array was used to genotype a population derived from a cross between Chinese Spring and Renan, leading to the construction a dense genetic map comprising 83,721 markers. The results described here will provide the wheat community with powerful tools for both basic and applied research.
The metabolomic profiling analyses of 11 vitamins' statuses of wheat grain in a subsample of 167 accessions from the INRAE worldwide bread wheat core collection planted in two contrasting ...environments in France (Le Moulon and Clermont-Ferrand) have been evaluated using a high-throughput liquid chromatography-tandem mass spectrometry (LC-MS/MS) procedure. This has allowed us to perform a genome-wide association study (GWAS) for these nutritional traits of interest combining the phenotypic data with the genotypic data derived from the TaBW280K SNP chip. Considering both thresholds (
< 0.0003 and
≥ 8%), the GWAS identified between 1 and 22 marker-trait associations (MTAs) for the individual vitamins at the individual locations, and 12 SNP markers were stable and associated with vitamin contents across two environments. Desirable alleles and superior genotypes identified in the current analysis provide novel genetic data that can be used for future research on the genetics of vitamins and their application in wheat breeding.
We produced a reference sequence of the 1-gigabase chromosome 3B of hexaploid bread wheat. By sequencing 8452 bacterial artificial chromosomes in pools, we assembled a sequence of 774 megabases ...carrying 5326 protein-coding genes, 1938 pseudogenes, and 85% of transposable elements. The distribution of structural and functional features along the chromosome revealed partitioning correlated with meiotic recombination. Comparative analyses indicated high wheat-specific inter- and intrachromosomal gene duplication activities that are potential sources of variability for adaption. In addition to providing a better understanding of the organization, function, and evolution of a large and polyploid genome, the availability of a high-quality sequence anchored to genetic maps will accelerate the identification of genes underlying important agronomic traits.
Wheat grain storage protein (GSP) content and composition are the main determinants of the end-use value of bread wheat (Triticum aestivum L.) grain. The accumulation of glutenins and gliadins, the ...two main classes of GSP in wheat, is believed to be mainly controlled at the transcriptional level through a network of transcription factors. This regulation network could lead to stable cross-environment allometric scaling relationships between the quantity of GSP classes/subunits and the total quantity of nitrogen per grain. This work conducted a genetic mapping study of GSP content and composition and allometric scaling parameters of grain N allocation using a bread wheat worldwide core collection grown in three environments. The core collection was genotyped with 873 markers for genome-wide association and 167 single nucleotide polymorphism markers in 51 candidate genes for candidate association. The candidate genes included 35 transcription factors (TFs) expressed in grain. This work identified 74 loci associated with 38 variables, of which 19 were candidate genes or were tightly linked with candidate genes. Besides structural GSP genes, several loci putatively trans-regulating GSP accumulation were identified. Seven candidate TFs, including four wheat orthologues of barley TFs that control hordein gene expression, were associated or in strong linkage disequilibrium with markers associated with the composition or quantity of glutenin or gliadin, or allometric grain N allocation parameters, confirming the importance of the transcriptional control of GSP accumulation. Genome-wide association results suggest that the genes regulating glutenin and gliadin compositions are mostly distinct from each other and operate differently.
Since its Neolithic domestication in the Fertile Crescent, barley has spread to all continents and represents a major cereal in many modern agrarian systems. Current barley diversity includes ...thousands of varieties divided into four main categories corresponding to 2-row and 6-row subspecies and naked and hulled types, each of them with winter and spring varieties. This diversity is associated to different uses and allow cultivation in diverse environments. We used a large dataset of 58 varieties of French origin, (1) to assess the taxonomic signal in barley grain measurements comparing 2-row and 6-row subspecies, and naked and hulled types; (2) to test the impact of the sowing period and interannual variation on the grains size and shape; (3) to investigate the existence of morphological differences between winter and spring types; and finally (4) to contrast the relationship between the morphometric and genetic proximity. Size and shape of 1980 modern barley caryopses were quantified through elliptic Fourier Transforms and traditional size measurements. Our results indicate that barley grains record morphological diversity of the ear (89.3% classification accuracy between 2-row/6-row subspecies; 85.2% between hulled and naked type), sowing time of the grains (from 65.6% to 73.3% within barley groups), and environmental conditions during its cultivation and varietal diversity. This study opens perspectives for studying archaeological barley seeds and tracing the barley diversity and evolution since the Neolithic.
Since its domestication in the Fertile Crescent ~8000 to 10,000 years ago, wheat has undergone a complex history of spread, adaptation, and selection. To get better insights into the wheat ...phylogeography and genetic diversity, we describe allele distribution through time using a set of 4506 landraces and cultivars originating from 105 different countries genotyped with a high-density single-nucleotide polymorphism array. Although the genetic structure of landraces is collinear to ancient human migration roads, we observe a reshuffling through time, related to breeding programs, with the appearance of new alleles enriched with structural variations that may be the signature of introgressions from wild relatives after 1960.
During meiosis, crossovers (COs) create new allele associations by reciprocal exchange of DNA. In bread wheat (
L.), COs are mostly limited to subtelomeric regions of chromosomes, resulting in a ...substantial loss of breeding efficiency in the proximal regions, though these regions carry ∼60-70% of the genes. Identifying sequence and/or chromosome features affecting recombination occurrence is thus relevant to improve and drive recombination. Using the recent release of a reference sequence of chromosome 3B and of the draft assemblies of the 20 other wheat chromosomes, we performed fine-scale mapping of COs and revealed that 82% of COs located in the distal ends of chromosome 3B representing 19% of the chromosome length. We used 774 SNPs to genotype 180 varieties representative of the Asian and European genetic pools and a segregating population of 1270 F
lines. We observed a common location for ancestral COs (predicted through linkage disequilibrium) and the COs derived from the segregating population. We delineated 73 small intervals (<26 kb) on chromosome 3B that contained 252 COs. We observed a significant association of COs with genic features (73 and 54% in recombinant and nonrecombinant intervals, respectively) and with those expressed during meiosis (67% in recombinant intervals and 48% in nonrecombinant intervals). Moreover, while the recombinant intervals contained similar amounts of retrotransposons and DNA transposons (42 and 53%), nonrecombinant intervals had a higher level of retrotransposons (63%) and lower levels of DNA transposons (28%). Consistent with this, we observed a higher frequency of a DNA motif specific to the
DNA transposon in recombinant intervals.
Bread wheat (Triticum aestivum), one of the world's major crops, is genetically very diverse. In order to select a representative sample of the worldwide wheat diversity, 3,942 accessions originating ...from 73 countries were analysed with a set of 38 genomic simple sequence repeat (SSR) markers. The number of alleles at each locus ranged from 7 to 45 with an average of 23.9 alleles per locus. The 908 alleles detected were used together with passport data to select increasingly large sub-samples that maximised both the number of observed alleles at SSR loci and the number of geographical origins. A final core of 372 accessions (372CC) was selected with this M strategy. All the different geographical areas and more than 98% of the allelic diversity at the 38 polymorphic loci were represented in this core. The method used to build the core was validated, by using a second set of independent markers 44 expressed sequence tag (EST)-SSR markers on a larger sample of 744 accessions: 96.74% of the alleles observed at these loci had already been captured in the 372CC. So maximizing the diversity with a first set of markers also maximised the diversity at a second independent set of locus. To relate the genetic structure of wheat germplasm to its geographical origins, the two sets of markers were used to compute a dissimilarity matrix between geographical groups. Current worldwide wheat diversity is clearly divided according to wheat's European and Asian origins, whereas the diversity within each geographical group might be the result of the combined effects of adaptation of an initial germplasm to different environmental conditions and specific breeding practices. Seeds from each accession of the 372CC were multiplied and are now available to the scientific community. The genomic DNA of the 372CC, which can be entirely contained in a 384-deep-well storage plate, will be a useful tool for future studies of wheat genetic diversity.
A previous study provided an in-depth understanding of molecular population genetics of European and Asian wheat gene pools using a sequenced 3.1-Mb contig (ctg954) on chromosome 3BS. This region is ...believed to carry the Fhb1 gene for response to Fusarium head blight. In this study, 266 wheat accessions were evaluated in three environments for Type II FHB response based on the single floret inoculation method. Hierarchical clustering (UPGMA) based on a Manhattan dissimilarity matrix divided the accessions into eight groups according to five FHB-related traits which have a high correlation between them; Group VIII comprised six accessions with FHB response levels similar to variety Sumai 3. Based on the compressed mixed linear model (MLM), association analysis between five FHB-related traits and 42 molecular markers along the 3.1-Mb region revealed 12 significant association signals at a threshold of P<0.05. The highest proportion of phenotypic variation (6.2%) in number of diseased spikelets (NDS) occurred at locus cfb6059, and the physical distance was about 2.9 Kb between umn10 and this marker. Haplotype block (HapB) analysis using a sliding window LD of 5 markers, detected six HapBs in the 3.1-Mb region at r(2)>0.1 and P<0.001 between random closely linked markers. F-tests among Haps with frequencies >0.05 within each HapB at r(2)>0.1 and P<0.001 showed significant differences between the Hap carried by FHB resistant resources, such as Sumai 3 and Wangshuibai, and susceptible genotypes in HapB3 and HapB6. These results suggest that Fhb1 is located within HapB6, with the possibility that another gene is located at or near HapB3. SSR markers and Haps detected in this study will be helpful in further understanding the genetic basis of FHB resistance, and provide useful information for marker-assisted selection of Fhb1 in wheat breeding.
Structural variations (SVs) such as copy number and presence–absence variations are polymorphisms that are known to impact genome composition at the species level and are associated with phenotypic ...variations. In the absence of a reference genome sequence, their study has long been hampered in wheat. The recent production of new wheat genomic resources has led to a paradigm shift, making possible to investigate the extent of SVs among cultivated and wild accessions. We assessed SVs affecting genes and transposable elements (TEs) in a Triticeae diversity panel of 45 accessions from seven tetraploid and hexaploid species using high-coverage shotgun sequencing of sorted chromosome 3B DNA and dedicated bioinformatics approaches. We showed that 23% of the genes are variable within this panel, and we also identified 330 genes absent from the reference accession Chinese Spring. In addition, 60% of the TE-derived reference markers were absent in at least one accession, revealing a high level of intraspecific and interspecific variability affecting the TE space. Chromosome extremities are the regions where we observed most of the variability, confirming previous hypotheses made when comparing wheat with the other grasses. This study provides deeper insights into the genomic variability affecting the complex Triticeae genomes at the intraspecific and interspecific levels and suggests a phylogeny with independent hybridization events leading to different hexaploid species.