Microsatellites are a widely-used marker system in plant genetics and forensics. The development of reliable microsatellite markers from resequencing data is challenging.
We extended MISA, a ...computational tool assisting the development of microsatellite markers, and reimplemented it as a web-based application. We improved compound microsatellite detection and added the possibility to display and export MISA results in GFF3 format for downstream analysis.
MISA-web can be accessed under http://misaweb.ipk-gatersleben.de/. The website provides tutorials, usage note as well as download links to the source code.
scholz@ipk-gatersleben.de.
Understanding how crop plants evolved from their wild relatives and spread around the world can inform about the origins of agriculture. Here, we review how the rapid development of genomic resources ...and tools has made it possible to conduct genetic mapping and population genetic studies to unravel the molecular underpinnings of domestication and crop evolution in diverse crop species. We propose three future avenues for the study of crop evolution: establishment of high-quality reference genomes for crops and their wild relatives; genomic characterization of germplasm collections; and the adoption of novel methodologies such as archaeogenetics, epigenomics, and genome editing.
Genebanks have the long-term mission of preserving plant genetic resources as an agricultural legacy for future crop improvement. Operating procedures for seed storage and plant propagation have been ...in place for decades, but there is a lack of effective means for the discovery and transfer of beneficial alleles from landraces and wild relatives into modern varieties. Here, we review the prospects of using molecular passport data derived from genomic sequence information as a universal monitoring tool at the single-plant level within and between genebanks. Together with recent advances in breeding methodologies, the transformation of genebanks into bio-digital resource centers will facilitate the selection of useful genetic variation and its use in breeding programs, thus providing easy access to past crop diversity. We propose linking catalogs of natural genetic variation and enquiries into biological mechanisms of plant performance as a long-term joint research goal of genebanks, plant geneticists and breeders.
Floret fertility is a key determinant of the number of grains per inflorescence in cereals. During the evolution of wheat (Triticum sp.), floret fertility has increased, such that current bread wheat ...(Triticum aestivum) cultivars set three to five grains per spikelet. However, little is known regarding the genetic basis of floret fertility. The locus Grain Number Increase 1 (GNI1) is shown here to be an important contributor to floret fertility. GNI1 evolved in the Triticeae through gene duplication. The gene, which encodes a homeodomain leucine zipper class I (HD-Zip I) transcription factor, was expressed most abundantly in the most apical floret primordia and in parts of the rachilla, suggesting that it acts to inhibit rachilla growth and development. The level of GNI1 expression has decreased over the course of wheat evolution under domestication, leading to the production of spikes bearing more fertile florets and setting more grains per spikelet. Genetic analysis has revealed that the reduced-function allele GNI-A1 contributes to the increased number of fertile florets per spikelet. The RNAi-based knockdown of GNI1 led to an increase in the number of both fertile florets and grains in hexaploid wheat. Mutants carrying an impaired GNI-A1 allele out-yielded WT allele carriers under field conditions. The data show that gene duplication generated evolutionary novelty affecting floret fertility while mutations favoring increased grain production have been under selection during wheat evolution under domestication.
Abstract
Meiotic recombination generates genetic diversity upon which selection can act. Recombination rates are highly variable between species, populations, individuals, sexes, chromosomes, and ...chromosomal regions. The underlying mechanisms are controlled at the genetic and epigenetic level and show plasticity toward the environment. Environmental plasticity may be divided into short- and long-term responses. We estimated recombination rates in natural populations of wild barley and domesticated landraces using a population genetics approach. We analyzed recombination landscapes in wild barley and domesticated landraces at high resolution. In wild barley, high recombination rates are found in more interstitial chromosome regions in contrast to distal chromosome regions in domesticated barley. Among subpopulations of wild barley, natural variation in effective recombination rate is correlated with temperature, isothermality, and solar radiation in a nonlinear manner. A positive linear correlation was found between effective recombination rate and annual precipitation. We discuss our findings with respect to how the environment might shape effective recombination rates in natural populations. Higher recombination rates in wild barley populations subjected to specific environmental conditions could be a means to maintain fitness in a strictly inbreeding species.
Summary
Until recently, achieving a reference‐quality genome sequence for bread wheat was long thought beyond the limits of genome sequencing and assembly technology, primarily due to the large ...genome size and > 80% repetitive sequence content. The release of the chromosome scale 14.5‐Gb IWGSC RefSeq v1.0 genome sequence of bread wheat cv. Chinese Spring (CS) was, therefore, a milestone. Here, we used a direct label and stain (DLS) optical map of the CS genome together with a prior nick, label, repair and stain (NLRS) optical map, and sequence contigs assembled with Pacific Biosciences long reads, to refine the v1.0 assembly. Inconsistencies between the sequence and maps were reconciled and gaps were closed. Gap filling and anchoring of 279 unplaced scaffolds increased the total length of pseudomolecules by 168 Mb (excluding Ns). Positions and orientations were corrected for 233 and 354 scaffolds, respectively, representing 10% of the genome sequence. The accuracy of the remaining 90% of the assembly was validated. As a result of the increased contiguity, the numbers of transposable elements (TEs) and intact TEs have increased in IWGSC RefSeq v2.1 compared with v1.0. In total, 98% of the gene models identified in v1.0 were mapped onto this new assembly through development of a dedicated approach implemented in the MAGAAT pipeline. The numbers of high‐confidence genes on pseudomolecules have increased from 105 319 to 105 534. The reconciled assembly enhances the utility of the sequence for genetic mapping, comparative genomics, gene annotation and isolation, and more general studies on the biology of wheat.
Significance Statement
This new release of bread wheat cv. Chinese Spring reference genome sequence, IWGSC RefSeq v2.1, features correction of assembly errors affecting approximately 10% of the prior IWGSC RefSeq v1.0 release using genome‐wide optical maps and filling of gaps with single‐molecule long‐reads as well as incorporating re‐annotation of TEs and re‐computation of gene coordinates. These refinements enhance the sequence utility for breeding and research applications.
Chromosome-scale genome sequence assemblies underpin pan-genomic studies. Recent genome assembly efforts in the large-genome Triticeae crops wheat and barley have relied on the commercial ...closed-source assembly algorithm DeNovoMagic. We present TRITEX, an open-source computational workflow that combines paired-end, mate-pair, 10X Genomics linked-read with chromosome conformation capture sequencing data to construct sequence scaffolds with megabase-scale contiguity ordered into chromosomal pseudomolecules. We evaluate the performance of TRITEX on publicly available sequence data of tetraploid wild emmer and hexaploid bread wheat, and construct an improved annotated reference genome sequence assembly of the barley cultivar Morex as a community resource.
Sequence assembly of large and repeat-rich plant genomes has been challenging, requiring substantial computational resources and often several complementary sequence assembly and genome mapping ...approaches. The recent development of fast and accurate long-read sequencing by circular consensus sequencing (CCS) on the PacBio platform may greatly increase the scope of plant pan-genome projects. Here, we compare current long-read sequencing platforms regarding their ability to rapidly generate contiguous sequence assemblies in pan-genome studies of barley (Hordeum vulgare). Most long-read assemblies are clearly superior to the current barley reference sequence based on short-reads. Assemblies derived from accurate long reads excel in most metrics, but the CCS approach was the most cost-effective strategy for assembling tens of barley genomes. A downsampling analysis indicated that 20-fold CCS coverage can yield very good sequence assemblies, while even five-fold CCS data may capture the complete sequence of most genes. We present an updated reference genome assembly for barley with near-complete representation of the repeat-rich intergenic space. Long-read assembly can underpin the construction of accurate and complete sequences of multiple genomes of a species to build pan-genome infrastructures in Triticeae crops and their wild relatives.
Polyploid species have long been thought to be recalcitrant to whole-genome assembly. By combining high-throughput sequencing, recent developments in parallel computing, and genetic mapping, we ...derive, de novo, a sequence assembly representing 9.1 Gbp of the highly repetitive 16 Gbp genome of hexaploid wheat, Triticum aestivum, and assign 7.1 Gb of this assembly to chromosomal locations. The genome representation and accuracy of our assembly is comparable or even exceeds that of a chromosome-by-chromosome shotgun assembly. Our assembly and mapping strategy uses only short read sequencing technology and is applicable to any species where it is possible to construct a mapping population.
The rapid development of next-generation sequencing platforms has enabled the use of sequencing for routine genotyping across a range of genetics studies and breeding applications. ...Genotyping-by-sequencing (GBS), a low-cost, reduced representation sequencing method, is becoming a common approach for whole-genome marker profiling in many species. With quickly developing sequencing technologies, adapting current GBS methodologies to new platforms will leverage these advancements for future studies. To test new semiconductor sequencing platforms for GBS, we genotyped a barley recombinant inbred line (RIL) population. Based on a previous GBS approach, we designed bar code and adapter sets for the Ion Torrent platforms. Four sets of 24-plex libraries were constructed consisting of 94 RILs and the two parents and sequenced on two Ion platforms. In parallel, a 96-plex library of the same RILs was sequenced on the Illumina HiSeq 2000. We applied two different computational pipelines to analyze sequencing data; the reference-independent TASSEL pipeline and a reference-based pipeline using SAMtools. Sequence contigs positioned on the integrated physical and genetic map were used for read mapping and variant calling. We found high agreement in genotype calls between the different platforms and high concordance between genetic and reference-based marker order. There was, however, paucity in the number of SNP that were jointly discovered by the different pipelines indicating a strong effect of alignment and filtering parameters on SNP discovery. We show the utility of the current barley genome assembly as a framework for developing very low-cost genetic maps, facilitating high resolution genetic mapping and negating the need for developing de novo genetic maps for future studies in barley. Through demonstration of GBS on semiconductor sequencing platforms, we conclude that the GBS approach is amenable to a range of platforms and can easily be modified as new sequencing technologies, analysis tools and genomic resources develop.