Premise
Wild species are strategic sources of valuable traits to be introduced into crops through hybridization. For peanut, the 33 currently described wild species in the section Arachis are ...particularly important because of their sexual compatibility with the domesticated species, Arachis hypogaea. Although numerous wild accessions are carefully preserved in seed banks, their morphological similarities pose challenges to routine classification.
Methods
Using a high‐density array, we genotyped 272 accessions encompassing all diploid species in section Arachis. Detailed relationships between accessions and species were revealed through phylogenetic analyses and interpreted using the expertise of germplasm collectors and curators.
Results
Two main groups were identified: one with A genome species and the other with B, D, F, G, and K genomes. Species groupings generally showed clear boundaries. Structure within groups was informative, for instance, revealing the history of the proto‐domesticate A. stenosperma. However, some groupings suggested multiple sibling species. Others were polyphyletic, indicating the need for taxonomic revision. Annual species were better defined than perennial ones, revealing limitations in applying classical and phylogenetic species concepts to the genus. We suggest new species assignments for several accessions.
Conclusions
Curated by germplasm collectors and curators, this analysis of species relationships lays the foundation for future species descriptions, classification of unknown accessions, and germplasm use for peanut improvement. It supports the conservation and curation of current germplasm, both critical tasks considering the threats to the genus posed by habitat loss and the current restrictions on new collections and germplasm transfer.
Understanding the relationship between genotype and phenotype is a major biological question and being able to predict phenotypes based on molecular genotypes is integral to molecular breeding. ...Whole- genome duplications have shaped the history of all flowering plants and present challenges to elucidating the relationship between genotype and phenotype, especially in neopolyploid species. Although single nucleotide polymorphisms (SNPs) have become popular tools for genetic mapping, discovery and appli- cation of SNPs in polyploids has been difficult. Here, we summarize common experimental approaches to SNP calling, highlighting recent polyploid successes. To examine the impact of software choice on these analyses, we called SNPs among five peanut genotypes using different alignment programs (BWA-mem and Bowtie 2) and variant callers (SAMtools, GATK, and Freebayes). Alignments produced by Bowtie 2 and BWA-mem and analyzed in SAMtools shared 24.5% concordant SNPs, and SAMtools, GATK, and Freebayes shared 1.4% concordant SNPs. A subsequent analysis of simulated Brassica napus chromosome 1A and 1C genotypes demonstrated that, of the three software programs, SAMtools performed with the highest sensitivity and specificity on Bowtie 2 alignments. These results, however, are likely to vary among species, and we therefore propose a series of best practices for SNP calling in polyploids.
Background Sequencing of cDNA libraries for the development of expressed sequence tags (ESTs) as well as for the discovery of simple sequence repeats (SSRs) has been a common method of developing ...microsatellites or SSR-based markers. In this research, our objective was to further sequence and develop common bean microsatellites from leaf and root cDNA libraries derived from the Andean gene pool accession G19833 and the Mesoamerican gene pool accession DOR364, mapping parents of a commonly used reference map. The root libraries were made from high and low phosphorus treated plants. Results A total of 3,123 EST sequences from leaf and root cDNA libraries were screened and used for direct simple sequence repeat discovery. From these EST sequences we found 184 microsatellites; the majority containing tri-nucleotide motifs, many of which were GC rich (ACC, AGC and AGG in particular). Di-nucleotide motif microsatellites were about half as common as the tri-nucleotide motif microsatellites but most of these were AGn microsatellites with a moderate number of ATn microsatellites in root ESTs followed by few ACn and no GCn microsatellites. Out of the 184 new SSR loci, 120 new microsatellite markers were developed in the BMc (Bean Microsatellites from cDNAs) series and these were evaluated for their capacity to distinguish bean diversity in a germplasm panel of 18 genotypes. We developed a database with images of the microsatellites and their polymorphism information content (PIC), which averaged 0.310 for polymorphic markers. Conclusions The present study produced information about microsatellite frequency in root and leaf tissues of two important genotypes for common bean genomics: namely G19833, the Andean genotype selected for whole genome shotgun sequencing from race Peru, and DOR364 a race Mesoamerica subgroup 2 genotype that is a small-red seeded, released variety in Central America. Both race Peru and Mesoamerica subgroup 2 (small red beans) have been understudied in comparison to race Nueva Granada and Mesoamerica subgroup 1 (black beans) both with regards to gene expression and as sources of markers. However, we found few differences between SSR type and frequency between the G19833 leaf and DOR364 root tissue-derived ESTs. Overall, our work adds to the analysis of microsatellite frequency evaluation for common bean and provides a new set of 120 BMc markers which combined with the 248 previously developed BMc markers brings the total in this series to 368 markers. Once we include BMd markers, which are derived from GenBank sequences, the current total of gene-based markers from our laboratory surpasses 500 markers. These markers are basic for studies of the transcriptome of common bean and can form anchor points for genetic mapping studies in the future.
Late leaf spot (LLS;
) is a major fungal disease of cultivated peanut (
). A recombinant inbred line population segregating for quantitative field resistance was used to identify quantitative trait ...loci (QTL) using QTL-seq. High rates of false positive SNP calls using established methods in this allotetraploid crop obscured significant QTLs. To resolve this problem, robust parental SNPs were first identified using polyploid-specific SNP identification pipelines, leading to discovery of significant QTLs for LLS resistance. These QTLs were confirmed over 4 years of field data. Selection with markers linked to these QTLs resulted in a significant increase in resistance, showing that these markers can be immediately applied in breeding programs. This study demonstrates that QTL-seq can be used to rapidly identify QTLs controlling highly quantitative traits in polyploid crops with complex genomes. Markers identified can then be deployed in breeding programs, increasing the efficiency of selection using molecular tools.
Field resistance to late leaf spot is a quantitative trait controlled by many QTLs. Using polyploid-specific methods, QTL-seq is faster and more cost effective than QTL mapping.
Arachis hypogaea L. (cultivated peanut) is an allotetraploid (2n = 4x = 40) with an AABB genome type. Based on cytogenetic studies it has been assumed that peanut and wild-derived induced AABB ...allotetraploids have classic allotetraploid genetic behavior with diploid-like disomic recombination only between homologous chromosomes, at the exclusion of recombination between homeologous chromosomes. Using this assumption, numerous linkage map and quantitative trait loci studies have been carried out. Here, with a systematic analysis of genotyping and gene expression data, we show that this assumption is not entirely valid. In fact, autotetraploid-like tetrasomic recombination is surprisingly frequent in recombinant inbred lines generated from a cross of cultivated peanut and an induced allotetraploid derived from peanut's most probable ancestral species. We suggest that a better, more predictive genetic model for peanut is that of a "segmental allotetraploid" with partly disomic, partly tetrasomic genetic behavior. This intermediate genetic behavior has probably had a previously overseen, but significant, impact on the genome and genetics of cultivated peanut.
Key message
A total of 33 additive stem rot QTLs were identified in peanut genome with nine of them consistently detected in multiple years or locations. And 12 pairs of epistatic QTLs were firstly ...reported for peanut stem rot disease.
Stem rot in peanut (
Arachis hypogaea
) is caused by the
Sclerotium rolfsii
and can result in great economic loss during production. In this study, a recombinant inbred line population from the cross between NC 3033 (stem rot resistant) and Tifrunner (stem rot susceptible) that consists of 156 lines was genotyped by using 58 K peanut single nucleotide polymorphism (SNP) array and phenotyped for stem rot resistance at multiple locations and in multiple years. A linkage map consisting of 1451 SNPs and 73 simple sequence repeat (SSR) markers was constructed. A total of 33 additive quantitative trait loci (QTLs) for stem rot resistance were detected, and six of them with phenotypic variance explained of over 10% (
qSR.A01
-
2
,
qSR.A01
-
5
,
qSR.A05/B05
-
1
,
qSR.A05/B05
-
2
,
qSR.A07/B07
-
1
and
qSR.B05
-
1
) can be consistently detected in multiple years or locations. Besides, 12 pairs of QTLs with epistatic (additive × additive) interaction were identified. An additive QTL
qSR.A01
-
2
also with an epistatic effect interacted with a novel locus
qSR.B07_1
-
1
to affect the percentage of asymptomatic plants in a row. A total of 193 candidate genes within 38 stem rot QTLs intervals were annotated with functions of biotic stress resistance such as chitinase, ethylene-responsive transcription factors and pathogenesis-related proteins. The identified stem rot resistance QTLs, candidate genes, along with the associated SNP markers in this study, will benefit peanut molecular breeding programs for improving stem rot resistance.
Abstract
Cytonuclear coevolution is a common feature among plants, which coordinates gene expression and protein products between the nucleus and organelles. Consequently, lineage-specific ...differences may result in incompatibilities between the nucleus and cytoplasm in hybrid taxa. Allopolyploidy is also a common phenomenon in plant evolution. The hybrid nature of allopolyploids may result in cytonuclear incompatibilities, but the massive nuclear redundancy created during polyploidy affords additional avenues for resolving cytonuclear conflict (i.e. cytonuclear accommodation). Here we evaluate expression changes in organelle-targeted nuclear genes for 6 allopolyploid lineages that represent 4 genera (i.e. Arabidopsis, Arachis, Chenopodium, and Gossypium) and encompass a range in polyploid ages. Because incompatibilities between the nucleus and cytoplasm could potentially result in biases toward the maternal homoeolog and/or maternal expression level, we evaluate patterns of homoeolog usage, expression bias, and expression-level dominance in cytonuclear genes relative to the background of noncytonuclear expression changes and to the diploid parents. Although we find subsets of cytonuclear genes in most lineages that match our expectations of maternal preference, these observations are not consistent among either allopolyploids or categories of organelle-targeted genes. Our results indicate that cytonuclear expression evolution may be subtle and variable among genera and genes, likely reflecting a diversity of mechanisms to resolve nuclear-cytoplasmic incompatibilities in allopolyploid species.
Peanut (Arachis hypogaea; 2n = 4x = 40) is a nutritious food and a good source of vitamins, minerals, and healthy fats. Expansion of genetic and genomic resources for genetic enhancement of ...cultivated peanut has gained momentum from the sequenced genomes of the diploid ancestors of cultivated peanut. To facil- itate high-throughput genotyping of Arachis species, 20 genotypes were re-sequenced and genome-wide single nucleotide poiymorphisms (SNPs) were selected to develop a large-scale SNP genotyping array. For flexibility in genotyping applications, SNPs polymorphic between tetraploid and diploid species were included for use in cultivated and interspecific populations. A set of 384 accessions was used to test the array resulting in 54 564 markers that produced high-quality polymorphic clusters between diploid species, 47 116 polymorphic markers between cultivated and interspecific hybrids, and 15 897 polymorphic markers within A. hypogaea germplasm. An additional 1193 markers were identified that illuminated genomic re- gions exhibiting tetrasomic recombination. Furthermore, a set of elite cultivars that make up the pedigree of US runner germplasm were genotyped and used to identify genomic regions that have undergone pos- itive selection. These observations provide key insights on the inclusion of new genetic diversity in culti- vated peanut and will inform the development of high-resolution mapping populations. Due to its efficiency, scope, and flexibility, the newly developed SNP array will be very useful for further genetic and breeding applications in Arachis.
Common beans are an important food legume faced with a series of abiotic stresses the most severe of which is drought. The crop is interesting as a model for the analysis of gene phylogenies due to ...its domestication process, race structure, and origins in a group of wild common beans found along the South American Andes and the region of Mesoamerica. Meanwhile, the DREB2 transcription factors have been implicated in controlling non-ABA dependent responses to drought stress. With this in mind our objective was to study in depth the genetic diversity for two DREB2 genes as possible candidates for association with drought tolerance through a gene phylogenetic analysis. In this genetic diversity assessment, we analyzed nucleotide diversity at the two candidate genes
Dreb2A
and
Dreb2B
, in partial core collections of 104 wild and 297 cultivated common beans with a total of 401 common bean genotypes from world-wide germplasm analyzed. Our wild population sample covered a range of semi-mesic to very dry habitats, while our cultivated samples presented a wide spectrum of low to high drought tolerance. Both genes showed very different patterns of nucleotide variation.
Dreb2B
exhibited very low nucleotide diversity relative to neutral reference loci previously surveyed in these populations. This suggests that strong purifying selection has been acting on this gene. In contrast,
Dreb2A
exhibited higher levels of nucleotide diversity, which is indicative of adaptive selection and population expansion. These patterns were more distinct in wild compared to cultivated common beans. These approximations suggested the importance of
Dreb2
genes in the context of drought tolerance, and constitute the first steps towards an association study between genetic polymorphism of this gene family and variation in drought tolerance traits. We discuss the utility of allele mining in the DREB gene family for the discovery of new drought tolerance traits from wild common bean.
Although seed and pod traits are important for peanut breeding, little is known about the inheritance of these traits. A recombinant inbred line (RIL) population of 156 lines from a cross of ...Tifrunner x NC 3033 was genotyped with the Axiom_Arachis1 SNP array and SSRs to generate a genetic map composed of 1524 markers in 29 linkage groups (LG). The genetic positions of markers were compared with their physical positions on the peanut genome to confirm the validity of the linkage map and explore the distribution of recombination and potential chromosomal rearrangements. This linkage map was then used to identify Quantitative Trait Loci (QTL) for seed and pod traits that were phenotyped over three consecutive years for the purpose of developing trait-associated markers for breeding. Forty-nine QTL were identified in 14 LG for seed size index, kernel percentage, seed weight, pod weight, single-kernel, double-kernel, pod area and pod density. Twenty QTL demonstrated phenotypic variance explained (PVE) greater than 10% and eight more than 20%. Of note, seven of the eight major QTL for pod area, pod weight and seed weight (PVE >20% variance) were attributed to NC 3033 and located in a single linkage group, LG B06_1. In contrast, the most consistent QTL for kernel percentage were located on A07/B07 and derived from Tifrunner.