Soybean (Glycine max) is one of the most important crop plants for seed protein and oil content, and for its capacity to fix atmospheric nitrogen through symbioses with soil-borne microorganisms. We ...sequenced the 1.1-gigabase genome by a whole-genome shotgun approach and integrated it with physical and high-density genetic maps to create a chromosome-scale draft sequence assembly. We predict 46,430 protein-coding genes, 70% more than Arabidopsis and similar to the poplar genome which, like soybean, is an ancient polyploid (palaeopolyploid). About 78% of the predicted genes occur in chromosome ends, which comprise less than one-half of the genome but account for nearly all of the genetic recombination. Genome duplications occurred at approximately 59 and 13 million years ago, resulting in a highly duplicated genome with nearly 75% of the genes present in multiple copies. The two duplication events were followed by gene diversification and loss, and numerous chromosome rearrangements. An accurate soybean genome sequence will facilitate the identification of the genetic basis of many soybean traits, and accelerate the creation of improved soybean varieties.
Full text
Available for:
DOBA, IJS, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
The Soybean Consensus Map 4.0 facilitated the anchoring of 95.6% of the soybean whole genome sequence developed by the Joint Genome Institute, Department of Energy, but its marker density was only ...sufficient to properly orient 66% of the sequence scaffolds. The discovery and genetic mapping of more single nucleotide polymorphism (SNP) markers were needed to anchor and orient the remaining genome sequence. To that end, next generation sequencing and high-throughput genotyping were combined to obtain a much higher resolution genetic map that could be used to anchor and orient most of the remaining sequence and to help validate the integrity of the existing scaffold builds.
A total of 7,108 to 25,047 predicted SNPs were discovered using a reduced representation library that was subsequently sequenced by the Illumina sequence-by-synthesis method on the clonal single molecule array platform. Using multiple SNP prediction methods, the validation rate of these SNPs ranged from 79% to 92.5%. A high resolution genetic map using 444 recombinant inbred lines was created with 1,790 SNP markers. Of the 1,790 mapped SNP markers, 1,240 markers had been selectively chosen to target existing unanchored or un-oriented sequence scaffolds, thereby increasing the amount of anchored sequence to 97%.
We have demonstrated how next generation sequencing was combined with high-throughput SNP detection assays to quickly discover large numbers of SNPs. Those SNPs were then used to create a high resolution genetic map that assisted in the assembly of scaffolds from the 8x whole genome shotgun sequences into pseudomolecules corresponding to chromosomes of the organism.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Rust and downy mildew (DM) are two important sunflower diseases that lead to significant yield losses globally. The use of resistant hybrids to control rust and DM in sunflower has a long history. ...The rust resistance genes, R13a and R16, were previously mapped to a 3.4 Mb region at the lower end of sunflower chromosome 13, while the DM resistance gene, Pl33, was previously mapped to a 4.2 Mb region located at the upper end of chromosome 4. High-resolution fine mapping was conducted using whole genome sequencing of HA-R6 (R13a) and TX16R (R16 and Pl33) and large segregated populations. R13a and R16 were fine mapped to a 0.48 cM region in chromosome 13 corresponding to a 790 kb physical interval on the XRQr1.0 genome assembly. Four disease defense-related genes with nucleotide-binding leucine-rich repeat (NLR) motifs were found in this region from XRQr1.0 gene annotation as candidate genes for R13a and R16. Pl33 was fine mapped to a 0.04 cM region in chromosome 4 corresponding to a 63 kb physical interval. One NLR gene, HanXRQChr04g0095641, was predicted as the candidate gene for Pl33. The diagnostic SNP markers developed for each gene in the current study will facilitate marker-assisted selections of resistance genes in sunflower breeding programs.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
The objective of this research was to identify single nucleotide polymorphisms (SNPs) and to develop an Illumina Infinium BeadChip that contained over 50,000 SNPs from soybean (Glycine max L. Merr.). ...A total of 498,921,777 reads 35-45 bp in length were obtained from DNA sequence analysis of reduced representation libraries from several soybean accessions which included six cultivated and two wild soybean (G. soja Sieb. et Zucc.) genotypes. These reads were mapped to the soybean whole genome sequence and 209,903 SNPs were identified. After applying several filters, a total of 146,161 of the 209,903 SNPs were determined to be ideal candidates for Illumina Infinium II BeadChip design. To equalize the distance between selected SNPs, increase assay success rate, and minimize the number of SNPs with low minor allele frequency, an iteration algorithm based on a selection index was developed and used to select 60,800 SNPs for Infinium BeadChip design. Of the 60,800 SNPs, 50,701 were targeted to euchromatic regions and 10,000 to heterochromatic regions of the 20 soybean chromosomes. In addition, 99 SNPs were targeted to unanchored sequence scaffolds. Of the 60,800 SNPs, a total of 52,041 passed Illumina's manufacturing phase to produce the SoySNP50K iSelect BeadChip. Validation of the SoySNP50K chip with 96 landrace genotypes, 96 elite cultivars and 96 wild soybean accessions showed that 47,337 SNPs were polymorphic and generated successful SNP allele calls. In addition, 40,841 of the 47,337 SNPs (86%) had minor allele frequencies ≥ 10% among the landraces, elite cultivars and the wild soybean accessions. A total of 620 and 42 candidate regions which may be associated with domestication and recent selection were identified, respectively. The SoySNP50K iSelect SNP beadchip will be a powerful tool for characterizing soybean genetic diversity and linkage disequilibrium, and for constructing high resolution linkage maps to improve the soybean whole genome sequence assembly.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Seed protein, oil content and yield are highly correlated agronomically important traits that essentially account for the economic value of soybean. The underlying molecular mechanisms and selection ...of these correlated seed traits during soybean domestication are, however, less known. Here, we demonstrate that a CCT gene, POWR1, underlies a large-effect protein/oil QTL. A causative TE insertion truncates its CCT domain and substantially increases seed oil content, weight, and yield while decreasing protein content. POWR1 pleiotropically controls these traits likely through regulating seed nutrient transport and lipid metabolism genes. POWR1 is also a domestication gene. We hypothesize that the TE insertion allele is exclusively fixed in cultivated soybean due to selection for larger seeds during domestication, which significantly contributes to shaping soybean with increased yield/seed weight/oil but reduced protein content. This study provides insights into soybean domestication and is significant in improving seed quality and yield in soybean and other crop species.
Association analysis is an alternative to conventional family-based methods to detect the location of gene(s) or quantitative trait loci (QTL) and provides relatively high resolution in terms of ...defining the genome position of a gene or QTL. Seed protein and oil concentration are quantitative traits which are determined by the interaction among many genes with small to moderate genetic effects and their interaction with the environment. In this study, a genome-wide association study (GWAS) was performed to identify quantitative trait loci (QTL) controlling seed protein and oil concentration in 298 soybean germplasm accessions exhibiting a wide range of seed protein and oil content.
A total of 55,159 single nucleotide polymorphisms (SNPs) were genotyped using various methods including Illumina Infinium and GoldenGate assays and 31,954 markers with minor allele frequency >0.10 were used to estimate linkage disequilibrium (LD) in heterochromatic and euchromatic regions. In euchromatic regions, the mean LD (r2) rapidly declined to 0.2 within 360 Kbp, whereas the mean LD declined to 0.2 at 9,600 Kbp in heterochromatic regions. The GWAS results identified 40 SNPs in 17 different genomic regions significantly associated with seed protein. Of these, the five SNPs with the highest associations and seven adjacent SNPs were located in the 27.6-30.0 Mbp region of Gm20. A major seed protein QTL has been previously mapped to the same location and potential candidate genes have recently been identified in this region. The GWAS results also detected 25 SNPs in 13 different genomic regions associated with seed oil. Of these markers, seven SNPs had a significant association with both protein and oil.
This research indicated that GWAS not only identified most of the previously reported QTL controlling seed protein and oil, but also resulted in narrower genomic regions than the regions reported as containing these QTL. The narrower GWAS-defined genome regions will allow more precise marker-assisted allele selection and will expedite positional cloning of the causal gene(s).
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Downy mildew (DM) is one of the severe biotic threats to sunflower production worldwide. The inciting pathogen, Plasmopara halstedii, could overwinter in the field for years, creating a persistent ...threat to sunflower. The dominant genes Pl18 and Pl20 conferring resistance to known DM races have been previously mapped to 1.5 and 1.8 cM intervals on sunflower chromosomes 2 and 8, respectively. Utilizing a whole-genome resequencing strategy combined with reference sequence-based chromosome walking and high-density mapping in the present study, Pl18 was placed in a 0.7 cM interval on chromosome 2. A candidate gene HanXRQChr02g0048181 for Pl18 was identified from the XRQ reference genome and predicted to encode a protein with typical NLR domains for disease resistance. The Pl20 gene was placed in a 0.2 cM interval on chromosome 8. The putative gene with the NLR domain for Pl20, HanXRQChr08g0210051, was identified within the Pl20 interval. SNP markers closely linked to Pl18 and Pl20 were evaluated with 96 diverse sunflower lines, and a total of 13 diagnostic markers for Pl18 and four for Pl20 were identified. These markers will facilitate to transfer these new genes to elite sunflower lines and to pyramid these genes with broad-spectrum DM resistance in sunflower breeding.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
Soybean (Glycine max) is a photoperiod-sensitive and self-pollinated species. Days to flowering (DTF) and maturity (DTM), duration of flowering-to-maturity (DFTM) and plant height (PH) are crucial ...for soybean adaptability and yield. To dissect the genetic architecture of these agronomically important traits, a population consisting of 309 early maturity soybean germplasm accessions was genotyped with the Illumina Infinium SoySNP50K BeadChip and phenotyped in multiple environments. A genome-wide association study (GWAS) was conducted using a mixed linear model that involves both relative kinship and population structure.
The linkage disequilibrium (LD) decayed slowly in soybean, and a substantial difference in LD pattern was observed between euchromatic and heterochromatic regions. A total of 27, 6, 18 and 27 loci for DTF, DTM, DFTM and PH were detected via GWAS, respectively. The Dt1 gene was identified in the locus strongly associated with both DTM and PH. Ten candidate genes homologous to Arabidopsis flowering genes were identified near the peak single nucleotide polymorphisms (SNPs) associated with DTF. Four of them encode MADS-domain containing proteins. Additionally, a pectin lyase-like gene was also identified in a major-effect locus for PH where LD decayed rapidly.
This study identified multiple new loci and refined chromosomal regions of known loci associated with DTF, DTM, DFTM and/or PH in soybean. It demonstrates that GWAS is powerful in dissecting complex traits and identifying candidate genes although LD decayed slowly in soybean. The loci and trait-associated SNPs identified in this study can be used for soybean genetic improvement, especially the major-effect loci associated with PH could be used to improve soybean yield potential. The candidate genes may serve as promising targets for studies of molecular mechanisms underlying the related traits in soybean.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Anthracnose is a seed-borne disease of common bean (Phaseolus vulgaris L.) caused by the fungus Colletotrichum lindemuthianum, and the pathogen is cosmopolitan in distribution. The objectives of this ...study were to identify new sources of anthracnose resistance in a diverse panel of 230 Andean beans comprised of multiple seed types and market classes from the Americas, Africa, and Europe, and explore the genetic basis of this resistance using genome-wide association mapping analysis (GWAS). Twenty-eight of the 230 lines tested were resistant to six out of the eight races screened, but only one cultivar Uyole98 was resistant to all eight races (7, 39, 55, 65, 73, 109, 2047, and 3481) included in the study. Outputs from the GWAS indicated major quantitative trait loci (QTL) for resistance on chromosomes, Pv01, Pv02, and Pv04 and two minor QTL on Pv10 and Pv11. Candidate genes associated with the significant SNPs were detected on all five chromosomes. An independent QTL study was conducted to confirm the physical location of the Co-1 locus identified on Pv01 in an F4:6 recombinant inbred line (RIL) population. Resistance was determined to be conditioned by the single dominant gene Co-1 that mapped between 50.16 and 50.30 Mb on Pv01, and an InDel marker (NDSU_IND_1_50.2219) tightly linked to the gene was developed. The information reported will provide breeders with new and diverse sources of resistance and genomic regions to target in the development of anthracnose resistance in Andean beans.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Symbiotic nitrogen fixation differs among Bradyrhizobium japonicum strains. Soybean inoculated with USDA123 has a lower yield than strains known to have high nitrogen fixation efficiency, such as ...USDA110. In the main soybean-producing area in the Midwest of the United States, USDA123 has a high nodule incidence in field-grown soybean and is competitive but inefficient in nitrogen fixation. In this study, a high-throughput system was developed to characterize nodule number among 1,321 Glycine max and 69 Glycine soja accessions single inoculated with USDA110 and USDA123.
Seventy-three G. max accessions with significantly different nodule number of USDA110 and USDA123 were identified. After double inoculating 35 of the 73 accessions, it was observed that PI189939, PI317335, PI324187B, PI548461, PI562373, and PI628961 were occupied by USDA110 and double-strain nodules but not by USDA123 nodules alone. PI567624 was only occupied by USDA110 nodules, and PI507429 restricted all strains. Analysis showed that 35 loci were associated with nodule number in G. max when inoculated with strain USDA110 and 35 loci with USDA123. Twenty-three loci were identified in G. soja when inoculated with strain USDA110 and 34 with USDA123. Only four loci were common across two treatments, and each locus could only explain 0.8 to 1.5% of phenotypic variation.
High-throughput phenotyping systems to characterize nodule number and occupancy were developed, and soybean germplasm restricting rhizobium strain USDA123 but preferring USDA110 was identified. The larger number of minor effects and a small few common loci controlling the nodule number indicated trait genetic complexity and strain-dependent nodulation restriction. The information from the present study will add to the development of cultivars that limit USDA123, thereby increasing nitrogen fixation efficiency and productivity.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK