Genomic selection (GS) facilitates the rapid selection of superior genotypes and accelerates the breeding cycle. In this review, we discuss the history, principles, and basis of GS and ...genomic-enabled prediction (GP) as well as the genetics and statistical complexities of GP models, including genomic genotype×environment (G×E) interactions. We also examine the accuracy of GP models and methods for two cereal crops and two legume crops based on random cross-validation. GS applied to maize breeding has shown tangible genetic gains. Based on GP results, we speculate how GS in germplasm enhancement (i.e., prebreeding) programs could accelerate the flow of genes from gene bank accessions to elite lines. Recent advances in hyperspectral image technology could be combined with GS and pedigree-assisted breeding.
In recent years, the global climate has changed, resulting in drastic fluctuations in rainfall patterns and increasing temperature. Sudden climate changes can cause significant economic losses to countries worldwide.
Genetic improvement of several economically important crops during the 20th century using phenotypic, pedigree, and performance data was very successful. However, signs of grain yield stagnation in some crops, especially in drought-stressed and semi-arid regions, are evident.
Genomic selection offers the opportunity to increase grain production in less time. International Maize and Wheat Improvement Center (CIMMYT) maize breeding research in Sub-Saharan Africa, India, and Mexico has shown that genomic selection can reduce the breeding interval cycle to at least half the conventional time and produces lines that, in hybrid combinations, significantly increase grain yield performance over that of commercial checks.
Public and private investment in crop genomic selection research should increase to successfully develop in less time germplasm that is adapted to sudden climate change.
Although hybrid crop varieties are among the most popular agricultural innovations, the rationale for hybrid crop breeding is sometimes misunderstood. Hybrid breeding is slower and more ...resource-intensive than inbred breeding, but it allows systematic improvement of a population by recurrent selection and exploitation of heterosis simultaneously. Inbred parental lines can identically reproduce both themselves and their F
progeny indefinitely, whereas outbred lines cannot, so uniform outbred lines must be bred indirectly through their inbred parents to harness heterosis. Heterosis is an expected consequence of whole-genome non-additive effects at the population level over evolutionary time. Understanding heterosis from the perspective of molecular genetic mechanisms alone may be elusive, because heterosis is likely an emergent property of populations. Hybrid breeding is a process of recurrent population improvement to maximize hybrid performance. Hybrid breeding is not maximization of heterosis
, nor testing random combinations of individuals to find an exceptional hybrid, nor using heterosis in place of population improvement. Though there are methods to harness heterosis other than hybrid breeding, such as use of open-pollinated varieties or clonal propagation, they are not currently suitable for all crops or production environments. The use of genomic selection can decrease cycle time and costs in hybrid breeding, particularly by rapidly establishing heterotic pools, reducing testcrossing, and limiting the loss of genetic variance. Open questions in optimal use of genomic selection in hybrid crop breeding programs remain, such as how to choose founders of heterotic pools, the importance of dominance effects in genomic prediction, the necessary frequency of updating the training set with phenotypic information, and how to maintain genetic variance and prevent fixation of deleterious alleles.
Genome-wide association study (GWAS) and genomic prediction/selection (GP/GS) are the two essential enterprises in genomic research. Due to the great magnitude and complexity of genomic and ...phenotypic data, analytical methods and their associated software packages are frequently advanced. GAPIT is a widely-used genomic association and prediction integrated tool as an R package. The first version was released to the public in 2012 with the implementation of the general linear model (GLM), mixed linear model (MLM), compressed MLM (CMLM), and genomic best linear unbiased prediction (gBLUP). The second version was released in 2016 with several new implementations, including enriched CMLM (ECMLM) and settlement of MLMs under progressively exclusive relationship (SUPER). All the GWAS methods are based on the single-locus test. For the first time, in the current release of GAPIT, version 3 implemented three multi-locus test methods, including multiple loci mixed model (MLMM), fixed and random model circulating probability unification (FarmCPU), and Bayesian-information and linkage-disequilibrium iteratively nested keyway (BLINK). Additionally, two GP/GS methods were implemented based on CMLM (named compressed BLUP; cBLUP) and SUPER (named SUPER BLUP; sBLUP). These new implementations not only boost statistical power for GWAS and prediction accuracy for GP/GS, but also improve computing speed and increase the capacity to analyze big genomic data. Here, we document the current upgrade of GAPIT by describing the selection of the recently developed methods, their implementations, and potential impact. All documents, including source code, user manual, demo data, and tutorials, are freely available at the GAPIT website (http://zzlab.net/GAPIT).
The Finnish Ayrshire (FAY) belongs to the Nordic Red breeds and is characterized by high milk yield, high milk components, good fertility, and functional conformation. The FAY breeding program is ...based on genomic selection. Despite the benefits of selection on breeding values, autozygosity in the genome may increase due to selection, and increased autozygosity may cause inbreeding depression in selected traits. However, there is lack of studies concerning selection signatures in the FAY after genomic selection introduction.
The aim of this study was to identify signatures of selection in FAY after the introduction of genomic selection. Genomic data included 45,834 SNPs. The genotyped animals were divided into 2 groups: animals born before genomic selection introduction (6,108 cows) and animals born after genomic selection introduction (47,361 cows). We identified the selection signatures using 3 complementary methods: 2 based on identification of selection signatures from runs of homozygosity (ROH) islands and one based on the decay of site-specific extended haplotype between populations at SNP sites (Rsb).
In total, we identified 34 ROH islands on chromosomes 1, 3, 6, 8, 12–15, 17, 19, 22, and 26 in FAY animals born before genomic selection (between 1980 and 2011) and 30 ROH islands on chromosomes 1–3, 13–17, 22, and 25–26 in FAY animals born after genomic selection introduction (between 2015 and 2020). We additionally detected 22 ΔROH islands on chromosomes 2–3, 11, 13, 14, 16, 18, 20, and 25–26. Finally, a total of 31 Rsb regions on chromosomes 2, 3, 14, 18, 20, and 25 were identified.
Based on the results, genomic selection has favored certain alleles and haplotypes on genomic regions related to traits relevant in the FAY breeding program: milk production, fertility, growth, beef production traits, and feed efficiency. Several genes related to these traits, e.g., PLA2G4A, MECR, CHUK, COX15, RICTOR, SHISA9, and SEMA4G overlapped or partially overlapped the observed selection signature regions. The association of genotypes within these regions and their effects on traits relevant in the FAY breeding program should be studied and genetic regions undergoing selection monitored in the FAY population.
With marker and phenotype information from observed populations, genomic selection (GS) can be used to establish associations between markers and phenotypes. It aims to use genome-wide markers to ...estimate the effects of all loci and thereby predict the genetic values of untested populations, so as to achieve more comprehensive and reliable selection and to accelerate genetic progress in crop breeding. GS models usually face the problem that the number of markers is much higher than the number of phenotypic observations. To overcome this issue and improve prediction accuracy, many models and algorithms, including GBLUP, Bayes, and machine learning have been employed for GS. As hot issues in GS research, the estimation of non-additive genetic effects and the combined analysis of multiple traits or multiple environments are also important for improving the accuracy of prediction. In recent years, crop breeding has taken advantage of the development of GS. The principles and characteristics of current popular GS methods and research progress in these methods for crop improvement are reviewed in this paper.
Genomic selection (GS) models use genome-wide genetic information to predict genetic values of candidates of selection. Originally, these models were developed without considering genotype × ...environment interaction(G×E). Several authors have proposed extensions of the single-environment GS model that accommodate G×E using either covariance functions or environmental covariates. In this study, we model G×E using a marker × environment interaction (M×E) GS model; the approach is conceptually simple and can be implemented with existing GS software. We discuss how the model can be implemented by using an explicit regression of phenotypes on markers or using co-variance structures (a genomic best linear unbiased prediction-type model). We used the M×E model to analyze three CIMMYT wheat data sets (W1, W2, and W3), where more than 1000 lines were genotyped using genotyping-by-sequencing and evaluated at CIMMYT's research station in Ciudad Obregon, Mexico, under simulated environmental conditions that covered different irrigation levels, sowing dates and planting systems. We compared the M×E model with a stratified (i.e., within-environment) analysis and with a standard (across-environment) GS model that assumes that effects are constant across environments (i.e., ignoring G×E). The prediction accuracy of the M×E model was substantially greater of that of an across-environment analysis that ignores G×E. Depending on the prediction problem, the M×E model had either similar or greater levels of prediction accuracy than the stratified analyses. The M×E model decomposes marker effects and genomic values into components that are stable across environments (main effects) and others that are environment-specific (interactions). Therefore, in principle, the interaction model could shed light over which variants have effects that are stable across environments and which ones are responsible for G×E. The data set and the scripts required to reproduce the analysis are publicly available as Supporting Information.
IntroductionSoybean stem diameter (SD) and branch diameter (BD) are closely related traits, and genetic clarification of SD and BD is crucial for soybean breeding.MethodsSD and BD were genetically ...analyzed by a population of 363 RIL derived from the cross between Zhongdou41 (ZD41) and ZYD02878 using restricted two-stage multi-locus genome-wide association, inclusive composite interval mapping, and three-variance component multi-locus random SNP effect mixed linear modeling. Then candidate genes of major QTLs were selected and genetic selection model of SD and BD were constructed respectively.Results and discussionThe results showed that SD and BD were significantly correlated (r = 0.74, P < 0.001). A total of 93 and 84 unique quantitative trait loci (QTL) were detected for SD and BD, respectively by three different methods. There were two and ten major QTLs for SD and BD, respectively, with phenotypic variance explained (PVE) by more than 10%. Within these loci, seven genes involved in the regulation of phytohormones (IAA and GA) and cell proliferation and showing extensive expression of shoot apical meristematic genes were selected as candidate genes. Genomic selection (GS) analysis showed that the trait-associated markers identified in this study reached 0.47-0.73 in terms of prediction accuracy, which was enhanced by 6.56-23.69% compared with genome-wide markers. These results clarify the genetic basis of SD and BD, which laid solid foundation in regulation gene cloning, and GS models constructed could be potentially applied in future breeding programs.
In the last decade, genomic selection has become a standard in the genetic evaluation of livestock populations. However, most procedures for the implementation of genomic selection only consider the ...additive effects associated with SNP (Single Nucleotide Polymorphism) markers used to calculate the prediction of the breeding values of candidates for selection. Nevertheless, the availability of estimates of non-additive effects is of interest because: (i) they contribute to an increase in the accuracy of the prediction of breeding values and the genetic response; (ii) they allow the definition of mate allocation procedures between candidates for selection; and (iii) they can be used to enhance non-additive genetic variation through the definition of appropriate crossbreeding or purebred breeding schemes. This study presents a review of methods for the incorporation of non-additive genetic effects into genomic selection procedures and their potential applications in the prediction of future performance, mate allocation, crossbreeding, and purebred selection. The work concludes with a brief outline of some ideas for future lines of that may help the standard inclusion of non-additive effects in genomic selection.
•Genome selection (GS) considers marker effects across the whole genome.•The use of high-density markers is one of the features of GS.•GS is based on two distinct and related groups: training and ...breeding populations.•Phenotyping is a key informant in GS to build up accuracy of statistical models.•GS may revolutionize plant and tree breeding practices.
Association analysis is used to measure relations between markers and quantitative trait loci (QTL). Their estimation ignores genes with small effects that trigger underpinning quantitative traits. By contrast, genome-wide selection estimates marker effects across the whole genome on the target population based on a prediction model developed in the training population (TP). Whole-genome prediction models estimate all marker effects in all loci and capture small QTL effects. Here, we review several genomic selection (GS) models with respect to both the prediction accuracy and genetic gain from selection. Phenotypic selection or marker-assisted breeding protocols can be replaced by selection, based on whole-genome predictions in which phenotyping updates the model to build up the prediction accuracy.
Plantation‐grown trees have to cope with an increasing pressure of pest and disease in the context of climate change, and breeding approaches using genomics may offer efficient and flexible tools to ...face this pressure. In the present study, we targeted genetic improvement of resistance of an introduced conifer species in Canada, Norway spruce (Picea abies (L.) Karst.), to the native white pine weevil (Pissodes strobi Peck). We developed single‐ and multi‐trait genomic selection (GS) models and selection indices considering the relationships between weevil resistance, intrinsic wood quality, and growth traits. Weevil resistance, acoustic velocity as a proxy for mechanical wood stiffness, and average wood density showed moderate‐to‐high heritability and low genotype‐by‐environment interactions. Weevil resistance was genetically positively correlated with tree height, height‐to‐diameter at breast height (DBH) ratio, and acoustic velocity. The accuracy of the different GS models tested (GBLUP, threshold GBLUP, Bayesian ridge regression, BayesCπ) was high and did not differ among each other. Multi‐trait models performed similarly as single‐trait models when all trees were phenotyped. However, when weevil attack data were not available for all trees, weevil resistance was more accurately predicted by integrating genetically correlated growth traits into multi‐trait GS models. A GS index that corresponded to the breeders’ priorities achieved near maximum gains for weevil resistance, acoustic velocity, and height growth, but a small decrease for DBH. The results of this study indicate that it is possible to breed for high‐quality, weevil‐resistant Norway spruce reforestation stock with high accuracy achieved from single‐trait or multi‐trait GS.