Genomic prediction is applicable to individuals of different breeds. Empirical results to date, however, show limited benefits in using information on multiple breeds in the context of genomic ...prediction. We investigated a multitask Bayesian model, presented previously by others, implemented in a Bayesian stochastic search variable selection (BSSVS) model. This model allowed for evidence of quantitative trait loci (QTL) to be accumulated across breeds or for both QTL that segregate across breeds and breed-specific QTL. In both cases, single nucleotide polymorphism effects were estimated with information from a single breed. Other models considered were a single-trait and multitrait genomic residual maximum likelihood (GREML) model, with breeds considered as different traits, and a single-trait BSSVS model. All single-trait models were applied to each of the 2 breeds separately and to the pooled data of both breeds. The data used included a training data set of 6,278 Holstein and 722 Jersey bulls, as well as 374 Jersey validation bulls. All animals had genotypes for 474,773 single nucleotide polymorphisms after editing and phenotypes for milk, fat, and protein yields. Using the same training data, BSSVS consistently outperformed GREML. The multitask BSSVS, however, did not outperform single-trait BSSVS, which used pooled Holstein and Jersey data for training. Thus, the rigorous assumption that the traits are the same in both breeds yielded a slightly better prediction than a model that had to estimate the correlation between the breeds from the data. Adding the Holstein data significantly increased the accuracy of the single-trait GREML and BSSVS in predicting the Jerseys for milk and protein, in line with estimated correlations between the breeds of 0.66 and 0.47 for milk and protein yields, whereas only the BSSVS model significantly improved the accuracy for fat yield with an estimated correlation between breeds of only 0.05. The relatively high genetic correlations for milk and protein yields, and the superiority of the pooling strategy, is likely the result of the observed admixture between both breeds in our data. The Bayesian model was able to detect several QTL in Holsteins, which likely enabled it to outperform GREML. The inability of the multitask Bayesian models to outperform a simple pooling strategy may be explained by the fact that the pooling strategy assumes equal effects in both breeds; furthermore, this assumption may be valid for moderate- to large-sized QTL, which are important for multibreed genomic prediction.
In livestock populations, estimation of breeding values for selection requires a matrix describing the additive relationship between individuals in the population. This matrix can be derived from ...pedigree information. In some livestock populations, pedigree information may be unavailable, incomplete, or in error. Here we use simulated data to demonstrate that marker-derived relationship matrices can be used to predict breeding values and estimate additive variance components, provided the markers are sufficiently dense. The approach is demonstrated for an Angus data set with 9,323 SNP markers genotyped.
Multiple-trait genome-wide association study (GWAS) analyses were compared with single-trait GWAS for power to discover and subsequently validate genetic markers (single nucleotide polymorphisms; ...SNP) associated with dairy traits. The SNP associations were discovered in 1 Holstein population and validated in both a Holstein population consisting of bulls younger than those in the discovery population and a Jersey population. The multivariate methods used were a principal component analysis and a series of bivariate analyses. The statistical power of detecting associations using multiple-trait GWAS was as good as or better than that of the best single-trait GWAS. Additional SNP associations were found with the multivariate methods that had not been discovered in the single-trait analyses; this was achieved without an increase in the false discovery rate. From the multivariate analysis, 4 common pleiotropic patterns were identified among the putative quantitative trait loci (QTL) affecting the Australian selection index. These patterns could be interpreted as a primary effect of the putative QTL on 1 or more milk components and secondary effects on other components. The multivariate analysis did not appear to increase the precision with which putative QTL were mapped.
Can we make genomic selection 100% accurate? Goddard, M.E.
Journal of animal breeding and genetics (1986),
August 2017, 2017-Aug, 2017-08-00, 20170801, Letnik:
134, Številka:
4
Journal Article
A deterministic model to calculate rates of genetic gain and inbreeding was used to compare a range of breeding scheme designs under genomic selection (GS) for a population of 140,000 cows. For most ...schemes it was assumed that the reliability of genomic breeding values (GEBV) was 0.6 across 4 pathways of selection. In addition, the effect of varying reliability on the ranking of schemes was also investigated. The schemes considered included intense selection in male pathways and genotyping of 1,000 young bulls (GS-Y). This scheme was extended to include selection in females and to include a “worldwide” scheme similar to GS-Y, but 6 times as large and assuming genotypes were freely exchanged between 6 countries. An additional worldwide scheme was modeled where GEBV were available through international genetic evaluations without exchange of genotypes. Finally, a closed nucleus herd that used juvenile in vitro embryo transfer in heifers was modeled so that the generation interval in female pathways was reduced to 1 or 2 yr. When the breeding schemes were compared using a GEBV reliability of 0.6, the rates of genetic gain were between 59 and 130% greater than the rate of genetic gain achieved in progeny testing. This was mainly through reducing the generation interval and increasing selection intensity. Genomic selection of females resulted in a 50% higher rate of genetic gain compared with restricting GS to young bulls only. The annual rates of inbreeding were, in general, 60% lower than with progeny testing, because more sires of bulls and sires of cows were selected, thus increasing the effective population size. The exception was in nucleus breeding schemes that had very short generation intervals, resulting in higher rates of both gain and inbreeding. It is likely that breeding companies will move rapidly to alter their breeding schemes to make use of genomic selection because benefits to the breeding companies and to the industry are considerable.
We propose a method (GREML-LDMS) to estimate heritability for human complex traits in unrelated individuals using whole-genome sequencing data. We demonstrate using simulations based on whole-genome ...sequencing data that ∼97% and ∼68% of variation at common and rare variants, respectively, can be captured by imputation. Using the GREML-LDMS method, we estimate from 44,126 unrelated individuals that all ∼17 million imputed variants explain 56% (standard error (s.e.) = 2.3%) of variance for height and 27% (s.e. = 2.5%) of variance for body mass index (BMI), and we find evidence that height- and BMI-associated variants have been under natural selection. Considering the imperfect tagging of imputation and potential overestimation of heritability from previous family-based studies, heritability is likely to be 60-70% for height and 30-40% for BMI. Therefore, the missing heritability is small for both traits. For further discovery of genes associated with complex traits, a study design with SNP arrays followed by imputation is more cost-effective than whole-genome sequencing at current prices.
A number of cattle breeds have become highly specialized for milk or beef production, following strong artificial selection for these traits. In this paper, we compare allele frequencies from 9323 ...single nucleotide polymorphism (SNP) markers genotyped in dairy and beef cattle breeds averaged in sliding windows across the genome, with the aim of identifying divergently selected regions of the genome between the production types. The value of the method for identifying selection signatures was validated by four sources of evidence. First, differences in allele frequencies between dairy and beef cattle at individual SNPs were correlated with the effects of those SNPs on production traits. Secondly, large differences in allele frequencies generally occurred in the same location for two independent data sets (correlation 0.45) between sliding window averages. Thirdly, the largest differences in sliding window average difference in allele frequencies were found on chromosome 20 in the region of the growth hormone receptor gene, which carries a mutation known to have an effect on milk production traits in a number of dairy populations. Finally, for the chromosome tested, the location of selection signatures between dairy and beef cattle was correlated with the location of selection signatures within dairy cattle.
The aim of this study was to assess the accuracy of genomic predictions for 19 traits including feed efficiency, growth, and carcass and meat quality traits in beef cattle. The 10,181 cattle in our ...study had real or imputed genotypes for 729,068 SNP although not all cattle were measured for all traits. Animals included Bos taurus, Brahman, composite, and crossbred animals. Genomic EBV (GEBV) were calculated using 2 methods of genomic prediction BayesR and genomic BLUP (GBLUP) either using a common training dataset for all breeds or using a training dataset comprising only animals of the same breed. Accuracies of GEBV were assessed using 5-fold cross-validation. The accuracy of genomic prediction varied by trait and by method. Traits with a large number of recorded and genotyped animals and with high heritability gave the greatest accuracy of GEBV. Using GBLUP, the average accuracy was 0.27 across traits and breeds, but the accuracies between breeds and between traits varied widely. When the training population was restricted to animals from the same breed as the validation population, GBLUP accuracies declined by an average of 0.04. The greatest decline in accuracy was found for the 4 composite breeds. The BayesR accuracies were greater by an average of 0.03 than GBLUP accuracies, particularly for traits with known genes of moderate to large effect mutations segregating. The accuracies of 0.43 to 0.48 for IGF-I traits were among the greatest in the study. Although accuracies are low compared with those observed in dairy cattle, genomic selection would still be beneficial for traits that are hard to improve by conventional selection, such as tenderness and residual feed intake. BayesR identified many of the same quantitative trait loci as a genomewide association study but appeared to map them more precisely. All traits appear to be highly polygenic with thousands of SNP independently associated with each trait.
Selection of animals for breeding ranked on estimated breeding value maximizes genetic gain in the next generation but does not necessarily maximize long-term response. An alternative method, as ...practiced by plant breeders, is to build a desired genotype by selection on specific loci. Maximal long-term response in animal breeding requires selection on estimated breeding values with constraints on coancestry. In this paper, we compared long-term genetic response using either a genotype building or a genomic estimated breeding value (GEBV) strategy for the Australian Selection Index (ASI), a measure of profit. First, we used real marker effects from the Australian Dairy Herd Improvement Scheme to estimate breeding values for chromosome segments (approximately 25cM long) for 2,650 Holstein bulls. Second, we selected 16 animals to be founders for a simulated breeding program where, between them, founders contain the best possible combination of 2 segments from 2 animals at each position in the genome. Third, we mated founder animals and their descendants over 30 generations with 2 breeding objectives: (1) to create a population with the “ideal genotype,” where the best 2 segments from the founders segregate at each position, or (2) obtain the highest possible response in ASI with coancestry lower than that achieved under breeding objective 1. Results show that genotype building achieved the ideal genotype for breeding objective 1 and obtained a large gain in ASI over the current population (+A$864.99). However, selection on overall GEBV had greater short-term response and almost as much long-term gain (+A$820.42). When coancestry was lowered under breeding objective 2, selection on overall GEBV achieved a higher response in ASI than the genotype building strategy. Selection on overall GEBV seems more flexible in its selection decisions and was therefore better able to precisely control coancestry while maximizing ASI. We conclude that selection on overall GEBV while minimizing average coancestry is the more practical strategy for dairy cattle where selection is for highly polygenic traits, the reproductive rate is relatively low, and there is low tolerance of coancestry. The outcome may be different for traits controlled by few loci of relatively large effects or for different species. In contrast to other simulations, our results indicate that response to selection on overall GEBV may continue for several generations. This is because long-term genetic change in complex traits requires favorable changes to allele frequencies for many loci located throughout the genome.
The aim of this study was to investigate the feasibility of using mid-infrared (MIR) spectroscopy analysis of milk samples to increase the power and precision of genome-wide association studies ...(GWAS) for milk composition and to better distinguish linked quantitative trait loci (QTL). To achieve this goal, we analyzed phenotypic data of milk composition traits, related MIR spectra, and genotypic data comprising 626,777 SNP on 5,202 Holstein, Jersey, and crossbred cows. We performed a conventional GWAS on protein, lactose, fat, and fatty acid concentrations in milk, a GWAS on individual MIR wavenumbers, and a partial least squares regression (PLS), which is equivalent to a multi-trait GWAS, exploiting MIR data simultaneously to predict SNP genotypes. The PLS detected most of the QTL identified using single-trait GWAS, usually with a higher significance value, as well as previously undetected QTL for milk composition. Each QTL tends to have a different pattern of effects across the MIR spectrum and this explains the increased power. Because SNP tracking different QTL tend to have different patterns of effect, it was possible to distinguish closely linked QTL. Overall, the results of this study suggest that using MIR data through either GWAS or PLS analysis applied to genomic data can provide a powerful tool to distinguish milk composition QTL.