Sequencing data enable the detection of causal loci or single nucleotide polymorphisms (SNPs) highly linked to causal loci to improve genomic prediction. However, until now, studies on integrating ...such SNPs using a single-step genomic best linear unbiased prediction (ssGBLUP) model are scarce. We investigated the integration of sequencing SNPs selected by association (1262 SNPs) and bioinformatics (2359 SNPs) analyses into the currently used 54K-SNP chip, using three ssGBLUP models which make different assumptions on the distribution of SNP effects: a basic ssGBLUP model, a so-called featured ssGBLUP (ssFGBLUP) model that considered selected sequencing SNPs as a feature genetic component, and a weighted ssGBLUP (ssWGBLUP) model in which the genomic relationship matrix was weighted by the SNP variances estimated from a Bayesian whole-genome regression model, with every 1, 30, or 100 adjacent SNPs within a chromosome region sharing the same variance. We used data on milk production and female fertility in Danish Jersey. In total, 15,823 genotyped and 528,981 non-genotyped females born between 1990 and 2013 were used as reference population and 7415 genotyped females and 33,040 non-genotyped females born between 2014 and 2016 were used as validation population.
With basic ssGBLUP, integrating SNPs selected from sequencing data improved prediction reliabilities for milk and protein yields, but resulted in limited or no improvement for fat yield and female fertility. Model performances depended on the SNP set used. When using ssWGBLUP with the 54K SNPs, reliabilities for milk and protein yields improved by 0.028 for genotyped animals and by 0.006 for non-genotyped animals compared with ssGBLUP. However, with the SNP set that included SNPs selected from sequencing data, no statistically significant difference in prediction reliability was observed between the three ssGBLUP models.
In summary, when using 54K SNPs, a ssWGBLUP model with a common weight on the SNPs in a given region is a feasible approach for single-trait genetic evaluation. Integrating relevant SNPs selected from sequencing data into the standard SNP chip can improve the reliability of genomic prediction. Based on such SNP data, a basic ssGBLUP model was suggested since no significant improvement was observed from using alternative models such as ssWGBLUP and ssFGBLUP.
Genotype imputation, often focused on SNP and small insertions and deletions (indels; size ≤50 bp), is a crucial step for association mapping and estimation of genomic breeding values. Here, we ...present strategies to impute genotypes for large chromosomal deletions (size >50 bp), along with SNP and indels in cattle. The pipelines include a strategy for extending the whole-genome sequence reference panel for large deletions, a 2-step genotype refinement approach using Beagle4 and SHAPEIT2 software, and finally, joint imputation of SNP, indels, and large deletions to the existing SNP array-typed population using Minimac3 software. Using these pipelines we achieved an imputation accuracy of the squared Pearson correlation (r2) > 0.6 at minor allele frequencies as low as 0.7% for SNP and indels, and 0.2% for large deletions. This highlights the potential of our approach to build a haplotype reference panel and impute different classes of sequence variants across a wide allele frequency spectrum with high accuracy.
The objective of this study was to evaluate the improvement of the accuracy of estimated breeding values for ability to recycle after calving by using information of genomic markers and phenotypic ...information of correlated traits. The traits in this study were the interval from calving to first insemination (CFI), based on artificial insemination data, and the interval from calving to first high activity (CFHA), recorded from activity tags, which could better measure ability to recycle after caving. The phenotypic data set included 1,472,313 records from 820,218 cows for CFI, and 36,504 records from 25,733 cows for CFHA. The genomic information was available for 3,159 progeny-tested sires, which were genotyped using Illumina Bovine SNP50 BeadChip (Illumina, San Diego, CA). Heritability estimates were 0.06 for the interval from calving to first insemination and 0.14 for the interval from calving to first high activity, and the genetic correlation between both traits was strong (0.87). Breeding values were obtained using 4 models: conventional single-trait BLUP; conventional multitrait BLUP with pedigree-based relationship matrix; single-trait single-step genomic BLUP; and multitrait single-step genomic BLUP model with joint relationship matrix combining pedigree and genomic information. The results showed that reliabilities of estimated breeding values (EBV) from single-step genomic BLUP models were about 40% higher than those from conventional BLUP models for both traits. Furthermore, using a multitrait model doubled the reliability of breeding values for CFHA, whereas no gain was observed for CFI. The best model was the multitrait single-step genomic BLUP, which resulted in a reliability of EBV 0.19 for CFHA and 0.14 for CFI. The results indicate that even though a relatively small number of records for CFHA were available, with genomic information and using multitrait model, the reliability of EBV for CFHA is acceptable. Thus, it is feasible to include CFHA in Nordic Holstein breeding evaluations to improve fertility performance.
Background: The objective of the present study was to estimate (co)variance components of female fertility traits in Chinese Holsteins, considering fertility traits in different parities as different ...traits. Data on 88,647 females with 215,632 records (parities) were collected during 2000 to 2014 from 32 herds in the Sanyuan Lvhe Dairy Cattle Center, Beijing, China. The analyzed female fertility traits included interval from calving to first insemination, interval from first to last insemination, days open, conception rate at first insemination, number of inseminations per conception and non-return rates within 56 days after first insemination.Results: The descriptive statistics showed that the average fertility of heifers was superior to that of cows. Moreover,the genetic correlations between the performances of a trait in heifers and in cows were all moderate to high but far from one, which suggested that the performances of a trait in heifers and cows should be considered as different but genetically correlated traits in genetic evaluations. On the other hand, genetic correlations between performances of a trait in different parities of cows were greater than 0.87, with only a few exceptions, but variances were not homogeneous across parities for some traits. The estimated heritabilities of female fertility traits were low; all were below 0.049 (except for interval from calving to first insemination). Additionally, the heritabilities of the heifer interval traits were lower than those of the corresponding cow interval traits. Moreover, the heritabilities of the interval traits were higher than those of the threshold traits when measuring similar fertility functions. In general, estimated genetic correlations between traits were highly consistent with the biological categories of the female fertility traits.Conclusions: Interval from calving to first insemination, interval from first to last insemination and non-return rates within 56 days after first insemination are recommended to be included in the selection index of the Chinese Holstein population. The parameters estimated in the present study will facilitate the development of a genetic evaluation system for female fertility traits to improve the reproduction efficiency of Chinese Holsteins.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
In order to obtain accurate genomic breeding values a large number of reference animals with both phenotype and genotype data are needed. This poses a challenge for breeds with small reference ...populations. One option to overcome this obstacle is to use a multi-breed reference population. However, combining populations across breeds is not straightforward due to differences in linkage disequilibrium structure and weak relationships between breeds. This study offers a review of the available literature on the use of reference populations compiled from different cattle breeds. Results show that the effect of multi-breed reference populations on the accuracy of genomic prediction is highly affected by the genetic distance between breeds. When combining populations of the same breeds from different countries, large increases in accuracy are seen, whereas for admixed populations with some exchange of sires, substantial but smaller gains are found. Little or no benefit is found when combining distantly related breeds such as Holstein and Jersey and using the widely used genomic BLUP model. By using more sophisticated Bayesian variable selection models that put more focus on genomic markers in strong linkage disequilibrium with causative variants in combination with denser markers sets or functional subsets of markers, it is however possible to utilize information across distantly related breeds to increase the accuracy of genomic prediction. The further development of multi-breed genomic prediction models offers not only increases in the accuracy of genomic breeding values for small breeds, but will also give a stronger persistence of the accuracy over generations within larger breeds.
The availability of whole genome sequencing (WGS) data enables the discovery of causative single nucleotide polymorphisms (SNPs) or SNPs in high linkage disequilibrium with causative SNPs. This study ...investigated effects of integrating SNPs selected from imputed WGS data into the data of 54K chip on genomic prediction in Danish Jersey. The WGS SNPs, mainly including peaks of quantitative trait loci, structure variants, regulatory regions of genes, and SNPs within genes with strong effects predicted with variant effect predictor, were selected in previous analyses for dairy breeds in Denmark-Finland-Sweden (DFS) and France (FRA). Animals genotyped with 54K chip, standard LD chip, and customized LD chip which covered selected WGS SNPs and SNPs in the standard LD chip, were imputed to 54K together with DFS and FRA SNPs. Genomic best linear unbiased prediction (GBLUP) and Bayesian four-distribution mixture models considering 54K and selected WGS SNPs as one (a one-component model) or two separate genetic components (a two-component model) were used to predict breeding values. For milk production traits and mastitis, both DFS (0.025) and FRA (0.029) sets of additional WGS SNPs improved reliabilities, and inclusions of all selected WGS SNPs generally achieved highest improvements of reliabilities (0.034). A Bayesian four-distribution model yielded higher reliabilities than a GBLUP model for milk and protein, but extra gains in reliabilities from using selected WGS SNPs were smaller for a Bayesian four-distribution model than a GBLUP model. Generally, no significant difference was observed between one-component and two-component models, except for using GBLUP models for milk.
Domestic animals can serve as model systems of adaptive introgression and their genomic signatures. In part, their usefulness as model systems is due to their well-known histories. Different breeding ...strategies such as introgression and artificial selection have generated numerous desirable phenotypes and superior performance in domestic animals. The modern Danish Red Dairy Cattle is studied as an example of an introgressed population. It originates from crossing the traditional Danish Red Dairy Cattle with the Holstein and Brown Swiss breeds, both known for high milk production. This crossing happened, among other things due to changes in the production system, to raise milk production and overall performance. The genomes of modern Danish Red Dairy Cattle are heavily influenced by regions introgressed from the Holstein and Brown Swiss breeds and under subsequent selection in the admixed population. The introgressed proportion of the genome was found to be highly variable across the genome. Haplotypes introgressed from Holstein and Brown Swiss contained or overlapped known genes affecting milk production, as well as protein and fat content (
,
,
, and
for Holstein origin and
,
,
, and
for Brown Swiss origin). Genomic regions with high introgression signals also contained genes and enriched QTL associated with calving traits, body confirmation, feed efficiency, carcass, and fertility traits. These introgressed signals with relative identity-by-descent scores larger than the median showing Holstein or Brown Swiss introgression are mostly significantly correlated with the corresponding test statistics from signatures of selection analyses in modern Danish Red Dairy Cattle. Meanwhile, the putative significant introgressed signals have a significant dependency with the putative significant signals from signatures of selection analyses. Artificial selection has played an important role in the genomic footprints of introgression in the genome of modern Danish Red Dairy Cattle. Our study on a modern cattle breed contributes to an understanding of genomic consequences of selective introgression by demonstrating the extent to which adaptive effects contribute to shape the specific genomic consequences of introgression.
Selective genotyping of phenotypically superior animals may lead to bias and less accurate genomic breeding values (GEBV). Performing selective genotyping based on phenotypes measured in the breeding ...environment (B) is not necessarily a good strategy when the aim of a breeding program is to improve animals’ performance in the commercial environment (C). Our simulation study compared different genotyping strategies for selection candidates and for fish in C in a breeding program for rainbow trout in the presence of genotype-by-environment interactions when the program had limited genotyping resources and unregistered pedigrees of individuals. For the reference population, selective genotyping of top and bottom individuals in C based on phenotypes measured in C led to the highest genetic gains, followed by random genotyping and then selective genotyping of top individuals in C. For selection candidates, selective genotyping of top individuals in B based on phenotypes measured in B led to the highest genetic gains, followed by selective genotyping of top and bottom individuals and then random genotyping. Selective genotyping led to bias in predicting GEBV. However, in scenarios that used selective genotyping of top fish in B and random genotyping of fish in C, predictions of GEBV were unbiased, with genetic correlations of 0.2 and 0.5 between traits measured in B and C. Estimates of variance components were sensitive to genotyping strategy, with an overestimation of the variance with selective genotyping of top and bottom fish and an underestimation of the variance with selective genotyping of top fish. Unbiased estimates of variance components were obtained when fish in B and C were genotyped at random. In conclusion, we recommend phenotypic genotyping of top and bottom fish in C and top fish in B for the purpose of selecting breeding animals and random genotyping of individuals in B and C for the purpose of estimating variance components when a genomic breeding program for rainbow trout aims to improve animals’ performance in C.
Identification of the genetic variants associated with calf survival in dairy cattle will aid in the elimination of harmful mutations from the cattle population and the reduction of calf and young ...stock mortality rates. We used de-regressed estimated breeding values for the young stock survival (YSS) index as response variables in a genome-wide association study with imputed whole-genome sequence variants. A total of 4,610 bulls with estimated breeding values were genotyped with the Illumina BovineSNP50 (Illumina, San Diego, CA) single nucleotide polymorphism (SNP) genotyping array. Genotypes were imputed to whole-genome sequence variants. After quality control, 15,419,550 SNP on 29 Bos taurus autosomes (BTA) were used for association analysis. A modified mixed-model association analysis was used for a genome scan, followed by a linear mixed-model analysis for selected genetic variants. We identified 498 SNP on BTA5 and BTA18 that were associated with the YSS index in Nordic Holstein. The SNP rs440345507 (Chr5:94721790) on BTA5 was the putative causal mutation affecting YSS. Two haplotype-based models were used to identify haplotypes with the largest detrimental effects on YSS index. For each association signal, 1 haplotype region with harmful effects and the lead associated SNP were identified. Detected haplotypes on BTA5 and BTA18 explained 1.16 and 1.20%, respectively, of genetic variance for the YSS index. We examined whether YSS quantitative trait loci (QTL) on BTA5 and BTA18 were associated with stillbirth. YSS QTL on BTA18 overlapped a QTL region for stillbirth, but most likely 2 different causal variants were responsible for these 2 QTL. Four component traits of the YSS index, defined by sex and age, were analyzed separately by the modified mixed-model approach. The same genomic regions were associated with both bull and heifer calf mortality. Several genes (EPS8, LOC100138951, and KLK family genes) contained a lead associated SNP or were included in haplotypes with large detrimental effects on YSS in Nordic Holstein cattle.
Newcastle disease (ND) and avian influenza (AI) are the most feared diseases in the poultry industry worldwide. They can cause flock mortality up to 100%, resulting in a catastrophic economic loss. ...This is the first study to investigate the feasibility of genomic selection for antibody response to Newcastle disease virus (Ab-NDV) and antibody response to Avian Influenza virus (Ab-AIV) in chickens. The data were collected from a crossbred population. Breeding values for Ab-NDV and Ab-AIV were estimated using a pedigree-based best linear unbiased prediction model (BLUP) and a genomic best linear unbiased prediction model (GBLUP). Single-trait and multiple-trait analyses were implemented. According to the analysis using the pedigree-based model, the heritability for Ab-NDV estimated from the single-trait and multiple-trait models was 0.478 and 0.487, respectively. The heritability for Ab-AIV estimated from the two models was 0.301 and 0.291, respectively. The estimated genetic correlation between the two traits was 0.438. A four-fold cross-validation was used to assess the accuracy of the estimated breeding values (EBV) in the two validation scenarios. In the family sample scenario each half-sib family is randomly allocated to one of four subsets and in the random sample scenario the individuals are randomly divided into four subsets. In the family sample scenario, compared with the pedigree-based model, the accuracy of the genomic prediction increased from 0.086 to 0.237 for Ab-NDV and from 0.080 to 0.347 for Ab-AIV. In the random sample scenario, the accuracy was improved from 0.389 to 0.427 for Ab-NDV and from 0.281 to 0.367 for Ab-AIV. The multiple-trait GBLUP model led to a slightly higher accuracy of genomic prediction for both traits. These results indicate that genomic selection for antibody response to ND and AI in chickens is promising.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK