A large proportion of pregnancy losses occur during the pre-implantation period, when the developing embryo is elongating rapidly and signalling its presence to the maternal system. The molecular ...mechanisms that prevent luteolysis and support embryo survival within the maternal environment are not well understood. To gain a more complete picture of these molecular events, genome-wide transcriptional profiles of reproductive day 17 endometrial tissue were determined in pregnant and cyclic Holstein-Friesian dairy cattle.
Microarray analyses revealed 1,839 and 1,189 differentially expressed transcripts between pregnant and cyclic animals (with > or = 1.5 fold change in expression; P-value < 0.05, MTC Benjamini-Hochberg) in caruncular and intercaruncular endometrium respectively. Gene ontology and biological pathway analysis of differentially expressed genes revealed enrichment for genes involved in interferon signalling and modulation of the immune response in pregnant animals.
The maternal immune system actively surveys the uterine environment during early pregnancy. The embryo modulates this response inducing the expression of endometrial molecules that suppress the immune response and promote maternal tolerance to the embryo. During this period of local immune suppression, genes of the innate immune response (in particular, antimicrobial genes) may function to protect the uterus against infection.
We report mapping of a quantitative trait locus (QTL) with a major effect on bovine stature to a ∼780-kb interval using a Hidden Markov Model-based approach that simultaneously exploits linkage and ...linkage disequilibrium. We re-sequenced the interval in six sires with known QTL genotype and identified 13 clustered candidate quantitative trait nucleotides (QTNs) out of >9,572 discovered variants. We eliminated five candidate QTNs by studying the phenotypic effect of a recombinant haplotype identified in a breed diversity panel. We show that the QTL influences fetal expression of seven of the nine genes mapping to the ∼780-kb interval. We further show that two of the eight candidate QTNs, mapping to the PLAG1-CHCHD7 intergenic region, influence bidirectional promoter strength and affect binding of nuclear factors. By performing expression QTL analyses, we identified a splice site variant in CHCHD7 and exploited this naturally occurring null allele to exclude CHCHD7 as single causative gene.
Causal variants for complex traits, such as eQTL are often found in non-coding regions of the genome, where they are hypothesised to influence phenotypes by regulating gene expression. Many ...regulatory regions are marked by histone modifications, which can be assayed by chromatin immunoprecipitation followed by sequencing (ChIP-seq). Sequence reads from ChIP-seq form peaks at putative regulatory regions, which may reflect the amount of regulatory activity at this region. Therefore, eQTL which are also associated with differences in histone modifications are excellent candidate causal variants.
We assayed the histone modifications H3K4Me3, H3K4Me1 and H3K27ac and mRNA in the mammary gland of up to 400 animals. We identified QTL for peak height (histone QTL), exon expression (eeQTL), allele specific expression (aseQTL) and allele specific binding (asbQTL). By intersecting these results, we identify variants which may influence gene expression by altering regulatory regions of the genome, and may be causal variants for other traits. Lastly, we find that these variants are found in putative transcription factor binding sites, identifying a mechanism for the effect of many eQTL.
We find that allele specific and traditional QTL analysis often identify the same genetic variants and provide evidence that many eQTL are regulatory variants which alter activity at regulatory regions of the bovine genome. Our work provides methodological and biological updates on how regulatory mechanisms interplay at multi-omics levels.
Cerebellar hypoplasia is a heterogeneous neurological condition in which the cerebellum is smaller than usual or not completely developed. The condition can have genetic origins, with ...Mendelian-effect mutations described in several mammalian species. Here, we describe a genetic investigation of cerebellar hypoplasia in White Swiss Shepherd dogs, where two affected puppies were identified from a litter with a recent common ancestor on both sides of their pedigree. Whole genome sequencing was conducted for 10 dogs in this family, and filtering of these data based on a recessive transmission hypothesis highlighted five protein-altering candidate variants - including a frameshift-deletion of the Reelin (RELN) gene (p.Val947*). Given the status of RELN as a gene responsible for cerebellar hypoplasia in humans, sheep and mice, these data strongly suggest the loss-of-function variant as underlying these effects. This variant has not been found in other dog breeds nor in a cohort of European White Swiss Shepherds, suggesting a recent mutation event. This finding will support the genotyping of a more diverse sample of dogs, and should aid future management of the harmful allele through optimised mating schemes.
Genome sequence variants affecting complex traits (quantitative trait loci, QTL) are enriched in functional regions of the genome, such as those marked by certain histone modifications. These ...variants are believed to influence gene expression. However, due to the linkage disequilibrium among nearby variants, pinpointing the precise location of QTL is challenging. We aimed to identify allele-specific binding (ASB) QTL (asbQTL) that cause variation in the level of histone modification, as measured by the height of peaks assayed by ChIP-seq (chromatin immunoprecipitation sequencing). We identified DNA sequences that predict the difference between alleles in ChIP-seq peak height in H3K4me3 and H3K27ac histone modifications in the mammary glands of cows. We used a gapped k-mer support vector machine, a novel best linear unbiased prediction model, and a multiple linear regression model that combines the other two approaches to predict variant impacts on peak height. For each method, a subset of 1000 sites with the highest magnitude of predicted ASB was considered as candidate asbQTL. The accuracy of this prediction was measured by the proportion where the predicted direction matched the observed direction. Prediction accuracy ranged between 0.59 and 0.74, suggesting that these 1000 sites are enriched for asbQTL. Using independent data, we investigated functional enrichment in the candidate asbQTL set and three control groups, including non-causal ASB sites, non-ASB variants under a peak, and SNPs (single nucleotide polymorphisms) not under a peak. For H3K4me3, a higher proportion of the candidate asbQTL were confirmed as ASB when compared to the non-causal ASB sites (P < 0.01). However, these candidate asbQTL did not enrich for the other annotations, including expression QTL (eQTL), allele-specific expression QTL (aseQTL) and sites conserved across mammals (P > 0.05). We identified putatively causal sites for asbQTL using the DNA sequence surrounding these sites. Our results suggest that many sites influencing histone modifications may not directly affect gene expression. However, it is important to acknowledge that distinguishing between putative causal ASB sites and other non-causal ASB sites in high linkage disequilibrium with the causal sites regarding their impact on gene expression may be challenging due to limitations in statistical power.
The DGAT1 gene encodes an enzyme responsible for catalysing the terminal reaction in mammary triglyceride synthesis, and underpins a well-known pleiotropic quantitative trait locus (QTL) with a large ...influence on milk composition phenotypes. Since first described over 15 years ago, a protein-coding variant K232A has been assumed as the causative variant underlying these effects, following in-vitro studies that demonstrated differing levels of triglyceride synthesis between the two protein isoforms.
We used a large RNAseq dataset to re-examine the underlying mechanisms of this large milk production QTL, and hereby report novel expression-based functions of the chr14 g.1802265AA > GC variant that encodes the DGAT1 K232A substitution. Using expression QTL (eQTL) mapping, we demonstrate a highly-significant mammary eQTL for DGAT1, where the K232A mutation appears as one of the top associated variants for this effect. By conducting in vitro expression and splicing experiments in bovine mammary cell culture, we further show modulation of splicing efficiency by this mutation, likely through disruption of an exon splice enhancer as a consequence of the allele encoding the 232A variant.
The relative contributions of the enzymatic and transcription-based mechanisms now attributed to K232A remain unclear; however, these results suggest that transcriptional impacts contribute to the diversity of lactation effects observed at the DGAT1 locus.
Fourier-transform mid-infrared (FT-MIR) spectroscopy is a high-throughput and inexpensive methodology used to evaluate concentrations of fat and protein in dairy cattle milk samples. The objective of ...this study was to compare the genetic characteristics of FT-MIR predicted fatty acids and individual milk proteins with those that had been measured directly using gas and liquid chromatography methods. The data used in this study was based on 2,005 milk samples collected from 706 Holstein-Friesian × Jersey animals that were managed in a seasonal, pasture-based dairy system, with milk samples collected across 2 consecutive seasons. Concentrations of fatty acids and protein fractions in milk samples were directly determined by gas chromatography and high-performance liquid chromatography, respectively. Models to predict each directly measured trait based on FT-MIR spectra were developed using partial least squares regression, with spectra from a random selection of half the cows used to train the models, and predictions for the remaining cows used as validation. Variance parameters for each trait and genetic correlations for each pair of measured/predicted traits were estimated from pedigree-based bivariate models using REML procedures. A genome-wide association study was undertaken using imputed whole-genome sequence, and quantitative trait loci (QTL) from directly measured traits were compared with QTL from the corresponding FT-MIR predicted traits. Cross-validation prediction accuracies based on partial least squares for individual and grouped fatty acids ranged from 0.18 to 0.65. Trait prediction accuracies in cross-validation for protein fractions were 0.53, 0.19, and 0.48 for α-casein, β-casein, and κ-casein, 0.31 for α-lactalbumin, 0.68 for β-lactoglobulin, and 0.36 for lactoferrin. Heritability estimates for directly measured traits ranged from 0.07 to 0.55 for fatty acids; and from 0.14 to 0.63 for individual milk proteins. For FT-MIR predicted traits, heritability estimates were mostly higher than for the corresponding measured traits, ranging from 0.14 to 0.46 for fatty acids, and from 0.30 to 0.70 for individual proteins. Genetic correlations between directly measured and FT-MIR predicted protein fractions were consistently above 0.75, with the exceptions of C18:0 and C18:3 cis-3, which had genetic correlations of 0.72 and 0.74, respectively. The GWAS identified trait QTL for fatty acids with likely candidates in the DGAT1, CCDC57, SCD, and GPAT4 genes. Notably, QTL for SCD were largely absent in the FT-MIR predicted traits, and QTL for GPAT4 were absent in directly measured traits. Similarly, for directly measured individual proteins, we identified QTL with likely candidates in the CSN1S1, CSN3, PAEP, and LTF genes, but the QTL for CSN3 and LTF were absent in the FT-MIR predicted traits. Our study indicates that genetic correlations between directly measured and FT-MIR predicted fatty acid and protein fractions are typically high, but that phenotypic variation in these traits may be underpinned by differing genetic architecture.
Lactose provides an easily-digested energy source for neonates, and is the primary carbohydrate in milk in most species. Bovine lactose is also a key component of many human food products. However, ...compared to analyses of other milk components, the genetic control of lactose has been little studied. Here we present the first GWAS focussed on analysis of milk lactose traits.
Using a discovery population of 12,000 taurine dairy cattle, we detail 27 QTL for lactose concentration and yield, and subsequently validate the effects of 26 of these loci in a distinct population of 18,000 cows. We next present data implicating causative genes and variants for these QTL. Fine mapping of these regions using imputed, whole genome sequence-resolution genotypes reveals protein-coding candidate causative variants affecting the ABCG2, DGAT1, STAT5B, KCNH4, NPFFR2 and RNF214 genes. Eleven of the remaining QTL appear to be driven by regulatory effects, suggested by the presence of co-locating, co-segregating eQTL discovered using mammary RNA sequence data from a population of 357 lactating cows. Pathway analysis of genes representing all lactose-associated loci shows significant enrichment of genes located in the endoplasmic reticulum, with functions related to ion channel activity mediated through the LRRC8C, P2RX4, KCNJ2 and ANKH genes. A number of the validated QTL are also found to be associated with additional milk volume, fat and protein phenotypes.
Overall, these findings highlight novel candidate genes and variants involved in milk lactose regulation, whose impacts on membrane transport mechanisms reinforce the key osmo-regulatory roles of lactose in milk.
Mammalian species carry ~100 loss-of-function variants per individual
, where ~1-5 of these impact essential genes and cause embryonic lethality or severe disease when homozygous
. The functions of ...the remainder are more difficult to resolve, although the assumption is that these variants impact fitness in less manifest ways. Here we report one of the largest sequence-resolution screens of cattle to date, targeting discovery and validation of non-additive effects in 130,725 animals. We highlight six novel recessive loci with impacts generally exceeding the largest-effect variants identified from additive genome-wide association studies, presenting analogs of human diseases and hitherto-unrecognized disorders. These loci present compelling missense (PLCD4, MTRF1 and DPF2), premature stop (MUS81) and splice-disrupting (GALNT2 and FGD4) mutations, together explaining substantial proportions of inbreeding depression. These results demonstrate that the frequency distribution of deleterious alleles segregating in selected species can afford sufficient power to directly map novel disorders, presenting selection opportunities to minimize the incidence of genetic disease.
Lactation, hair development and homeothermy are characteristic evolutionary features that define mammals from other vertebrate species. Here we describe the discovery of two autosomal dominant ...mutations with antagonistic, pleiotropic effects on all three of these biological processes, mediated through the prolactin signalling pathway. Most conspicuously, mutations in prolactin (PRL) and its receptor (PRLR) have an impact on thermoregulation and hair morphology phenotypes, giving prominence to this pathway outside of its classical roles in lactation.