Epigenetic changes are important for understanding complex trait variation and inheritance in pigs that are also a valuable biomedical model for human health research. Testis is the main organ for ...reproduction and boar taint in pigs; however, there have been no studies to-date on adult pig testis epigenome. The main objective of this study was to establish a genome-wide DNA methylation map of pig testis that would help identify candidate epigenetic biomarkers and methylated genes for complex traits such as male reproduction, fertility or boar taint. Reduced Representation Bisulfite Sequencing (RRBS) was used to study methylation levels of cytosine in nine pig testis samples. The results showed that genome-wide methylation status of nine samples overlapped greatly and their variation among pigs were low. The methylation levels of promoter, exon, intron, cytosine and guanine dinucleotide (CpG) islands and CpG island shores regions were 0.15, 0.47, 0.55, 0.39, and 0.53, respectively. Cytosines binding to CpG islands showed different methylation levels between exon and intron regions. All methylation levels of CpG islands were lower than CpG island shores in different genic features. The distribution of 12,738 differentially methylated cytosines (DMCs) within CpG islands, CpG island shores and other regions was 36.86, 21.65, and 41.49%, respectively, and was 0.33, 1.71, 5.95, and 92.01% in promoter, exon, intron and intergenic regions, respectively. Methylation levels of DMCs in promoter, exon and intron regions were significantly different between CpG islands and CpG island shores (
< 0.05). A total of 898 genes with 2089 DMCs were enriched in 112 Gene Ontology (GO) terms. Fifteen methylated genes from our study were associated with fertility or boar taint traits. Our analysis revealed the methylation patterns in different genic features and CpG island regions of testis in pigs, and summarized several candidate genes associated with DMCs and the involved GO terms. These findings are helpful to understand the relationship between DNA methylation and genic CpG islands, to provide candidate epigenetic regions or biomarkers for pig production and welfare and for translational epigenomic studies that use pigs as an animal model for human research.
Boar taint is an offensive odour and/or taste from a proportion of non-castrated male pigs caused by skatole and androstenone accumulation during sexual maturity. Castration is widely used to avoid ...boar taint but is currently under debate because of animal welfare concerns. This study aimed to identify expression quantitative trait loci (eQTLs) with potential effects on boar taint compounds to improve breeding possibilities for reduced boar taint. Danish Landrace male boars with low, medium and high genetic merit for skatole and human nose score (HNS) were slaughtered at ~100 kg. Gene expression profiles were obtained by RNA-Seq, and genotype data were obtained by an Illumina 60K Porcine SNP chip. Following quality control and filtering, 10,545 and 12,731 genes from liver and testis were included in the eQTL analysis, together with 20,827 SNP variants. A total of 205 and 109 single-tissue eQTLs associated with 102 and 58 unique genes were identified in liver and testis, respectively. By employing a multivariate Bayesian hierarchical model, 26 eQTLs were identified as significant multi-tissue eQTLs. The highest densities of eQTLs were found on pig chromosomes SSC12, SSC1, SSC13, SSC9 and SSC14. Functional characterisation of eQTLs revealed functions within regulation of androgen and the intracellular steroid hormone receptor signalling pathway and of xenobiotic metabolism by cytochrome P450 system and cellular response to oestradiol. A QTL enrichment test revealed 89 QTL traits curated by the Animal Genome PigQTL database to be significantly overlapped by the genomic coordinates of cis-acting eQTLs. Finally, a subset of 35 cis-acting eQTLs overlapped with known boar taint QTL traits. These eQTLs could be useful in the development of a DNA test for boar taint but careful monitoring of other overlapping QTL traits should be performed to avoid any negative consequences of selection.
Cattle production is one of the key contributors to global warming due to methane emission, which is a by-product of converting feed stuff into milk and meat for human consumption. Rumen hosts ...numerous microbial communities that are involved in the digestive process, leading to notable amounts of methane emission. The key factors underlying differences in methane emission between individual animals are due to, among other factors, both specific enrichments of certain microbial communities and host genetic factors that influence the microbial abundances. The detection of such factors involves various biostatistical and bioinformatics methods. In this study, our main objective was to reanalyze a publicly available data set using our proprietary
that is based on novel combinatorial network and machine learning methods to detect key metagenomic and host genetic features for methane emission and residual feed intake (RFI) in dairy cattle. The other objective was to compare the results with publicly available standard tools, such as those found in the microbiome bioinformatics platform QIIME2 and classic GWAS analysis. The data set used was publicly available and comprised 1,016 dairy cows with 16S short read sequencing data from two dairy cow breeds: Holstein and Nordic Reds. Host genomic data consisted of both 50 k and 150 k SNP arrays. Although several traits were analyzed by the original authors, here, we considered only methane emission as key phenotype for associating microbial communities and host genetic factors. The Synomics Insights platform is based on combinatorial methods that can identify taxa that are differentially abundant between animals showing high or low methane emission or RFI. Focusing exclusively on enriched taxa, for methane emission, the study identified 26 order-level taxa that combinatorial networks reported as significantly enriched either in high or low emitters. Additionally, a Z-test on proportions found 21/26 (81%) of these taxa were differentially enriched between high and low emitters (
value <.05). In particular, the phylum of Proteobacteria and the order Desulfovibrionales were found enriched in high emitters while the order Veillonellales was found to be more abundant in low emitters as previously reported for cattle (Wallace et al., 2015). In comparison, using the publicly available tool ANCOM only the order Methanosarcinales could be identified as differentially abundant between the two groups. We also investigated a link between host genome and rumen microbiome by applying our Synomics Insights platform and comparing it with an industry standard GWAS method. This resulted in the identification of genetic determinants in cows that are associated with changes in heritable components of the rumen microbiome. Only four key SNPs were found by both our platform and GWAS, whereas the Synomics Insights platform identified 1,290 significant SNPs that were not found by GWAS. Gene Ontology (GO) analysis found transcription factor as the dominant biological function. We estimated heritability of a core 73 taxa from the original set of 150 core order-level taxonomies and showed that some species are medium to highly heritable (0.25-0.62), paving the way for selective breeding of animals with desirable core microbiome characteristics. We identified a set of 113 key SNPs associated with >90% of these core heritable taxonomies. Finally, we have characterized a small set (<10) of SNPs strongly associated with key heritable bacterial orders with known role in methanogenesis, such as Desulfobacterales and Methanobacteriales.
This study was aimed at identifying genomic regions controlling feeding behavior in Danish Duroc boars and its potential implications for eating behavior in humans. Data regarding individual daily ...feed intake (DFI), total daily time spent in feeder (TPD), number of daily visits to feeder (NVD), average duration of each visit (TPV), mean feed intake per visit (FPV) and mean feed intake rate (FR) were available for 1130 boars. All boars were genotyped using the Illumina Porcine SNP60 BeadChip. The association analyses were performed using the GenABEL package in the R program. Sixteen SNPs were found to have moderate genome-wide significance (p<5E-05) and 76 SNPs had suggestive (p<5E-04) association with feeding behavior traits. MSI2 gene on chromosome (SSC) 14 was very strongly associated with NVD. Thirty-six SNPs were located in genome regions where QTLs have previously been reported for behavior and/or feed intake traits in pigs. The regions: 64-65 Mb on SSC 1, 124-130 Mb on SSC 8, 63-68 Mb on SSC 11, 32-39 Mb and 59-60 Mb on SSC 12 harbored several signifcant SNPs. Synapse genes (GABRR2, PPP1R9B, SYT1, GABRR1, CADPS2, DLGAP2 and GOPC), dephosphorylation genes (PPM1E, DAPP1, PTPN18, PTPRZ1, PTPN4, MTMR4 and RNGTT) and positive regulation of peptide secretion genes (GHRH, NNAT and TCF7L2) were highly significantly associated with feeding behavior traits. This is the first GWAS to identify genetic variants and biological mechanisms for eating behavior in pigs and these results are important for genetic improvement of pig feed efficiency. We have also conducted pig-human comparative gene mapping to reveal key genomic regions and/or genes on the human genome that may influence eating behavior in human beings and consequently affect the development of obesity and metabolic syndrome. This is the first translational genomics study of its kind to report potential candidate genes for eating behavior in humans.
Pregnancy rates for in vitro produced (IVP) embryos are usually lower than for embryos produced in vivo after ovarian superovulation (MOET). This is potentially due to alterations in their ...trophectoderm (TE), the outermost layer in physical contact with the maternal endometrium. The main objective was to apply a multi-omics data integration approach to identify both temporally differentially expressed and differentially methylated genes (DEG and DMG), between IVP and MOET embryos, that could impact TE function. To start, four and five published transcriptomic and epigenomic datasets, respectively, were processed for data integration. Second, DEG from day 7 to days 13 and 16 and DMG from day 7 to day 17 were determined in the TE from IVP vs. MOET embryos. Third, genes that were both DE and DM were subjected to hierarchical clustering and functional enrichment analysis. Finally, findings were validated through a machine learning approach with two additional datasets from day 15 embryos. There were 1535 DEG and 6360 DMG, with 490 overlapped genes, whose expression profiles at days 13 and 16 resulted in three main clusters. Cluster 1 (188) and Cluster 2 (191) genes were down-regulated at day 13 or day 16, respectively, while Cluster 3 genes (111) were up-regulated at both days, in IVP embryos compared to MOET embryos. The top enriched terms were the KEGG pathway "focal adhesion" in Cluster 1 (FDR = 0.003), and the cellular component: "extracellular exosome" in Cluster 2 (FDR<0.0001), also enriched in Cluster 1 (FDR = 0.04). According to the machine learning approach, genes in Cluster 1 showed a similar expression pattern between IVP and less developed (short) MOET conceptuses; and between MOET and DKK1-treated (advanced) IVP conceptuses. In conclusion, these results suggest that early conceptuses derived from IVP embryos exhibit epigenomic and transcriptomic changes that later affect its elongation and focal adhesion, impairing post-transfer survival.
Obesity is associated with severe co-morbidities such as type 2 diabetes and nonalcoholic steatohepatitis. However, studies have shown that 10-25 percent of the severely obese individuals are ...metabolically healthy. To date, the identification of genetic factors underlying the metabolically healthy obese (MHO) state is limited. Systems genetics approaches have led to the identification of genes and pathways in complex diseases. Here, we have used such approaches across tissues to detect genes and pathways involved in obesity-induced disease development.
Expression data of 60 severely obese individuals was accessible, of which 28 individuals were MHO and 32 were metabolically unhealthy obese (MUO). A whole genome expression profile of four tissues was available: liver, muscle, subcutaneous adipose tissue and visceral adipose tissue. Using insulin-related genes, we used the weighted gene co-expression network analysis (WGCNA) method to build within- and inter-tissue gene networks. We identified genes that were differentially connected between MHO and MUO individuals, which were further investigated by homing in on the modules they were active in. To identify potentially causal genes, we integrated genomic and transcriptomic data using an eQTL mapping approach.
Both IL-6 and IL1B were identified as highly differentially co-expressed genes across tissues between MHO and MUO individuals, showing their potential role in obesity-induced disease development. WGCNA showed that those genes were clustering together within tissues, and further analysis showed different co-expression patterns between MHO and MUO subnetworks. A potential causal role for metabolic differences under similar obesity state was detected for PTPRE, IL-6R and SLC6A5.
We used a novel integrative approach by integration of co-expression networks across tissues to elucidate genetic factors related to obesity-induced metabolic disease development. The identified genes and their interactions give more insight into the genetic architecture of obesity and the association with co-morbidities.
Feed efficiency (FE) is a key trait in pig production, as improvement in FE has positive economic and environmental impact. FE is a complex phenotype and testing animals for FE is costly. Therefore, ...there has been a desire to find functionally relevant single nucleotide polymorphisms (SNPs) as biomarkers, to improve our biological understanding of FE as well as accuracy of genomic prediction for FE. We have performed a cis- and trans- eQTL (expression quantitative trait loci) analysis, in a population of Danbred Durocs (N = 11) and Danbred Landrace (N = 27) using both a linear and ANOVA model based on muscle tissue RNA-seq. We analyzed a total of 1425x19179 or 2.7x10.sup.7 Gene-SNP combinations in eQTL detection models for FE. The 1425 genes were from RNA-Seq based differential gene expression analyses using 25880 genes related to FE and additionally combined with mitochondrial genes. The 19179 SNPs were from applying stringent quality control and linkage disequilibrium filtering on genotype data using a GGP Porcine HD 70k SNP array. We applied 1000 fold bootstrapping and enrichment analysis to further validate and analyze our detected eQTLs. We identified 13 eQTLs with FDR 0.1, affecting several genes found in previous studies of commercial pig breeds. Examples include MYO19, CPT1B, ACSL1, IER5L, CPT1A, SUCLA2, CSRNP1, PARK7 and MFF. The bootstrapping results showed statistically significant enrichment (p-value2.2x10.sup.-16) of eQTLs with p-value 0.01 in both cis and trans-eQTLs. Enrichment analysis of top trans-eQTLs revealed high enrichment for gene categories and gene ontologies associated with genomic context and expression regulation. This included transcription factors (p-value = 1.0x10.sup.-13 ), DNA-binding (GO:0003677, p-value = 8.9x10.sup.-14 ), DNA-binding transcription factor activity (GO:0003700,) nucleus gene (GO:0005634, p-value2.2x10.sup.-16 ), negative regulation of expression (GO:0010629, p-value2.2x10.sup.-16). These results would be useful for future genome assisted breeding of pigs to improve FE, and in the improved understanding of the functional mechanism of trans eQTLs.
Improving feed efficiency (FE) is a major goal of pig breeding, reducing production costs and providing sustainability to the pig industry. Reliable predictors for FE could assist pig producers. We ...carried out untargeted blood metabolite profiling in uncastrated males from Danbred Duroc (n = 59) and Danbred Landrace (n = 50) pigs at the beginning and end of a FE testing phase to identify biomarkers and biological processes underlying FE and related traits. By applying linear modeling and clustering analyses coupled with WGCNA framework, we identified 102 and 73 relevant metabolites in Duroc and Landrace based on two sampling time points. Among them, choline and pyridoxamine were hub metabolites in Duroc in early testing phase, while, acetoacetate, cholesterol sulfate, xanthine, and deoxyuridine were identified in the end of testing. In Landrace, cholesterol sulfate, thiamine, L-methionine, chenodeoxycholate were identified at early testing phase, while, D-glutamate, pyridoxamine, deoxycytidine, and L-2-aminoadipate were found at the end of testing. Validation of these results in larger populations could establish FE prediction using metabolomics biomarkers. We conclude that it is possible to identify a link between blood metabolite profiles and FE. These results could lead to improved nutrient utilization, reduced production costs, and increased FE.
The main goal was to apply machine learning (ML) methods on integrated multi-transcriptomic data, to identify endometrial genes capable of predicting uterine receptivity according to their expression ...patterns in the cow. Public data from five studies were re-analyzed. In all of them, endometrial samples were obtained at day 6-7 of the estrous cycle, from cows or heifers of four different European breeds, classified as pregnant (n = 26) or not (n = 26). First, gene selection was performed through supervised and unsupervised ML algorithms. Then, the predictive ability of potential key genes was evaluated through support vector machine as classifier, using the expression levels of the samples from all the breeds but one, to train the model, and the samples from that one breed, to test it. Finally, the biological meaning of the key genes was explored. Fifty genes were identified, and they could predict uterine receptivity with an overall 96.1% accuracy, despite the animal's breed and category. Genes with higher expression in the pregnant cows were related to circadian rhythm, Wnt receptor signaling pathway, and embryonic development. This novel and robust combination of computational tools allowed the identification of a group of biologically relevant endometrial genes that could support pregnancy in the cattle.