High-throughput genotyping methods have increased the analytical power to study complex traits but high cost has remained a barrier for large scale use in animal improvement. We have adapted ...genotyping-by-sequencing (GBS) used in plants for genotyping 47 animals representing 7 taurine and indicine breeds of cattle from the US and Africa. Genomic DNA was digested with different enzymes, ligated to adapters containing one of 48 unique bar codes and sequenced by the Illumina HiSeq 2000. PstI was the best enzyme producing 1.4 million unique reads per animal and initially identifying a total of 63,697 SNPs. After removal of SNPs with call rates of less than 70%, 51,414 SNPs were detected throughout all autosomes with an average distance of 48.1 kb, and 1,143 SNPs on the X chromosome at an average distance of 130.3 kb, as well as 191 on unmapped contigs. If we consider only the SNPs with call rates of 90% and over, we identified 39,751 on autosomes, 850 on the X chromosome and 124 on unmapped contigs. Of these SNPs, 28,843 were not tightly linked to other SNPs. Average marker density per autosome was highly correlated with chromosome size (coefficient of correlation = -0.798, r(2) = 0.637) with higher density in smaller chromosomes. Average SNP call rate was 86.5% for all loci, with 53.0% of the loci having call rates >90% and the average minor allele frequency being 0.212. Average observed heterozygosity ranged from 0.046-0.294 among individuals, and from 0.064-0.197 among breeds, with Brangus showing the highest diversity as expected. GBS technique is novel, flexible, sufficiently high-throughput, and capable of providing acceptable marker density for genomic selection or genome-wide association studies at roughly one third of the cost of currently available genotyping technologies.
High-throughput sequencing technologies have increased the ability to detect sequence variations for complex trait improvement. A high throughput genome wide genotyping-by-sequencing (GBS) method was ...used to generate 515,787 single nucleotide polymorphisms (SNPs), from which 76,355 SNPs with call rates >85% and minor allele frequency ≥1.5% were used in genome wide association study (GWAS) of 44 milk traits in 1,246 Canadian Holstein cows. GWAS was accomplished with a mixed linear model procedure implementing the additive and dominant models. A strong signal within the centromeric region of bovine chromosome 14 was associated with test day fat percentage. Several SNPs were associated with eicosapentaenoic acid, docosapentaenoic acid, arachidonic acid, CLA:9c11t and gamma linolenic acid. Most of the significant SNPs for 44 traits studied are novel and located in intergenic regions or introns of genes. Novel potential candidate genes for milk traits or mammary gland functions include ERCC6, TONSL, NPAS2, ACER3, ITGB4, GGT6, ACOX3, MECR, ADAM12, ACHE, LRRC14, FUK, NPRL3, EVL, SLCO3A1, PSMA4, FTO, ADCK5, PP1R16A and TEP1. Our study further demonstrates the utility of the GBS approach for identifying population-specific SNPs for use in improvement of complex dairy traits.
The immune systems are fundamentally vital for evolution and survival of species; as such, selection patterns in innate immune loci are of special interest in molecular evolutionary research. The ...interferon regulatory factor (IRF) gene family control many different aspects of the innate and adaptive immune responses in vertebrates. Among these, IRF3 is known to take active part in very many biological processes. We assembled and evaluated 1356 base pairs of the IRF3 gene coding region in domesticated goats from Africa (Nigeria, Ethiopia and South Africa) and Asia (Iran and China) and the wild goat (Capra aegagrus). Five segregating sites with θ value of 0.0009 for this gene demonstrated a low diversity across the goats' populations. Fu and Li tests were significantly positive but Tajima's D test was significantly negative, suggesting its deviation from neutrality. Neighbor joining tree of IRF3 gene in domesticated goats, wild goat and sheep showed that all domesticated goats have a closer relationship than with the wild goat and sheep. Maximum likelihood tree of the gene showed that different domesticated goats share a common ancestor and suggest single origin. Four unique haplotypes were observed across all the sequences, of which, one was particularly common to African goats (MOCH-K14-0425, Poitou and WAD). In assessing the evolution mode of the gene, we found that the codon model dN/dS ratio for all goats was greater than one. Phylogenetic Analysis by Maximum Likelihood (PAML) gave a ω0 (dN/dS) value of 0.067 with LnL value of -6900.3 for the first Model (M1) while ω2 = 1.667 in model M2 with LnL value of -6900.3 with positive selection inferred in 3 codon sites. Mechanistic empirical combination (MEC) model for evaluating adaptive selection pressure on particular codons also confirmed adaptive selection pressure in three codons (207, 358 and 408) in IRF3 gene. Positive diversifying selection inferred with recent evolutionary changes in domesticated goat IRF3 led us to conclude that the gene evolution may have been influenced by domestication processes in goats.
Interferons are secretory proteins induced in response to specific extracellular stimuli which stimulate intra- and intercellular networks for regulating innate and acquired immunity, resistance to ...viral infections, and normal and tumor cell survival and death. Type 1 interferons plays a major role in the CD8 T-cell response to viral infection. The genomic analysis carried out here for type I interferons within Bovidae family shows that cattle, bison, water buffalo, goat, and sheep (all Bovidae), have different number of genes of the different subtypes, with a large increase in the numbers, compared to human and mouse genomes. A phylogenetic analysis of the interferon alpha (IFNA) proteins in this group shows that the genes do not follow the evolutionary pattern of the species, but rather a cycle of duplications and deletions in the different species. In this study we also studied the genetic diversity of the bovine interferon alpha A (IFNAA), as an example of the IFNA genes in cattle, sequencing a fragment of the coding sequence in 18 breeds of cattle from Pakistan, Nigeria and USA. Similarity analysis allowed the allocation of sequences into 22 haplotypes. Bhagnari, Brangus, Sokoto Gudali, and White Fulani, had the highest number of haplotypes, while Angus, Hereford and Nari Master had the least. However, when analyzed by the average haplotype count, Angus, Bhagnari, Hereford, Holstein, Muturu showed the highest values, while Cholistani, Lohani, and Nari Master showed the lowest values. Haplotype 4 was found in the highest number of individuals (74), and in 15 breeds. Sequences for yak, bison, and water buffalo, were included within the bovine haplotypes. Medium Joining network showed that the sequences could be divided into 4 groups: one with highly similar haplotypes containing mostly Asian and African breeds, one with almost all of the
American breeds, one mid-diverse group with mostly Asian and African sequences, and one group with highly divergent haplotypes with five N'Dama sequences and one from each of White Fulani, Dhanni, Tharparkar, and Bhagnari. The large genetic diversity found in IFNAA could be a very good indication of the genetic variation among the different genes of IFNA and could be an adaptation for these species in response to viral challenges they face.
The present study was aimed at identifying causative hub genes within modules formed by co-expression and protein–protein interaction (PPI) networks, followed by Bayesian network (BN) construction in ...the liver transcriptome of starved zebrafish. To this end, the GSE11107 and GSE112272 datasets from the GEO databases were downloaded and meta-analyzed using the MetaDE package, an add-on R package. Differentially expressed genes (DEGs) were identified based upon expression intensity N(µ = 0.2, σ2 = 0.4). Reconstruction of BNs was performed by the bnlearn R package on genes within modules using STRINGdb and CEMiTool. ndufs5 (shared among PPI, BN and COEX), rps26, rpl10, sdhc (shared between PPI and BN), ndufa6, ndufa10, ndufb8 (shared between PPI and COEX), skp1, atp5h, ndufb10, rpl5b, zgc:193613, zgc:123327, zgc:123178, wu:fc58f10, zgc:111986, wu:fc37b12, taldo1, wu:fb62f08, zgc:64133 and acp5a (shared between COEX and BN) were identified as causative hub genes affecting gene expression in the liver of starving zebrafish. Future work will shed light on using integrative analyses of miRNA and DNA microarrays simultaneously, and performing in silico and experimental validation of these hub-causative (CST) genes affecting starvation in zebrafish.
Milk yield (MY) is highly heritable and an economically important trait in dairy livestock species. To increase power to detect candidate genomic regions for this trait, we carried out a ...meta-analysis of genome-wide association studies (GWAS). In the present study, we identified 19 studies in PubMed for the meta-analysis. After review of the studies, 16 studies passed the filters for meta-analysis, and the number of chromosomes, detected markers and their positions, number of animals, and p-values were extracted from these studies and recorded. The final data set based on 16 GWAS studies had 353,698 cows and 3950 markers and was analyzed using METAL software. Our findings revealed 1712 significant (p-value < 2.5 × 10−6) genomic loci related to MY, with markers associated with MY found on all autosomes and sex chromosomes and the majority of them found on chromosome 14. Furthermore, gene ontology (GO) annotation was used to explore biological functions of the genes associated with MY; therefore, different regions of this chromosome may be suitable as genomic regions for further research into gene expression.
Bayesian gene networks are powerful for modelling causal relationships and incorporating prior knowledge for making inferences about relationships. We used three algorithms to construct Bayesian gene ...networks around genes expressed in the bovine uterus and compared the efficacies of the algorithms. Dataset GSE33030 from the Gene Expression Omnibus (GEO) repository was analyzed using different algorithms for hub gene expression due to the effect of progesterone on bovine endometrial tissue following conception. Six different algorithms (grow-shrink, max-min parent children, tabu search, hill-climbing, max-min hill-climbing and restricted maximum) were compared in three higher categories, including constraint-based, score-based and hybrid algorithms. Gene network parameters were estimated using the bnlearn bundle, which is a Bayesian network structure learning toolbox implemented in R. The results obtained indicated the tabu search algorithm identified the highest degree between genes (390), Markov blankets (25.64), neighborhood sizes (8.76) and branching factors (4.38). The results showed that the highest number of shared hub genes (e.g., proline dehydrogenase 1 (
), Sam-pointed domain containing Ets transcription factor (
), monocyte-to-macrophage differentiation associated 2 (
), semaphorin 3E (
), solute carrier family 27 member 6 (
) and actin gamma 2 (
)) was seen between the hybrid and the constraint-based algorithms, and these genes could be recommended as central to the GSE33030 data series. Functional annotation of the hub genes in uterine tissue during progesterone treatment in the pregnancy period showed that the predicted hub genes were involved in extracellular pathways, lipid and protein metabolism, protein structure and post-translational processes. The identified hub genes obtained by the score-based algorithms had a role in 2-arachidonoylglycerol and enzyme modulation. In conclusion, different algorithms and subsequent topological parameters were used to identify hub genes to better illuminate pathways acting in response to progesterone treatment in the bovine uterus, which should help with our understanding of gene regulatory networks in complex trait expression.
Microscopy and polymerase chain reaction (PCR) were used to survey pathogenic trypanosome infection in naturally infected Nigerian cattle. In 411 animals sampled, microscopy detected 15.1% positive ...infection of at least one of Trypanosoma brucei, Trypanosoma congolense or Trypanosoma vivax, while PCR detected 63.7% positive infections of at least one of those species and Trypanosoma evansi. PCR detected 4.4%, 48.7%, 26.0% and 0.5% respectively of T. brucei, T. congolense, T. vivax and T. evansi infections. All of the T. congolense detected were savannah-type, except for two forest-type infections. Prevalence of mixed infections was 13.9%, being primarily co-infection by T. congolense and T. vivax while prevalence of mixed infections by T. evansi, T. vivax and T. congolense was 1.5%. Microscopy showed poor sensitivity but specificity greater than 94%. Infection rates were much higher in Southern than in Northern Nigeria. Infections were lowest in N’dama compared to Muturu, Sokoto Gudali and White Fulani breeds. Animals with T. vivax monoinfection and mixed infections showed significantly lower packed cell volume (PCV) values. Those infected with any Trypanosoma species with <200 parasites/μl showed higher PCV values than those infected with >200 parasites/μl. The new finding of savannah- and forest- type T. congolense in Nigeria and the relatively high abundance of mixed infections are of significant clinical relevance. This study also suggests that T. congolense is the most prevalent species in Nigeria.
Type II melanoma-associated antigens (MAGE) are a subgroup of about a dozen proteins found in various locations in the genome and expressed in normal tissues, thus are not related to cancer as the ...type I MAGE genes. This gene family exists as a single copy in non-mammals and monotremata, but found as two copies in metatherians and occur as a diverse group in all eutherians. Our studies suggest MAGED2 as the ancestor of this subfamily and the most likely evolutionary history of eutherian type II MAGE genes is hereby proposed based on synteny conservation, phylogenetic relations, genome location, homology conservation, and the protein and gene structures. Type II genes can be divided into two: those with 13 exons (MAGED1, MAGED2, TRO, and MAGED4) and those with only one exon (MAGEE1, MAGEE2, MAGEF1, NSMCE3, MAGEH1, MAGEL2, and NDN) with different evolutionary patterns. Our results suggest a need to change the gene nomenclature to MAGE1 (the ancestral gene), currently designated as LOC103095671 and LOC100935086, in opossum and Tasmanian devil, respectively, and MAGE2 (the duplicated one), currently designated as LOC100617402 and NDNL2, respectively, to avoid confusion. We reconstructed the phylogenetic relationships among 23 mammalian species using the combined sequences of MAGED1, MAGED2, MAGEL2, and NDN, because of their high divergence, and found high levels of support, being able to resolve the phylogenetic relationships among Euarchontoglires, Laurasiatheria, Afrotheria, and Xenarthra, as an example that small, but phylogenetically informative sequences, can be very useful for resolving basal mammalian clades.
Skin is a major thermoregulatory organ in the body controlling homeothermy, a critical function for climate adaptation. We compared genes expressed between tropical- and temperate-adapted cattle to ...better understand genes involved in climate adaptation and hence thermoregulation. We profiled the skin of representative tropical and temperate cattle using RNA-seq. A total of 214,754,759 reads were generated and assembled into 72,993,478 reads and were mapped to unique regions in the bovine genome. Gene coverage of unique regions of the reference genome showed that of 24,616 genes, only 13,130 genes (53.34%) displayed more than one count per million reads for at least two libraries and were considered suitable for downstream analyses. Our results revealed that of 255 genes expressed differentially, 98 genes were upregulated in tropically-adapted White Fulani (WF;
) and 157 genes were down regulated in WF compared to Angus, AG (
). Fifteen pathways were identified from the differential gene sets through gene ontology and pathway analyses. These include the significantly enriched melanin metabolic process, proteinaceous extracellular matrix, inflammatory response, defense response, calcium ion binding and response to wounding. Quantitative PCR was used to validate six representative genes which are associated with skin thermoregulation and epithelia dysfunction (mean correlation 0.92;
< 0.001). Our results contribute to identifying genes and understanding molecular mechanisms of skin thermoregulation that may influence strategic genomic selection in cattle to withstand climate adaptation, microbial invasion and mechanical damage.