We present a systematic review of pleiotropy among SNPs and genes reported to show genome-wide association with common complex diseases and traits. We find abundant evidence of pleiotropy; 233 ...(16.9%) genes and 77 (4.6%) SNPs show pleiotropic effects. SNP pleiotropic status was associated with gene location (p = 0.024; pleiotropic SNPs more often exonic 14.5% versus 4.9% for nonpleiotropic, trait-associated SNPs and less often intergenic 15.8% versus 23.6%), “predicted transcript consequence” (p = 0.001; pleiotropic SNPs more often predicted to be structurally deleterious 5% versus 0.4% but not more often in regulatory sequences), and certain disease classes. We develop a method to calculate the likelihood that pleiotropic links between traits occurred more often than expected and demonstrate that this approach can identify etiological links that are already known (such as between fetal hemoglobin and malaria risk) and those that are not yet established (e.g., between plasma campesterol levels and gallstones risk; and between immunoglobulin A and juvenile idiopathic arthritis). Examples of pleiotropy will accumulate over time, but it is already clear that pleiotropy is a common property of genes and SNPs associated with disease traits, and this will have implications for identification of molecular targets for drug development, future genetic risk-profiling, and classification of diseases.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
DNA supercoiling is an inherent consequence of twisting DNA and is critical for regulating gene expression and DNA replication. However, DNA supercoiling at a genomic scale in human cells is ...uncharacterized. To map supercoiling, we used biotinylated trimethylpsoralen as a DNA structure probe to show that the human genome is organized into supercoiling domains. Domains are formed and remodeled by RNA polymerase and topoisomerase activities and are flanked by GC-AT boundaries and CTCF insulator protein-binding sites. Underwound domains are transcriptionally active and enriched in topoisomerase I, 'open' chromatin fibers and DNase I sites, but they are depleted of topoisomerase II. Furthermore, DNA supercoiling affects additional levels of chromatin compaction as underwound domains are cytologically decondensed, topologically constrained and decompacted by transcription of short RNAs. We suggest that supercoiling domains create a topological environment that facilitates gene activation, providing an evolutionary purpose for clustering genes along chromosomes.
Full text
Available for:
DOBA, IJS, IZUM, KILJ, NUK, PILJ, PNG, SAZU, UILJ, UKNU, UL, UM, UPUK
Differential isoform usage is an important driver of inter-individual phenotypic diversity and is linked to various diseases and traits. However, accurately detecting the differential usage of ...different gene transcripts between groups can be difficult, in particular in less well annotated genomes where the spectrum of transcript isoforms is largely unknown.
We investigated whether machine learning approaches can detect differential isoform usage based purely on the distribution of reads across a gene region. We illustrate that gradient boosting and elastic net approaches can successfully identify large numbers of genes showing potential differential isoform usage between Europeans and Africans, that are enriched among relevant biological pathways and significantly overlap those identified by previous approaches. We demonstrate that diversity at the 3' and 5' ends of genes are primary drivers of these differences between populations.
Machine learning methods can effectively detect differential isoform usage from read fraction data, and can provide novel insights into the biological differences between groups.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Understanding the variation between well and poorly adapted cattle breeds to local environments and pathogens is essential for breeding cattle with improved climate and disease-resistant phenotypes. ...Although considerable progress has been made towards identifying genetic differences between breeds, variation at the epigenetic and chromatin levels remains poorly characterized. Here, we generate, sequence and analyse over 150 libraries at base-pair resolution to explore the dynamics of DNA methylation and chromatin accessibility of the bovine immune system across three distinct cattle lineages.
We find extensive epigenetic divergence between the taurine and indicine cattle breeds across immune cell types, which is linked to the levels of local DNA sequence divergence between the two cattle sub-species. The unique cell type profiles enable the deconvolution of complex cellular mixtures using digital cytometry approaches. Finally, we show distinct sub-categories of CpG islands based on their chromatin and methylation profiles that discriminate between classes of distal and gene proximal islands linked to discrete transcriptional states.
Our study provides a comprehensive resource of DNA methylation, chromatin accessibility and RNA expression profiles of three diverse cattle populations. The findings have important implications, from understanding how genetic editing across breeds, and consequently regulatory backgrounds, may have distinct impacts to designing effective cattle epigenome-wide association studies in non-European breeds.
Abstract
The Victoria Cross is the United Kingdom’s premier military award for bravery, presented for gallantry during active operations. Since its inception in 1856 just 1358 have been awarded, and, ...due to their rarity and historic interest, have become highly prized amongst private and public collections. Unresolved, however, is a debate about the source material of the medals. Some authorities adhere to a traditional belief that all medals have been cast from the bronze of guns captured from the Russians at Sebastopol. Furthermore, controversy is attached to the authenticity of some VCs. In this study we used X-ray fluorescence spectrometry data to compare the metal compositions of 100 Victoria Crosses, covering 7% of those ever issued. Using Gaussian mixture modelling we identify that Victoria Crosses fall into four distinct clusters, confirming that the primary split occurred between medals issued prior to and after 1914. Using these data we investigate the potential of X-ray fluorescence to inform the study of medals whose authenticity have been queried, showing some have unusually similar compositions to other VCs. This paper highlights how X-ray fluorescence data in conjunction with clustering approaches can be used to effectively and non-destructively investigate the authenticity and history of Victoria Crosses.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
Second generation sequencing has prompted a number of groups to re-interrogate the transcriptomes of several bacterial and archaeal species. One of the central findings has been the identification of ...complex networks of small non-coding RNAs that play central roles in transcriptional regulation in all growth conditions and for the pathogen's interaction with and survival within host cells. Legionella pneumophila is a gram-negative facultative intracellular human pathogen with a distinct biphasic lifestyle. One of its primary environmental hosts in the free-living amoeba Acanthamoeba castellanii and its infection by L. pneumophila mimics that seen in human macrophages. Here we present analysis of strand specific sequencing of the transcriptional response of L. pneumophila during exponential and post-exponential broth growth and during the replicative and transmissive phase of infection inside A. castellanii. We extend previous microarray based studies as well as uncovering evidence of a complex regulatory architecture underpinned by numerous non-coding RNAs. Over seventy new non-coding RNAs could be identified; many of them appear to be strain specific and in configurations not previously reported. We discover a family of non-coding RNAs preferentially expressed during infection conditions and identify a second copy of 6S RNA in L. pneumophila. We show that the newly discovered putative 6S RNA as well as a number of other non-coding RNAs show evidence for antisense transcription. The nature and extent of the non-coding RNAs and their expression patterns suggests that these may well play central roles in the regulation of Legionella spp. specific traits and offer clues as to how L. pneumophila adapts to its intracellular niche. The expression profiles outlined in the study have been deposited into Genbank's Gene Expression Omnibus (GEO) database under the series accession GSE27232.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
BACKGROUND: DNA methylation and the Polycomb repression system are epigenetic mechanisms that play important roles in maintaining transcriptional repression. Recent evidence suggests that DNA ...methylation can attenuate the binding of Polycomb protein components to chromatin and thus plays a role in determining their genomic targeting. However, whether this role of DNA methylation is important in the context of transcriptional regulation is unclear. RESULTS: By genome-wide mapping of the Polycomb Repressive Complex 2-signature histone mark, H3K27me3, in severely DNA hypomethylated mouse somatic cells, we show that hypomethylation leads to widespread H3K27me3 redistribution, in a manner that reflects the local DNA methylation status in wild-type cells. Unexpectedly, we observe striking loss of H3K27me3 and Polycomb Repressive Complex 2 from Polycomb target gene promoters in DNA hypomethylated cells, including Hox gene clusters. Importantly, we show that many of these genes become ectopically expressed in DNA hypomethylated cells, consistent with loss of Polycomb-mediated repression. CONCLUSIONS: An intact DNA methylome is required for appropriate Polycomb-mediated gene repression by constraining Polycomb Repressive Complex 2 targeting. These observations identify a previously unappreciated role for DNA methylation in gene regulation and therefore influence our understanding of how this epigenetic mechanism contributes to normal development and disease.
Large-scale gene expression datasets are providing an increasing understanding of the location of cis-eQTLs in the human genome and their role in disease. However, little is currently known regarding ...the extent of regulatory site-sharing between genes. This is despite it having potentially wide-ranging implications, from the determination of the way in which genetic variants may shape multiple phenotypes to the understanding of the evolution of human gene order. By first identifying the location of non-redundant cis-eQTLs, we show that regulatory site-sharing is a relatively common phenomenon in the human genome, with over 10% of non-redundant regulatory variants linked to the expression of multiple nearby genes. We show that these shared, local regulatory sites are linked to high levels of chromatin looping between the regulatory sites and their associated genes. In addition, these co-regulated gene modules are found to be strongly conserved across mammalian species, suggesting that shared regulatory sites have played an important role in shaping human gene order. The association of these shared cis-eQTLs with multiple genes means they also appear to be unusually important in understanding the genetics of human phenotypes and pleiotropy, with shared regulatory sites more often linked to multiple human phenotypes than other regulatory variants. This study shows that regulatory site-sharing is likely an underappreciated aspect of gene regulation and has important implications for the understanding of various biological phenomena, including how the two and three dimensional structures of the genome have been shaped and the potential causes of disease pleiotropy outside coding regions.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Background:
Germline variation in the 71 Crohn's disease (CD) loci implicated by genome‐wide association studies (GWAS) only accounts for approximately 25% of estimated heritability. The contribution ...of epigenetic alterations to disease pathogenesis is emerging as a research priority.
Materials and Methods:
The methylation status of 27,578 CpG sites across the genome was analyzed using the Illumina Human Methylation27 assay in DNA extracted from whole blood samples from 40 adult females (21 ileal CD, 19 healthy controls) and 16 girls with childhood‐onset CD, all nonsmokers. Our primary analysis compared methylation profiles in adult cases and controls.
Results:
Our data define a global methylation profile characteristic of ileal CD. In all, 1117 sites were differentially methylated (corrected P < 0.01); 50 showed significantly altered methylation in cases compared with controls (uncorrected P < 10−6, corrected P < 0.0006), including genes altering immune activation: MAPK13, FASLG, PRF1, S100A13, RIPK3, and IL‐21R. Gene ontology analyses implicated immunity‐related pathways as targets of epigenetic modification (immune system processes P = 1.3 × 10−22, immune response P = 8.1 × 10−16, defense responses to bacteria P = 1.8 × 10−15). Ingenuity canonical pathway analyses implicated dendritic cell activity (P = 2.4 × 10−8) and differential regulation of cytokines by interleukin (IL)‐17A and IL‐17F (P = 5.8 × 10−7). We identified a significant enrichment of methylation changes within 50 kb of CD GWAS loci (8.6‐fold P = 0.021 in adults; 2.4‐fold P = 0.009 in adults and children combined), including IL‐27, IL‐19, TNF, MST1, and NOD2. Methylation status was predictive of disease status (sensitivity 0.71, specificity 0.83). Disease activity, drug therapy, NOD2 and DNMT3A genotypes were not associated with methylation changes.
Conclusions:
These data provide an important insight into the impact of epigenetic mechanisms in the pathogenesis of CD. (Inflamm Bowel Dis 2011;)
Abstract
More people globally depend on the water buffalo than any other domesticated species, and as the most closely related domesticated species to cattle they can provide important insights into ...the shared evolutionary basis of domestication. Here, we sequence the genomes of 79 water buffalo across seven breeds and compare patterns of between breed selective sweeps with those seen for 294 cattle genomes representing 13 global breeds. The genomic regions under selection between cattle breeds significantly overlap regions linked to stature in human genetic studies, with a disproportionate number of these loci also shown to be under selection between water buffalo breeds. Investigation of potential functional variants in the water buffalo genome identifies a rare example of convergent domestication down to the same mutation having independently occurred and been selected for across domesticated species. Cross-species comparisons of recent selective sweeps can consequently help identify and refine important loci linked to domestication.