Functional annotation represents a key step toward the understanding and interpretation of germline and somatic variation as revealed by genome-wide association studies (GWAS) and The Cancer Genome ...Atlas (TCGA), respectively. GWAS have revealed numerous genetic risk variants residing in non-coding DNA associated with complex diseases. For sequences that lie within enhancers or promoters of transcription, it is not straightforward to assess the effects of variants on likely transcription factor binding sites. Consequently we introduce motifbreakR, which allows the biologist to judge whether the sequence surrounding a polymorphism or mutation is a good match, and how much information is gained or lost in one allele of the polymorphism or mutation relative to the other. MotifbreakR is flexible, giving a choice of algorithms for interrogation of genomes with motifs from many public sources that users can choose from. MotifbreakR can predict effects for novel or previously described variants in public databases, making it suitable for tasks beyond the scope of its original design. Lastly, it can be used to interrogate any genome curated within bioconductor.
https://github.com/Simon-Coetzee/MotifBreakR, www.bioconductor.org.
dennis.hazelett@cshs.org.
Genome-wide association studies (GWAS) have linked dozens of single nucleotide polymorphisms (SNPs) with Parkinson's disease (PD) risk. Ascertaining the functional and eventual causal mechanisms ...underlying these relationships has proven difficult. The majority of risk SNPs, and nearby SNPs in linkage disequilibrium (LD), are found in intergenic or intronic regions and confer risk through allele-dependent expression of multiple unknown target genes. Combining GWAS results with publicly available GTEx data, generated through eQTL (expression quantitative trait loci) identification studies, enables a direct association of SNPs to gene expression levels and aids in narrowing the large population of potential genetic targets for hypothesis-driven experimental cell biology. Separately, overlapping of SNPs with putative enhancer segmentations can strengthen target filtering. We report here the results of analyzing 7,607 PD risk SNPs along with an additional 23,759 high linkage disequilibrium-associated variants paired with eQTL gene expression. We found that enrichment analysis on the set of genes following target filtering pointed to a single large LD block at 6p21 that contained multiple HLA-MHC-II genes. These MHC-II genes remain associated with PD when the genes were filtered for correlation between GWAS significance and eQTL levels, strongly indicating a direct effect on PD etiology.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Genome-wide association studies (GWAS) have revolutionized the field of cancer genetics, but the causal links between increased genetic risk and onset/progression of disease processes remain to be ...identified. Here we report the first step in such an endeavor for prostate cancer. We provide a comprehensive annotation of the 77 known risk loci, based upon highly correlated variants in biologically relevant chromatin annotations--we identified 727 such potentially functional SNPs. We also provide a detailed account of possible protein disruption, microRNA target sequence disruption and regulatory response element disruption of all correlated SNPs at r(2) ≥ 0.88%. 88% of the 727 SNPs fall within putative enhancers, and many alter critical residues in the response elements of transcription factors known to be involved in prostate biology. We define as risk enhancers those regions with enhancer chromatin biofeatures in prostate-derived cell lines with prostate-cancer correlated SNPs. To aid the identification of these enhancers, we performed genomewide ChIP-seq for H3K27-acetylation, a mark of actively engaged enhancers, as well as the transcription factor TCF7L2. We analyzed in depth three variants in risk enhancers, two of which show significantly altered androgen sensitivity in LNCaP cells. This includes rs4907792, that is in linkage disequilibrium (r(2) = 0.91) with an eQTL for NUDT11 (on the X chromosome) in prostate tissue, and rs10486567, the index SNP in intron 3 of the JAZF1 gene on chromosome 7. Rs4907792 is within a critical residue of a strong consensus androgen response element that is interrupted in the protective allele, resulting in a 56% decrease in its androgen sensitivity, whereas rs10486567 affects both NKX3-1 and FOXA-AR motifs where the risk allele results in a 39% increase in basal activity and a 28% fold-increase in androgen stimulated enhancer activity. Identification of such enhancer variants and their potential target genes represents a preliminary step in connecting risk to disease process.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Alpha-synuclein accumulation in dopaminergic neurons is one of the primary features of Parkinson’s disease (PD). Despite its toxic properties during PD, alpha-synuclein has some important ...physiological functions. Although the activity of the protein has been extensively studied in neurons, the protein is also expressed in other cell types including immune cells and glia. Genetic studies show that mutations in synuclein alpha (
SNCA
), the gene that encodes alpha-synuclein, and alterations in its expression levels are a significant risk factor for PD, which likely impact the functions of a broad range of cell types. The consequences of altered
SNCA
expression in other cell types is beginning to be explored. Microglia, the primary macrophage population in the Central Nervous System (CNS), for example, are affected by variations in alpha-synuclein levels and functions. Studies suggest that deviations of alpha-synuclein’s normal activity influence hematopoiesis, the process that gives rise to microglia, and microglia’s immune functions. Alpha-synuclein levels also dictate the efficiency of SNARE-mediated vesicle formation, which could influence autophagy and cytokine release in microglia. Starting from the time of conception, these effects could impact one’s risk for developing PD. Further studies are needed to determine the physiological role of alpha-synuclein and how the protein is affected during PD in non-neuronal cells such as microglia. In this review we will discuss the known roles of alpha-synuclein in differentiation, immune responses, and vesicle formation, with insights into how abnormal alpha-synuclein expression and activity are linked to altered functions of microglia during PD.
Multiple discrete regions at 8q24 were recently shown to contain alleles that predispose to many cancers including prostate, breast, and colon. These regions are far from any annotated gene and their ...biological activities have been unknown. Here we profiled a 5-megabase chromatin segment encompassing all the risk regions for RNA expression, histone modifications, and locations occupied by RNA polymerase II and androgen receptor (AR). This led to the identification of several transcriptional enhancers, which were verified using reporter assays. Two enhancers in one risk region were occupied by AR and responded to androgen treatment; one contained a single nucleotide polymorphism (rs11986220) that resides within a FoxA1 binding site, with the prostate cancer risk allele facilitating both stronger FoxA1 binding and stronger androgen responsiveness. The study reported here exemplifies an approach that may be applied to any risk-associated allele in non-protein coding regions as it emerges from genome-wide association studies to better understand the genetic predisposition of complex diseases.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Genome-wide association studies have consistently shown that the alpha-synuclein locus is significantly associated with Parkinson’s disease. The mechanism by which this locus modulates the disease ...pathology and etiology remains largely under-investigated. This is due to the assumption that
SNCA
is the only driver of the functional aspects of several single nucleotide polymorphism (SNP) risk-signals at this locus. Recent evidence has shown that the risk associated with the top GWAS-identified variant within this locus is independent of
SNCA
expression, calling into question the validity of assigning function to the nearest gene,
SNCA
. In this review, we examine additional genes and risk variants present at the
SNCA
locus and how they may contribute to Parkinson’s disease. Using the
SNCA
locus as an example, we hope to demonstrate that deeper and detailed functional validations are required for high impact disease-linked variants.
The precise nature of how cell type specific chromatin structures at enhancer sites affect gene expression is largely unknown. Here we identified cell type specific enhancers coupled with gene ...expression in two different types of breast epithelial cells, HMEC (normal breast epithelial cells) and MDAMB231 (triple negative breast cancer cell line).
Enhancers were defined by modified neighboring histones using chromatin immunoprecipitation followed by sequencing (ChIP-seq) and nucleosome depletion using formaldehyde-assisted isolation of regulatory elements followed by sequencing (FAIRE-seq). Histone modifications at enhancers were related to the expression levels of nearby genes up to 750 kb away. These expression levels were correlated with enhancer status (poised or active), defined by surrounding histone marks. Furthermore, about fifty percent of poised and active enhancers contained nucleosome-depleted regions. We also identified response element motifs enriched at these enhancer sites that revealed key transcription factors (e.g. TP63) likely involved in regulating breast epithelial enhancer-mediated gene expression. By utilizing expression data, potential target genes of more than 600 active enhancers were identified. These genes were involved in proteolysis, epidermis development, cell adhesion, mitosis, cell cycle, and DNA replication.
These findings facilitate the understanding of epigenetic regulation specifically, such as the relationships between regulatory elements and gene expression and generally, how breast epithelial cellular phenotypes are determined by cell type specific enhancers.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Recent genome-wide association studies (GWAS) have identified more than 100 loci associated with increased risk of prostate cancer, most of which are in non-coding regions of the genome. ...Understanding the function of these non-coding risk loci is critical to elucidate the genetic susceptibility to prostate cancer.
We generate genome-wide regulatory element maps and performed genome-wide chromosome confirmation capture assays (in situ Hi-C) in normal and tumorigenic prostate cells. Using this information, we annotate the regulatory potential of 2,181 fine-mapped prostate cancer risk-associated SNPs and predict a set of target genes that are regulated by prostate cancer risk-related H3K27Ac-mediated loops. We next identify prostate cancer risk-associated CTCF sites involved in long-range chromatin loops. We use CRISPR-mediated deletion to remove prostate cancer risk-associated CTCF anchor regions and the CTCF anchor regions looped to the prostate cancer risk-associated CTCF sites, and we observe up to 100-fold increases in expression of genes within the loops when the prostate cancer risk-associated CTCF anchor regions are deleted.
We identify GWAS risk loci involved in long-range loops that function to repress gene expression within chromatin loops. Our studies provide new insights into the genetic susceptibility to prostate cancer.
Breast Cancer (BCa) genome-wide association studies revealed allelic frequency differences between cases and controls at index single nucleotide polymorphisms (SNPs). To date, 71 loci have thus been ...identified and replicated. More than 320,000 SNPs at these loci define BCa risk due to linkage disequilibrium (LD). We propose that BCa risk resides in a subgroup of SNPs that functionally affects breast biology. Such a shortlist will aid in framing hypotheses to prioritize a manageable number of likely disease-causing SNPs. We extracted all the SNPs, residing in 1 Mb windows around breast cancer risk index SNP from the 1000 genomes project to find correlated SNPs. We used FunciSNP, an R/Bioconductor package developed in-house, to identify potentially functional SNPs at 71 risk loci by coinciding them with chromatin biofeatures. We identified 1,005 SNPs in LD with the index SNPs (r(2)≥0.5) in three categories; 21 in exons of 18 genes, 76 in transcription start site (TSS) regions of 25 genes, and 921 in enhancers. Thirteen SNPs were found in more than one category. We found two correlated and predicted non-benign coding variants (rs8100241 in exon 2 and rs8108174 in exon 3) of the gene, ANKLE1. Most putative functional LD SNPs, however, were found in either epigenetically defined enhancers or in gene TSS regions. Fifty-five percent of these non-coding SNPs are likely functional, since they affect response element (RE) sequences of transcription factors. Functionality of these SNPs was assessed by expression quantitative trait loci (eQTL) analysis and allele-specific enhancer assays. Unbiased analyses of SNPs at BCa risk loci revealed new and overlooked mechanisms that may affect risk of the disease, thereby providing a valuable resource for follow-up studies.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
This opinion paper highlights strategies for a better understanding of non-Mendelian genetic risk that was revealed by genome-wide association studies (GWAS) of complex diseases. The genetic risk ...resides predominantly in non-coding regulatory DNA, such as in enhancers. The identification of mechanisms, the causal variants (mainly SNPs), and their target genes are, however, not always apparent but are likely involved in a network of risk determinants; the identification presents a bottle-neck in the full understanding of the genetics of complex phenotypes. Here, we propose strategies to identify functional SNPs and link risk enhancers with their target genes. The strategies are 1) identifying finemapped SNPs that break/form response elements within chromatin bio-features in relevant cell types 2) considering the nearest gene on linear DNA, 3) analyzing eQTLs, 4) mapping differential DNA methylation regions and relating them to gene expression, 5) employing genomic editing with CRISPR/cas9 and 6) identifying topological associated chromatin domains using chromatin conformation capture.