Structural variants (SVs) and short tandem repeats (STRs) comprise a broad group of diverse DNA variants which vastly differ in their sizes and distributions across the genome. Here, we identify ...genomic features of SV classes and STRs that are associated with gene expression and complex traits, including their locations relative to eGenes, likelihood of being associated with multiple eGenes, associated eGene types (e.g., coding, noncoding, level of evolutionary constraint), effect sizes, linkage disequilibrium with tagging single nucleotide variants used in GWAS, and likelihood of being associated with GWAS traits. We identify a set of high-impact SVs/STRs associated with the expression of three or more eGenes via chromatin loops and show that they are highly enriched for being associated with GWAS traits. Our study provides insights into the genomic properties of structural variant classes and short tandem repeats that are associated with gene expression and human traits.
The liver plays a central role in the maintenance of homeostasis and health in general. However, there is substantial inter-individual variation in hepatic gene expression, and although numerous ...genetic factors have been identified, less is known about the epigenetic factors.
By analyzing the methylomes and transcriptomes of 14 fetal and 181 adult livers, we identified 657 differentially methylated genes with adult-specific expression, these genes were enriched for transcription factor binding sites of HNF1A and HNF4A. We also identified 1,000 genes specific to fetal liver, which were enriched for GATA1, STAT5A, STAT5B and YY1 binding sites. We saw strong liver-specific effects of single nucleotide polymorphisms on both methylation levels (28,447 unique CpG sites (meQTL)) and gene expression levels (526 unique genes (eQTL)), at a false discovery rate (FDR) < 0.05. Of the 526 unique eQTL associated genes, 293 correlated significantly not only with genetic variation but also with methylation levels. The tissue-specificities of these associations were analyzed in muscle, subcutaneous adipose tissue and visceral adipose tissue. We observed that meQTL were more stable between tissues than eQTL and a very strong tissue-specificity for the identified associations between CpG methylation and gene expression.
Our analyses generated a comprehensive resource of factors involved in the regulation of hepatic gene expression, and allowed us to estimate the proportion of variation in gene expression that could be attributed to genetic and epigenetic variation, both crucial to understanding differences in drug response and the etiology of liver diseases.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Proton pump inhibitors (PPIs) are among the top 10 most widely used drugs in the world. PPI use has been associated with an increased risk of enteric infections, most notably Clostridium difficile. ...The gut microbiome plays an important role in enteric infections, by resisting or promoting colonisation by pathogens. In this study, we investigated the influence of PPI use on the gut microbiome.
The gut microbiome composition of 1815 individuals, spanning three cohorts, was assessed by tag sequencing of the 16S rRNA gene. The difference in microbiota composition in PPI users versus non-users was analysed separately in each cohort, followed by a meta-analysis.
211 of the participants were using PPIs at the moment of stool sampling. PPI use is associated with a significant decrease in Shannon's diversity and with changes in 20% of the bacterial taxa (false discovery rate <0.05). Multiple oral bacteria were over-represented in the faecal microbiome of PPI-users, including the genus Rothia (p=9.8×10(-38)). In PPI users we observed a significant increase in bacteria: genera Enterococcus, Streptococcus, Staphylococcus and the potentially pathogenic species Escherichia coli.
The differences between PPI users and non-users observed in this study are consistently associated with changes towards a less healthy gut microbiome. These differences are in line with known changes that predispose to C. difficile infections and can potentially explain the increased risk of enteric infections in PPI users. On a population level, the effects of PPI are more prominent than the effects of antibiotics or other commonly used drugs.
Despite continuous efforts, not a single predictor of breast cancer chemotherapy resistance has made it into the clinic yet. However, it has become clear in recent years that breast cancer is a ...collection of molecularly distinct diseases. With ever increasing amounts of breast cancer data becoming available, we set out to study if gene expression based predictors of chemotherapy resistance that are specific for breast cancer subtypes can improve upon the performance of generic predictors.
We trained predictors of resistance that were specific for a subtype and generic predictors that were not specific for a particular subtype, i.e. trained on all subtypes simultaneously. Through a rigorous double-loop cross-validation we compared the performance of these two types of predictors on the different subtypes on a large set of tumors all profiled on the same expression platform (n = 394). We evaluated predictors based on either mRNA gene expression or clinical features.
For HER2+, ER- breast cancer, subtype specific predictor based on clinical features outperformed the generic, non-specific predictor. This can be explained by the fact that the generic predictor included HER2 and ER status, features that are predictive over the whole set, but not within this subtype. In all other scenarios the generic predictors outperformed the subtype specific predictors or showed equal performance.
Since it depends on the specific context which type of predictor - subtype specific or generic- performed better, it is highly recommended to evaluate both specific and generic predictors when attempting to predict treatment response in breast cancer.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
BACKGROUNDApproximately 5% of patients with celiac disease (CeD) do not respond to a gluten-free diet and progress to refractory celiac disease (RCD), a severe progression that is characterized by ...infiltration of intraepithelial T lymphocytes. Patients with RCD type II (RCDII) show clonal expansions of intraepithelial T lymphocytes that result in a poor prognosis and a high mortality rate through development of aggressive enteropathy-associated T-cell lymphoma. It is not known whether genetic variations play a role in severe progression of CeD to RCDII.
PATIENTS AND METHODSWe performed the first genome-wide association study to identify the causal genes for RCDII and the molecular pathways perturbed in RCDII. The genome-wide association study was performed in 38 Dutch patients with RCDII, and the 15 independent top-associated single nucleotide polymorphism (SNP) variants (P<5×10) were replicated in 56 independent French and Dutch patients with RCDII.
RESULTSAfter replication, SNP rs2041570 on chromosome 7 was significantly associated with progression to RCDII (P=2.37×10, odds ratio=2.36) but not with CeD susceptibility. SNP rs2041570 risk allele A was associated with lower levels of FAM188B expression in blood and small intestinal biopsies. Stratification of RCDII biopsies based on rs2041570 genotype showed differential expression of innate immune and antibacterial genes that are expressed in Paneth cells.
CONCLUSIONWe have identified a novel SNP associated with the severe progression of CeD to RCDII. Our data suggest that genetic susceptibility to CeD might be distinct from the progression to RCDII and suggest a role for Paneth cells in RCDII progression.
To gain statistical power or to allow fine mapping, researchers typically want to pool data before meta-analyses or genotype imputation. However, the necessary harmonization of genetic datasets is ...currently error-prone because of many different file formats and lack of clarity about which genomic strand is used as reference.
Genotype Harmonizer (GH) is a command-line tool to harmonize genetic datasets by automatically solving issues concerning genomic strand and file format. GH solves the unknown strand issue by aligning ambiguous A/T and G/C SNPs to a specified reference, using linkage disequilibrium patterns without prior knowledge of the used strands. GH supports many common GWAS/NGS genotype formats including PLINK, binary PLINK, VCF, SHAPEIT2 & Oxford GEN. GH is implemented in Java and a large part of the functionality can also be used as Java 'Genotype-IO' API. All software is open source under license LGPLv3 and available from http://www.molgenis.org/systemsgenetics.
GH can be used to harmonize genetic datasets across different file formats and can be easily integrated as a step in routine meta-analysis and imputation pipelines.
The mitochondrial and nuclear genomes coordinate and co-evolve in eukaryotes in order to adapt to environmental changes. Variation in the mitochondrial genome is capable of affecting expression of ...genes on the nuclear genome. Sex-specific mitochondrial genetic control of gene expression has been demonstrated in Drosophila melanogaster, where males were found to drive most of the total variation in gene expression. This has potential implications for male-related health and disease resulting from variation in mtDNA solely inherited from the mother. We used a family-based study comprised of 47,323 gene expression probes and 78 mitochondrial SNPs (mtSNPs) from n = 846 individuals to examine the extent of mitochondrial genetic control of gene expression in humans. This identified 15 significant probe-mtSNP associations (P<10-8) corresponding to 5 unique genes on the mitochondrial and nuclear genomes, with three of these genes corresponding to mitochondrial genetic control of gene expression in the nuclear genome. The associated mtSNPs for three genes (one cis and two trans associations) were replicated (P < 0.05) in an independent dataset of n = 452 unrelated individuals. There was no evidence for sexual dimorphic gene expression in any of these five probes. Sex-specific effects were examined by applying our analysis to males and females separately and testing for differences in effect size. The MEST gene was identified as having the most significantly different effect sizes across the sexes (P≈10-7). MEST was similarly expressed in males and females with the G allele; however, males with the C allele are highly expressed for MEST, while females show no expression of the gene. This study provides evidence for the mitochondrial genetic control of expression of several genes in humans, with little evidence found for sex-specific effects.
Type 2 diabetes (T2D) is a very common disease in humans. Here we conduct a meta-analysis of genome-wide association studies (GWAS) with ~16 million genetic variants in 62,892 T2D cases and 596,424 ...controls of European ancestry. We identify 139 common and 4 rare variants associated with T2D, 42 of which (39 common and 3 rare variants) are independent of the known variants. Integration of the gene expression data from blood (n = 14,115 and 2765) with the GWAS results identifies 33 putative functional genes for T2D, 3 of which were targeted by approved drugs. A further integration of DNA methylation (n = 1980) and epigenomic annotation data highlight 3 genes (CAMK1D, TP53INP1, and ATP5G1) with plausible regulatory mechanisms, whereby a genetic variant exerts an effect on T2D through epigenetic regulation of gene expression. Our study uncovers additional loci, proposes putative genetic regulatory mechanisms for T2D, and provides evidence of purifying selection for T2D-associated variants.
Deep sequencing of the gut microbiomes of 1135 participants from a Dutch population-based cohort shows relations between the microbiome and 126 exogenous and intrinsic host factors, including 31 ...intrinsic factors, 12 diseases, 19 drug groups, 4 smoking categories, and 60 dietary factors. These factors collectively explain 18.7% of the variation seen in the interindividual distance of microbial composition. We could associate 110 factors to 125 species and observed that fecal chromogranin A (CgA), a protein secreted by enteroendocrine cells, was exclusively associated with 61 microbial species whose abundance collectively accounted for 53% of microbial composition. Low CgA concentrations were seen in individuals with a more diverse microbiome. These results are an important step toward a better understanding of environment-diet-microbe-host interactions.