Microbiome studies suggest the presence of an interaction between the human gut microbiome and soil-transmitted helminth. Upon deworming, a complex interaction between the anthelminthic drug, ...helminths and microbiome composition might occur. To dissect this, we analyse the changes that take place in the gut bacteria profiles in samples from a double blind placebo controlled trial conducted in an area endemic for soil transmitted helminths in Indonesia.
Either placebo or albendazole were given every three months for a period of one and a half years. Helminth infection was assessed before and at 3 months after the last treatment round. In 150 subjects, the bacteria were profiled using the 454 pyrosequencing. Statistical analysis was performed cross-sectionally at pre-treatment to assess the effect of infection, and at post-treatment to determine the effect of infection and treatment on microbiome composition using the Dirichlet-multinomial regression model.
At a phylum level, at pre-treatment, no difference was seen in microbiome composition in terms of relative abundance between helminth-infected and uninfected subjects and at post-treatment, no differences were found in microbiome composition between albendazole and placebo group. However, in subjects who remained infected, there was a significant difference in the microbiome composition of those who had received albendazole and placebo. This difference was largely attributed to alteration of Bacteroidetes. Albendazole was more effective against Ascaris lumbricoides and hookworms but not against Trichuris trichiura, thus in those who remained infected after receiving albendazole, the helminth composition was dominated by T. trichiura.
We found that overall, albendazole does not affect the microbiome composition. However, there is an interaction between treatment and helminths as in subjects who received albendazole and remained infected there was a significant alteration in Bacteroidetes. This helminth-albendazole interaction needs to be studied further to fully grasp the complexity of the effect of deworming on the microbiome.
ISRCTN Registy, ISRCTN83830814.
Clustered overdispersed multivariate count data are challenging to model due to the presence of correlation within and between samples. Typically, the first source of correlation needs to be ...addressed but its quantification is of less interest. Here, we focus on the correlation between time points. In addition, the effects of covariates on the multivariate counts distribution need to be assessed. To fulfill these requirements, a regression model based on the Dirichlet‐multinomial distribution for association between covariates and the categorical counts is extended by using random effects to deal with the additional clustering. This model is the Dirichlet‐multinomial mixed regression model. Alternatively, a negative binomial regression mixed model can be deployed where the corresponding likelihood is conditioned on the total count. It appears that these two approaches are equivalent when the total count is fixed and independent of the random effects. We consider both subject‐specific and categorical‐specific random effects. However, the latter has a larger computational burden when the number of categories increases. Our work is motivated by microbiome data sets obtained by sequencing of the amplicon of the bacterial 16S rRNA gene. These data have a compositional structure and are typically overdispersed. The microbiome data set is from an epidemiological study carried out in a helminth‐endemic area in Indonesia. The conclusions are as follows: time has no statistically significant effect on microbiome composition, the correlation between subjects is statistically significant, and treatment has a significant effect on the microbiome composition only in infected subjects who remained infected.
Protein N-glycosylation patterns are known to show vast genetic as well as physiological and pathological variation and represent a large pool of potential biomarkers. Large-scale studies are needed ...for the identification and validation of biomarkers, and the analytical techniques required have recently been developed. Such methods have up to now mainly been applied to complex mixtures of glycoproteins in biofluids (e.g. plasma). Here, we analyzed N-glycosylation profiles of alpha1-antitrypsin (AAT) and immunoglobulin A (IgA) enriched fractions by 96-well microtitration plate based high-throughput immuno-affinity capturing and N-glycan analysis using multiplexed capillary gel electrophoresis with laser-induced fluorescence detection (CGE-LIF). Human plasma samples were from the Leiden Longevity Study comprising 2415 participants of different chronological and biological ages. Glycosylation patterns of AAT enriched fractions were found to be associated with chronological (calendar) age and they differed between females and males. Moreover, several glycans in the AAT enriched fraction were associated with physiological parameters marking cardiovascular and metabolic diseases. Pronounced differences were found between males and females in the glycosylation profiles of IgA enriched fractions. Our results demonstrate that large-scale immuno-affinity capturing of proteins from human plasma using a bead-based method combined with high-throughput N-glycan analysis is a powerful tool for the discovery of glycosylation-based biomarker candidates.
Objective
To identify osteoarthritis (OA) progression–modulating pathways in articular cartilage and their respective regulatory epigenetic and genetic determinants in end‐stage disease.
Methods
...Transcriptional activity of CpG was assessed using gene expression data and DNA methylation data for preserved and lesional articular cartilage samples. Disease‐responsive transcriptionally active CpG were identified by means of differential methylation between preserved and lesional cartilage. Transcriptionally relevant genetic determinants were addressed by means of single‐nucleotide polymorphisms (SNPs) proximal to the OA‐responsive transcriptionally active CpG. Statistical analyses were corrected for age, sex, joint, and technical covariates. A random effect was included to correct for possible correlations between paired samples.
Results
Of 9,838 transcribed genes in articular cartilage, 2,324 correlated with the methylation status of 3,748 transcriptionally active CpG; both negative (n = 1,741) and positive (n = 2,007) correlations were observed. Hypomethylation and hypermethylation (false discovery rate of <0.05, |Δβ| > 0.05) were observed for 62 and 25 transcriptionally active CpG, respectively, covering 70 unique genes. Enrichment for developmental and extracellular matrix maintenance pathways indicated possible reactivation of endochondral ossification. Finally, we observed 31 and 26 genes for which methylation and expression, respectively, were additionally affected by genetic variation.
Conclusion
We identified tissue‐specific genes involved in OA disease progression, reflected by genetic and pathologic epigenetic regulation of transcription, primarily at genes involved in development. Therefore, transcriptionally active SNPs near these genes may serve as putative susceptibility alleles. Our results constitute an important step toward understanding the reported widespread epigenetic changes occurring in OA articular cartilage and toward subsequent development of treatments targeting disease‐driving pathways.
Marginal tests based on individual SNPs are routinely used in genetic association studies. Studies have shown that haplotype‐based methods may provide more power in disease mapping than methods based ...on single markers when, for example, multiple disease‐susceptibility variants occur within the same gene. A limitation of haplotype‐based methods is that the number of parameters increases exponentially with the number of SNPs, inducing a commensurate increase in the degrees of freedom and weakening the power to detect associations. To address this limitation, we introduce a hierarchical linkage disequilibrium model for disease mapping, based on a reparametrization of the multinomial haplotype distribution, where every parameter corresponds to the cumulant of each possible subset of a set of loci. This hierarchy present in the parameters enables us to employ flexible testing strategies over a range of parameter sets: from standard single SNP analyses through the full haplotype distribution tests, reducing degrees of freedom and increasing the power to detect associations. We show via extensive simulations that our approach maintains the type I error at nominal level and has increased power under many realistic scenarios, as compared to single SNP and standard haplotype‐based studies. To evaluate the performance of our proposed methodology in real data, we analyze genome‐wide data from the Wellcome Trust Case‐Control Consortium.
Summary
By studying the loci that contribute to human longevity, we aim to identify mechanisms that contribute to healthy aging. To identify such loci, we performed a genome‐wide association study ...(GWAS) comparing 403 unrelated nonagenarians from long‐living families included in the Leiden Longevity Study (LLS) and 1670 younger population controls. The strongest candidate SNPs from this GWAS have been analyzed in a meta‐analysis of nonagenarian cases from the Rotterdam Study, Leiden 85‐plus study, and Danish 1905 cohort. Only one of the 62 prioritized SNPs from the GWAS analysis (P < 1 × 10−4) showed genome‐wide significance with survival into old age in the meta‐analysis of 4149 nonagenarian cases and 7582 younger controls OR = 0.71 (95% CI 0.65–0.77), P = 3.39 × 10−17. This SNP, rs2075650, is located in TOMM40 at chromosome 19q13.32 close to the apolipoprotein E (APOE) gene. Although there was only moderate linkage disequilibrium between rs2075650 and the ApoE ε4 defining SNP rs429358, we could not find an APOE‐independent effect of rs2075650 on longevity, either in cross‐sectional or in longitudinal analyses. As expected, rs429358 associated with metabolic phenotypes in the offspring of the nonagenarian cases from the LLS and their partners. In addition, we observed a novel association between this locus and serum levels of IGF‐1 in women (P = 0.005). In conclusion, the major locus determining familial longevity up to high age as detected by GWAS was marked by rs2075650, which tags the deleterious effects of the ApoE ε4 allele. No other major longevity locus was found.
Glycosylation of immunoglobulin G (IgG) influences IgG effector function by modulating binding to Fc receptors. To identify genetic loci associated with IgG glycosylation, we quantitated N-linked IgG ...glycans using two approaches. After isolating IgG from human plasma, we performed 77 quantitative measurements of N-glycosylation using ultra-performance liquid chromatography (UPLC) in 2,247 individuals from four European discovery populations. In parallel, we measured IgG N-glycans using MALDI-TOF mass spectrometry (MS) in a replication cohort of 1,848 Europeans. Meta-analysis of genome-wide association study (GWAS) results identified 9 genome-wide significant loci (P<2.27 × 10(-9)) in the discovery analysis and two of the same loci (B4GALT1 and MGAT3) in the replication cohort. Four loci contained genes encoding glycosyltransferases (ST6GAL1, B4GALT1, FUT8, and MGAT3), while the remaining 5 contained genes that have not been previously implicated in protein glycosylation (IKZF1, IL6ST-ANKRD55, ABCF2-SMARCD3, SUV420H1, and SMARCB1-DERL3). However, most of them have been strongly associated with autoimmune and inflammatory conditions (e.g., systemic lupus erythematosus, rheumatoid arthritis, ulcerative colitis, Crohn's disease, diabetes type 1, multiple sclerosis, Graves' disease, celiac disease, nodular sclerosis) and/or haematological cancers (acute lymphoblastic leukaemia, Hodgkin lymphoma, and multiple myeloma). Follow-up functional experiments in haplodeficient Ikzf1 knock-out mice showed the same general pattern of changes in IgG glycosylation as identified in the meta-analysis. As IKZF1 was associated with multiple IgG N-glycan traits, we explored biomarker potential of affected N-glycans in 101 cases with SLE and 183 matched controls and demonstrated substantial discriminative power in a ROC-curve analysis (area under the curve = 0.842). Our study shows that it is possible to identify new loci that control glycosylation of a single plasma protein using GWAS. The results may also provide an explanation for the reported pleiotropy and antagonistic effects of loci involved in autoimmune diseases and haematological cancer.
This study indicates that glycosylation of immunoglobulin G, the most abundant antibody in human blood, may convey useful information with regard to inflammation and metabolic health. IgG occurs in ...the form of different subclasses, of which the effector functions show significant variation. Our method provides subclass-specific IgG glycosylation profiling, while previous large-scale studies neglected to measure IgG2-specific glycosylation. We analysed the plasma Fc glycosylation profiles of IgG1, IgG2 and IgG4 in a cohort of 1826 individuals by liquid chromatography-mass spectrometry. For all subclasses, a low level of galactosylation and sialylation and a high degree of core fucosylation associated with poor metabolic health, i.e. increased inflammation as assessed by C-reactive protein, low serum high-density lipoprotein cholesterol and high triglycerides, which are all known to indicate increased risk of cardiovascular disease. IgG2 consistently showed weaker associations of its galactosylation and sialylation with the metabolic markers, compared to IgG1 and IgG4, while the direction of the associations were overall similar for the different IgG subclasses. These findings demonstrate the potential of IgG glycosylation as a biomarker for inflammation and metabolic health, and further research is required to determine the additive value of IgG glycosylation on top of biomarkers which are currently used.
Hydrogen/deuterium exchange monitored by mass spectrometry is a promising technique for rapidly fingerprinting structural and dynamical properties of proteins. The time-dependent change in the mass ...of any fragment of the polypeptide chain depends uniquely on the rate of exchange of its amide hydrogens, but determining the latter from the former is generally not possible. Here, we show that, if time-resolved measurements are available for a number of overlapping peptides that cover the whole sequence, rate constants for each amide hydrogen exchange (or equivalently, their protection factors) may be extracted and the uniqueness of the solutions obtained depending on the degree of peptide overlap. However, in most cases, the solution is not unique, and multiple alternatives must be considered. We provide a statistical method that clusters the solutions to further reduce their number. Such analysis always provides meaningful constraints on protection factors and can be used in situations in which obtaining more refined experimental data is impractical. It also provides a systematic way to improve data collection strategies to obtain unambiguous information at single-residue level (e.g., for assessing protein structure predictions at atomistic level).