Antibodies are known to have an important role in the development of rheumatoid arthritis (RA), one of the most prevalent chronic inflammatory diseases which primarily involves the joints. Most RA ...patients develop autoantibodies against immunoglobulin G (IgG) and changes in IgG glycosylation have been associated with RA. We undertook this study to determine whether altered IgG glycosylation precedes the disease diagnosis. We studied IgG glycosylation in RA in two prospective cohorts (N = 14,749) by measuring 28 IgG glycan traits in 179 subjects who developed RA within 10-years follow-up and 358 matched controls. Ultra-performance liquid chromatography method based on hydrophilic interactions (HILIC-UPLC) was used to analyse IgG glycans. Future RA diagnosis associated with traits related to lower galactosylation and sialylation of IgG when comparing the cases to the matched controls. In RA cases, these traits did not correlate with the time between being recruited to the study and being diagnosed with RA (median time 4.31 years). The difference in IgG glycosylation was relatively stable and present years before diagnosis. This indicates that long-acting factors affecting IgG glycome composition are among the underlying mechanisms of RA and that decreased galactosylation is a pre-existing risk factor involved in the disease development.
•Future diagnosis of rheumatoid arthritis is associated with lower galactosylation of IgG.•IgG glycosylation alterations are present years before diagnosis.•Glycosylation is a pre-existing risk factor involved in the disease development.
Evaluation of O2PLS in Omics data integration Bouhaddani, Said El; Houwing-Duistermaat, Jeanine; Salo, Perttu ...
BMC bioinformatics,
2016-Jan-20, 2016-1-20, 20160120, Letnik:
17 Suppl 2, Številka:
29
Journal Article
Recenzirano
Odprti dostop
Rapid computational and technological developments made large amounts of omics data available in different biological levels. It is becoming clear that simultaneous data analysis methods are needed ...for better interpretation and understanding of the underlying systems biology. Different methods have been proposed for this task, among them Partial Least Squares (PLS) related methods. To also deal with orthogonal variation, systematic variation in the data unrelated to one another, we consider the Two-way Orthogonal PLS (O2PLS): an integrative data analysis method which is capable of modeling systematic variation, while providing more parsimonious models aiding interpretation.
A simulation study to assess the performance of O2PLS showed positive results in both low and higher dimensions. More noise (50 % of the data) only affected the systematic part estimates. A data analysis was conducted using data on metabolomics and transcriptomics from a large Finnish cohort (DILGOM). A previous sequential study, using the same data, showed significant correlations between the Lipo-Leukocyte (LL) module and lipoprotein metabolites. The O2PLS results were in agreement with these findings, identifying almost the same set of co-varying variables. Moreover, our integrative approach identified other associative genes and metabolites, while taking into account systematic variation in the data. Including orthogonal components enhanced overall fit, but the orthogonal variation was difficult to interpret.
Simulations showed that the O2PLS estimates were close to the true parameters in both low and higher dimensions. In the presence of more noise (50 %), the orthogonal part estimates could not distinguish well between joint and unique variation. The joint estimates were not systematically affected. Simultaneous analysis with O2PLS on metabolome and transcriptome data showed that the LL module, together with VLDL and HDL metabolites, were important for the metabolomic and transcriptomic relation. This is in agreement with an earlier study. In addition more gene expression and metabolites are identified being important for the joint covariation.
Techniques enabling targeted re-sequencing of the protein coding sequences of the human genome on next generation sequencing instruments are of great interest. We conducted a systematic comparison of ...the solution-based exome capture kits provided by Agilent and Roche NimbleGen. A control DNA sample was captured with all four capture methods and prepared for Illumina GAII sequencing. Sequence data from additional samples prepared with the same protocols were also used in the comparison.
We developed a bioinformatics pipeline for quality control, short read alignment, variant identification and annotation of the sequence data. In our analysis, a larger percentage of the high quality reads from the NimbleGen captures than from the Agilent captures aligned to the capture target regions. High GC content of the target sequence was associated with poor capture success in all exome enrichment methods. Comparison of mean allele balances for heterozygous variants indicated a tendency to have more reference bases than variant bases in the heterozygous variant positions within the target regions in all methods. There was virtually no difference in the genotype concordance compared to genotypes derived from SNP arrays. A minimum of 11× coverage was required to make a heterozygote genotype call with 99% accuracy when compared to common SNPs on genome-wide association arrays.
Libraries captured with NimbleGen kits aligned more accurately to the target regions. The updated NimbleGen kit most efficiently covered the exome with a minimum coverage of 20×, yet none of the kits captured all the Consensus Coding Sequence annotated exons.
Elevated serum uric acid levels cause gout and are a risk factor for cardiovascular disease and diabetes. To investigate the polygenetic basis of serum uric acid levels, we conducted a meta-analysis ...of genome-wide association scans from 14 studies totalling 28,141 participants of European descent, resulting in identification of 954 SNPs distributed across nine loci that exceeded the threshold of genome-wide significance, five of which are novel. Overall, the common variants associated with serum uric acid levels fall in the following nine regions: SLC2A9 (p = 5.2x10(-201)), ABCG2 (p = 3.1x10(-26)), SLC17A1 (p = 3.0x10(-14)), SLC22A11 (p = 6.7x10(-14)), SLC22A12 (p = 2.0x10(-9)), SLC16A9 (p = 1.1x10(-8)), GCKR (p = 1.4x10(-9)), LRRC16A (p = 8.5x10(-9)), and near PDZK1 (p = 2.7x10(-9)). Identified variants were analyzed for gender differences. We found that the minor allele for rs734553 in SLC2A9 has greater influence in lowering uric acid levels in women and the minor allele of rs2231142 in ABCG2 elevates uric acid levels more strongly in men compared to women. To further characterize the identified variants, we analyzed their association with a panel of metabolites. rs12356193 within SLC16A9 was associated with DL-carnitine (p = 4.0x10(-26)) and propionyl-L-carnitine (p = 5.0x10(-8)) concentrations, which in turn were associated with serum UA levels (p = 1.4x10(-57) and p = 8.1x10(-54), respectively), forming a triangle between SNP, metabolites, and UA levels. Taken together, these associations highlight additional pathways that are important in the regulation of serum uric acid levels and point toward novel potential targets for pharmacological intervention to prevent or treat hyperuricemia. In addition, these findings strongly support the hypothesis that transport proteins are key in regulating serum uric acid levels.
Short sleep duration or insomnia may lead to an increased risk of various psychiatric and cardio-metabolic conditions. Since DNA methylation plays a critical role in the regulation of gene ...expression, studies of differentially methylated positions (DMPs) might be valuable for understanding the mechanisms underlying insomnia. We performed a cross-sectional genome-wide analysis of DNA methylation in relation to self-reported insufficient sleep in individuals from a community-based sample (79 men, aged 39.3 ± 7.3), and in relation to shift work disorder in an occupational cohort (26 men, aged 44.9 ± 9.0). The analysis of DNA methylation data revealed that genes corresponding to selected DMPs form a distinctive pathway: "Nervous System Development" (FDR P value < 0.05). We found that 78% of the DMPs were hypomethylated in cases in both cohorts, suggesting that insufficient sleep may be associated with loss of DNA methylation. A karyoplot revealed clusters of DMPs at various chromosomal regions, including 12 DMPs on chromosome 17, previously associated with Smith-Magenis syndrome, a rare condition comprising disturbed sleep and inverse circadian rhythm. Our findings give novel insights into the DNA methylation patterns associated with sleep loss, possibly modifying processes related to neuroplasticity and neurodegeneration. Future prospective studies are needed to confirm the observed associations.
Myocardial infarction (MI) is divided into either ST elevation MI (STEMI) or non-ST elevation MI (NSTEMI), differing in a number of clinical characteristics. We sought to identify genetic variants ...conferring risk to NSTEMI or STEMI by conducting a genome-wide association study (GWAS) of MI stratified into NSTEMI and STEMI in a consecutive sample of 1,579 acute MI cases with 1,576 controls. Subsequently, we followed the results in an independent population-based sample of 562 cases and 566 controls, a partially independent prospective cohort (N = 16,627 with 163 incident NSTEMI cases), and examined the effect of disease-associated variants on gene expression in 513 healthy participants. Genetic variants on chromosome 1p13.3 near the damage-regulated autophagy modulator 2 gene DRAM2 associated with NSTEMI (rs656843; odds ratio 1.57, P = 3.11 × 10(-10)) in the case-control analysis with a consistent but not statistically significant effect in the prospective cohort (rs656843; hazard ratio 1.13, P = 0.43). These variants were not associated with STEMI (rs656843; odds ratio, 1.11, P = 0.20; hazard ratio 0.97, P = 0.87), appearing to have a pronounced effect on NSTEMI risk. A majority of the variants at 1p13.3 associated with NSTEMI were also associated with the expression level of DRAM2 in blood leukocytes of healthy controls (top-ranked variant rs325927, P = 1.50 × 10(-12)). The results suggest that genetic factors may in part influence whether coronary artery disease results in NSTEMI rather than STEMI.
Cardiomyocytes secrete atrial natriuretic peptide (ANP) and B-type natriuretic peptide (BNP) in response to mechanical stretching, making them useful clinical biomarkers of cardiac stress. Both human ...and animal studies indicate a role for ANP as a regulator of blood pressure with conflicting results for BNP.
We used genome-wide association analysis (n=6296) to study the effects of genetic variants on circulating natriuretic peptide concentrations and compared the impact of natriuretic peptide-associated genetic variants on blood pressure (n=27 059). Eight independent genetic variants in 2 known (
and
) and 1 novel locus (
) associated with midregional proANP (MR-proANP), BNP, aminoterminal proBNP (NT-proBNP), or BNP:NT-proBNP ratio. The
locus containing the adjacent genes encoding ANP and BNP harbored 4 independent
variants with effects specific to either midregional proANP or BNP and a rare missense single nucleotide polymorphism in NT-proBNP seriously altering its measurement. Variants near the calcineurin catalytic subunit gamma gene
and the polypeptide N-acetylgalactosaminyltransferase 4 gene
associated with BNP:NT-proBNP ratio but not with BNP or midregional proANP, suggesting effects on the post-translational regulation of proBNP. Out of the 8 individual variants, only those correlated with midregional proANP had a statistically significant albeit weak impact on blood pressure. The combined effect of these 3 single nucleotide polymorphisms also associated with hypertension risk (
=8.2×10
).
Common genetic differences affecting the circulating concentration of ANP associated with blood pressure, whereas those affecting BNP did not, highlighting the blood pressure-lowering effect of ANP in the general population.
Investigating whether metabolites regulate the co-expression of a predefined gene module is one of the relevant questions posed in the integrative analysis of metabolomic and transcriptomic data. ...This article concerns the integrative analysis of the two high-dimensional datasets by means of multivariate models and statistical tests for the dependence between metabolites and the co-expression of a gene module. The general linear model (GLM) for correlated data that we propose models the dependence between adjusted gene expression values through a block-diagonal variance-covariance structure formed by metabolic-subset specific general variance-covariance blocks. Performance of statistical tests for the inference of conditional co-expression are evaluated through a simulation study. The proposed methodology is applied to the gene expression data of the previously characterized lipid-leukocyte module. Our results show that the GLM approach improves on a previous approach by being less prone to the detection of spurious conditional co-expression.
Abstract
Study Objectives:
Low or excessive sleep duration has been associated with multiple outcomes, but the biology behind these associations remains elusive. Specifically, genetic studies in ...children are scarce. In this study, we aimed to: (1) estimate the proportion of genetic variance of sleep duration in children attributed to common single nucleotide polymorphisms (SNPs), (2) identify novel SNPs associated with sleep duration in children, and (3) investigate the genetic overlap of sleep duration in children and related metabolic and psychiatric traits.
Methods:
We performed a population-based molecular genetic study, using data form the EArly Genetics and Life course Epidemiology (EAGLE) Consortium. 10,554 children of European ancestry were included in the discovery, and 1,250 children in the replication phase.
Results:
We found evidence of significant but modest SNP heritability of sleep duration in children (SNP h2
0.14, 95% CI 0.05, 0.23) using the LD score regression method. A novel region at chromosome 11q13.4 (top SNP: rs74506765, P = 2.27e-08) was associated with sleep duration in children, but this was not replicated in independent studies. Nominally significant genetic overlap was only found (rG = 0.23, P = 0.05) between sleep duration in children and type 2 diabetes in adults, supporting the hypothesis of a common pathogenic mechanism.
Conclusions:
The significant SNP heritability of sleep duration in children and the suggestive genetic overlap with type 2 diabetes support the search for genetic mechanisms linking sleep duration in children to multiple outcomes in health and disease.
The pathogenesis of Hirschsprung disease is complex. Although the RET proto-oncogene is the most frequently affected gene in Hirschsprung disease, rare coding sequence variants explain only a small ...part of Hirschsprung disease cases. We aimed to assess the genetic background of Hirschsprung disease using a genome-wide association analysis combined with sequencing all RET exons in samples from 105 Hirschsprung disease cases (30 familial and 75 sporadic) and 386 controls.
As expected, variants in or near RET showed the strongest overall association with Hirschsprung disease and the most statistically significant association was observed when using a recessive genetic model (rs2435357, NC_000010.10:g.43582056T > C; genotype TT, OR = 17.31, P = 1.462 × 10−21). Previously published associations in variants in SEMA (rs11766001, NC_000007.13:g.84145202A > C; allele C, OR = 2.268, P = 0.009533) and NRG1 (rs4541858, NC_000008.10:g.32410309A > G; allele G, OR = 1.567, P = 0.015; rs7835688, NC_000008.10:g.32411499G > C; allele C, OR = 1.567, P = 0.015) were also replicated in the genome-wide association analysis. Sequencing revealed a total of 12 exonic RET rare variants. Of these, eight amino acid changing rare variants and two frameshift variants caused or possibly caused Hirschsprung disease.
Only a minority of the Hirschsprung disease cases (9/30 familial; 7/75 sporadic) carried one of the rare variants. Excluding the rare variant carriers from the genome-wide association analysis did not appreciably change the association of rs2435357 with Hirschsprung disease. We estimate that approximately two thirds of the sporadic cases may be statistically attributed to the recessive action of the common non-coding RET variants. Thus, even though most cases do not carry rare RET variants, combinations of rare variants and the common non-coding RET variant cause the majority of the cases in our population.