Epigenome-wide association studies (EWAS) and differential gene expression analyses are generally performed on tissue samples, which consist of multiple cell types. Cell-type-specific effects of a ...trait, such as disease, on the omics expression are of interest but difficult or costly to measure experimentally. By measuring omics data for the bulk tissue, cell type composition of a sample can be inferred statistically. Subsequently, cell-type-specific effects are estimated by linear regression that includes terms representing the interaction between the cell type proportions and the trait. This approach involves two issues, scaling and multicollinearity.
First, although cell composition is analyzed in linear scale, differential methylation/expression is analyzed suitably in the logit/log scale. To simultaneously analyze two scales, we applied nonlinear regression. Second, we show that the interaction terms are highly collinear, which is obstructive to ordinary regression. To cope with the multicollinearity, we applied ridge regularization. In simulated data, nonlinear ridge regression attained well-balanced sensitivity, specificity and precision. Marginal model attained the lowest precision and highest sensitivity and was the only algorithm to detect weak signal in real data.
Nonlinear ridge regression performed cell-type-specific association test on bulk omics data with well-balanced performance. The omicwas package for R implements nonlinear ridge regression for cell-type-specific EWAS, differential gene expression and QTL analyses. The software is freely available from https://github.com/fumi-github/omicwas.
Background: Tobacco smoking is a leading preventable cause of morbidity and mortality worldwide; still, the success rate of smoking cessation is low in general. From the viewpoint of public health ...and clinical care, an objective biomarker of long-term smoking behavior is sought.Methods and Results: This study assessed DNA methylation as a biomarker of smoking in a hospital setting through a combination of molecular approaches including genetic, DNA methylation and mRNA expression analyses. First, in an epigenome-wide association study involving Japanese individuals with chronic cardiovascular disease (n=94), genome-wide significant smoking association was identified at 2 CpG sites on chromosome 5, with the strongest signal at cg05575921 located in intron 3 of the aryl-hydrocarbon receptor repressor (AHRR) gene. Highly significant (P<1×10−27) smoking–cg05575921 association was validated in 2 additional panels (n=339 and n=300). For the relationship of cg05575921 methylation extent with time after smoking cessation and cumulative cigarette consumption among former smokers, smoking-related hypomethylation was found to remain for ≥20 years after smoking cessation and to be affected by multiple factors, such as cis-interaction of genetic variation. There was a significant inverse correlation (P=0.0005) between cg05575921 methylation extent and AHRR mRNA expression.Conclusions: The present study results support that reversion of AHRR hypomethylation can be a quantifiable biomarker for progress in and observance of smoking cessation, although some methodological points need to be considered.
Large bowel preparation may cause a substantial change in the gut microbiota and metabolites. Here, we included a bowel prep group and a no-procedure control group and evaluated the effects of bowel ...prep on the stability of the gut microbiome and metabolome as well as on recovery. Gut microbiota and metabolome compositions were analyzed by 16S rRNA sequencing and capillary electrophoresis time-of-flight mass spectrometry, respectively. Analysis of coefficients at the genus and species level and weighted UniFrac distance showed that, compared with controls, microbiota composition was significantly reduced immediately after the prep but not at 14 days after it. For the gut metabolome profiles, correlation coefficients between before and immediately after the prep were significantly lower than those between before and 14 days after prep and were not significantly different compared with those for between-subject differences. Thirty-two metabolites were significantly changed before and immediately after the prep, but these metabolites recovered within 14 days. In conclusion, bowel preparation has a profound effect on the gut microbiome and metabolome, but the overall composition recovers to baseline within 14 days. To properly conduct studies of the human gut microbiome and metabolome, fecal sampling should be avoided immediately after bowel prep.
Certain classes of antihypertensive drug may exert specific, blood pressure (BP)-independent protective effects on end-organ damages such as left ventricular hypertrophy, although the overall ...evidence has not been definitive in clinical trials. To unravel antihypertensive drug-induced gene expression changes that are potentially related to the amelioration of end-organ damages, we performed in vivo phenotypic evaluation and transcriptomic analysis on the heart and the kidney, with administration of antihypertensive drugs to two inbred strains (ie, hypertensive and normotensive) of rats. We chose 6 antihypertensive classes: enalapril (angiotensin-converting enzyme inhibitor), candesartan (angiotensin receptor blocker), hydrochlorothiazide (diuretics), amlodipine (calcium-channel blocker), carvedilol (vasodilating β-blocker), and hydralazine. In the tested rat strains, 4 of 6 drugs, including 2 renin-angiotensin system inhibitors, were effective for BP lowering, whereas the remaining 2 drugs were not. Besides BP lowering, there appeared to be some interdrug heterogeneity in phenotypic changes, such as suppressed body weight gain and body weight-adjusted heart weight reduction. For the transcriptomic response, a considerable number of genes showed prominent mRNA expression changes either in a BP-dependent or BP-independent manner with substantial diversity between the target organs. Noticeable changes of mRNA expression were induced particularly by renin-angiotensin system blockade, for example, for genes in the natriuretic peptide system (
and
) in the heart and for those in the renin-angiotensin system/kallikrein-kinin system (
and rat
paralogs) and those related to calcium ion binding (
and
) in the kidney. The research resources constructed here will help corroborate occasionally inconclusive evidence in clinical settings.
We report the first genome-wide association study (GWAS) whose sample size (1,053 Swedish subjects) is sufficiently powered to detect genome-wide significance (p<1.5 x 10(-7)) for polymorphisms that ...modestly alter therapeutic warfarin dose. The anticoagulant drug warfarin is widely prescribed for reducing the risk of stroke, thrombosis, pulmonary embolism, and coronary malfunction. However, Caucasians vary widely (20-fold) in the dose needed for therapeutic anticoagulation, and hence prescribed doses may be too low (risking serious illness) or too high (risking severe bleeding). Prior work established that approximately 30% of the dose variance is explained by single nucleotide polymorphisms (SNPs) in the warfarin drug target VKORC1 and another approximately 12% by two non-synonymous SNPs (*2, *3) in the cytochrome P450 warfarin-metabolizing gene CYP2C9. We initially tested each of 325,997 GWAS SNPs for association with warfarin dose by univariate regression and found the strongest statistical signals (p<10(-78)) at SNPs clustering near VKORC1 and the second lowest p-values (p<10(-31)) emanating from CYP2C9. No other SNPs approached genome-wide significance. To enhance detection of weaker effects, we conducted multiple regression adjusting for known influences on warfarin dose (VKORC1, CYP2C9, age, gender) and identified a single SNP (rs2108622) with genome-wide significance (p = 8.3 x 10(-10)) that alters protein coding of the CYP4F2 gene. We confirmed this result in 588 additional Swedish patients (p<0.0029) and, during our investigation, a second group provided independent confirmation from a scan of warfarin-metabolizing genes. We also thoroughly investigated copy number variations, haplotypes, and imputed SNPs, but found no additional highly significant warfarin associations. We present power analysis of our GWAS that is generalizable to other studies, and conclude we had 80% power to detect genome-wide significance for common causative variants or markers explaining at least 1.5% of dose variance. These GWAS results provide further impetus for conducting large-scale trials assessing patient benefit from genotype-based forecasting of warfarin dose.
Blood pressure (BP) is a major risk factor for cardiovascular disease and more than 200 genetic loci associated with BP are known. Here, we perform a multi-stage genome-wide association study for BP ...(max N = 289,038) principally in East Asians and meta-analysis in East Asians and Europeans. We report 19 new genetic loci and ancestry-specific BP variants, conforming to a common ancestry-specific variant association model. At 10 unique loci, distinct non-rare ancestry-specific variants colocalize within the same linkage disequilibrium block despite the significantly discordant effects for the proxy shared variants between the ethnic groups. The genome-wide transethnic correlation of causal-variant effect-sizes is 0.898 and 0.851 for systolic and diastolic BP, respectively. Some of the ancestry-specific association signals are also influenced by a selective sweep. Our results provide new evidence for the role of common ancestry-specific variants and natural selection in ethnic differences in complex traits such as BP.
Large-scale meta-analyses of genome-wide association studies (GWAS) have identified >175 loci associated with fasting cholesterol levels, including total cholesterol (TC), high-density lipoprotein ...cholesterol (HDL-C), low-density lipoprotein cholesterol (LDL-C), and triglycerides (TG). With differences in linkage disequilibrium (LD) structure and allele frequencies between ancestry groups, studies in additional large samples may detect new associations. We conducted staged GWAS meta-analyses in up to 69,414 East Asian individuals from 24 studies with participants from Japan, the Philippines, Korea, China, Singapore, and Taiwan. These meta-analyses identified (P < 5 × 10-8) three novel loci associated with HDL-C near CD163-APOBEC1 (P = 7.4 × 10-9), NCOA2 (P = 1.6 × 10-8), and NID2-PTGDR (P = 4.2 × 10-8), and one novel locus associated with TG near WDR11-FGFR2 (P = 2.7 × 10-10). Conditional analyses identified a second signal near CD163-APOBEC1. We then combined results from the East Asian meta-analysis with association results from up to 187,365 European individuals from the Global Lipids Genetics Consortium in a trans-ancestry meta-analysis. This analysis identified (log10Bayes Factor ≥6.1) eight additional novel lipid loci. Among the twelve total loci identified, the index variants at eight loci have demonstrated at least nominal significance with other metabolic traits in prior studies, and two loci exhibited coincident eQTLs (P < 1 × 10-5) in subcutaneous adipose tissue for BPTF and PDGFC. Taken together, these analyses identified multiple novel lipid loci, providing new potential therapeutic targets.
Background: Normative alcohol use (or drinking behavior) influences the risk of cardiovascular disease in a multi-faceted manner. To identify susceptibility gene variants for drinking behavior, a ...2-staged genome-wide association study was performed in a Japanese population. Methods and Results: In the stage-1 scan, 733 cases and 729 controls were genotyped with 456,827 SNP markers. The associated loci without redundancy of linkage disequilibrium were further examined in the stage-2 general population panel comprising 2,794 drinkers (≥once per week), 1,521 chance drinkers (<once per week), and 1,351 non-drinkers. Along with genome-wide exploration, we aimed to replicate the trait association of a candidate gene SNP previously reported (rs1229984 in ADH1B). A cluster of 12 SNPs on 12q24 were found to significantly (P<5×10-8) associate with drinking behavior in stage 1, among which rs671 (a Glu-to-Lys substitution at position 504) in the ALDH2 gene showed the strongest association (odds ratio (OR)=0.16, P=3.6×10-211 in the joint analysis). The association was also replicated for rs1229984 (OR=1.20, P<3.6×10-4). Furthermore, ALDH2 504Lys was associated with several metabolic traits, eg, lower levels of high-density lipoprotein cholesterol and liver enzymes-AST, ALT, and γGTP-by interacting with alcohol intake. Conclusions: Our results confirm ALDH2 as a major locus regulating drinking behavior in the Japanese, indicating that the ALDH2 504Lys variant exerts pleiotropic effects on risk factors of cardiovascular disease among drinkers. (Circ J 2011; 75: 911-918)