Coronary artery disease (CAD) has substantial heritability and a polygenic architecture. However, the potential of genomic risk scores to help predict CAD outcomes has not been evaluated ...comprehensively, because available studies have involved limited genomic scope and limited sample sizes.
This study sought to construct a genomic risk score for CAD and to estimate its potential as a screening tool for primary prevention.
Using a meta-analytic approach to combine large-scale, genome-wide, and targeted genetic association data, we developed a new genomic risk score for CAD (metaGRS) consisting of 1.7 million genetic variants. We externally tested metaGRS, both by itself and in combination with available data on conventional risk factors, in 22,242 CAD cases and 460,387 noncases from the UK Biobank.
The hazard ratio (HR) for CAD was 1.71 (95% confidence interval CI: 1.68 to 1.73) per SD increase in metaGRS, an association larger than any other externally tested genetic risk score previously published. The metaGRS stratified individuals into significantly different life course trajectories of CAD risk, with those in the top 20% of metaGRS distribution having an HR of 4.17 (95% CI: 3.97 to 4.38) compared with those in the bottom 20%. The corresponding HR was 2.83 (95% CI: 2.61 to 3.07) among individuals on lipid-lowering or antihypertensive medications. The metaGRS had a higher C-index (C = 0.623; 95% CI: 0.615 to 0.631) for incident CAD than any of 6 conventional factors (smoking, diabetes, hypertension, body mass index, self-reported high cholesterol, and family history). For men in the top 20% of metaGRS with >2 conventional factors, 10% cumulative risk of CAD was reached by 48 years of age.
The genomic score developed and evaluated here substantially advances the concept of using genomic information to stratify individuals with different trajectories of CAD risk and highlights the potential for genomic screening in early life to complement conventional risk prediction.
Display omitted
Early prediction of risk of cardiovascular disease (CVD), including stroke, is a cornerstone of disease prevention. Clinical risk scores have been widely used for predicting CVD risk from known risk ...factors. Most CVDs have a substantial genetic component, which also has been confirmed for stroke in recent gene discovery efforts. However, the role of genetics in prediction of risk of CVD, including stroke, has been limited to testing for highly penetrant monogenic disorders. In contrast, the importance of polygenic variation, the aggregated effect of many common genetic variants across the genome with individually small effects, has become more apparent in the last 5 to 10 years, and powerful polygenic risk scores for CVD have been developed. Here we review the current state of the field of polygenic risk scores for CVD including stroke, and their potential to improve CVD risk prediction. We present findings and lessons from diseases such as coronary artery disease as these will likely be useful to inform future research in stroke polygenic risk prediction.
Many different microarray experiments are publicly available today. It is natural to ask whether different experiments for the same phenotypic conditions can be combined using meta-analysis, in order ...to increase the overall sample size. However, some genes are not measured in all experiments, hence they cannot be included or their statistical significance cannot be appropriately estimated in traditional meta-analysis. Nonetheless, these genes, which we refer to as incomplete genes, may also be informative and useful.
We propose a meta-analysis framework, called "Incomplete Gene Meta-analysis", which can include incomplete genes by imputing the significance of missing replicates, and computing a meta-score for every gene across all datasets. We demonstrate that the incomplete genes are worthy of being included and our method is able to appropriately estimate their significance in two groups of experiments. We first apply the Incomplete Gene Meta-analysis and several comparable methods to five breast cancer datasets with an identical set of probes. We simulate incomplete genes by randomly removing a subset of probes from each dataset and demonstrate that our method consistently outperforms two other methods in terms of their false discovery rate. We also apply the methods to three gastric cancer datasets for the purpose of discriminating diffuse and intestinal subtypes.
Meta-analysis is an effective approach that identifies more robust sets of differentially expressed genes from multiple studies. The incomplete genes that mainly arise from the use of different platforms may also have statistical and biological importance but are ignored or are not appropriately involved by previous studies. Our Incomplete Gene Meta-analysis is able to incorporate the incomplete genes by estimating their significance. The results on both breast and gastric cancer datasets suggest that the highly ranked genes and associated GO terms produced by our method are more significant and biologically meaningful according to the previous literature.
Coronary artery disease (CAD) has substantial heritability and a polygenic architecture. However, the potential of genomic risk scores to help predict CAD outcomes has not been evaluated ...comprehensively, because available studies have involved limited genomic scope and limited sample sizes.
This study sought to construct a genomic risk score for CAD and to estimate its potential as a screening tool for primary prevention.
Using a meta-analytic approach to combine large-scale, genome-wide, and targeted genetic association data, we developed a new genomic risk score for CAD (metaGRS) consisting of 1.7 million genetic variants. We externally tested metaGRS, both by itself and in combination with available data on conventional risk factors, in 22,242 CAD cases and 460,387 noncases from the UK Biobank.
The hazard ratio (HR) for CAD was 1.71 (95% confidence interval CI: 1.68 to 1.73) per SD increase in metaGRS, an association larger than any other externally tested genetic risk score previously published. The metaGRS stratified individuals into significantly different life course trajectories of CAD risk, with those in the top 20% of metaGRS distribution having an HR of 4.17 (95% CI: 3.97 to 4.38) compared with those in the bottom 20%. The corresponding HR was 2.83 (95% CI: 2.61 to 3.07) among individuals on lipid-lowering or antihypertensive medications. The metaGRS had a higher C-index (C = 0.623; 95% CI: 0.615 to 0.631) for incident CAD than any of 6 conventional factors (smoking, diabetes, hypertension, body mass index, self-reported high cholesterol, and family history). For men in the top 20% of metaGRS with >2 conventional factors, 10% cumulative risk of CAD was reached by 48 years of age.
The genomic score developed and evaluated here substantially advances the concept of using genomic information to stratify individuals with different trajectories of CAD risk and highlights the potential for genomic screening in early life to complement conventional risk prediction.
Polygenic risk scores (PRSs) can be used to predict ischemic stroke (IS). However, further validation of PRS performance is required in independent populations, particularly older adults in whom the ...majority of strokes occur.
We predicted risk of incident IS events in a population of 12 792 healthy older individuals enrolled in the ASPREE trial (Aspirin in Reducing Events in the Elderly). The PRS was calculated using 3.6 million genetic variants. Participants had no previous history of cardiovascular events, dementia, or persistent physical disability at enrollment. The primary outcome was IS over 5 years, with stroke subtypes as secondary outcomes. A multivariable model including conventional risk factors was applied and reevaluated after adding PRS. Area under the curve and net reclassification were evaluated.
At baseline, mean population age was 75 years. In total, 173 incident IS events occurred over a median follow-up of 4.7 years. When PRS was added to the multivariable model as a continuous variable, it was independently associated with IS (hazard ratio, 1.41 95% CI, 1.20–1.65 per SD of the PRS; P<0.001). The PRS alone was a better discriminator for IS events than most conventional risk factors. PRS as a categorical variable was a significant predictor in the highest tertile (hazard ratio, 1.74; P=0.004) compared with the lowest. The area under the curve of the conventional model was 66.6% (95% CI, 62.2–71.1) and after inclusion of the PRS, improved to 68.5 (95% CI, 64.0–73.0 P=0.095). In subgroup analysis, the continuous PRS remained an independent predictor for large vessel and cardioembolic stroke subtypes but not for small vessel stroke. Reclassification was improved, as the continuous net reclassification index after adding PRS to the conventional model was 0.25 (95% CI, 0.17–0.43).
PRS predicts incident IS in a healthy older population but only moderately improves prediction over conventional risk factors.
URL: https://www.clinicaltrials.gov; Unique identifier: NCT01038583.
Recent advances in our understanding of the genomics of the human metabolome have shed light on the pathways involved in metabolic and cardiovascular disease. Such studies crucially depend on the ...interpretation of complex molecular spectra. A recent study by Suhre and colleagues provides a way to identify potentially clinically relevant biomarkers without a priori information, such as reference spectra, thus aiding the discovery of additional spectral features and corresponding genomic loci associated with metabolism and disease.
Cytokines are essential regulatory components of the immune system, and their aberrant levels have been linked to many disease states. Despite increasing evidence that cytokines operate in concert, ...many of the physiological interactions between cytokines, and the shared genetic architecture that underlies them, remain unknown. Here, we aimed to identify and characterize genetic variants with pleiotropic effects on cytokines. Using three population-based cohorts (n = 9,263), we performed multivariate genome-wide association studies (GWAS) for a correlation network of 11 circulating cytokines, then combined our results in meta-analysis. We identified a total of eight loci significantly associated with the cytokine network, of which two (PDGFRB and ABO) had not been detected previously. In addition, conditional analyses revealed a further four secondary signals at three known cytokine loci. Integration, through the use of Bayesian colocalization analysis, of publicly available GWAS summary statistics with the cytokine network associations revealed shared causal variants between the eight cytokine loci and other traits; in particular, cytokine network variants at the ABO, SERPINE2, and ZFPM2 loci showed pleiotropic effects on the production of immune-related proteins, on metabolic traits such as lipoprotein and lipid levels, on blood-cell-related traits such as platelet count, and on disease traits such as coronary artery disease and type 2 diabetes.
The biomarker glycoprotein acetylation (GlycA) has been shown to predict risk of cardiovascular disease and all-cause mortality. Here, we characterize biological processes associated with GlycA by ...leveraging population-based omics data and health records from >10,000 individuals. Our analyses show that GlycA levels are chronic within individuals for up to a decade. In apparently healthy individuals, elevated GlycA corresponded to elevation of myriad inflammatory cytokines, as well as a gene coexpression network indicative of increased neutrophil activity, suggesting that individuals with high GlycA may be in a state of chronic inflammatory response. Accordingly, analysis of infection-related hospitalization and death records showed that increased GlycA increased long-term risk of severe non-localized and respiratory infections, particularly septicaemia and pneumonia. In total, our work demonstrates that GlycA is a biomarker for chronic inflammation, neutrophil activity, and risk of future severe infection. It also illustrates the utility of leveraging multi-layered omics data and health records to elucidate the molecular and cellular processes associated with biomarkers.
Display omitted
•Elevated GlycA was stable within individuals for up to a decade•GlycA marked the levels of myriad inflammatory cytokines in circulation•A gene network enriched for neutrophil functions was associated with GlycA•GlycA strongly predicted future risk of hospitalization and death from infection
Ritchie et al. investigate the biology of GlycA, a known biomarker for short-term mortality. They reveal GlycA’s long-term behavior in apparently healthy patients: it is stable for >10 years and associated with chronic low-grade inflammation. Accordingly, GlycA predicts death from infection up to 14 years in the future.