The relationship between the development and/or progression of interstitial lung abnormalities (ILA) and clinical outcomes has not been previously investigated.
To determine the risk factors for, and ...the clinical consequences of, having ILA progression in participants from the Framingham Heart Study.
ILA were assessed in 1,867 participants who had serial chest computed tomography (CT) scans approximately 6 years apart. Mixed effect regression (and Cox) models were used to assess the association between ILA progression and pulmonary function decline (and mortality).
During the follow-up period 660 (35%) participants did not have ILA on either CT scan, 37 (2%) had stable to improving ILA, and 118 (6%) had ILA with progression (the remaining participants without ILA were noted to be indeterminate on at least one CT scan). Increasing age and increasing copies of the MUC5B promoter polymorphism were associated with ILA progression. After adjustment for covariates, ILA progression was associated with a greater FVC decline when compared with participants without ILA (20 ml; SE, ±6 ml; P = 0.0005) and with those with ILA without progression (25 ml; SE, ±11 ml; P = 0.03). Over a median follow-up time of approximately 4 years, after adjustment, ILA progression was associated with an increase in the risk of death (hazard ratio, 3.9; 95% confidence interval, 1.3-10.9; P = 0.01) when compared with those without ILA.
These findings demonstrate that ILA progression in the Framingham Heart Study is associated with an increased rate of pulmonary function decline and increased risk of death.
Identification of single nucleotide polymorphisms (SNPs) associated with gene expression levels, known as expression quantitative trait loci (eQTLs), may improve understanding of the functional role ...of phenotype-associated SNPs in genome-wide association studies (GWAS). The small sample sizes of some previous eQTL studies have limited their statistical power. We conducted an eQTL investigation of microarray-based gene and exon expression levels in whole blood in a cohort of 5257 individuals, exceeding the single cohort size of previous studies by more than a factor of 2.
We detected over 19,000 independent lead cis-eQTLs and over 6000 independent lead trans-eQTLs, targeting over 10,000 gene targets (eGenes), with a false discovery rate (FDR) < 5%. Of previously published significant GWAS SNPs, 48% are identified to be significant eQTLs in our study. Some trans-eQTLs point toward novel mechanistic explanations for the association of the SNP with the GWAS-related phenotype. We also identify 59 distinct blocks or clusters of trans-eQTLs, each targeting the expression of sets of six to 229 distinct trans-eGenes. Ten of these sets of target genes are significantly enriched for microRNA targets (FDR < 5%). Many of these clusters are associated in GWAS with multiple phenotypes.
These findings provide insights into the molecular regulatory patterns involved in human physiology and pathophysiology. We illustrate the value of our eQTL database in the context of a recent GWAS meta-analysis of coronary artery disease and provide a list of targeted eGenes for 21 of 58 GWAS loci.
A common promoter polymorphism (rs35705950) in MUC5B, the gene encoding mucin 5B, is associated with idiopathic pulmonary fibrosis. It is not known whether this polymorphism is associated with ...interstitial lung disease in the general population.
We performed a blinded assessment of interstitial lung abnormalities detected in 2633 participants in the Framingham Heart Study by means of volumetric chest computed tomography (CT). We evaluated the relationship between the abnormalities and the genotype at the rs35705950 locus.
Of the 2633 chest CT scans that were evaluated, interstitial lung abnormalities were present in 177 (7%). Participants with such abnormalities were more likely to have shortness of breath and chronic cough and reduced measures of total lung and diffusion capacity, as compared with participants without such abnormalities. After adjustment for covariates, for each copy of the minor rs35705950 allele, the odds of interstitial lung abnormalities were 2.8 times greater (95% confidence interval CI, 2.0 to 3.9; P<0.001), and the odds of definite CT evidence of pulmonary fibrosis were 6.3 times greater (95% CI, 3.1 to 12.7; P<0.001). Although the evidence of an association between the MUC5B genotype and interstitial lung abnormalities was greater among participants who were older than 50 years of age, a history of cigarette smoking did not appear to influence the association.
The MUC5B promoter polymorphism was found to be associated with interstitial lung disease in the general population. Although this association was more apparent in older persons, it did not appear to be influenced by cigarette smoking. (Funded by the National Institutes of Health and others; ClinicalTrials.gov number, NCT00005121.).
Considering relatives' health history in logistic regression for case-control genome-wide association studies (CC-GWAS) may provide new information that increases accuracy and power to detect disease ...associated genetic variants. We conducted simulations and analyzed type 2 diabetes (T2D) data from the Framingham Heart Study (FHS) to compare two methods, liability threshold model conditional on both case-control status and family history (LT-FH) and Fam-meta, which incorporate family history into CC-GWAS. In our simulation scenario of trait with modest T2D heritability (h.sup.2 = 0.28), variant minor allele frequency ranging from 1% to 50%, and 1% of phenotype variance explained by the genetic variants, Fam-meta had the highest overall power, while both methods incorporating family history were more powerful than CC-GWAS. All three methods had controlled type I error rates, while LT-FH was the most conservative with a lower-than-expected error rate. In addition, we observed a substantial increase in power of the two familial history methods compared to CC-GWAS when the prevalence of the phenotype increased with age. Furthermore, we showed that, when only the phenotypes of more distant relatives were available, Fam-meta still remained more powerful than CC-GWAS, confirming that leveraging disease history of both close and distant relatives can increase power of association analyses. Using FHS data, we confirmed the well-known association of TCF7L2 region with T2D at the genome-wide threshold of P-value < 5 x 10.sup.-8, and both familial history methods increased the significance of the region compared to CC-GWAS. We identified two loci at 5q35 (ADAMTS2) and 5q23 (PRR16), not previously reported for T2D using CC-GWAS and Fam-meta; both genes play a role in cardiovascular diseases. Additionally, CC-GWAS detected one more significant locus at 13q31 (GPC6) reported associated with T2D-related traits. Overall, LT-FH and Fam-meta had higher power than CC-GWAS in simulations, especially using phenotypes that were more prevalent in older age groups, and both methods detected known genetic variants with lower P-values in real data application, highlighting the benefits of including family history in genetic association studies.
Genome-wide association studies (GWAS) may have reached their limit of detecting common type 2 diabetes (T2D)-associated genetic variation. We evaluated the performance of current polygenic T2D ...prediction. Using data from the Framingham Offspring (FOS) and the Coronary Artery Risk Development in Young Adults (CARDIA) studies, we tested three hypotheses: 1) a 62-locus genotype risk score (GRSt) improves T2D prediction compared with previous less inclusive GRSt; 2) separate GRS for β-cell (GRSβ) and insulin resistance (GRSIR) independently predict T2D; and 3) the relationships between T2D and GRSt, GRSβ, or GRSIR do not differ between blacks and whites. Among 1,650 young white adults in CARDIA, 820 young black adults in CARDIA, and 3,471 white middle-aged adults in FOS, cumulative T2D incidence was 5.9%, 14.4%, and 12.9%, respectively, over 25 years. The 62-locus GRSt was significantly associated with incident T2D in all three groups. In FOS but not CARDIA, the 62-locus GRSt improved the model C statistic (0.698 and 0.726 for models without and with GRSt, respectively; P < 0.001) but did not materially improve risk reclassification in either study. Results were similar among blacks compared with whites. The GRSβ but not GRSIR predicted incident T2D among FOS and CARDIA whites. At the end of the era of common variant discovery for T2D, polygenic scores can predict T2D in whites and blacks but do not outperform clinical models. Further optimization of polygenic prediction may require novel analytic methods, including less common as well as functional variants.
As omics measurements profiled on different molecular layers are interconnected, integrative approaches that incorporate the regulatory effect from multi-level omics data are needed. When the ...multi-level omics data are from the same individuals, gene expression (GE) clusters can be identified using information from regulators like genetic variants and DNA methylation. When the multi-level omics data are from different individuals, the choice of integration approaches is limited.
We developed an approach to improve GE clustering from microarray data by integrating regulatory data from different but partially overlapping sets of individuals. We achieve this through (1) decomposing gene expression into the regulated component and the other component that is not regulated by measured factors, (2) optimizing the clustering goodness-of-fit objective function. We do not require the availability of different omics measurements on all individuals. A certain amount of individual overlap between GE data and the regulatory data is adequate for modeling the regulation, thus improving GE clustering.
A simulation study shows that the performance of the proposed approach depends on the strength of the GE-regulator relationship, degree of missingness, data dimensionality, sample size, and the number of clusters. Across the various simulation settings, the proposed method shows competitive performance in terms of accuracy compared to the alternative K-means clustering method, especially when the clustering structure is due mostly to the regulated component, rather than the unregulated component. We further validate the approach with an application to 8,902 Framingham Heart Study participants with data on up to 17,873 genes and regulation information of DNA methylation and genotype from different but partially overlapping sets of participants. We identify clustering structures of genes associated with pulmonary function while incorporating the predicted regulation effect from the measured regulators. We further investigate the over-representation of these GE clusters in pathways of other diseases that may be related to lung function and respiratory health.
We propose a novel approach for clustering GE with the assistance of regulatory data that allowed for different but partially overlapping sets of individuals to be included in different omics data.
Because single genetic variants may have pleiotropic effects, one trait can be a confounder in a genome-wide association study (GWAS) that aims to identify loci associated with another trait. A ...typical approach to address this issue is to perform an additional analysis adjusting for the confounder. However, obtaining conditional results can be time-consuming. We propose an approximate conditional phenotype analysis based on GWAS summary statistics, the covariance between outcome and confounder, and the variant minor allele frequency (MAF). GWAS summary statistics and MAF are taken from GWAS meta-analysis results while the traits covariance may be estimated by two strategies: (i) estimates from a subset of the phenotypic data; or (ii) estimates from published studies. We compare our two strategies with estimates using individual level data from the full GWAS sample (gold standard). A simulation study for both binary and continuous traits demonstrates that our approximate approach is accurate. We apply our method to the Framingham Heart Study (FHS) GWAS and to large-scale cardiometabolic GWAS results. We observed a high consistency of genetic effect size estimates between our method and individual level data analysis. Our approach leads to an efficient way to perform approximate conditional analysis using large-scale GWAS summary statistics.
Multiple genetic loci have been convincingly associated with the risk of type 2 diabetes mellitus. We tested the hypothesis that knowledge of these loci allows better prediction of risk than ...knowledge of common phenotypic risk factors alone.
We genotyped single-nucleotide polymorphisms (SNPs) at 18 loci associated with diabetes in 2377 participants of the Framingham Offspring Study. We created a genotype score from the number of risk alleles and used logistic regression to generate C statistics indicating the extent to which the genotype score can discriminate the risk of diabetes when used alone and in addition to clinical risk factors.
There were 255 new cases of diabetes during 28 years of follow-up. The mean (+/-SD) genotype score was 17.7+/-2.7 among subjects in whom diabetes developed and 17.1+/-2.6 among those in whom diabetes did not develop (P<0.001). The sex-adjusted odds ratio for diabetes was 1.12 per risk allele (95% confidence interval, 1.07 to 1.17). The C statistic was 0.534 without the genotype score and 0.581 with the score (P=0.01). In a model adjusted for sex and self-reported family history of diabetes, the C statistic was 0.595 without the genotype score and 0.615 with the score (P=0.11). In a model adjusted for age, sex, family history, body-mass index, fasting glucose level, systolic blood pressure, high-density lipoprotein cholesterol level, and triglyceride level, the C statistic was 0.900 without the genotype score and 0.901 with the score (P=0.49). The genotype score resulted in the appropriate risk reclassification of, at most, 4% of the subjects.
A genotype score based on 18 risk alleles predicted new cases of diabetes in the community but provided only a slightly better prediction of risk than knowledge of common risk factors alone.
Previous reports have implicated multiple genetic loci associated with AF, but the contributions of genome-wide variation to AF susceptibility have not been quantified.
We assessed the contribution ...of genome-wide single-nucleotide polymorphism variation to AF risk (single-nucleotide polymorphism heritability,
) using data from 120 286 unrelated individuals of European ancestry (2987 with AF) in the population-based UK Biobank. We ascertained AF based on self-report, medical record billing codes, procedure codes, and death records. We estimated
using a variance components method with variants having a minor allele frequency ≥1%. We evaluated
in age, sex, and genomic strata of interest. The
for AF was 22.1% (95% confidence interval, 15.6%-28.5%) and was similar for early- versus older-onset AF (≤65 versus >65 years of age), as well as for men and women. The proportion of AF variance explained by genetic variation was mainly accounted for by common (minor allele frequency, ≥5%) variants (20.4%; 95% confidence interval, 15.1%-25.6%). Only 6.4% (95% confidence interval, 5.1%-7.7%) of AF variance was attributed to variation within known AF susceptibility, cardiac arrhythmia, and cardiomyopathy gene regions.
Genetic variation contributes substantially to AF risk. The risk for AF conferred by genomic variation is similar to that observed for several other cardiovascular diseases. Established AF loci only explain a moderate proportion of disease risk, suggesting that further genetic discovery, with an emphasis on common variation, is warranted to understand the causal genetic basis of AF.
Background Limited data are available on the prospective relationship between beverage consumption and plasma lipid and lipoprotein concentrations. Two major sources of sugar in the US diet are ...sugar-sweetened beverages (SSBs) and 100% fruit juices. Low-calorie sweetened beverages are common replacements. Methods and Results Fasting plasma lipoprotein concentrations were measured in the FOS (Framingham Offspring Study) (1991-2014; N=3146) and Generation Three (2002-2001; N=3584) cohorts. Beverage intakes were estimated from food frequency questionnaires and grouped into 5 intake categories. Mixed-effect linear regression models were used to examine 4-year changes in lipoprotein measures, and Cox proportional hazard models were used to estimate hazard ratios for incident dyslipidemia, adjusting for potential confounding factors. We found that regular (>1 serving per day) versus low (<1 serving per month) SSB consumption was associated with a greater mean decrease in high-density lipoprotein cholesterol (β±standard error -1.6±0.4 mg/dL;
<0.0001) and increase in triglyceride (β±standard error: 4.4±2.2 mg/dL;
=0.003) concentrations. Long-term regular SSB consumers also had a higher incidence of high triglyceride (hazard ratio, 1.52; 95% CI, 1.03-2.25) compared with low consumers. Although recent regular low-calorie sweetened beverage consumers had a higher incidence of high non-high-density lipoprotein cholesterol (hazard ratio, 1.40; 95% CI, 1.17-1.69) and low-density lipoprotein cholesterol (hazard ratio, 1.27; 95% CI, 1.05-1.53) concentrations compared with low consumers, cumulative average intakes of low-calorie sweetened beverages were not associated with changes in non-high-density lipoprotein cholesterol, low-density lipoprotein cholesterol concentrations, or incident dyslipidemias. Conclusions SSB intake was associated with adverse changes in high-density lipoprotein cholesterol and triglyceride concentrations, along with a higher risk of incident dyslipidemia, suggesting that increased SSB consumption may contribute to the development of dyslipidemia.