We investigate the extent to which human genetic data are incorporated into studies that hypothesize novel links between genes and metabolic disease. To lower the barriers to using genetic data, we ...present an approach to enable researchers to evaluate human genetic support for experimentally determined hypotheses.
Despite the now widespread availability of human genetic data and their recognized value to complement work in experimental models, they are rarely incorporated into experimental studies. Dornbos et al. discuss how researchers can use human genetic data to support the involvement of genes in common metabolic diseases.
Abstract Combining multiple genetic variants related to obesity into a genetic risk score (GRS) might improve identification of individuals at risk of developing obesity. Moreover, characterizing ...gene–diet interactions is a research challenge to establish dietary recommendations to individuals with higher predisposition to obesity. Our objective was to analyze the association between an obesity GRS and body mass index (BMI) in the Genetics of Lipid Lowering Drugs and Diet Network (GOLDN) population, focusing on gene–diet interactions with total fat and saturated fatty acid (SFA) intake, and to replicate findings in the Multi-Ethnic Study of Atherosclerosis (MESA) population. Cross-sectional analyses included 783 white US participants from GOLDN and 2,035 from MESA. Dietary intakes were estimated with validated food frequency questionnaires. Height and weight were measured. A weighted GRS was calculated on the basis of 63 obesity-associated variants. Multiple linear regression models adjusted by potential confounders were used to examine gene–diet interactions between dietary intake (total fat and SFA) and the obesity GRS in determining BMI. Significant interactions were found between total fat intake and the obesity GRS using these variables as continuous for BMI ( P for interaction=0.010, 0.046, and 0.002 in GOLDN, MESA, and meta-analysis, respectively). These association terms were stronger when assessing interactions between SFA intake and GRS for BMI ( P for interaction=0.005, 0.018, and <0.001 in GOLDN, MESA, and meta-analysis, respectively). SFA intake interacts with an obesity GRS in modulating BMI in two US populations. Although determining the causal direction requires further investigation, these findings suggest that potential dietary recommendations to reduce BMI effectively in populations with high obesity GRS would be to reduce total fat intake mainly by limiting SFAs.
The genetic code has degenerate codons that produce no change in the translated protein sequence and are generally thought to be silent. However, some synonymous variants are clearly not silent. ...Herein, we questioned the frequency of non-silent synonymous variants. We tested how random synonymous variants in the HIV Tat transcription factor effect transcription of an LTR-GFP reporter. Our model system has the advantage of directly measuring the function of the gene in human cells. Approximately, 67% of synonymous variants in Tat were non-silent, either having reduced activity or were full loss-of-function alleles. Eight mutant codons had higher codon usage than wild type, accompanied by reduced transcriptional activity. These were clustered on a loop in the Tat structure. We conclude that most synonymous Tat variants are not silent in human cells, and 25% are associated with changes in codon usage, likely effecting protein folding.
•This is the first analysis of molecular function in human cells for a randomly sampled set of synonymous variants in a gene.•Approximately 59–67% of synonymous Tat variants are not silent with respect to transcriptional function.•Approximately 25% of synonymous variants are clustered and associated with changes in codon usage.
Integrative approaches that simultaneously model multi-omics data have gained increasing popularity because they provide holistic system biology views of multiple or all components in a biological ...system of interest. Canonical correlation analysis (CCA) is a correlation-based integrative method designed to extract latent features shared between multiple assays by finding the linear combinations of features-referred to as canonical variables (CVs)-within each assay that achieve maximal across-assay correlation. Although widely acknowledged as a powerful approach for multi-omics data, CCA has not been systematically applied to multi-omics data in large cohort studies, which has only recently become available. Here, we adapted sparse multiple CCA (SMCCA), a widely-used derivative of CCA, to proteomics and methylomics data from the Multi-Ethnic Study of Atherosclerosis (MESA) and Jackson Heart Study (JHS). To tackle challenges encountered when applying SMCCA to MESA and JHS, our adaptations include the incorporation of the Gram-Schmidt (GS) algorithm with SMCCA to improve orthogonality among CVs, and the development of Sparse Supervised Multiple CCA (SSMCCA) to allow supervised integration analysis for more than two assays. Effective application of SMCCA to the two real datasets reveals important findings. Applying our SMCCA-GS to MESA and JHS, we identified strong associations between blood cell counts and protein abundance, suggesting that adjustment of blood cell composition should be considered in protein-based association studies. Importantly, CVs obtained from two independent cohorts also demonstrate transferability across the cohorts. For example, proteomic CVs learned from JHS, when transferred to MESA, explain similar amounts of blood cell count phenotypic variance in MESA, explaining 39.0% ~ 50.0% variation in JHS and 38.9% ~ 49.1% in MESA. Similar transferability was observed for other omics-CV-trait pairs. This suggests that biologically meaningful and cohort-agnostic variation is captured by CVs. We anticipate that applying our SMCCA-GS and SSMCCA on various cohorts would help identify cohort-agnostic biologically meaningful relationships between multi-omics data and phenotypic traits.
Metabolite signatures of long-term alcohol consumption are lacking. To better understand the molecular basis linking alcohol drinking and cardiovascular disease (CVD), we investigated circulating ...metabolites associated with long-term alcohol consumption and examined whether these metabolites were associated with incident CVD.
Cumulative average alcohol consumption (g/day) was derived from the total consumption of beer, wine, and liquor on average of 19 years in 2428 Framingham Heart Study Offspring participants (mean age 56 years, 52% women). We used linear mixed models to investigate the associations of alcohol consumption with 211 log-transformed plasma metabolites, adjusting for age, sex, batch, smoking, diet, physical activity, BMI, and familial relationship. Cox models were used to test the association of alcohol-related metabolite scores with fatal and nonfatal incident CVD (myocardial infarction, coronary heart disease, stroke, and heart failure).
We identified 60 metabolites associated with cumulative average alcohol consumption (p < 0.05/211 ≈ 0.00024). For example, 1 g/day increase of alcohol consumption was associated with higher levels of cholesteryl esters (e.g., CE 16:1, beta = 0.023 ± 0.002, p = 6.3e - 45) and phosphatidylcholine (e.g., PC 32:1, beta = 0.021 ± 0.002, p = 3.1e - 38). Survival analysis identified that 10 alcohol-associated metabolites were also associated with a differential CVD risk after adjusting for age, sex, and batch. Further, we built two alcohol consumption weighted metabolite scores using these 10 metabolites and showed that, with adjustment age, sex, batch, and common CVD risk factors, the two scores had comparable but opposite associations with incident CVD, hazard ratio 1.11 (95% CI = 1.02, 1.21, p = 0.02) vs 0.88 (95% CI = 0.78, 0.98, p = 0.02).
We identified 60 long-term alcohol consumption-associated metabolites. The association analysis with incident CVD suggests a complex metabolic basis between alcohol consumption and CVD.
By means of a combination of genome-wide and follow-up studies, recent large-scale association studies of populations of European descent have now identified over 46 loci associated with coronary ...artery disease (CAD). As part of the TAICHI Consortium, we have collected and genotyped 8556 subjects from Taiwan, comprising 5423 controls and 3133 cases with coronary artery disease, for 9087 CAD SNPs using the CardioMetaboChip. We applied penalized logistic regression to ascertain the top SNPs that contribute together to CAD susceptibility in Taiwan. We observed that the 9p21 locus contributes to CAD at the level of genome-wide significance (rs1537372, with the presence of C, the major allele, the effect estimate is -0.216, standard error 0.033, p value 5.8x10-10). In contrast to a previous report, we propose that the 9p21 locus is a single genetic contribution to CAD in Taiwan because: 1) the penalized logistic regression and the follow-up conditional analysis suggested that rs1537372 accounts for all of the CAD association in 9p21, and 2) the high linkage disequilibrium observed for all associated SNPs in 9p21. We also observed evidence for the following loci at a false discovery rate >5%: SH2B3, ADAMTS7, PHACTR1, GGCX, HTRA1, COL4A1, and LARP6-LRRC49. We also took advantage of the fact that penalized methods are an efficient approach to search for gene-by-gene interactions, and observed that two-way interactions between the PHACTR1 and ADAMTS7 loci and between the SH2B3 and COL4A1 loci contribute to CAD risk. Both the similarities and differences between the significance of these loci when compared with significance of loci in studies of populations of European descent underscore the fact that further genetic association of studies in additional populations will provide clues to identify the genetic architecture of CAD across all populations worldwide.
The TG/HDL-C ratio is used as a marker of insulin resistance (IR) in Caucasians; however, there is limited data in other ethnic groups. We hypothesized that the TG/HDL-C ratio is associated with IR ...in Hispanics and African Americans (AA).
Data from the Insulin Resistance Atherosclerosis Family Study was examined for associations between TG/HDL-C ratio and IR, β-cell function and incident diabetes in non-diabetic Hispanics (n = 872, 63% female) and AA (n = 371, 61% female). Insulin sensitivity index (SI) and disposition index (DI) from frequently-sampled intravenous glucose tolerance tests were used as markers of IR and β-cell function respectively. Incident type 2 diabetes was determined by fasting glucose ≥ 126 mg/dl or initiation of anti-hyperglycemia agents over 5 year follow-up.
Higher TG/HDL-C ratio was associated with IR in Hispanic and AA men and women (P < 0.0002), as well as β-cell function in Hispanic women and AA men and women (P < 0.02). TG/HDL-C predicted incident type 2 diabetes in women (area under the curves 0.703 and 0.795 for Hispanics and AA respectively).
Similar to Caucasians, the TG/HDL-C ratio can be used to identify IR in Hispanics and AA, and may predict type 2 diabetes in women.
Mitochondria carry their own circular genome and disruption of the mitochondrial genome is associated with various aging-related diseases. Unlike the nuclear genome, mitochondrial DNA (mtDNA) can be ...present at 1000 s to 10,000 s copies in somatic cells and variants may exist in a state of heteroplasmy, where only a fraction of the DNA molecules harbors a particular variant. We quantify mtDNA heteroplasmy in 194,871 participants in the UK Biobank and find that heteroplasmy is associated with a 1.5-fold increased risk of all-cause mortality. Additionally, we functionally characterize mtDNA single nucleotide variants (SNVs) using a constraint-based score, mitochondrial local constraint score sum (MSS) and find it associated with all-cause mortality, and with the prevalence and incidence of cancer and cancer-related mortality, particularly leukemia. These results indicate that mitochondria may have a functional role in certain cancers, and mitochondrial heteroplasmic SNVs may serve as a prognostic marker for cancer, especially for leukemia.
Males and females differ in body composition and fat distribution. Using a mouse model that segregates gonadal sex (ovaries and testes) from chromosomal sex (XX and XY), we showed that XX chromosome ...complement in combination with a high-fat diet led to enhanced weight gain in the presence of male or female gonads. We identified the genomic dosage of Kdm5c, an X chromosome gene that escapes X chromosome inactivation, as a determinant of the X chromosome effect on adiposity. Modulating Kdm5c gene dosage in XX female mice to levels that are normally present in males resulted in reduced body weight, fat content, and food intake to a degree similar to that seen with altering the entire X chromosome dosage. In cultured preadipocytes, the levels of KDM5C histone demethylase influenced chromatin accessibility (ATAC-Seq), gene expression (RNA-Seq), and adipocyte differentiation. Both in vitro and in vivo, Kdm5c dosage influenced gene expression involved in extracellular matrix remodeling, which is critical for adipocyte differentiation and adipose tissue expansion. In humans, adipose tissue KDM5C mRNA levels and KDM5C genetic variants were associated with body mass. These studies demonstrate that the sex-dependent dosage of Kdm5c contributes to male/female differences in adipocyte biology and highlight X-escape genes as a critical component of female physiology.