Abstract
Depression is currently the leading cause of disability around the world. We conducted an epigenome-wide association study (EWAS) in a sample of 58 depression score-discordant monozygotic ...twin pairs, aiming to detect specific epigenetic variants potentially related to depression and further integrate with gene expression profile data. Association between the methylation level of each CpG site and depression score was tested by applying a linear mixed effect model. Weighted gene co-expression network analysis (WGCNA) was performed for gene expression data. The association of DNA methylation levels of 66 CpG sites with depression score reached the level of
P
< 1 × 10
−4
. These top CpG sites were located at 34 genes, especially
PTPRN2
,
HES5
,
GATA2
,
PRDM7
, and
KCNIP1
. Many ontology enrichments were highlighted, including Notch signaling pathway, Huntington disease, p53 pathway by glucose deprivation, hedgehog signaling pathway, DNA binding, and nucleic acid metabolic process. We detected 19 differentially methylated regions (DMRs), some of which were located at
GRIK2
,
DGKA
, and
NIPA2
. While integrating with gene expression data,
HELZ2
,
PTPRN2
,
GATA2
, and
ZNF624
were differentially expressed. In WGCNA, one specific module was positively correlated with depression score (
r
= 0.62,
P
= 0.002). Some common genes (including
BMP2
,
PRDM7
,
KCNIP1
, and
GRIK2
) and enrichment terms (including complement and coagulation cascades pathway, DNA binding, neuron fate specification, glial cell differentiation, and thyroid gland development) were both identified in methylation analysis and WGCNA. Our study identifies specific epigenetic variations which are significantly involved in regions, functional genes, biological function, and pathways that mediate depression disorder.
Mosaicism in blood varies with age, and cross-sectional studies indicate that for women, skewness of X-chromosomal mosaicism increases with age. This pattern could, however, also be due to less ...X-inactivation in more recent birth cohorts. Skewed X-chromosome inactivation was here measured longitudinally by the HUMARA assay in 67 septuagenarian and octogenarian women assessed at 2 time points, 10 years apart, and in 10 centenarian women assessed at 2 time points, 2-7 years apart. Skewed X-chromosome inactivation was also compared in 293 age-matched septuagenarian twins born in 1917-1923 and 1931-1937, and 212 centenarians born in 1895, 1905 and 1915. The longitudinal study of septuagenarians and octogenarians revealed that 16% (95% CI 7-29%) of the women developed skewed X-inactivation over a 10-year period. In the cross-sectional across-birth cohort study, the earlier-born septuagenarian (1917-1923) and centenarian women (1895) had a higher degree of skewness than the respective recent age-matched birth cohorts, which indicates that the women in the more recent cohorts, after the age of 70, had not only changed degree of skewness with age, they had also undergone less age-related hematopoietic sub-clone expansion. This may be a result of improved living conditions and better medical treatment in the more recent birth cohorts.
Expression of
HOX transcript antisense intergenic
RNA (
HOTAIR
)—a long non-coding RNA—has been examined in a variety of human cancers, and overexpression of
HOTAIR
is correlated with poor survival ...among breast, colon, and liver cancer patients. In this retrospective study, we examine
HOTAIR
expression in 164 primary breast tumors, from patients who do not receive adjuvant treatment, in a design that is paired with respect to the traditional prognostic markers. We show that
HOTAIR
expression differs between patients with or without a metastatic endpoint, respectively. Survival analysis shows that high
HOTAIR
expression in primary tumors is significantly associated with worse prognosis independent of prognostic markers (
P
= 0.012, hazard ratio (HR) 1.747). This association is even stronger when looking only at estrogen receptor (ER)-positive tumor samples (
P
= 0.0086, HR 1.985). In ER-negative tumor samples, we are not able to detect a prognostic value of
HOTAIR
expression, probably due to the limited sample size. These results are successfully validated in an independent dataset with similar associations (
P
= 0.018, HR 1.825). In conclusion, our findings suggest that
HOTAIR
expression may serve as an independent biomarker for the prediction of the risk of metastasis in ER-positive breast cancer patients.
Body mass index (BMI) serves as an important measurement of obesity and adiposity, which are highly correlated with cardiometabolic diseases. Although high heritability has been estimated, the ...identified genetic variants by genetic association studies only explain a small proportion of BMI variation. As an active effort for further exploring the molecular basis of BMI variation, large-scale epigenome-wide association studies have been conducted but with limited number of loci reported, perhaps due to poorly controlled confounding factors, including genetic factors. Being genetically identical, monozygotic twins discordant for BMI are ideal subjects for analyzing the epigenetic association between DNA methylation and BMI, providing perfect control on their genetic makeups largely responsible for BMI variation.
We performed an epigenome-wide association study on BMI using 30 identical twin pairs (15 male and 15 female pairs) with age ranging from 39 to 72 years and degree of BMI discordance ranging from 3-7.5 kg/m
. Methylation data from whole blood samples were collected using the reduced representation bisulfite sequencing technique.
After adjusting for blood cell composition and clinical variables, we identified 136 CpGs with p-value < 1e-4, 30 CpGs with p < 1e-05 but no CpGs reached genome-wide significance. Genomic region-based analysis found 11 differentially methylated regions harboring coding and non-coding genes some of which were validated by gene expression analysis on independent samples.
Our DNA methylation sequencing analysis on identical twins provides new references for the epigenetic regulation on BMI and obesity.
The therapeutic management of obesity is challenging, hence further elucidating the underlying mechanisms of obesity development and identifying new diagnostic biomarkers and therapeutic targets are ...urgent and necessary. Here, we performed differential gene expression analysis and weighted gene co-expression network analysis (WGCNA) to identify significant genes and specific modules related to BMI based on gene expression profile data of 7 discordant monozygotic twins.
In the differential gene expression analysis, it appeared that 32 differentially expressed genes (DEGs) were with a trend of up-regulation in twins with higher BMI when compared to their siblings. Categories of positive regulation of nitric-oxide synthase biosynthetic process, positive regulation of NF-kappa B import into nucleus, and peroxidase activity were significantly enriched within GO database and NF-kappa B signaling pathway within KEGG database. DEGs of NAMPT, TLR9, PTGS2, HBD, and PCSK1N might be associated with obesity. In the WGCNA, among the total 20 distinct co-expression modules identified, coral1 module (68 genes) had the strongest positive correlation with BMI (r = 0.56, P = 0.04) and disease status (r = 0.56, P = 0.04). Categories of positive regulation of phospholipase activity, high-density lipoprotein particle clearance, chylomicron remnant clearance, reverse cholesterol transport, intermediate-density lipoprotein particle, chylomicron, low-density lipoprotein particle, very-low-density lipoprotein particle, voltage-gated potassium channel complex, cholesterol transporter activity, and neuropeptide hormone activity were significantly enriched within GO database for this module. And alcoholism and cell adhesion molecules pathways were significantly enriched within KEGG database. Several hub genes, such as GAL, ASB9, NPPB, TBX2, IL17C, APOE, ABCG4, and APOC2 were also identified. The module eigengene of saddlebrown module (212 genes) was also significantly correlated with BMI (r = 0.56, P = 0.04), and hub genes of KCNN1 and AQP10 were differentially expressed.
We identified significant genes and specific modules potentially related to BMI based on the gene expression profile data of monozygotic twins. The findings may help further elucidate the underlying mechanisms of obesity development and provide novel insights to research potential gene biomarkers and signaling pathways for obesity treatment. Further analysis and validation of the findings reported here are important and necessary when more sample size is acquired.
•The epigenome of identical twins can serve as a perfect laboratory for studying the environmental impact on behavioral traits.•The discordant twin design is a powerful tool for epigenetic ...association study.•The use of the design in linking specific environmental exposures with epigenetic modification should be encouraged.•Twins are useful for causal inference in behavioral epigenetics for finding effective biomarkers as targets for intervention and prevention.
The debate upon the relative importance of nature vs nurture in the development of human behaviour can be traced back to ancient times. Traditional epidemiology and genetic epidemiology have confirmed the association of environmental and genetic factors with behavioural traits. Current genomic studies are identifying genetic variants associated with various behavioural traits. However, exploring the relationship of abundant environmental factors with the complex epigenome that mediates human behaviour is just at its beginning. Identical twins can serve as perfect experiments for studying the environmental impact on behavioural epigenetics advantaged by enriched power in association analysis due to controlling of their genetic make-ups. Recent development in causal inference using twin-based models adds more values in twins. This review briefly introduces the various approaches in making use of twins in studying behavioural epigenetics from experiment design to practical applications. Exploring the epigenome of twins using the powerful twin-based study designs and analytical approaches will help identifying causal epigenetic markers mediating environmental exposures and behavioural traits enabling both pharmaceutical intervention and effective prevention.
We analyzed data from twins to determine how much the familial risk of colorectal cancer can be attributed to genetic factors vs environment. We also examined whether heritability is distinct for ...colon vs rectal cancer, given evidence of distinct etiologies.
Our data set included 39,990 monozygotic and 61,443 same-sex dizygotic twins from the Nordic Twin Study of Cancer. We compared each cancer's risk in twins of affected co-twins relative to the cohort risk (familial risk ratio FRR). We then estimated the proportion of variation in risk that could be attributed to genetic factors (heritability).
From earliest registration in 1943 through 2010, there were 1861 individuals diagnosed with colon cancer and 1268 diagnosed with rectal cancer. Monozygotic twins of affected co-twins had an FRR for colorectal cancer of 3.1 (95% confidence interval CI, 2.4-3.8) relative to the cohort risk. Dizygotic twins of affected co-twins had an FRR for colorectal cancer of 2.2 (95% CI, 1.7-2.7). We estimated that 40% (95% CI, 33%-48%) of the variation in colorectal cancer risk could be attributed to genetic factors; unique environment only accounted for the remaining liability. For colon cancer, the FRR was 3.3 (95% CI, 2.1-4.5) for monozygotic twins and 2.6 (95% CI, 1.7-3.5) for dizygotic twins. For rectal cancer, comparable estimates were 3.3 (95% CI, 1.5-5.1) for monozygotic twins and 2.6 (95% CI, 1.2-4.0) for dizygotic twins. Heritability estimates for colon and rectal cancer were 16% (95% CI, 0-46%) and 15% (95% CI, 0-50%), common environment estimates were 15% (95% CI, 0-38%) and 11% (95% CI, 0-38%), and unique environment estimates were 68% (95% CI, 57%-79%) and 75% (95% CI, 61%-88%), respectively.
Interindividual genetic differences could account for 40% of the variation in susceptibility to colorectal cancer; risk for colon and rectal cancers might have less of a genetic component than risk for colorectal cancer. Siblings, and particularly monozygotic co-twins, of individuals with colon or rectal cancer should consider personalized screening.
Semantic fluency impairment has been attributed to a wide range of neurocognitive and psychiatric conditions, especially in the older population. Moderate heritability estimates on semantic fluency ...were obtained from both twin and family-based studies suggesting genetic contributions to the observed variation across individuals. Currently, effort in identifying the genetic variants underlying the heritability estimates for this complex trait remains scarce. Using the semantic fluency scale and genome-wide SNP genotype data from the Long Life Family Study (LLFS), we performed a genome-wide association study (GWAS) and epistasis network analysis on semantic fluency in 2289 individuals aged over 60 years from the American LLFS cohorts and replicated the findings in 1129 individuals aged over 50 years from the Danish LLFS cohort. In the GWAS, two SNPs with genome-wide significance (rs3749683,
= 2.52 × 10
; rs880179,
= 4.83 × 10
) mapped to the
gene on chromosome 5 were detected. The epistasis network analysis identified five modules as significant (4.16 × 10
<
< 7.35 × 10
), of which two were replicated (
< 3.10 × 10
). These two modules revealed significant enrichment of tissue-specific gene expression in brain tissues and high enrichment of GWAS catalog traits, e.g., obesity-related traits, blood pressure, chronotype, sleep duration, and brain structure, that have been reported to associate with verbal performance in epidemiological studies. Our results suggest high tissue specificity of genetic regulation of gene expression in brain tissues with epistatic SNP networks functioning jointly in modifying individual verbal ability and cognitive performance.
Breast cancer cells exhibit complex karyotypic alterations causing deregulation of numerous genes. Some of these genes are probably causal for cancer formation and local growth whereas others are ...causal for the various steps of metastasis. In a fraction of tumors deregulation of the same genes might be caused by epigenetic modulations, point mutations or the influence of other genes. We have investigated the relation of gene expression and chromosomal position, using eight datasets including more than 1200 breast tumors, to identify chromosomal regions and candidate genes possibly causal for breast cancer metastasis. By use of “Gene Set Enrichment Analysis” we have ranked chromosomal regions according to their relation to metastasis. Overrepresentation analysis identified regions with increased expression for chromosome 1q41-42, 8q24, 12q14, 16q22, 16q24, 17q12-21.2, 17q21-23, 17q25, 20q11, and 20q13 among metastasizing tumors and reduced gene expression at 1p31-21, 8p22-21, and 14q24. By analysis of genes with extremely imbalanced expression in these regions we identified DIRAS3 at 1p31, PSD3, LPL, EPHX2 at 8p21-22, and FOS at 14q24 as candidate metastasis suppressor genes. Potential metastasis promoting genes includes RECQL4 at 8q24, PRMT7 at 16q22, GINS2 at 16q24, and AURKA at 20q13.