Identifying sources of variation in DNA methylation levels is important for understanding gene regulation. Recently, bisulfite sequencing has become a popular tool for investigating DNA methylation ...levels. However, modeling bisulfite sequencing data is complicated by dramatic variation in coverage across sites and individual samples, and because of the computational challenges of controlling for genetic covariance in count data. To address these challenges, we present a binomial mixed model and an efficient, sampling-based algorithm (MACAU: Mixed model association for count data via data augmentation) for approximate parameter estimation and p-value computation. This framework allows us to simultaneously account for both the over-dispersed, count-based nature of bisulfite sequencing data, as well as genetic relatedness among individuals. Using simulations and two real data sets (whole genome bisulfite sequencing (WGBS) data from Arabidopsis thaliana and reduced representation bisulfite sequencing (RRBS) data from baboons), we show that our method provides well-calibrated test statistics in the presence of population structure. Further, it improves power to detect differentially methylated sites: in the RRBS data set, MACAU detected 1.6-fold more age-associated CpG sites than a beta-binomial model (the next best approach). Changes in these sites are consistent with known age-related shifts in DNA methylation levels, and are enriched near genes that are differentially expressed with age in the same population. Taken together, our results indicate that MACAU is an efficient, effective tool for analyzing bisulfite sequencing data, with particular salience to analyses of structured populations. MACAU is freely available at www.xzlab.org/software.html.
A long-standing goal of evolutionary biology is to decode how gene regulation contributes to organismal diversity. Doing so is challenging because it is hard to predict function from non-coding ...sequence and to perform molecular research with non-model taxa. Massively parallel reporter assays (MPRAs) enable the testing of thousands to millions of sequences for regulatory activity simultaneously. Here, we discuss the execution, advantages, and limitations of MPRAs, with a focus on evolutionary questions. We propose solutions for extending MPRAs to rare taxa and those with limited genomic resources, and we underscore MPRA's broad potential for driving genome-scale, functional studies across organisms.
Gene expression variance has been linked to organismal function and fitness but remains a commonly neglected aspect of molecular research. As a result, we lack a comprehensive understanding of the ...patterns of transcriptional variance across genes, and how this variance is linked to context-specific gene regulation and gene function. Here, we use 57 large publicly available RNA-seq data sets to investigate the landscape of gene expression variance. These studies cover a wide range of tissues and allowed us to assess if there are consistently more or less variable genes across tissues and data sets and what mechanisms drive these patterns. We show that gene expression variance is broadly similar across tissues and studies, indicating that the pattern of transcriptional variance is consistent. We use this similarity to create both global and within-tissue rankings of variation, which we use to show that function, sequence variation, and gene regulatory signatures contribute to gene expression variance. Low-variance genes are associated with fundamental cell processes and have lower levels of genetic polymorphisms, have higher gene-gene connectivity, and tend to be associated with chromatin states associated with transcription. In contrast, high-variance genes are enriched for genes involved in immune response, environmentally responsive genes, immediate early genes, and are associated with higher levels of polymorphisms. These results show that the pattern of transcriptional variance is not noise. Instead, it is a consistent gene trait that seems to be functionally constrained in human populations. Furthermore, this commonly neglected aspect of molecular phenotypic variation harbors important information to understand complex traits and disease.
Variation in resource availability commonly exerts strong effects on fitness‐related traits in wild animals. However, we know little about the molecular mechanisms that mediate these effects, or ...about their persistence over time. To address these questions, we profiled genome‐wide whole‐blood DNA methylation levels in two sets of wild baboons: (i) ‘wild‐feeding’ baboons that foraged naturally in a savanna environment and (ii) ‘Lodge’ baboons that had ready access to spatially concentrated human food scraps, resulting in high feeding efficiency and low daily travel distances. We identified 1014 sites (0.20% of sites tested) that were differentially methylated between wild‐feeding and Lodge baboons, providing the first evidence that resource availability shapes the epigenome in a wild mammal. Differentially methylated sites tended to occur in contiguous stretches (i.e., in differentially methylated regions or DMRs), in promoters and enhancers, and near metabolism‐related genes, supporting their functional importance in gene regulation. In agreement, reporter assay experiments confirmed that methylation at the largest identified DMR, located in the promoter of a key glycolysis‐related gene, was sufficient to causally drive changes in gene expression. Intriguingly, all dispersing males carried a consistent epigenetic signature of their membership in a wild‐feeding group, regardless of whether males dispersed into or out of this group as adults. Together, our findings support a role for DNA methylation in mediating ecological effects on phenotypic traits in the wild and emphasize the dynamic environmental sensitivity of DNA methylation levels across the life course.
There is increasing appreciation that, in addition to being shaped by an individual's genotype and environment, most complex traits are also determined by poorly understood interactions between these ...two factors. So-called "genotype × environment" (G×E) interactions remain difficult to map at the organismal level but can be uncovered using molecular phenotypes. To do so at large scale, we used TM3'seq to profile transcriptomes across 12 cellular environments in 544 immortalized B cell lines from the 1000 Genomes Project. We mapped the genetic basis of gene expression levels across environments and revealed a context-dependent genetic architecture: The average heritability of gene expression levels increased in treatment relative to control conditions, and on average, each treatment revealed new expression quantitative trait loci (eQTLs) at 11% of genes. Across our experiments, 22% of all identified eQTLs were context-dependent, and this group was enriched for trait- and disease-associated loci. Further, evolutionary analyses suggested that positive selection has shaped G×E loci involved in responding to immune challenges and hormones but not to man-made chemicals. We hypothesize that this reflects a reduced opportunity for selection to act on responses to molecules recently introduced into human environments. Together, our work highlights the importance of considering an exposure's evolutionary history when studying and interpreting G×E interactions, and provides new insight into the evolutionary mechanisms that maintain G×E loci in human populations.
To elucidate the role of Tau isoforms and post-translational modification (PTM) stoichiometry in Alzheimer’s disease (AD), we generated a high-resolution quantitative proteomics map of 95 PTMs on ...multiple isoforms of Tau isolated from postmortem human tissue from 49 AD and 42 control subjects. Although Tau PTM maps reveal heterogeneity across subjects, a subset of PTMs display high occupancy and frequency for AD, suggesting importance in disease. Unsupervised analyses indicate that PTMs occur in an ordered manner, leading to Tau aggregation. The processive addition and minimal set of PTMs associated with seeding activity was further defined by analysis of size-fractionated Tau. To summarize, features in the Tau protein critical for disease intervention at different stages of disease are identified, including enrichment of 0N and 4R isoforms, underrepresentation of the C terminus, an increase in negative charge in the proline-rich region (PRR), and a decrease in positive charge in the microtubule binding domain (MBD).
Display omitted
•95 post-translational modifications (PTMs) were identified on Tau in human subjects•Frequency and PTM stoichiometry of pathological Tau identify patient heterogeneity•Modifications occur in a processive fashion and are reflective of disease progression•The study identifies critical therapeutic targets at each stage of disease
A high-resolution quantitative proteomics map of post-translational modifications on multiple isoforms of Tau shows heterogeneity across Alzheimer’s disease patients, reflective of disease progression, and identifies critical targets at each stage of disease.
Changes in DNA methylation are involved in development, disease, and the response to environmental conditions. However, not all regulatory elements are functionally methylation-dependent (MD). Here, ...we report a method, mSTARR-seq, that assesses the causal effects of DNA methylation on regulatory activity at hundreds of thousands of fragments (millions of CpG sites) simultaneously. Using mSTARR-seq, we identify thousands of MD regulatory elements in the human genome. MD activity is partially predictable using sequence and chromatin state information, and distinct transcription factors are associated with higher activity in unmethylated versus methylated DNA. Further, pioneer TFs linked to higher activity in the methylated state appear to drive demethylation of experimentally methylated sites. MD regulatory elements also predict methylation-gene expression relationships across individuals, where they are 1.6x enriched among sites with strong negative correlations. mSTARR-seq thus provides a map of MD regulatory activity in the human genome and facilitates interpretation of differential methylation studies.
Aging, for virtually all life, is inescapable. However, within populations, biological aging rates vary. Understanding sources of variation in this process is central to understanding the ...biodemography of natural populations. We constructed a DNA methylation-based age predictor for an intensively studied wild baboon population in Kenya. Consistent with findings in humans, the resulting 'epigenetic clock' closely tracks chronological age, but individuals are predicted to be somewhat older or younger than their known ages. Surprisingly, these deviations are not explained by the strongest predictors of lifespan in this population, early adversity and social integration. Instead, they are best predicted by male dominance rank: high-ranking males are predicted to be older than their true ages, and epigenetic age tracks changes in rank over time. Our results argue that achieving high rank for male baboons - the best predictor of reproductive success - imposes costs consistent with a 'live fast, die young' life-history strategy.
Evolutionary theory suggests that lifespan-reducing alleles should be purged from the gene pool, and yet decades of genome-wide association and model organism studies have shown that they persist. ...One potential explanation is that alleles that regulate lifespan do so only in certain environmental contexts. We exposed outbred Drosophila to control and high-sugar diets and genotyped more than 10,000 adult flies to track allele frequency changes over the course of a single adult lifespan. We identified thousands of lifespan-associated alleles associated with early versus late-life trade-offs, late-onset effects and genotype-by-environment interactions. Remarkably, a third of lifespan-associated genetic variation had environmentally dependent effects on lifespan. We find that lifespan-reducing alleles are often recently derived, have stronger effects on a high-sugar diet and show signatures of selection in wild Drosophila populations, consistent with the evolutionary mismatch hypothesis. Our results provide insight into the highly polygenic and context-dependent genetic architecture of lifespan variation and the evolutionary processes that shape this key trait.
Developmental Constraints in a Wild Primate Lea, Amanda J.; Altmann, Jeanne; Alberts, Susan C. ...
The American naturalist,
06/2015, Letnik:
185, Številka:
6
Journal Article
Recenzirano
Odprti dostop
Early-life experiences can dramatically affect adult traits. However, the evolutionary origins of such early-life effects are debated. The predictive adaptive response hypothesis argues that adverse ...early environments prompt adaptive phenotypic adjustments that prepare animals for similar challenges in adulthood. In contrast, the developmental constraints hypothesis argues that early adversity is generally costly. To differentiate between these hypotheses, we studied two sets of wild female baboons: those born during low-rainfall, low-quality years and those born during normal-rainfall, high-quality years. For each female, we measured fertility-related fitness components during years in adulthood that matched and mismatched her early conditions. We found support for the developmental constraints hypothesis: females born in low-quality environments showed greater decreases in fertility during drought years than females born in high-quality environments, even though drought years matched the early conditions of females born in low-quality environments. Additionally, we found that females born in low-quality years to high-status mothers did not experience reduced fertility during drought years. These results indicate that early ecological adversity did not prepare individuals to cope with ecological challenges in later life. Instead, individuals that experienced at least one high-quality early environment—either ecological or social—were more resilient to ecological stress in later life. Together, these data suggest that early adversity carries lifelong costs, which is consistent with the developmental constraints hypothesis.