We are in a phase of unprecedented progress in identifying genetic loci that cause variation in traits ranging from growth and fitness in simple organisms to disease in humans. However, a mechanistic ...understanding of how these loci influence traits is lacking for the majority of loci. Studies of the genetics of gene expression have emerged as a key tool for linking DNA sequence variation to phenotypes. Here, we review recent insights into the molecular nature of regulatory variants and describe their influence on the transcriptome and the proteome. We discuss conceptual advances from studies in model organisms and present examples of complete chains of causality that link individual polymorphisms to changes in gene expression, which in turn result in physiological changes and, ultimately, disease risk.
Protein degradation is an essential biological process that regulates protein abundance and removes misfolded and damaged proteins from cells. In eukaryotes, most protein degradation occurs through ...the stepwise actions of two functionally distinct entities, the ubiquitin system and the proteasome. Ubiquitin system enzymes attach ubiquitin to cellular proteins, targeting them for degradation. The proteasome then selectively binds and degrades ubiquitinated substrate proteins. Genetic variation in ubiquitin system genes creates heritable differences in the degradation of their substrates. However, the challenges of measuring the degradative activity of the proteasome independently of the ubiquitin system in large samples have limited our understanding of genetic influences on the proteasome. Here, using the yeast Saccharomyces cerevisiae, we built and characterized reporters that provide high-throughput, ubiquitin system-independent measurements of proteasome activity. Using single-cell measurements of proteasome activity from millions of genetically diverse yeast cells, we mapped 15 loci across the genome that influence proteasomal protein degradation. Twelve of these 15 loci exerted specific effects on the degradation of two distinct proteasome substrates, revealing a high degree of substrate-specificity in the genetics of proteasome activity. Using CRISPR-Cas9-based allelic engineering, we resolved a locus to a causal variant in the promoter of RPT6, a gene that encodes a subunit of the proteasome's 19S regulatory particle. The variant increases RPT6 expression, which we show results in increased proteasome activity. Our results reveal the complex genetic architecture of proteasome activity and suggest that genetic influences on the proteasome may be an important source of variation in the many cellular and organismal traits shaped by protein degradation.
Changes in gene expression are thought to underlie many of the phenotypic differences between species. However, large-scale analyses of gene expression evolution were until recently prevented by ...technological limitations. Here we report the sequencing of polyadenylated RNA from six organs across ten species that represent all major mammalian lineages (placentals, marsupials and monotremes) and birds (the evolutionary outgroup), with the goal of understanding the dynamics of mammalian transcriptome evolution. We show that the rate of gene expression evolution varies among organs, lineages and chromosomes, owing to differences in selective pressures: transcriptome change was slow in nervous tissues and rapid in testes, slower in rodents than in apes and monotremes, and rapid for the X chromosome right after its formation. Although gene expression evolution in mammals was strongly shaped by purifying selection, we identify numerous potentially selectively driven expression switches, which occurred at different rates across lineages and tissues and which probably contributed to the specific organ biology of various mammals.
DNA variants that alter gene expression contribute to variation in many phenotypic traits. In particular, trans-acting variants, which are often located on different chromosomes from the genes they ...affect, are an important source of heritable gene expression variation. However, our knowledge about the identity and mechanism of causal trans-acting variants remains limited. Here, we developed a fine-mapping strategy called CRISPR-Swap and dissected three expression quantitative trait locus (eQTL) hotspots known to alter the expression of numerous genes in trans in the yeast Saccharomyces cerevisiae. Causal variants were identified by engineering recombinant alleles and quantifying the effects of these alleles on the expression of a green fluorescent protein-tagged gene affected by the given locus in trans. We validated the effect of each variant on the expression of multiple genes by RNA-sequencing. The three variants differed in their molecular mechanism, the type of genes they reside in, and their distribution in natural populations. While a missense leucine-to-serine variant at position 63 in the transcription factor Oaf1 (L63S) was almost exclusively present in the reference laboratory strain, the two other variants were frequent among S. cerevisiae isolates. A causal missense variant in the glucose receptor Rgt2 (V539I) occurred at a poorly conserved amino acid residue and its effect was strongly dependent on the concentration of glucose in the culture medium. A noncoding variant in the conserved fatty acid regulated (FAR) element of the OLE1 promoter influenced the expression of the fatty acid desaturase Ole1 in cis and, by modulating the level of this essential enzyme, other genes in trans. The OAF1 and OLE1 variants showed a non-additive genetic interaction, and affected cellular lipid metabolism. These results demonstrate that the molecular basis of trans-regulatory variation is diverse, highlighting the challenges in predicting which natural genetic variants affect gene expression.
Genetic mapping studies of quantitative traits typically focus on detecting loci that contribute additively to trait variation. Genetic interactions are often proposed as a contributing factor to ...trait variation, but the relative contribution of interactions to trait variation is a subject of debate. Here we use a very large cross between two yeast strains to accurately estimate the fraction of phenotypic variance due to pairwise QTL-QTL interactions for 20 quantitative traits. We find that this fraction is 9% on average, substantially less than the contribution of additive QTL (43%). Statistically significant QTL-QTL pairs typically have small individual effect sizes, but collectively explain 40% of the pairwise interaction variance. We show that pairwise interaction variance is largely explained by pairs of loci at least one of which has a significant additive effect. These results refine our understanding of the genetic architecture of quantitative traits and help guide future mapping studies.
The analysis of introgression of genomic regions between divergent populations provides an excellent opportunity to determine the genetic basis of reproductive isolation during the early stages of ...speciation. However, hybridization and subsequent gene flow must be relatively common in order to localize individual loci that resist introgression. In this study, we used next-generation sequencing to study genome-wide patterns of genetic differentiation between two hybridizing subspecies of rabbits (Oryctolagus cuniculus algirus and O. c. cuniculus) that are known to undergo high rates of gene exchange. Our primary objective was to identify specific genes or genomic regions that have resisted introgression and are likely to confer reproductive barriers in natural conditions. On the basis of 326,000 polymorphisms, we found low to moderate overall levels of differentiation between subspecies, and fewer than 200 genomic regions dispersed throughout the genome showing high differentiation consistent with a signature of reduced gene flow. Most differentiated regions were smaller than 200 Kb and contained very few genes. Remarkably, 30 regions were each found to contain a single gene, facilitating the identification of candidate genes underlying reproductive isolation. This gene-level resolution yielded several insights into the genetic basis and architecture of reproductive isolation in rabbits. Regions of high differentiation were enriched on the X-chromosome and near centromeres. Genes lying within differentiated regions were often associated with transcription and epigenetic activities, including chromatin organization, regulation of transcription, and DNA binding. Overall, our results from a naturally hybridizing system share important commonalities with hybrid incompatibility genes identified using laboratory crosses in mice and flies, highlighting general mechanisms underlying the maintenance of reproductive barriers.
Domestication has led to similar changes in morphology and behavior in several animal species, raising the question whether similarities between different domestication events also exist at the ...molecular level. We used mRNA sequencing to analyze genome-wide gene expression patterns in brain frontal cortex in three pairs of domesticated and wild species (dogs and wolves, pigs and wild boars, and domesticated and wild rabbits). We compared the expression differences with those between domesticated guinea pigs and a distant wild relative (Cavia aperea) as well as between two lines of rats selected for tameness or aggression towards humans. There were few gene expression differences between domesticated and wild dogs, pigs, and rabbits (30-75 genes (less than 1%) of expressed genes were differentially expressed), while guinea pigs and C. aperea differed more strongly. Almost no overlap was found between the genes with differential expression in the different domestication events. In addition, joint analyses of all domesticated and wild samples provided only suggestive evidence for the existence of a small group of genes that changed their expression in a similar fashion in different domesticated species. The most extreme of these shared expression changes include up-regulation in domesticates of SOX6 and PROM1, two modulators of brain development. There was almost no overlap between gene expression in domesticated animals and the tame and aggressive rats. However, two of the genes with the strongest expression differences between the rats (DLL3 and DHDH) were located in a genomic region associated with tameness and aggression, suggesting a role in influencing tameness. In summary, the majority of brain gene expression changes in domesticated animals are specific to the given domestication event, suggesting that the causative variants of behavioral domestication traits may likewise be different.
Variation among individuals arises in part from differences in DNA sequences, but the genetic basis for variation in most traits, including common diseases, remains only partly understood. Many DNA ...variants influence phenotypes by altering the expression level of one or several genes. The effects of such variants can be detected as expression quantitative trait loci (eQTL). Traditional eQTL mapping requires large-scale genotype and gene expression data for each individual in the study sample, which limits sample sizes to hundreds of individuals in both humans and model organisms and reduces statistical power. Consequently, many eQTL are probably missed, especially those with smaller effects. Furthermore, most studies use messenger RNA rather than protein abundance as the measure of gene expression. Studies that have used mass-spectrometry proteomics reported unexpected differences between eQTL and protein QTL (pQTL) for the same genes, but these studies have been even more limited in scope. Here we introduce a powerful method for identifying genetic loci that influence protein expression in the yeast Saccharomyces cerevisiae. We measure single-cell protein abundance through the use of green fluorescent protein tags in very large populations of genetically variable cells, and use pooled sequencing to compare allele frequencies across the genome in thousands of individuals with high versus low protein abundance. We applied this method to 160 genes and detected many more loci per gene than previous studies. We also observed closer correspondence between loci that influence protein abundance and loci that influence mRNA abundance of a given gene. Most loci that we detected were clustered in 'hotspots' that influence multiple proteins, and some hotspots were found to influence more than half of the proteins that we examined. The variants that underlie these hotspots have profound effects on the gene regulatory network and provide insights into genetic variation in cell physiology between yeast strains.
Although the benefits of antithrombotic drugs are indisputable to reduce thrombotic events, they carry a high risk of compromising patient safety. No previous studies investigated the implementation ...and (cost-) effectiveness of a hospital-based multidisciplinary antithrombotic team on bleeding and thrombotic outcomes. The primary aim of this study was to compare the proportion of patients with a composite end point consisting of one or more bleeding episodes or one or more thrombotic event from hospitalization until three months after hospitalization.
A prospective, multicenter before-after intervention study was conducted in two Dutch hospitals. Adult patients hospitalized between October 2015 and December 2017 treated with anticoagulant therapy were included. The primary aim was to estimate the proportion of patients with a composite end point consisting of one or more bleeding episodes or one or more thrombotic event from hospitalization until three months after hospitalization. The intervention was the implementation of a multidisciplinary antithrombotic team focusing on education, medication reviews by pharmacists, implementing of local anticoagulant therapy guidelines based on national guidelines, patient counselling and medication reconciliation at admission and discharge. The primary endpoint was analysed using segmented linear regression. We obtained data for 1,886 patients: 941 patients were included in the usual care period and 945 patients in the intervention period. The S-team study showed that implementation of a multidisciplinary antithrombotic team over time significantly reduced the composite end point consisting of one or more bleeding episodes or one or more thrombotic event from hospitalization until three months after hospitalization in patients using anticoagulant drugs (-1.83% (-2.58% to -1.08%) per 2 month period).
This study shows that implementation of a multidisciplinary antithrombotic team over time significantly reduces the composite end point consisting of one or more bleeding episodes or one or more thrombotic event from hospitalization until three months after hospitalization in patients using anticoagulant drugs.
Trialregister.nl NTR4887.
Whether newer risk markers for coronary heart disease (CHD) improve CHD risk prediction remains unclear.
To assess whether newer risk markers for CHD risk prediction and stratification improve ...Framingham risk score (FRS) predictions.
Prospective population-based study.
The Rotterdam Study, Rotterdam, the Netherlands.
5933 asymptomatic, community-dwelling participants (mean age, 69.1 years SD, 8.5).
Traditional CHD risk factors used in the FRS (age, sex, systolic blood pressure, treatment of hypertension, total and high-density lipoprotein cholesterol levels, smoking, and diabetes) and newer CHD risk factors (N-terminal fragment of prohormone B-type natriuretic peptide levels, von Willebrand factor antigen levels, fibrinogen levels, chronic kidney disease, leukocyte count, C-reactive protein levels, homocysteine levels, uric acid levels, coronary artery calcium CAC scores, carotid intima-media thickness, peripheral arterial disease, and pulse wave velocity).
Adding CAC scores to the FRS improved the accuracy of risk predictions (c-statistic increase, 0.05 95% CI, 0.02 to 0.06; net reclassification index, 19.3% overall 39.3% in those at intermediate risk, by FRS). Levels of N-terminal fragment of prohormone B-type natriuretic peptide also improved risk predictions but to a lesser extent (c-statistic increase, 0.02 CI, 0.01 to 0.04; net reclassification index, 7.6% overall 33.0% in those at intermediate risk, by FRS). Improvements in predictions with other newer markers were marginal.
The findings may not be generalizable to younger or nonwhite populations.
Among 12 CHD risk markers, improvements in FRS predictions were most statistically and clinically significant with the addition of CAC scores. Further investigation is needed to assess whether risk refinements using CAC scores lead to a meaningful change in clinical outcome. Whether to use CAC score screening as a more routine test for risk prediction requires full consideration of the financial and clinical costs of performing versus not performing the test for both persons and health systems.
Netherlands Organization for Health Research and Development (ZonMw).