Understanding the molecular mechanisms of congenital diseases is challenging due to their occurrence within specific developmental stages. Esophageal malformations are examples of such conditions, ...characterized by abnormalities in the development of esophagus during embryogenesis. These developmental malformations encompass a range of anomalies, including esophageal atresia, and tracheoesophageal fistula. Here, we investigated the preferential expression of 29 genes that are implicated in such malformations and their immediate interactome (a total of 67 genes). We conducted our analyses across several single-cell atlases of embryonic development, encompassing approximately 150,000 cells from the mouse foregut, 180,000 cells from human embryos, and 500,000 cells from 24 human organs. Our study, spanning diverse mesodermal and endodermal cell populations and early developmental stages, shows that the genes associated with esophageal malformations show their highest cell-type specific expression in lateral plate mesoderm cells and at the developmental stage of E8.75-E9.0 days. In human embryos, these genes show a significant cell-type specific expression among subpopulations of epithelial cells, fibroblasts and progenitor cells including basal cells. Notably, members of the forkhead-box family of transcription factors, namely FOXF1, FOXC1, and FOXD1, as well as the SRY-box transcription factor, SOX2, demonstrate the most significant preferential expression in both mouse and human embryos. Overall, our findings provide insights into the temporal and cellular contexts contributing to esophageal malformations.
Many organismal traits are genetically determined and covary in evolving populations. The resulting trait correlations can either help or hinder evolvability - the ability to bring forth new and ...adaptive phenotypes. The evolution of evolvability requires that trait correlations themselves must be able to evolve, but we know little about this ability. To learn more about it, we here study two evolvable systems, a yellow fluorescent protein and the antibiotic resistance protein VIM-2 metallo beta-lactamase. We consider two traits in the fluorescent protein, namely the ability to emit yellow and green light, and three traits in our enzyme, namely the resistance against ampicillin, cefotaxime, and meropenem. We show that correlations between these traits can evolve rapidly through both mutation and selection on short evolutionary time scales. In addition, we show that these correlations are driven by a protein's ability to fold, because single mutations that alter foldability can dramatically change trait correlations. Since foldability is important for most proteins and their traits, mutations affecting protein folding may alter trait correlations mediated by many other proteins. Thus, mutations that affect protein foldability may also help shape the correlations of complex traits that are affected by hundreds of proteins.
Confinement of the X chromosome to a territory for dosage compensation is a prime example of how subnuclear compartmentalization is used to regulate transcription at the megabase scale. In Drosophila ...melanogaster, two sex-specific non-coding RNAs (roX1 and roX2) are transcribed from the X chromosome. They associate with the male-specific lethal (MSL) complex
, which acetylates histone H4 lysine 16 and thereby induces an approximately twofold increase in expression of male X-linked genes
. Current models suggest that X-over-autosome specificity is achieved by the recognition of cis-regulatory DNA high-affinity sites (HAS) by the MSL2 subunit
. However, HAS motifs are also found on autosomes, indicating that additional factors must stabilize the association of the MSL complex with the X chromosome. Here we show that the low-complexity C-terminal domain (CTD) of MSL2 renders its recruitment to the X chromosome sensitive to roX non-coding RNAs. roX non-coding RNAs and the MSL2 CTD form a stably condensed state, and functional analyses in Drosophila and mammalian cells show that their interactions are crucial for dosage compensation in vivo. Replacing the CTD of mammalian MSL2 with that from Drosophila and expressing roX in cis is sufficient to nucleate ectopic dosage compensation in mammalian cells. Thus, the condensing nature of roX-MSL2
is the primary determinant for specific compartmentalization of the X chromosome in Drosophila.
A model of proteome-associated chemical energetic costs of cells is derived from protein-turnover kinetics and protein folding. Minimization of the proteostatic maintenance cost can explain a range ...of trends of proteomes and combines both protein function, stability, size, proteostatic cost, temperature, resource availability, and turnover rates in one simple framework. We then explore the ansatz that the chemical energy remaining after proteostatic maintenance is available for reproduction (or cell division) and thus, proportional to organism fitness. Selection for lower proteostatic costs is then shown to be significant vs. typical effective population sizes of yeast. The model explains and quantifies evolutionary conservation of highly abundant proteins as arising both from functional mutations and from changes in other properties such as stability, cost, or turnover rates. We show that typical hypomorphic mutations can be selected against due to increased cost of compensatory protein expression (both in the mutated gene and in related genes, i.e. epistasis) rather than compromised function itself, although this compensation depends on the protein's importance. Such mutations exhibit larger selective disadvantage in abundant, large, synthetically costly, and/or short-lived proteins. Selection against increased turnover costs of less stable proteins rather than misfolding toxicity per se can explain equilibrium protein stability distributions, in agreement with recent findings in E. coli. The proteostatic selection pressure is stronger at low metabolic rates (i.e. scarce environments) and in hot habitats, explaining proteome adaptations towards rough environments as a question of energy. The model may also explain several trade-offs observed in protein evolution and suggests how protein properties can coevolve to maintain low proteostatic cost.
This paper reports DFT-computed electronic ground states, Mössbauer isomer shifts, O–O and Fe–O vibration frequencies, and thermodynamics of O2 binding of heme models representing different distal ...(position E7) interactions, strictly validated against experimental data. Based on the results, the impact of specific types of distal interactions on oxyheme electronic structure can be systematized. Hydrogen bonding increases back-donation, O–O bond activation, and oxygen binding affinity. The heme side chains reduce isomer shifts by −0.06 mm/s due to electron withdrawal from iron, and distal hydrogen bonds can further reduce isomer shifts up to 0.07 mm/s. The O–O stretch vibration, the O–O distance, and the isomer shift possess substantial heuristic value in interpreting electronic structure, whereas other properties are less effective, based on computed correlation coefficients. Shorter Fe–O bond length does not correlate with O2 affinity, as hydrogen bonding elongates both Fe–O and O–O bonds by ∼0.01–0.02 Å, contrary to the situation absent from distal hydrogen bonds and of potential relevance to ligand activation where distal interactions are involved. An ionic (Weiss-type) model of Fe–O bonding combined with electron withdrawal by hydrogen bonds is shown to robustly explain the structural, spectroscopic, and thermodynamic properties of the hemes. The identified correlations may be useful, e.g., for designing O2-activating catalysts or for diagnosing heme protein variants.
Myoglobin (Mb) is a centrally important, widely studied mammalian protein. While much work has investigated multi-step unfolding of apoMb using acid or denaturant, holomyoglobin unfolding is poorly ...understood despite its biological relevance. We present here the first systematic unfolding simulations of holoMb and the first comparative study of unfolding of protein orthologs from different species (sperm whale, pig, horse, and harbor seal). We also provide new interpretations of experimental mean molecular ellipticities of myoglobin intermediates, notably correcting for random coil and number of helices in intermediates. The simulated holoproteins at 310 K displayed structures and dynamics in agreement with crystal structures (R g ~1.48-1.51 nm, helicity ~75%). At 400 K, heme was not lost, but some helix loss was observed in pig and horse, suggesting that these helices are less stable in terrestrial species. At 500 K, heme was lost within 1.0-3.7 ns. All four proteins displayed exponentially decaying helix structure within 20 ns. The C- and F-helices were lost quickly in all cases. Heme delayed helix loss, and sperm whale myoglobin exhibited highest retention of heme and D/E helices. Persistence of conformation (RMSD), secondary structure, and ellipticity between 2-11 ns was interpreted as intermediates of holoMb unfolding in all four species. The intermediates resemble those of apoMb notably in A and H helices, but differ substantially in the D-, E- and F-helices, which interact with heme. The identified mechanisms cast light on the role of metal/cofactor in poorly understood holoMb unfolding. We also observed β-sheet formation of several myoglobins at 500 K as seen experimentally, occurring after disruption of helices to a partially unfolded, globally disordered state; heme reduced this tendency and sperm-whale did not display any sheet propensity during the simulations.
Abstract
Protein phase separation can help explain the formation of many nonmembranous organelles. However, we know little about its ability to change in evolution. Here we studied the evolution of ...the mammalian RNA-binding protein Fused in Sarcoma (FUS), a protein whose prion-like domain (PLD) contributes to the formation of stress granules through liquid–liquid phase separation. Although the PLD evolves three times as rapidly as the remainder of FUS, it harbors absolutely conserved tyrosine residues that are crucial for phase separation. Ancestral reconstruction shows that the phosphorylation sites within the PLD are subject to stabilizing selection. They toggle among a small number of amino acid states. One exception to this pattern is primates, where the number of such phosphosites has increased through positive selection. In addition, we find frequent glutamine to proline changes that help maintain the unstructured state of FUS that is necessary for phase separation. Our work provides evidence that natural selection has stabilized the liquid forming potential of FUS and minimized the propensity of cytotoxic liquid-to-solid phase transitions during 160 My of mammalian evolution.
Display omitted
•We provide the first direct evidence that stress-induced phase separation is encoded in a protein’s sequence, requires contributions from both structured and disordered regions, and ...can be understood and predicted using the thermodynamics of amphiphilic aggregation.•We found that reversible phase-separation requires contributions from both disordered and structured regions.•The aggregation-prone regions of these proteins are significantly enriched in aliphatic residues and depleted in positively charged amino acids. These proteins resemble supra-molecular amphiphiles where the aggregation-prone regions are the hydrophobic parts, and the remainder of the proteins are the hydrophilic parts.•We discovered a novel association between the length of aggregation-prone regions in these proteins and their enrichment in the aggregated phase, which occurs in heat-stressed and nutrient-starved conditions.•We show that the sequence-encoded features we discovered can distinguish aggregation-prone regions from the remainder of the protein sequence with high accuracy (∼93%).
More than a hundred proteins in yeast reversibly aggregate and phase-separate in response to various stressors, such as nutrient depletion and heat shock. We know little about the protein sequence and structural features behind this ability, which has not been characterized on a proteome-wide level. To identify the distinctive features of aggregation-prone protein regions, we apply machine learning algorithms to genome-scale limited proteolysis-mass spectrometry (LiP-MS) data from yeast proteins. LiP-MS data reveals that 96 proteins show significant structural changes upon heat shock. We find that in these proteins the propensity to phase separate cannot be solely driven by disordered regions, because their aggregation-prone regions (APRs) are not significantly disordered. Instead, the phase separation of these proteins requires contributions from both disordered and structured regions. APRs are significantly enriched in aliphatic residues and depleted in positively charged amino acids. Aggregator proteins with longer APRs show a greater propensity to aggregate, a relationship that can be explained by equilibrium statistical thermodynamics. Altogether, our observations suggest that proteome-wide reversible protein aggregation is mediated by sequence-encoded properties. We propose that aggregating proteins resemble supra-molecular amphiphiles, where APRs are the hydrophobic parts, and non-APRs are the hydrophilic parts.
In heterogametic organisms, expression of unequal number of X chromosomes in males and females is balanced by a process called dosage compensation. In Drosophila and mammals, dosage compensation ...involves nearly two-fold up-regulation of the X chromosome mediated by dosage compensation complex (DCC). Experimental studies on the role of DCC on RNA polymerase II (Pol II) transcription in mammals disclosed a non-linear relationship between Pol II densities at different transcription steps and mRNA expression. An ∼20–30% increase in Pol II densities corresponds to a rough 200% increase in mRNA expression and two-fold up-regulation. Here, using a simple kinetic model of Pol II transcription calibrated by in vivo measured rate constants of different transcription steps in mammalian cells, we demonstrate how this non-linearity can be explained by multi-step transcriptional regulation. Moreover, we show how multi-step enhancement of Pol II transcription can increase mRNA production while leaving Pol II densities unaffected. Our theoretical analysis not only recapitulates experimentally observed Pol II densities upon two-fold up-regulation but also points to erroneous interpretations of Pol II profiles from chromatin immunoprecipitation sequencing (ChIP-seq) or global run-on assays.
Since divergence ∼50 Ma ago from their terrestrial ancestors, cetaceans underwent a series of adaptations such as a ∼10-20 fold increase in myoglobin (Mb) concentration in skeletal muscle, critical ...for increasing oxygen storage capacity and prolonging dive time. Whereas the O2-binding affinity of Mbs is not significantly different among mammals (with typical oxygenation constants of ∼0.8-1.2 µM(-1)), folding stabilities of cetacean Mbs are ∼2-4 kcal/mol higher than for terrestrial Mbs. Using ancestral sequence reconstruction, maximum likelihood and bayesian tests to describe the evolution of cetacean Mbs, and experimentally calibrated computation of stability effects of mutations, we observe accelerated evolution in cetaceans and identify seven positively selected sites in Mb. Overall, these sites contribute to Mb stabilization with a conditional probability of 0.8. We observe a correlation between Mb folding stability and protein abundance, suggesting that a selection pressure for stability acts proportionally to higher expression. We also identify a major divergence event leading to the common ancestor of whales, during which major stabilization occurred. Most of the positively selected sites that occur later act against other destabilizing mutations to maintain stability across the clade, except for the shallow divers, where late stability relaxation occurs, probably due to the shorter aerobic dive limits of these species. The three main positively selected sites 66, 5, and 35 undergo changes that favor hydrophobic folding, structural integrity, and intra-helical hydrogen bonds.