This work merges a large set of previously reported thermochemical data for myoglobin (Mb) mutants with a physiological model of O
2-transport and -storage. The model allows a quantification of the ...functional proficiency of myoglobin (Mb) mutants under various physiological conditions,
i.e. O
2-consumption rate resembling workload, O
2 partial pressure resembling hypoxic stress, muscle cell size, and Mb concentration, resembling different organism-specific and compensatory variables. We find that O
2-storage and -transport are distinct functions that rank mutants and wild type differently depending on O
2 partial pressure. Specifically, the wild type is near-optimal for storage at all conditions, but for transport only at severely hypoxic conditions. At normoxic conditions, low-affinity mutants are in fact better O
2-transporters because they still have empty sites for O
2, giving rise to a larger MbO
2 gradient (more varying saturation curve). The distributions of functionality reveal that many mutants are near-neutral with respect to function, whereas only a few are strongly affected, and the variation in functionality increases dramatically at lower O
2 pressure. These results together show that conserved residues in wild type (WT) Mb were fixated under a selection pressure of low P
O2.
Virtually all enzymes catalyse more than one reaction, a phenomenon known as enzyme promiscuity. It is unclear whether promiscuous enzymes are more often generalists that catalyse multiple reactions ...at similar rates or specialists that catalyse one reaction much more efficiently than other reactions. In addition, the factors that shape whether an enzyme evolves to be a generalist or a specialist are poorly understood. To address these questions, we follow a three-pronged approach. First, we examine the distribution of promiscuity in empirical enzymes reported in the BRENDA database. We find that the promiscuity distribution of empirical enzymes is bimodal. In other words, a large fraction of promiscuous enzymes are either generalists or specialists, with few intermediates. Second, we demonstrate that enzyme biophysics is not sufficient to explain this bimodal distribution. Third, we devise a constraint-based model of promiscuous enzymes undergoing duplication and facing selection pressures favouring subfunctionalization. The model posits the existence of constraints between the catalytic efficiencies of an enzyme for different reactions and is inspired by empirical case studies. The promiscuity distribution predicted by our constraint-based model is consistent with the empirical bimodal distribution. Our results suggest that subfunctionalization is possible and beneficial only in certain enzymes. Furthermore, the model predicts that conflicting constraints and selection pressures can cause promiscuous enzymes to enter a ‘frustrated’ state, in which competing interactions limit the specialisation of enzymes. We find that frustration can be both a driver and an inhibitor of enzyme evolution by duplication and subfunctionalization. In addition, our model predicts that frustration becomes more likely as enzymes catalyse more reactions, implying that natural selection may prefer catalytically simple enzymes. In sum, our results suggest that frustration may play an important role in enzyme evolution.
One key feature of proteins that form liquid droplets by phase separation inside a cell is multivalency-the presence of multiple sites that mediate interactions with other proteins. We know little ...about the variation of multivalency on evolutionary time scales. Here, we investigated the long-term evolution (∼600 million years) of multivalency in fungal mRNA decapping subunit 2 protein (Dcp2), and in the FET (FUS, EWS and TAF15) protein family. We found that multivalency varies substantially among the orthologs of these proteins. However, evolution has maintained the length scale at which sequence motifs that enable protein-protein interactions occur. That is, the total number of such motifs per hundred amino acids is higher and less variable than expected by neutral evolution. To help explain this evolutionary conservation, we developed a conformation classifier using machine-learning algorithms. This classifier demonstrates that disordered segments in Dcp2 and FET proteins tend to adopt compact conformations, which is necessary for phase separation. Thus, the evolutionary conservation we detected may help proteins preserve the ability to undergo phase separation. Altogether, our study reveals that the length scale of multivalent interactions is an evolutionarily conserved feature of two classes of phase-separating proteins in fungi and vertebrates.
Interferon (IFN) β and Tumor Necrosis Factor (TNF) are key players in immunity against viruses. Compelling evidence has shown that the antiviral and inflammatory transcriptional response induced by ...IFNβ is reprogrammed by crosstalk with TNF. IFNβ mainly induces interferon-stimulated genes by the Janus kinase (JAK)/signal transducer and activator of transcription (STAT) pathway involving the canonical ISGF3 transcriptional complex, composed of STAT1, STAT2, and IRF9. The signaling pathways engaged downstream of the combination of IFNβ and TNF remain elusive, but previous observations suggested the existence of a response independent of STAT1. Here, using genome-wide transcriptional analysis by RNASeq, we observed a broad antiviral and immunoregulatory response initiated in the absence of STAT1 upon IFNβ and TNF costimulation. Additional stratification of this transcriptional response revealed that STAT2 and IRF9 mediate the expression of a wide spectrum of genes. While a subset of genes was regulated by the concerted action of STAT2 and IRF9, other gene sets were independently regulated by STAT2 or IRF9. Collectively, our data supports a model in which STAT2 and IRF9 act through non-canonical parallel pathways to regulate distinct pool of antiviral and immunoregulatory genes in conditions with elevated levels of both IFNβ and TNF.
Assemblies of multivalent RNA-binding protein fused in sarcoma (FUS) can exist in the functional liquid-like state as well as less dynamic and potentially toxic amyloid- and hydrogel-like states. How ...could then cells form liquid-like condensates while avoiding their transformation to amyloids? Here, we show how posttranslational phosphorylation can provide a "handle" that prevents liquid-solid transition of intracellular condensates containing FUS. Using residue-specific coarse-grained simulations, for 85 different mammalian FUS sequences, we show how the number of phosphorylation sites and their spatial arrangement affect intracluster dynamics preventing conversion to amyloids. All atom simulations further confirm that phosphorylation can effectively reduce the β-sheet propensity in amyloid-prone fragments of FUS. A detailed evolutionary analysis shows that mammalian FUS PLDs are enriched in amyloid-prone stretches compared to control neutrally evolved sequences, suggesting that mammalian FUS proteins evolved to self-assemble. However, in stark contrast to proteins that do not phase-separate for their function, mammalian sequences have phosphosites in close proximity to these amyloid-prone regions. These results suggest that evolution uses amyloid-prone sequences in prion-like domains to enhance phase separation of condensate proteins while enriching phosphorylation sites in close proximity to safeguard against liquid-solid transitions.
Abstract Understanding the expression level and evolutionary rate of associated genes with human polygenic diseases provides crucial insights into their disease-contributing roles. In this work, we ...leveraged genome-wide association studies to investigate the relationship between the genetic association and both the evolutionary rate (dN/dS) and expression level of human genes associated with the two polygenic diseases of schizophrenia and coronary artery disease. Our findings highlight a distinct variation in these relationships between the two diseases. Genes associated with both diseases exhibit a significantly greater variance in evolutionary rate compared to those implicated in monogenic diseases. Expanding our analyses to 4,756 complex traits in the GWAS atlas database, we unraveled distinct trait categories with a unique interplay among the evolutionary rate, expression level, and genetic association of human genes. In most polygenic traits, highly expressed genes were more associated with the polygenic phenotypes compared to lowly expressed genes. About 69% of polygenic traits displayed a negative correlation between genetic association and evolutionary rate, while approximately 30% of these traits showed a positive correlation between genetic association and evolutionary rate. Our results demonstrate the presence of a spectrum among complex traits, shaped by natural selection. Notably, at opposite ends of this spectrum, we find metabolic traits being more likely influenced by purifying selection, and immunological traits that are more likely shaped by positive selection. We further established the polygenic evolution portal (evopolygen.de) as a resource for investigating relationships and generating hypotheses in the field of human polygenic trait evolution.
Positive (adaptive) selection has recently been implied in human superoxide dismutase 1 (SOD1), a highly abundant antioxidant protein with energy signaling and antiaging functions, one of very few ...examples of direct selection on a human protein product (exon); the molecular drivers of this selection are unknown. We mapped 30 extant SOD1 sequences to the recently established mammalian species tree and inferred ancestors, key substitutions, and signatures of selection during the protein’s evolution. We detected elevated substitution rates leading to great apes (Hominidae) at ~1 per 2 million years, significantly higher than in other primates and rodents, although these paradoxically generally evolve much faster. The high evolutionary rate was partly due to relaxation of some selection pressures and partly to distinct positive selection of SOD1 in great apes. We then show that higher stability and net charge and changes at the dimer interface were selectively introduced upon separation from old world monkeys and lesser apes (gibbons). Consequently, human, chimpanzee and gorilla SOD1s have a net charge of −6 at physiological pH, whereas the closely related gibbons and macaques have −3. These features consistently point towards selection against the malicious aggregation effects of elevated SOD1 levels in long-living great apes. The findings mirror the impact of human SOD1 mutations that reduce net charge and/or stability and cause ALS, a motor neuron disease characterized by oxidative stress and SOD1 aggregates and triggered by aging. Our study thus marks an example of direct selection for a particular chemical phenotype (high net charge and stability) in a single human protein with possible implications for the evolution of aging.
The extent of nonadditive interaction among mutations or epistasis reflects the ruggedness of the fitness landscape, the mapping of genotype to reproductive fitness. In protein evolution, there is ...strong support for the importance and prevalence of epistasis but the quantitative and relative contribution of various factors to epistasis are poorly known. Here, we determine the contribution of selection for folding stability to epistasis in protein evolution. By combining theoretical estimates of the rates of molecular evolution and the nonlinear mapping between protein folding thermodynamics and fitness, we show that the simple selection for folding stability imposes at least ~30% to ~40% epistasis in long‐term protein evolution. Estimating the contribution of governing factors in molecular evolution such as protein folding stability to epistasis will provide a better understanding of epistasis that could improve methods in molecular evolution.
Myoglobin (Mb) is a centrally important, widely studied mammalian protein. While much work has investigated multi-step unfolding of apoMb using acid or denaturant, holomyoglobin unfolding is poorly ...understood despite its biological relevance. We present here the first systematic unfolding simulations of holoMb and the first comparative study of unfolding of protein orthologs from different species (sperm whale, pig, horse, and harbor seal). We also provide new interpretations of experimental mean molecular ellipticities of myoglobin intermediates, notably correcting for random coil and number of helices in intermediates. The simulated holoproteins at 310 K displayed structures and dynamics in agreement with crystal structures (R.sub.g ~1.48-1.51 nm, helicity ~75%). At 400 K, heme was not lost, but some helix loss was observed in pig and horse, suggesting that these helices are less stable in terrestrial species. At 500 K, heme was lost within 1.0-3.7 ns. All four proteins displayed exponentially decaying helix structure within 20 ns. The C- and F-helices were lost quickly in all cases. Heme delayed helix loss, and sperm whale myoglobin exhibited highest retention of heme and D/E helices. Persistence of conformation (RMSD), secondary structure, and ellipticity between 2-11 ns was interpreted as intermediates of holoMb unfolding in all four species. The intermediates resemble those of apoMb notably in A and H helices, but differ substantially in the D-, E- and F-helices, which interact with heme. The identified mechanisms cast light on the role of metal/cofactor in poorly understood holoMb unfolding. We also observed beta-sheet formation of several myoglobins at 500 K as seen experimentally, occurring after disruption of helices to a partially unfolded, globally disordered state; heme reduced this tendency and sperm-whale did not display any sheet propensity during the simulations.