Abstract
The emergence of multicellularity is strongly correlated with the expansion of tyrosine kinases, a conserved family of signaling enzymes that regulates pathways essential for cell-to-cell ...communication. Although tyrosine kinases have been classified from several model organisms, a molecular-level understanding of tyrosine kinase evolution across all holozoans is currently lacking. Using a hierarchical sequence constraint-based classification of diverse holozoan tyrosine kinases, we construct a new phylogenetic tree that identifies two ancient clades of cytoplasmic and receptor tyrosine kinases separated by the presence of an extended insert segment in the kinase domain connecting the D and E-helices. Present in nearly all receptor tyrosine kinases, this fast-evolving insertion imparts diverse functionalities, such as post-translational modification sites and regulatory interactions. Eph and EGFR receptor tyrosine kinases are two exceptions which lack this insert, each forming an independent lineage characterized by unique functional features. We also identify common constraints shared across multiple tyrosine kinase families which warrant the designation of three new subgroups: Src module (SrcM), insulin receptor kinase-like (IRKL), and fibroblast, platelet-derived, vascular, and growth factor receptors (FPVR). Subgroup-specific constraints reflect shared autoinhibitory interactions involved in kinase conformational regulation. Conservation analyses describe how diverse tyrosine kinase signaling functions arose through the addition of family-specific motifs upon subgroup-specific features and coevolving protein domains. We propose the oldest tyrosine kinases, IRKL, SrcM, and Csk, originated from unicellular premetazoans and were coopted for complex multicellular functions. The increased frequency of oncogenic variants in more recent tyrosine kinases suggests that lineage-specific functionalities are selectively altered in human cancers.
We describe the application of T4 DNA ligase-catalyzed DNA templated oligonucleotide polymerization toward the evolution of a diversely functionalized nucleic acid aptamer for human α-thrombin. Using ...a 256-membered ANNNN comonomer library comprising 16 sublibraries modified with different functional groups, a highly functionalized aptamer for thrombin was raised with a dissociation constant of 1.6 nM. The aptamer was found to be selective for thrombin and required the modifications for binding affinity. This study demonstrates the most differentially functionalized nucleic acid aptamer discovered by in vitro selection and should enable the future exploration of functional group dependence during the evolution of nucleic acid polymer activity.
Glycosyltransferases (GTs) play fundamental roles in nearly all cellular processes through the biosynthesis of complex carbohydrates and glycosylation of diverse protein and small molecule ...substrates. The extensive structural and functional diversification of GTs presents a major challenge in mapping the relationships connecting sequence, structure, fold and function using traditional bioinformatics approaches. Here, we present a convolutional neural network with attention (CNN-attention) based deep learning model that leverages simple secondary structure representations generated from primary sequences to provide GT fold prediction with high accuracy. The model learns distinguishing secondary structure features free of primary sequence alignment constraints and is highly interpretable. It delineates sequence and structural features characteristic of individual fold types, while classifying them into distinct clusters that group evolutionarily divergent families based on shared secondary structural features. We further extend our model to classify GT families of unknown folds and variants of known folds. By identifying families that are likely to adopt novel folds such as GT91, GT96 and GT97, our studies expand the GT fold landscape and prioritize targets for future structural studies.
Phosphorylation of the MLKL pseudokinase by the RIPK3 kinase leads to MLKL oligomerization, translocation to, and permeabilization of, the plasma membrane to induce necroptotic cell death. The ...precise choreography of MLKL activation remains incompletely understood. Here, we report Monobodies, synthetic binding proteins, that bind the pseudokinase domain of MLKL within human cells and their crystal structures in complex with the human MLKL pseudokinase domain. While Monobody-32 constitutively binds the MLKL hinge region, Monobody-27 binds MLKL via an epitope that overlaps the RIPK3 binding site and is only exposed after phosphorylated MLKL disengages from RIPK3 following necroptotic stimulation. The crystal structures identified two distinct conformations of the MLKL pseudokinase domain, supporting the idea that a conformational transition accompanies MLKL disengagement from RIPK3. These studies provide further evidence that MLKL undergoes a large conformational change upon activation, and identify MLKL disengagement from RIPK3 as a key regulatory step in the necroptosis pathway.
A major challenge associated with biochemical and cellular analysis of pseudokinases is a lack of target-validated small-molecule compounds with which to probe function. Tribbles 2 (TRIB2) is a ...cancer-associated pseudokinase with a diverse interactome, including the canonical AKT signaling module. There is substantial evidence that human TRIB2 promotes survival and drug resistance in solid tumors and blood cancers and therefore is of interest as a therapeutic target. The unusual TRIB2 pseudokinase domain contains a unique cysteine-rich C-helix and interacts with a conserved peptide motif in its own carboxyl-terminal tail, which also supports its interaction with E3 ubiquitin ligases. We found that TRIB2 is a target of previously described small-molecule protein kinase inhibitors, which were originally designed to inhibit the canonical kinase domains of epidermal growth factor receptor tyrosine kinase family members. Using a thermal shift assay, we discovered TRIB2-binding compounds within the Published Kinase Inhibitor Set (PKIS) and used a drug repurposing approach to classify compounds that either stabilized or destabilized TRIB2 in vitro. TRIB2 destabilizing agents, including the covalent drug afatinib, led to rapid TRIB2 degradation in human AML cancer cells, eliciting tractable effects on signaling and survival. Our data reveal new drug leads for the development of TRIB2-degrading compounds, which will also be invaluable for unraveling the cellular mechanisms of TRIB2-based signaling. Our study highlights that small molecule-induced protein down-regulation through drug "off-targets" might be relevant for other inhibitors that serendipitously target pseudokinases.
Glycosyltransferases (GTs) are prevalent across the tree of life and regulate nearly all aspects of cellular functions. The evolutionary basis for their complex and diverse modes of catalytic ...functions remain enigmatic. Here, based on deep mining of over half million GT-A fold sequences, we define a minimal core component shared among functionally diverse enzymes. We find that variations in the common core and emergence of hypervariable loops extending from the core contributed to GT-A diversity. We provide a phylogenetic framework relating diverse GT-A fold families for the first time and show that inverting and retaining mechanisms emerged multiple times independently during evolution. Using evolutionary information encoded in primary sequences, we trained a machine learning classifier to predict donor specificity with nearly 90% accuracy and deployed it for the annotation of understudied GTs. Our studies provide an evolutionary framework for investigating complex relationships connecting GT-A fold sequence, structure, function and regulation.
The MLKL pseudokinase is the terminal effector in the necroptosis cell death pathway. Phosphorylation by its upstream regulator, RIPK3, triggers MLKL's conversion from a dormant cytoplasmic protein ...into oligomers that translocate to, and permeabilize, the plasma membrane to kill cells. The precise mechanisms underlying these processes are incompletely understood, and were proposed to differ between mouse and human cells. Here, we examine the divergence of activation mechanisms among nine vertebrate MLKL orthologues, revealing remarkable specificity of mouse and human RIPK3 for MLKL orthologues. Pig MLKL can restore necroptotic signaling in human cells; while horse and pig, but not rat, MLKL can reconstitute the mouse pathway. This selectivity can be rationalized from the distinct conformations observed in the crystal structures of horse and rat MLKL pseudokinase domains. These studies identify important differences in necroptotic signaling between species, and suggest that, more broadly, divergent regulatory mechanisms may exist among orthologous pseudoenzymes.
Catalytic signaling outputs of protein kinases are dynamically regulated by an array of structural mechanisms, including allosteric interactions mediated by intrinsically disordered segments flanking ...the conserved catalytic domain. The doublecortin-like kinases (DCLKs) are a family of microtubule-associated proteins characterized by a flexible C-terminal autoregulatory ‘tail’ segment that varies in length across the various human DCLK isoforms. However, the mechanism whereby these isoform-specific variations contribute to unique modes of autoregulation is not well understood. Here, we employ a combination of statistical sequence analysis, molecular dynamics simulations, and in vitro mutational analysis to define hallmarks of DCLK family evolutionary divergence, including analysis of splice variants within the DCLK1 sub-family, which arise through alternative codon usage and serve to ‘supercharge’ the inhibitory potential of the DCLK1 C-tail. We identify co-conserved motifs that readily distinguish DCLKs from all other calcium calmodulin kinases (CAMKs), and a ‘Swiss Army’ assembly of distinct motifs that tether the C-terminal tail to conserved ATP and substrate-binding regions of the catalytic domain to generate a scaffold for autoregulation through C-tail dynamics. Consistently, deletions and mutations that alter C-terminal tail length or interfere with co-conserved interactions within the catalytic domain alter intrinsic protein stability, nucleotide/inhibitor binding, and catalytic activity, suggesting isoform-specific regulation of activity through alternative splicing. Our studies provide a detailed framework for investigating kinome-wide regulation of catalytic output through cis-regulatory events mediated by intrinsically disordered segments, opening new avenues for the design of mechanistically divergent DCLK1 modulators, stabilizers, or degraders.
Protein prenylation by farnesyltransferase (FTase) is often described as the targeting of a cysteine-containing motif (CaaX) that is enriched for aliphatic amino acids at the a.sub.1 and a.sub.2 ...positions, while quite flexible at the X position. Prenylation prediction methods often rely on these features despite emerging evidence that FTase has broader target specificity than previously considered. Using a machine learning approach and training sets based on canonical (prenylated, proteolyzed, and carboxymethylated) and recently identified shunted motifs (prenylation only), this study aims to improve prenylation predictions with the goal of determining the full scope of prenylation potential among the 8000 possible Cxxx sequence combinations. Further, this study aims to subdivide the prenylated sequences as either shunted (i.e., uncleaved) or cleaved (i.e., canonical). Predictions were determined for Saccharomyces cerevisiae FTase and compared to results derived using currently available prenylation prediction methods. In silico predictions were further evaluated using in vivo methods coupled to two yeast reporters, the yeast mating pheromone a-factor and Hsp40 Ydj1p, that represent proteins with canonical and shunted CaaX motifs, respectively. Our machine learning-based approach expands the repertoire of predicted FTase targets and provides a framework for functional classification.