Coexpression networks and gene regulatory networks (GRNs) are emerging as important tools for predicting functional roles of individual genes at a system-wide scale. To enable network ...reconstructions, we built a large-scale gene expression atlas composed of 62,547 messenger RNAs (mRNAs), 17,862 nonmodified proteins, and 6227 phosphoproteins harboring 31,595 phosphorylation sites quantified across maize development. Networks in which nodes are genes connected on the basis of highly correlated expression patterns of mRNAs were very different from networks that were based on coexpression of proteins. Roughly 85% of highly interconnected hubs were not conserved in expression between RNA and protein networks. However, networks from either data type were enriched in similar ontological categories and were effective in predicting known regulatory relationships. Integration of mRNA, protein, and phosphoprotein data sets greatly improved the predictive power of GRNs.
Aphids are sap-feeding plant pests and harbor the endosymbiont Buchnera aphidicola , which is essential for their fecundity and survival. During plant penetration and feeding, aphids secrete saliva ...that contains proteins predicted to alter plant defenses and metabolism. Plants recognize microbe-associated molecular patterns and induce pattern-triggered immunity (PTI). No aphid-associated molecular pattern has yet been identified. By mass spectrometry, we identified in saliva from potato aphids (Macrosiphum euphorbiae) 105 proteins, some of which originated from Buchnera , including the chaperonin GroEL. Because GroEL is a widely conserved bacterial protein with an essential function, we tested its role in PTI. Applying or infiltrating GroEL onto Arabidopsis (Arabidopsis thaliana) leaves induced oxidative burst and expression of PTI early marker genes. These GroEL-induced defense responses required the known coreceptor BRASSINOSTEROID INSENSITIVE 1-ASSOCIATED RECEPTOR KINASE 1. In addition, in transgenic Arabidopsis plants, inducible expression of groEL activated PTI marker gene expression. Moreover, Arabidopsis plants expressing groEL displayed reduced fecundity of the green peach aphid (Myzus persicae), indicating enhanced resistance against aphids. Furthermore, delivery of GroEL into tomato (Solanum lycopersicum) or Arabidopsis through Pseudomonas fluorescens , engineered to express the type III secretion system, also reduced potato aphid and green peach aphid fecundity, respectively. Collectively our data indicate that GroEL is a molecular pattern that triggers PTI.
Ethylene gas is essential for many developmental processes and stress responses in plants. ETHYLENE INSENSITIVE2 (EIN2), an NRAMP-like integral membrane protein, plays an essential role in ethylene ...signaling, but its function remains enigmatic. Here we report that phosphorylation-regulated proteolytic processing of EIN2 triggers its endoplasmic reticulum (ER)–to–nucleus translocation. ER-tethered EIN2 shows CONSTITUTIVE TRIPLE RESP0NSE1 (CTR1) kinase-dependent phosphorylation. Ethylene triggers dephosphorylation at several sites and proteolytic cleavage at one of these sites, resulting in nuclear translocation of a carboxyl-terminal EIN2 fragment (EIN2-C'). Mutations that mimic EIN2 dephosphorylation, or inactivate CTR1, show constitutive cleavage and nuclear localization of EIN2-C' and EIN3 and EIN3-LIKE1-dependent activation of ethylene responses. These findings uncover a mechanism of subcellular communication whereby ethylene stimulates phosphorylation-dependent cleavage and nuclear movement of the EIN2-C' peptide, linking hormone perception and signaling components in the ER with nuclear-localized transcriptional regulators.
Many species possess an endogenous circadian clock to synchronize internal physiology with an oscillating external environment. In plants, the circadian clock coordinates growth, metabolism and ...development over daily and seasonal time scales. Many proteins in the circadian network form oscillating complexes that temporally regulate myriad processes, including signal transduction, transcription, protein degradation and post-translational modification. In Arabidopsis thaliana, a tripartite complex composed of EARLY FLOWERING 4 (ELF4), EARLY FLOWERING 3 (ELF3), and LUX ARRHYTHMO (LUX), named the evening complex, modulates daily rhythms in gene expression and growth through transcriptional regulation. However, little is known about the physical interactions that connect the circadian system to other pathways. We used affinity purification and mass spectrometry (AP-MS) methods to identify proteins that associate with the evening complex in A. thaliana. New connections within the circadian network as well as to light signaling pathways were identified, including linkages between the evening complex, TIMING OF CAB EXPRESSION1 (TOC1), TIME FOR COFFEE (TIC), all phytochromes and TANDEM ZINC KNUCKLE/PLUS3 (TZP). Coupling genetic mutation with affinity purifications tested the roles of phytochrome B (phyB), EARLY FLOWERING 4, and EARLY FLOWERING 3 as nodes connecting the evening complex to clock and light signaling pathways. These experiments establish a hierarchical association between pathways and indicate direct and indirect interactions. Specifically, the results suggested that EARLY FLOWERING 3 and phytochrome B act as hubs connecting the clock and red light signaling pathways. Finally, we characterized a clade of associated nuclear kinases that regulate circadian rhythms, growth, and flowering in A. thaliana. Coupling mass spectrometry and genetics is a powerful method to rapidly and directly identify novel components and connections within and between complex signaling pathways.
Microalgae have recently received attention as a potential low-cost host for the production of recombinant proteins and novel metabolites. However, a major obstacle to the development of algae as an ...industrial platform has been the poor expression of heterologous genes from the nuclear genome. Here we describe a nuclear expression strategy using the foot-and-mouth-disease-virus 2A self-cleavage peptide to transcriptionally fuse heterologous gene expression to antibiotic resistance in Chlamydomonas reinhardtii. We demonstrate that strains transformed with ble-2A-GFP are zeocin-resistant and accumulate high levels of GFP that is properly 'cleaved' at the FMDV 2A peptide resulting in monomeric, cytosolic GFP that is easily detectable by in-gel fluorescence analysis or fluorescent microscopy. Furthermore, we used our ble2A nuclear expression vector to engineer the heterologous expression of the industrial enzyme, xylanase. We demonstrate that linking xyn1 expression to ble2A expression on the same open reading frame led to a dramatic (~100-fold) increase in xylanase activity in cells lysates compared to the unlinked construct. Finally, by inserting an endogenous secretion signal between the ble2A and xyn1 coding regions, we were able to target monomeric xylanase for secretion. The novel microalgae nuclear expression strategy described here enables the selection of transgenic lines that are efficiently expressing the heterologous gene-of-interest and should prove valuable for basic research as well as algal biotechnology.
Terpenoids are a major component of maize (Zea mays) chemical defenses that mediate responses to herbivores, pathogens, and other environmental challenges. Here, we describe the biosynthesis and ...elicited production of a class of maize diterpenoids, named dolabralexins. Dolabralexin biosynthesis involves the sequential activity of two diterpene synthases, ENT-COPALYL DIPHOSPHATE SYNTHASE (ZmAN2) and KAURENE SYNTHASE-LIKE4 (ZmKSL4). Together, ZmAN2 and ZmKSL4 form the diterpene hydrocarbon dolabradiene. In addition, we biochemically characterized a cytochrome P450 monooxygenase, ZmCYP71Z16, which catalyzes the oxygenation of dolabradiene to yield the epoxides 15,16-epoxydolabrene (epoxydolabrene) and 3𝛽-hydroxy-15,16-epoxydolabrene (epoxydolabranol). The absence of dolabradiene and epoxydolabranol in Zman2 mutants under elicited conditions confirmed the in vivo biosynthetic requirement of ZmAN2. Combined mass spectrometry and NMR experiments demonstrated that much of the epoxydolabranol is further converted into 3𝛽,15,16-trihydroxydolabrene (trihydroxydolabrene). Metabolite profiling of field-grown maize root tissues indicated that dolabralexin biosynthesis is widespread across common maize cultivars, with trihydroxydolabrene as the predominant diterpenoid. Oxidative stress induced dolabralexin accumulation and transcript expression of ZmAN2 and ZmKSL4 in root tissues, and metabolite and transcript accumulation were up-regulated in response to elicitation with the fungal pathogens Fusarium verticillioides and Fusarium graminearum. Consistently, epoxydolabranol significantly inhibited the growth of both pathogens in vitro at 10 𝜇g mL⁻¹, while trihydroxydolabrene-mediated inhibition was specific to F. verticillioides. These findings suggest that dolabralexins have defense-related roles in maize stress interactions and expand the known chemical space of diterpenoid defenses as genetic targets for understanding and ultimately improving maize resilience.
Stop codons have been exploited for genetic incorporation of unnatural amino acids (Uaas) in live cells, but their low incorporation efficiency, which is possibly due to competition from release ...factors, limits the power and scope of this technology. Here we show that the reportedly essential release factor 1 (RF1) can be knocked out from Escherichia coli by 'fixing' release factor 2 (RF2). The resultant strain JX33 is stable and independent, and it allows UAG to be reassigned from a stop signal to an amino acid when a UAG-decoding tRNA-synthetase pair is introduced. Uaas were efficiently incorporated at multiple UAG sites in the same gene without translational termination in JX33. We also found that amino acid incorporation at endogenous UAG codons is dependent on RF1 and mRNA context, which explains why E. coli tolerates apparent global suppression of UAG. JX33 affords a unique autonomous host for synthesizing and evolving new protein functions by enabling Uaa incorporation at multiple sites.
Gene annotation underpins genome science. Most often protein coding sequence is inferred from the genome based on transcript evidence and computational predictions. While generally correct, gene ...models suffer from errors in reading frame, exon border definition, and exon identification. To ascertain the error rate of Arabidopsis thaliana gene models, we isolated proteins from a sample of Arabidopsis tissues and determined the amino acid sequences of 144,079 distinct peptides by tandem mass spectrometry. The peptides corresponded to 1 or more of 3 different translations of the genome: a 6-frame translation, an exon splice-graph, and the currently annotated proteome. The majority of the peptides (126,055) resided in existing gene models (12,769 confirmed proteins), comprising 40% of annotated genes. Surprisingly, 18,024 novel peptides were found that do not correspond to annotated genes. Using the gene finding program AUGUSTUS and 5,426 novel peptides that occurred in clusters, we discovered 778 new protein-coding genes and refined the annotation of an additional 695 gene models. The remaining 13,449 novel peptides provide high quality annotation (>99% correct) for thousands of additional genes. Our observation that 18,024 of 144,079 peptides did not match current gene models suggests that 13% of the Arabidopsis proteome was incomplete due to approximately equal numbers of missing and incorrect gene models.
A comprehensive knowledge of proteomic states is essential for understanding biological systems. Using mass spectrometry, we mapped an atlas of developing maize seed proteotypes comprising 14,165 ...proteins and 18,405 phosphopeptides (from 4,511 proteins), quantified across eight tissues. We found that many of the most abundant proteins are not associated with detectable levels of their mRNAs, and we provide evidence for three potential explanations: transport of proteins between tissues; diurnal, out-of-phase accumulation of mRNAs and cognate proteins; and differential lifetimes of mRNAs compared with proteins. Likewise, many of the most abundant mRNAs were not associated with detectable levels of their proteins. Across the entire dataset, protein abundance was poorly correlated with mRNA levels and was largely independent of phosphorylation status. Comparisons between proteotypes revealed the quantitative contribution of specific proteins and phosphorylation events to the spatially and temporally regulated starch and oil biosynthetic pathways. Reconstruction of signaling networks established associations of proteins and phosphoproteins with distinct biological processes acting during seed development. Additionally, a protein kinase substrate network was reconstructed, enabling the identification of 762 potential substrates of specific protein kinases. Finally, examination of 694 transcription factors revealed remarkable constraints on patterns of expression and phosphorylation within transcription factor families. These results provide a resource for understanding seed development in a crop that is the foundation of modern agriculture.
Significance A defining characteristic of living organisms is dynamic alignment of cellular responses to stress through activation of signal transduction pathways essential for fine-tuning of ...interorgannellar communication. Uncovering these communication signals is one of the prime challenges of biology. We have identified a chloroplast-produced retrograde signal, methylerythritol cyclodiphosphate (MEcPP), as a trigger of unfolded protein response (UPR) required for restoration of protein-folding homeostasis in the endoplasmic reticulum (ER). Increased levels of MEcPP via genetic manipulation or exogenous application potentiate expression of a sub-set of UPR genes, and alter plant’s resistance to the ER stress inducing agent. These findings provide a link between a plastidial retrograde signal and transcriptional reprogramming of ER genes critical for readjustment of protein-folding capacity in stressed cells.
Cellular homeostasis in response to internal and external stimuli requires a tightly coordinated interorgannellar communication network. We recently identified methylerythritol cyclodiphosphate (MEcPP) as a novel stress-specific retrograde signaling metabolite that accumulates in response to environmental perturbations to relay information from plastids to the nucleus. We now demonstrate, using a combination of transcriptome and proteome profiling approaches, that mutant plants ( ceh1 ) with high endogenous levels of MEcPP display increased transcript and protein levels for a subset of the core unfolded protein response (UPR) genes. The UPR is an adaptive cellular response conserved throughout eukaryotes to stress conditions that perturb the endoplasmic reticulum (ER) homeostasis. Our results suggest that MEcPP directly triggers the UPR. Exogenous treatment with MEcPP induces the rapid and transient induction of both the unspliced and spliced forms of the UPR gene bZIP60 . Moreover, compared with the parent background (P), ceh1 mutants are less sensitive to the ER-stress-inducing agent tunicamycin (Tm). P and ceh1 plants treated with Tm display similar UPR transcript profiles, suggesting that although MEcPP accumulation causes partial induction of selected UPR genes, full induction is triggered by accumulation of misfolded proteins. This finding refines our perspective of interorgannellar communication by providing a link between a plastidial retrograde signaling molecule and its targeted ensemble of UPR components in ER.