Protein O-GlcNAcylation, involving the β-attachment of single N-acetylglucosamine (GlcNAc) to the hydroxyl group of serine or threonine residues, is an O-linked glycosylation catalyzed by O-GlcNAc ...transferase (OGT). Molecular level investigation of the basis for OGT's substrate specificity should aid understanding how O-GlcNAc contributes to diverse cellular processes. Due to an increasing number of O-GlcNAcylated peptides with site-specific information identified by mass spectrometry (MS)-based proteomics, we were motivated to characterize substrate site motifs of O-GlcNAc transferases. In this investigation, a non-redundant dataset of 410 experimentally verified O-GlcNAcylation sites were manually extracted from dbOGAP, OGlycBase and UniProtKB. After detection of conserved motifs by using maximal dependence decomposition, profile hidden Markov model (profile HMM) was adopted to learn a first-layered model for each identified OGT substrate motif. Support Vector Machine (SVM) was then used to generate a second-layered model learned from the output values of profile HMMs in first layer. The two-layered predictive model was evaluated using a five-fold cross validation which yielded a sensitivity of 85.4%, a specificity of 84.1%, and an accuracy of 84.7%. Additionally, an independent testing set from PhosphoSitePlus, which was really non-homologous to the training data of predictive model, was used to demonstrate that the proposed method could provide a promising accuracy (84.05%) and outperform other O-GlcNAcylation site prediction tools. A case study indicated that the proposed method could be a feasible means of conducting preliminary analyses of protein O-GlcNAcylation and has been implemented as a web-based system, OGTSite, which is now freely available at http://csb.cse.yzu.edu.tw/OGTSite/.
The conjugation of ubiquitin to a substrate protein (protein ubiquitylation), which involves a sequential process--E1 activation, E2 conjugation and E3 ligation, is crucial to the regulation of ...protein function and activity in eukaryotes. This ubiquitin-conjugation process typically binds the last amino acid of ubiquitin (glycine 76) to a lysine residue of a target protein. The high-throughput of mass spectrometry-based proteomics has stimulated a large-scale identification of ubiquitin-conjugated peptides. Hence, a new web resource, UbiSite, was developed to identify ubiquitin-conjugation site on lysines based on large-scale proteome dataset.
Given a total of 37,647 ubiquitin-conjugated proteins, including 128,026 ubiquitylated peptides, obtained from various resources, this study carries out a large-scale investigation on ubiquitin-conjugation sites based on sequenced and structural characteristics. A TwoSampleLogo reveals that a significant depletion of histidine (H), arginine (R) and cysteine (C) residues around ubiquitylation sites may impact the conjugation of ubiquitins in closed three-dimensional environments. Based on the large-scale ubiquitylation dataset, a motif discovery tool, MDDLogo, has been adopted to characterize the potential substrate motifs for ubiquitin conjugation. Not only are single features such as amino acid composition (AAC), positional weighted matrix (PWM), position-specific scoring matrix (PSSM) and solvent-accessible surface area (SASA) considered, but also the effectiveness of incorporating MDDLogo-identified substrate motifs into a two-layered prediction model is taken into account. Evaluation by five-fold cross-validation showed that PSSM is the best feature in discriminating between ubiquitylation and non-ubiquitylation sites, based on support vector machine (SVM). Additionally, the two-layered SVM model integrating MDDLogo-identified substrate motifs could obtain a promising accuracy and the Matthews Correlation Coefficient (MCC) at 81.06% and 0.586, respectively. Furthermore, the independent testing showed that the two-layered SVM model could outperform other prediction tools, reaching at 85.10% sensitivity, 69.69% specificity, 73.69% accuracy and the 0.483 of MCC value.
The independent testing result indicated the effectiveness of incorporating MDDLogo-identified motifs into the prediction of ubiquitylation sites. In order to provide meaningful assistance to researchers interested in large-scale ubiquitinome data, the two-layered SVM model has been implemented onto a web-based system (UbiSite), which is freely available at http://csb.cse.yzu.edu.tw/UbiSite/ . Two cases given in the UbiSite provide a demonstration of effective identification of ubiquitylation sites with reference to substrate motifs.
Bidirectional integrin αIIbβ3 signaling is essential for platelet activation. The platelet adaptor protein Disabled-2 (Dab2) is a key regulator of integrin signaling and is phosphorylated at serine ...24 in eukaryotic cells. However, the mechanistic insight and function of Dab2-serine 24 phosphorylation (Dab2-pSer24) in platelet biology are barely understood. This study aimed to define whether and how Dab2 is phosphorylated at Ser24 during platelet activation and to investigate the effect of Dab2-pSer24 on platelet function.
An antibody with confirmed specificity for Dab2-pSer24 was generated. By using this antibody as a tool, we showed that protein kinase C (PKC)-mediated Dab2-pSer24 was a conservative signaling event when human platelets were activated by the platelet agonists such as thrombin, collagen, ADP, 12-O-tetradecanoylphorbol-13-acetate, and the thromboxane A2 activator U46619. The agonists-stimulated Dab2-pSer24 was attenuated by pretreatment of platelets with the RGDS peptide which inhibits integrin outside-in signaling by competitive binding of integrin αIIb with fibrinogen. Direct activation of platelet integrin outside-in signaling by combined treatment of platelets with manganese dichloride and fibrinogen or by spreading of platelets on fibrinogen also resulted in Dab2-pSer24. These findings implicate that Dab2-pSer24 was associated with the outside-in signaling of integrin. Further analysis revealed that Dab2-pSer24 was downstream of Src-PKC-axis and phospholipase D1 underlying the integrin αIIbβ3 outside-in signaling. A membrane penetrating peptide R11-Ser24 which contained 11 repeats of arginine linked to the Dab2-Ser24 phosphorylation site and its flanking sequences (RRRRRRRRRRR
APKAPSKKEKK
) and the R11-S24A peptide with Ser24Ala mutation were designed to elucidate the functions of Dab2-pSer24. R11-Ser24 but not R11-S24A inhibited agonists-stimulated Dab2-pSer24 and consequently suppressed platelet spreading on fibrinogen, with no effect on platelet aggregation and fibrinogen binding. Notably, Ser24 and the previously reported Ser723 phosphorylation (Dab2-pSer723) occurred exclusively in a single Dab2 molecule and resulted in distinctive subcellular distribution and function of Dab2. Dab2-pSer723 was mainly distributed in the cytosol of activated platelets and associated with integrin inside-out signaling, while Dab2-pSer24 was mainly distributed in the membrane fraction of activated platelets and associated with integrin outside-in signaling.
These findings demonstrate for the first time that Dab2-pSer24 is conservative in integrin αIIbβ3 outside-in signaling during platelet activation and plays a novel role in the control of cytoskeleton reorganization and platelet spreading on fibrinogen.
Protein O-GlcNAcylation, involving the attachment of single N-acetylglucosamine (GlcNAc) to the hydroxyl group of serine or threonine residues. Elucidation of O-GlcNAcylation sites on proteins is ...required in order to decipher its crucial roles in regulating cellular processes and aid in drug design. With an increasing number of O-GlcNAcylation sites identified by mass spectrometry (MS)-based proteomics, several methods have been proposed for the computational identification of O-GlcNAcylation sites. However, no development that focuses on the investigation of O-GlcNAcylated substrate motifs has existed. Thus, we were motivated to design a new method for the identification of protein O-GlcNAcylation sites with the consideration of substrate site specificity.
In this study, 375 experimentally verified O-GlcNAcylation sites were collected from dbOGAP, which is an integrated resource for protein O-GlcNAcylation. Due to the difficulty in characterizing the substrate motifs by conventional sequence logo analysis, a recursively statistical method has been applied to obtain significant conserved motifs. To construct the predictive models learned from the identified substrate motifs, we adopted Support Vector Machines (SVMs). A five-fold cross validation was used to evaluate the predictive model, achieving sensitivity, specificity, and accuracy of 0.76, 0.80, and 0.78, respectively. Additionally, an independent testing set, which was really blind to the training data of predictive model, was used to demonstrate that the proposed method could provide a promising accuracy (0.94) and outperform three other O-GlcNAcylation site prediction tools.
This work proposed a computational method to identify informative substrate motifs for O-GlcNAcylation sites. The evaluation of cross validation and independent testing indicated that the identified motifs were effective in the identification of O-GlcNAcylation sites. A case study demonstrated that the proposed method could be a feasible means of conducting preliminary analyses of protein O-GlcNAcylation. We also anticipated that the revealed substrate motif may facilitate the study of extensive crosstalk between O-GlcNAcylation and phosphorylation. This method may help unravel their mechanisms and roles in signaling, transcription, chronic disease, and cancer.
Some studies have shown associations of maternal age at delivery with asthma and food allergy in offspring. However, the relationship between maternal age at delivery and allergic rhinitis is largely ...unclear. This study aimed to investigate the association between maternal age at delivery and allergic rhinitis in a population sample of Asian children, and to explore potential effect modifiers.
A total of 1344 singleton-birth children (763 boys, 56.8%; mean age, 6.4 years) participating in the Longitudinal Investigation of Global Health in Taiwanese Schoolchildren (LIGHTS) cohort were evaluated by a modified International Study of Asthma and Allergies in Childhood (ISAAC) questionnaire and interviewed by pediatricians. Allergic sensitization was determined by using Phadiatop Infant. Multiple logistic regression models with covariates adjustment were performed to investigate the association of maternal age at delivery with allergic rhinitis and allergic sensitization in offspring.
Among 1344 study children, 793 (59%) had physician-diagnosed allergic rhinitis. Advanced maternal age at delivery (≥40 years) was significantly associated with increased odds of allergic rhinitis (adjusted odds ratio AOR = 4.58, 95% confidence interval CI: 1.90–11.03) and allergic sensitization (AOR = 2.86, 95% CI: 1.13–7.22) in offspring. A sex-stratified analysis revealed that the association of advanced maternal age with allergic rhinitis was statistically significant only in female offspring (AOR = 7.02, 95% CI: 1.89–26.14). Stratified analyses by birth order or environmental tobacco smoke exposure during pregnancy did not reveal any significant differences.
Advanced maternal age at delivery was associated with increased risk of allergic rhinitis in Asian children, probably more pronounced among girls.
Carbonylation, which takes place through oxidation of reactive oxygen species (ROS) on specific residues, is an irreversibly oxidative modification of proteins. It has been reported that the ...carbonylation is related to a number of metabolic or aging diseases including diabetes, chronic lung disease, Parkinson's disease, and Alzheimer's disease. Due to the lack of computational methods dedicated to exploring motif signatures of protein carbonylation sites, we were motivated to exploit an iterative statistical method to characterize and identify carbonylated sites with motif signatures.
By manually curating experimental data from research articles, we obtained 332, 144, 135, and 140 verified substrate sites for K (lysine), R (arginine), T (threonine), and P (proline) residues, respectively, from 241 carbonylated proteins. In order to examine the informative attributes for classifying between carbonylated and non-carbonylated sites, multifarious features including composition of twenty amino acids (AAC), composition of amino acid pairs (AAPC), position-specific scoring matrix (PSSM), and positional weighted matrix (PWM) were investigated in this study. Additionally, in an attempt to explore the motif signatures of carbonylation sites, an iterative statistical method was adopted to detect statistically significant dependencies of amino acid compositions between specific positions around substrate sites. Profile hidden Markov model (HMM) was then utilized to train a predictive model from each motif signature. Moreover, based on the method of support vector machine (SVM), we adopted it to construct an integrative model by combining the values of bit scores obtained from profile HMMs. The combinatorial model could provide an enhanced performance with evenly predictive sensitivity and specificity in the evaluation of cross-validation and independent testing.
This study provides a new scheme for exploring potential motif signatures at substrate sites of protein carbonylation. The usefulness of the revealed motifs in the identification of carbonylated sites is demonstrated by their effective performance in cross-validation and independent testing. Finally, these substrate motifs were adopted to build an available online resource (MDD-Carb, http://csb.cse.yzu.edu.tw/MDDCarb/ ) and are also anticipated to facilitate the study of large-scale carbonylated proteomes.
Alternative lengthening of telomeres (ALT) is known to use homologous recombination (HR) to replicate telomeric DNA in a telomerase-independent manner. However, the detailed process remains largely ...undefined. It was reported that nuclear receptors COUP-TFII and TR4 are recruited to the enriched GGGTCA variant repeats embedded within ALT telomeres, implicating nuclear receptors in regulating ALT activity. Here, we identified a function of nuclear receptors in ALT telomere maintenance that involves a direct interaction between COUP-TFII/TR4 and FANCD2, the key protein in the Fanconi anemia (FA) DNA repair pathway. The COUP-TFII/TR4-FANCD2 complex actively induces the DNA damage response by recruiting endonuclease MUS81 and promoting the loading of the PCNA-POLD3 replication complex in ALT telomeres. Furthermore, the COUP-TFII/TR4-mediated ALT telomere pathway does not require the FA core complex or the monoubiquitylation of FANCD2, key steps in the canonical FA pathway. Thus, our findings reveal that COUP-TFII/TR4 regulates ALT telomere maintenance through a novel noncanonical FANCD2 pathway.
IMPORTANCE: The prevalence of atopic dermatitis has substantially increased in recent decades, and atopic dermatitis could lead to allergic airway inflammation later in life. A previous study found ...that inorganic arsenic exposure was associated with allergic airway inflammation in children aged 8 to 14 years. However, the association between prenatal exposure to arsenic and other metals and the risk of atopic dermatitis among young children remains unknown. OBJECTIVE: To assess the association between prenatal exposure to arsenic and other metals and the occurrence of atopic dermatitis in children at age 4 years. DESIGN, SETTING, AND PARTICIPANTS: In total, 1152 pregnant women were enrolled in the original Taiwan Maternal and Infant Cohort Study (TMICS), a multicenter birth cohort study conducted at 9 hospitals in northern, central, southern, and eastern Taiwan from October 2012 to May 2015. Of those, 586 mothers and children aged 4 years participated in follow-up questionnaire interviews from August 2016 to January 2019. After excluding 216 participants with missing data, the final statistical analysis of follow-up data included 370 mother and child pairs from the central and eastern regions of Taiwan. Data were analyzed from February 2 to August 12, 2021. EXPOSURES: Arsenic, cadmium, lead, cobalt, copper, nickel, thallium, and zinc during pregnancy. MAIN OUTCOMES AND MEASURES: The outcome was parent-reported atopic dermatitis history among children aged 4 years. The presence of atopic dermatitis was defined as a positive response to the question, “Has your child ever had atopic dermatitis diagnosed by a physician?” During the initial TMICS study period, concentrations of arsenic, cadmium, lead, cobalt, copper, nickel, thallium, and zinc were measured in maternal urine during the third trimester of pregnancy using an inductively coupled plasma mass spectrometer. Estimated total inorganic arsenic exposure was calculated using a model that included data on both total arsenic and arsenic species (arsenite, arsenate, monomethylarsonate, and dimethylarsenate) obtained from a previous TMICS cohort. RESULTS: Among 370 children included in the analysis, the mean (SD) age was 3.94 (0.59) years; 208 children (56.2%) were male, and 267 children (72.2%) were from the central region of Taiwan. A total of 110 children (29.7%) had atopic dermatitis at age 4 years. Maternal estimated total inorganic arsenic exposure during pregnancy was associated with increased odds of atopic dermatitis among children at age 4 years (odds ratio OR, 2.42 95% CI, 1.33-4.39 for every doubled increase of total inorganic arsenic) after adjusting for parental allergies, child’s sex, geographic area, maternal educational level, and exposure to tobacco smoke. Every increased unit in the weighted quantile sum index of maternal metal exposure was significantly associated with atopic dermatitis (OR, 1.63; 95% CI, 1.28-2.07). Arsenic (40.1%) and cadmium (20.5%) accounted for most of the metal mixture index. CONCLUSIONS AND RELEVANCE: This cohort study found that prenatal exposure to inorganic arsenic and coexposure to inorganic arsenic and cadmium were associated with a higher risk of atopic dermatitis in young children. These findings suggest that prevention of exposure to inorganic arsenic and cadmium during pregnancy may be helpful for the control of atopic dermatitis and other potential allergies in children.
Transmembrane (TM) proteins have crucial roles in various cellular processes. The location of post-translational modifications (PTMs) on TM proteins is associated with their functional roles in ...various cellular processes. Given the importance of PTMs in the functioning of TM proteins, this study developed topPTM (available online at http://topPTM.cse.yzu.edu.tw), a new dbPTM module that provides a public resource for identifying the functional PTM sites on TM proteins with structural topology. Experimentally verified TM topology data were integrated from TMPad, TOPDB, PDBTM and OPM. In addition to the PTMs obtained from dbPTM, experimentally verified PTM sites were manually extracted from research articles by text mining. In an attempt to provide a full investigation of PTM sites on TM proteins, all UniProtKB protein entries containing annotations related to membrane localization and TM topology were considered potential TM proteins. Two effective tools were then used to annotate the structural topology of the potential TM proteins. The TM topology of TM proteins is represented by graphical visualization, as well as by the PTM sites. To delineate the structural correlation between the PTM sites and TM topologies, the tertiary structure of PTM sites on TM proteins was visualized by Jmol program. Given the support of research articles by manual curation and the investigation of domain-domain interactions in Protein Data Bank, 1347 PTM substrate sites are associated with protein-protein interactions for 773 TM proteins. The database content is regularly updated on publication of new data by continuous surveys of research articles and available resources.
The clinical association between dry eye syndrome (DES) and chronic fatigue syndrome (CFS) remain unclear with less evidences. We aimed to investigate the relationship between CFS and DES using a ...national insurance and prospective cohort study.
Data from the Longitudinal Health Insurance Database 2000 was applied to estimate the incidence of CFS among patients with DES, and their age- and sex-matched controls without DES over a long-term follow-up period. All participants were CFS free at baseline, before the interval (2005-2007), but were later diagnosed with CFS. DES patients and its relative matched controls were excluded prevalent CFS before the same interval.
We identified 884 patients with DES and 3,536 matched controls in baseline and estimated the hazard ratios for incident CFS in the follow-up period. Patients with DES had a 2.08-fold considerably increasing risk of developing CFS, compared to non-DES group. An elevated risk of developing CFS remained (1.61-fold risk) even after adjusting for age, sex, and comorbidities. There was a presence of increasing risk in DES-related CFS when CFS-related comorbidities existing (adjusted hazard ratio, 1.98, 95% confidence interval, 1.19-3.29;
< 0.01). The subsequent risk for CFS between DES and non-DES patients was significant increased with three or more annual medical visits, the adjusted risk for CFS was 4.88-fold risk (95% CI, 2.26-10.58,
< 0.001).
We recommended that physicians should be aware of the increased risk of CFS among DES patients and adequately assess the health impacts among these patients.