Glycosylation is the most abundant and diverse posttranslational modification of proteins. While several types of glycosylation can be predicted by the protein sequence context, and substantial ...knowledge of these glycoproteomes is available, our knowledge of the GalNAc‐type O‐glycosylation is highly limited. This type of glycosylation is unique in being regulated by 20 polypeptide GalNAc‐transferases attaching the initiating GalNAc monosaccharides to Ser and Thr (and likely some Tyr) residues. We have developed a genetic engineering approach using human cell lines to simplify O‐glycosylation (SimpleCells) that enables proteome‐wide discovery of O‐glycan sites using ‘bottom‐up’ ETD‐based mass spectrometric analysis. We implemented this on 12 human cell lines from different organs, and present a first map of the human O‐glycoproteome with almost 3000 glycosites in over 600 O‐glycoproteins as well as an improved NetOGlyc4.0 model for prediction of O‐glycosylation. The finding of unique subsets of O‐glycoproteins in each cell line provides evidence that the O‐glycoproteome is differentially regulated and dynamic. The greatly expanded view of the O‐glycoproteome should facilitate the exploration of how site‐specific O‐glycosylation regulates protein function.
Comprehensive proteomics survey in 12 human cell lines and development of an improved NetOGlyc4.0 prediction tool greatly expand the view on mucin‐type protein O‐glycosylation.
Zinc-finger nuclease (ZFN) gene targeting is emerging as a versatile tool for engineering of multiallelic gene deficiencies. A longstanding obstacle for detailed analysis of glycoproteomes has been ...the extensive heterogeneities in glycan structures and attachment sites. Here we applied ZFN targeting to truncate the O-glycan elongation pathway in human cells, generating stable 'SimpleCell' lines with homogenous O-glycosylation. Three SimpleCell lines expressing only truncated GalNAcα or NeuAcα2-6GalNAcα O-glycans were produced, allowing straightforward isolation and sequencing of GalNAc O-glycopeptides from total cell lysates using lectin chromatography and nanoflow liquid chromatography-mass spectrometry (nLC-MS/MS) with electron transfer dissociation fragmentation. We identified >100 O-glycoproteins with >350 O-glycan sites (the great majority previously unidentified), including a GalNAc O-glycan linkage to a tyrosine residue. The SimpleCell method should facilitate analyses of important functions of protein glycosylation. The strategy is also applicable to other O-glycoproteomes.
Previous studies demonstrated that certain glycosphingolipids (GSLs) are involved in various cell functions, such as cell growth and motility. Recent studies showed changes in GSL expression during ...differentiation of human embryonic stem cells; however, little is known about expression profiles of GSLs in cancer stem cells (CSCs). CSCs are a small subpopulation in cancer and are proposed as cancer-initiating cells, have been shown to be resistant to numerous chemotherapies, and may cause cancer recurrence. Here, we analyzed GSLs expressed in human breast CSCs by applying a CSC model induced through epithelial–mesenchymal transition, using mass spectrometry, TLC immunostaining, and cell staining. We found that (i) Fuc-(n)Lc4Cer and Gb3Cer were drastically reduced in CSCs, whereas GD2, GD3, GM2, and GD1a were greatly increased in CSCs; (ii) among various glycosyltransferases tested, mRNA levels for ST3GAL5, B4GALNT1, ST8SIA1, and ST3GAL2 were increased in CSCs, which could explain the increased expression of GD3, GD2, GM2, and GD1a in CSCs; (iii) the majority of GD2+ cells and GD3+ cells were detected in the CD44 ʰⁱ/CD24 ˡᵒ cell population; and (iv) knockdown of ST8SIA1 and B4GALNT1 significantly reduced the expression of GD2 and GD3 and caused a phenotype change from CSC to a non-CSC, which was detected by reduced mammosphere formation and cell motility. Our results provide insight into GSL profiles in human breast CSCs, indicate a functional role of GD2 and GD3 in CSCs, and suggest a possible novel approach in targeting human breast CSCs to interfere with cancer recurrence.
The metazoan O-mannose (O-Man) glycoproteome is largely unknown. It has been shown that up to 30% of brain O-glycans are of the O-Man type, but essentially only alpha-dystroglycan (α-DG) of the ...dystrophin–glycoprotein complex is well characterized as an O-Man glycoprotein. Defects in O-Man glycosylation underlie congenital muscular dystrophies and considerable efforts have been devoted to explore this O-glycoproteome without much success. Here, we used our SimpleCell strategy using nuclease-mediated gene editing of a human cell line (MDA-MB-231) to reduce the structural heterogeneity of O-Man glycans and to probe the O-Man glycoproteome. In this breast cancer cell line we found that O-Man glycosylation is primarily found on cadherins and plexins on β-strands in extracellular cadherin and Ig-like, plexin and transcription factor domains. The positions and evolutionary conservation of O-Man glycans in cadherins suggest that they play important functional roles for this large group of cell adhesion glycoproteins, which can now be addressed. The developed O-Man SimpleCell strategy is applicable to most types of cell lines and enables proteome-wide discovery of O-Man protein glycosylation.
Global analyses of proteins and their modifications by mass spectrometry are essential tools in cell biology and biomedical research. Analyses of glycoproteins represent particular challenges and we ...are only at the beginnings of the glycoproteomic era. Some of the challenges have been overcome with N-glycoproteins and proteome-wide analysis of N-glycosylation sites is accomplishable today but only by sacrificing information of structures at individual glycosites. More recently advances in analysis of O-glycoproteins have been made and proteome-wide analysis of O-glycosylation sites is becoming available as well.
Here we discuss the challenges of analysis of O-glycans and new O-glycoproteomics strategies focusing on O-GalNAc and O-Man glycoproteomes.
A variety of strategies are now available for proteome-wide analysis of O-glycosylation sites enabling functional studies. However, further developments are still needed for complete analysis of glycan structures at individual sites for both N- and O-glycoproteomics strategies.
The advances in O-glycoproteomics have led to identification of new biological functions of O-glycosylation and a new understanding of the importance of where O-glycans are positioned on proteins.
•What makes O-glycoproteomics particularly challenging?•ETD MS/MS based approaches had provided one solution.•Enrichment protocols for glycopeptides are essential for deep O-glycoproteomics.•Precise gene editing strategies complement functional glycoproteomics.
NKT cells represent a distinct lineage of T cells that coexpress a conserved αβ T cell receptor (TCR) and natural killer (NK) receptors. Although the TCR of NKT cells is characteristically ...autoreactive to CD1d, a lipid-presenting molecule, endogenous ligands for these cells have not been identified. We show that a lysosomal glycosphingolipid of previously unknown function, isoglobotrihexosylceramide (iGb3), is recognized both by mouse and human NKT cells. Impaired generation of lysosomal iGb3 in mice lacking β-hexosaminidase b results in severe NKT cell deficiency, suggesting that this lipid also mediates development of NKT cells in the mouse. We suggest that expression of iGb3 in peripheral tissues may be involved in controlling NKT cell responses to infections and malignancy and in autoimmunity.
Sphingosines, or sphingoids, are a family of naturally occurring long-chain hydrocarbon derivatives sharing a common 1,3-dihydroxy-2-amino-backbone motif. The majority of sphingolipids, as their ...derivatives are collectively known, can be found in cell membranes in the form of amphiphilic conjugates, each composed of a polar head group attached to an N-acylated sphingoid, or ceramide. Glycosphingolipids (GSLs), which are the glycosides of either ceramide or myo-inositol-(1-O)-phosphoryl-(O-1)-ceramide, are a structurally and functionally diverse sphingolipid subclass; GSLs are ubiquitously distributed among all eukaryotic species and are found in some bacteria. Since GSLs are secondary metabolites, direct and comprehensive analysis (metabolomics) must be considered an essential complement to genomic and proteomic approaches for establishing the structural repertoire within an organism and deducing its possible functional roles. The glycosphingolipidome clearly comprises an important and extensive subset of both the glycome and the lipidome, but the complexities of GSL structure, biosynthesis, and function form the outlines of a considerable analytical problem, especially since their structural diversity confers by extension an enormous variability with respect to physicochemical properties. This chapter covers selected developments and applications of techniques in mass spectrometric (MS) that have contributed to GSL structural analysis and glycosphingolipidomics since 1990. Sections are included on basic characteristics of ionization and fragmentation of permethylated GSLs and of lithium-adducted nonderivatized GSLs under positive-ion electrospray ionization mass spectrometry (ESI-MS) and collision-induced mass spectrometry (CID-MS) conditions; on the analysis of sulfatides, mainly using negative-ion techniques; and on selected applications of ESI-MS and matrix-assisted laser desorption/ionization mass spectrometry (MALDI-MS) to emerging GSL structural, functional, and analytical issues. The latter section includes a particular focus on evolving techniques for analysis of gangliosides, GSLs containing sialic acid, as well as on characterizations of GSLs from selected nonmammalian eukaryotes, such as dipterans, nematodes, cestodes, and fungi. Additional sections focus on the issue of whether it is better to leave GSLs intact or remove the ceramide; on development and uses of thin-layer chromatography (TLC) blotting and TLC-MS techniques; and on emerging issues of high-throughput analysis, including the use of flow injection, liquid chromatography mass spectrometry (LC-MS), and capillary electrophoresis mass spectrometry (CE-MS).
Our knowledge of the O-glycoproteome N-acetylgalactosamine (GalNAc) type is highly limited. The O-glycoproteome is differentially regulated in cells by dynamic expression of a subset of 20 ...polypeptide GalNAc-transferases (GalNAc-Ts), and methods to identify important functions of individual GalNAc-Ts are largely unavailable. We recently introduced SimpleCells, i.e., human cell lines made deficient in O-glycan extension by zinc finger nuclease targeting of a key gene in O-glycan elongation (Cosmc), which allows for proteome-wide discovery of O-glycoproteins. Here we have extended the SimpleCell concept to include proteome-wide discovery of unique functions of individual GalNAc-Ts. We used the GalNAc-T2 isoform implicated in dyslipidemia and the human HepG2 liver cell line to demonstrate unique functions of this isoform. We confirm that GalNAc-T2–directed site-specific O-glycosylation inhibits proprotein activation of the lipase inhibitor ANGPTL3 in HepG2 cells and further identify eight O-glycoproteins exclusively glycosylated by T2 of which one, ApoC-III, is implicated in dyslipidemia. Our study supports an essential role for GalNAc-T2 in lipid metabolism, provides serum biomarkers for GalNAc-T2 enzyme function, and validates the use of GALNT gene targeting with SimpleCells for broad discovery of disease-causing deficiencies in O-glycosylation. The presented glycoengineering strategy opens the way for proteome-wide discovery of functions of GalNAc-T isoforms and their role in congenital diseases and disorders.
N-acetylgalactosaminyltransferase (GalNAc)-type (mucin-type) O-glycosylation is an abundant and highly diverse modification of proteins. This type of O-glycosylation is initiated in the Golgi by a ...large family of up to 20 homologous polypeptide GalNAc-T isoenzymes that transfer GalNAc to Ser, Thr and possibly Tyr residues. These GalNAc residues are then further elongated by a large set of glycosyltransferases to build a variety of complex O-glycan structures. What determines O-glycan site occupancy is still poorly understood, although it is clear that the substrate specificities of individual isoenzymes and the repertoire of GalNAc-Ts in cells are key parameters. The GalNAc-T isoenzymes are differentially expressed in cells and tissues in principle allowing cells to produce unique O-glycoproteomes dependent on the specific subset of isoforms present. In vitro analysis of acceptor peptide substrate specificities using recombinant expressed GalNAc-Ts has been the method of choice for probing activities of individual isoforms, but these studies have been hampered by biological validation of actual O-glycosylation sites in proteins and number of substrate testable. Here, we present a systematic analysis of the activity of 10 human GalNAc-T isoenzymes with 195 peptide substrates covering known O-glycosylation sites and provide a comprehensive dataset for evaluating isoform-specific contributions to the O-glycoproteome.
The Chinese hamster ovary cell (CHO) is the major host cell factory for recombinant production of biological therapeutics primarily because of its “human-like” glycosylation features. CHO is used for ...production of several O-glycoprotein therapeutics including erythropoietin, coagulation factors, and chimeric receptor IgG1-Fc-fusion proteins, however, some O-glycoproteins are not produced efficiently in CHO. We have previously shown that the capacity for O-glycosylation of proteins can be one limiting parameter for production of active proteins in CHO. Although the capacity of CHO for biosynthesis of glycan structures (glycostructures) on glycoproteins are well established, our knowledge of the capacity of CHO cells for attaching GalNAc-type O-glycans to proteins (glycosites) is minimal. This type of O-glycosylation is one of the most abundant forms of glycosylation, and it is differentially regulated in cells by expression of a subset of homologous polypeptide GalNAc-transferases. Here, we have genetically engineered CHO cells to produce homogeneous truncated O-glycans, so-called SimpleCells, which enabled lectin enrichment of O-glycoproteins and characterization of the O-glycoproteome. We identified 738 O-glycoproteins (1548 O-glycosites) in cell lysates and secretomes providing the first comprehensive insight into the O-glycosylation capacity of CHO (http://glycomics.ku.dk/o-glycoproteome_db/).