Protein glycosylation is a ubiquitous post-translational modification found in all domains of life. Despite their significant complexity in animal systems, glycan structures have crucial biological ...and physiological roles, from contributions in protein folding and quality control to involvement in a large number of biological recognition events. As a result, they impart an additional level of 'information content' to underlying polypeptide structures. Improvements in analytical methodologies for dissecting glycan structural diversity, along with recent developments in biochemical and genetic approaches for studying glycan biosynthesis and catabolism, have provided a greater understanding of the biological contributions of these complex structures in vertebrates.
Glycans linked to proteins and lipids play key roles in biology; thus, accurate replication of cellular glycans is crucial for maintaining function following cell division. The fact that glycans are ...not copied from genomic templates suggests that fidelity is provided by the catalytic templates of glycosyltransferases that accurately add sugars to specific locations on growing oligosaccharides. To form new glycosidic bonds, glycosyltransferases bind acceptor substrates and orient a specific hydroxyl group, frequently one of many, for attack of the donor sugar anomeric carbon. Several recent crystal structures of glycosyltransferases with bound acceptor substrates reveal that these enzymes have common core structures that function as scaffolds upon which variable loops are inserted to confer substrate specificity and correctly orient the nucleophilic hydroxyl group. The varied approaches for acceptor binding site assembly suggest an ongoing evolution of these loop regions provides templates for assembly of the diverse glycan structures observed in biology.
An automated platform that can synthesize a wide range of complex carbohydrates will greatly increase their accessibility and should facilitate progress in glycoscience. Here we report a fully ...automated process for enzyme-mediated oligosaccharide synthesis that can give easy access to different classes of complex glycans including poly-N-acetyllactosamine derivatives, human milk oligosaccharides, gangliosides and N-glycans. Our automated platform uses a catch and release approach in which glycosyltransferase-catalysed reactions are performed in solution and product purification is accomplished by solid phase extraction. We developed a sulfonate tag that can easily be installed and enables highly efficient solid phase extraction and product release using a single set of washing conditions, regardless of the complexity of the glycan. Using this custom-built synthesizer, as many as 15 reaction cycles can be performed in an automated fashion without a need for lyophilization or buffer exchange steps.
Glycan microarrays provide a high-throughput means of profiling the interactions of glycan-binding proteins with their ligands. However, the construction of current glycan microarray platforms is ...time consuming and expensive. Here, we report a fast and cost-effective method for the assembly of cell-based glycan arrays to probe glycan-glycan-binding protein interactions directly on the cell surface. Chinese hamster ovary cell mutants with a narrow and relatively homogeneous repertoire of glycoforms serve as the foundation platforms to develop these arrays. Using recombinant glycosyltransferases, sialic acid, fucose, and analogs thereof are installed on cell-surface glycans to form cell-based arrays displaying diverse glycan epitopes that can be probed with glycan-binding proteins by flow cytometry. Using this platform, high-affinity glycan ligands are discovered for Siglec-15-a sialic acid-binding lectin involved in osteoclast differentiation. Incubating human osteoprogenitor cells with cells displaying a high-affinity Siglec-15 ligand impairs osteoclast differentiation, demonstrating the utility of this cell-based glycan array technology.
Despite mammalian glycans typically having highly complex asymmetrical multiantennary architectures, chemical and chemoenzymatic synthesis has almost exclusively focused on the preparation of simpler ...symmetrical structures. This deficiency hampers investigations into the biology of glycan-binding proteins, which in turn complicates the biomedical use of this class of biomolecules. Herein, we describe an enzymatic strategy, using a limited number of human glycosyltransferases, to access a collection of 60 asymmetric, multiantennary human milk oligosaccharides (HMOs), which were used to develop a glycan microarray. Probing the array with several glycan-binding proteins uncovered that not only terminal glycoepitopes but also complex architectures of glycans can influence binding selectivity in unanticipated manners. N- and O-linked glycans express structural elements of HMOs, and thus, the reported synthetic principles will find broad applicability.
Glycosyltransferases (GTs) play fundamental roles in nearly all cellular processes through the biosynthesis of complex carbohydrates and glycosylation of diverse protein and small molecule ...substrates. The extensive structural and functional diversification of GTs presents a major challenge in mapping the relationships connecting sequence, structure, fold and function using traditional bioinformatics approaches. Here, we present a convolutional neural network with attention (CNN-attention) based deep learning model that leverages simple secondary structure representations generated from primary sequences to provide GT fold prediction with high accuracy. The model learns distinguishing secondary structure features free of primary sequence alignment constraints and is highly interpretable. It delineates sequence and structural features characteristic of individual fold types, while classifying them into distinct clusters that group evolutionarily divergent families based on shared secondary structural features. We further extend our model to classify GT families of unknown folds and variants of known folds. By identifying families that are likely to adopt novel folds such as GT91, GT96 and GT97, our studies expand the GT fold landscape and prioritize targets for future structural studies.
Contemporary chemoenzymatic approaches can provide highly complex multi-antennary N-linked glycans. These procedures are, however, very demanding and typically involve as many as 100 chemical steps ...to prepare advanced intermediates that can be diversified by glycosyltransferases in a branch-selective manner to give asymmetrical structures commonly found in nature. Only highly specialized laboratories can perform such syntheses, which greatly hampers progress in glycoscience. Here we describe a biomimetic approach in which a readily available bi-antennary glycopeptide can be converted in ten or fewer chemical and enzymatic steps into multi-antennary N-glycans that at each arm can be uniquely extended by glycosyltransferases to give access to highly complex asymmetrically branched N-glycans. A key feature of our approach is the installation of additional branching points using recombinant MGAT4 and MGAT5 in combination with unnatural sugar donors. At an appropriate point in the enzymatic synthesis, the unnatural monosaccharides can be converted into their natural counterpart, allowing each arm to be elaborated into a unique appendage.
Vertebrate glycoproteins and glycolipids are synthesized in complex biosynthetic pathways localized predominantly within membrane compartments of the secretory pathway. The enzymes that catalyze ...these reactions are exquisitely specific, yet few have been extensively characterized because of challenges associated with their recombinant expression as functional products. We used a modular approach to create an expression vector library encoding all known human glycosyltransferases, glycoside hydrolases, and sulfotransferases, as well as other glycan-modifying enzymes. We then expressed the enzymes as secreted catalytic domain fusion proteins in mammalian and insect cell hosts, purified and characterized a subset of the enzymes, and determined the structure of one enzyme, the sialyltransferase ST6GalNAcII. Many enzymes were produced at high yields and at similar levels in both hosts, but individual protein expression levels varied widely. This expression vector library will be a transformative resource for recombinant enzyme production, broadly enabling structure-function studies and expanding applications of these enzymes in glycochemistry and glycobiology.
Heparan sulfate (HS) proteoglycans are extended (-GlcAβ1,4GlcNAcα1,4-)
co-polymers containing decorations of sulfation and epimerization that are linked to cell surface and extracellular matrix ...proteins. In mammals, HS repeat units are extended by an obligate heterocomplex of two exostosin family members, EXT1 and EXT2, where each protein monomer contains distinct GT47 (GT-B fold) and GT64 (GT-A fold) glycosyltransferase domains. In this study, we generated human EXT1-EXT2 (EXT1-2) as a functional heterocomplex and determined its structure in the presence of bound donor and acceptor substrates. Structural data and enzyme activity of catalytic site mutants demonstrate that only two of the four glycosyltransferase domains are major contributors to co-polymer syntheses: the EXT1 GT-B fold β1,4GlcA transferase domain and the EXT2 GT-A fold α1,4GlcNAc transferase domain. The two catalytic sites are over 90 Å apart, indicating that HS is synthesized by a dissociative process that involves a single catalytic site on each monomer.
Substrate side chain conformation impacts reactivity during glycosylation and glycoside hydrolysis and is restricted by many glycosidases and glycosyltransferases during catalysis. We show that the ...side chains of gluco and manno iminosugars can be restricted to predominant conformations by strategic installation of a methyl group. Glycosidase inhibition studies reveal that iminosugars with the gauche,gauche side chain conformations are 6‐ to 10‐fold more potent than isosteric compounds with the gauche,trans conformation; a manno‐configured iminosugar with the gauche,gauche conformation is a 27‐fold better inhibitor than 1‐deoxymannojirimycin. The results are discussed in terms of the energetic benefits of preorganization, particularly when in synergy with favorable hydrophobic interactions. The demonstration that inhibitor side chain preorganization can favorably impact glycosidase inhibition paves the way for improved inhibitor design through conformational preorganization.
The first experimental demonstration that preorganization of inhibitor side chain conformation can favorably impact glycosidase inhibition is reported. By strategic installation of a methyl group at C6, a manno‐configured iminosugar with the gauche,gauche side chain conformation was found to be a 27‐fold better inhibitor than 1‐deoxymannojirimycin.