Fundamental questions regarding collagen biosynthesis, especially with respect to the molecular origins of homotrimeric versus heterotrimeric assembly, remain unanswered. Here, we demonstrate that ...the presence or absence of a single cysteine in type-I collagen's C-propeptide domain is a key factor governing the ability of a given collagen polypeptide to stably homotrimerize. We also identify a critical role for Ca
in non-covalent collagen C-propeptide trimerization, thereby priming the protein for disulfide-mediated covalent immortalization. The resulting cysteine-based code for stable assembly provides a molecular model that can be used to predict, a priori, the identity of not just collagen homotrimers, but also naturally occurring 2:1 and 1:1:1 heterotrimers. Moreover, the code applies across all of the sequence-diverse fibrillar collagens. These results provide new insight into how evolution leverages disulfide networks to fine-tune protein assembly, and will inform the ongoing development of designer proteins that assemble into specific oligomeric forms.
Intracellular procollagen folding begins at the protein's C-terminal propeptide (C-Pro) domain, which initiates triple-helix assembly and defines the composition and chain register of fibrillar ...collagen trimers. The C-Pro domain is later proteolytically cleaved and excreted from the body, while the mature triple helix is incorporated into the extracellular matrix. The procollagen C-Pro domain possesses a single
-glycosylation site that is widely conserved in all the fibrillar procollagens across humans and diverse other species. Given that the C-Pro domain is removed once procollagen folding is complete, the
-glycan might be presumed to be important for folding. Surprisingly, however, there is no difference in the folding and secretion of
-glycosylated versus non-
-glycosylated collagen type-I, leaving the function of the
-glycan unclear. We hypothesized that the collagen
-glycan might have a context-dependent function, specifically, that it could be required to promote procollagen folding only when proteostasis is challenged. We show that removal of the
-glycan from misfolding-prone C-Pro domain variants does indeed cause serious procollagen and ER proteostasis defects. The
-glycan promotes folding and secretion of destabilized C-Pro variants by providing access to the ER's lectin-based chaperone machinery. Finally, we show that the C-Pro
-glycan is actually critical for the folding and secretion of even wild-type procollagen under ER stress conditions. Such stress is commonly incurred during development, wound healing, and other processes in which collagen production plays a key role. Collectively, these results establish an essential, context-dependent function for procollagen's previously enigmatic
-glycan, wherein the carbohydrate moiety buffers procollagen folding against proteostatic challenge.