Many mammalian proteins are glycoproteins, i.e. they contain sugars covalently linked to specific amino acids in their structure. There are two major types of sugar-containing proteins that occur in animal cells, generally referred to as glycoproteins and proteoglycans. Along with glycolipids, which are presented in the next chapter, all of these compounds are part of the group of sugar-containing macromolecules called glycoconjugates.
Glycoproteins (Fig. 27.1A) have short glycan chains; they can have up to 20 sugars but usually contain between three and 15 sugars. These oligosaccharides are highly branched, they do not have a repeating unit, and they usually contain amino sugars (N-acetylglucosamine or N-acetylgalactosamine), neutral sugars (D-galactose, D-mannose, L-fucose) and the acidic sugar sialic acid (N-acetylneuraminic acid). Glycoproteins generally do not contain uronic acids, acidic sugars that are a major part of the proteoglycans. Glycoproteins usually contain much smaller amounts of carbohydrate than of protein, typically from just a few percent carbohydrate to 10–15% sugar by weight.
Proteoglycans (see Fig. 27.1B and Chapter 29) contain as much as 50–60% carbohydrate. In these molecules, the sugar chains are long, unbranched polymers that may contain hundreds of monosaccharides. These saccharide chains have a repeating disaccharide unit, generally made up of a uronic acid and an amino sugar.
Most proteins in cell surface membranes that function as receptors for hormones or participate in other important membrane-associated processes such as cell–cell interactions are glycoproteins. Many of the membrane proteins of the endoplasmic reticulum or Golgi apparatus, as well as those proteins that are secreted from cells, including serum and mucous proteins, are also glycoproteins. In fact, glycosylation is the major enzymatic modification of proteins in the body. Addition of sugars to a protein can occur either at the same time, and location, as protein synthesis is occurring in the endoplasmic reticulum, i.e. co-translationally, or following the completion of protein synthesis and after the protein has been transported to the Golgi apparatus, i.e. posttranslationally. The functions of the carbohydrate chains of the resulting glycoproteins are diverse (Table 27.1).
Sugars can be attached to protein in either N-glycosidic linkages or O-glycosidic linkages. N-linked oligosaccharides (N-glycans) are widespread in nature and are characteristic of membrane and secretory proteins. The attachment of these oligosaccharides to protein involves a glycosylamine linkage between an N-acetylglucosamine (GlcNAc) residue and the amide nitrogen of an asparagine residue (Fig. 27.2A). The asparagine that serves as an acceptor of this oligosaccharide must be in the consensus sequence Asn-X-Ser (Thr) in order to be recognized as the acceptor by the oligosaccharide transferring enzyme (see below). However, not all asparagine residues, even those in this consensus sequence, become glycosylated, indicating that other factors such as protein conformation or other properties of the protein may be involved.
O-linked oligosaccharides (O-glycans) are most commonly found in proteins of mucous fluids, but also occur frequently on the same membrane and secretory proteins that contain the N-glycans. The O-glycans typically contain three or more sugars in linear or branched chains, attached to protein by a glycosidic linkage between an N-acetylgalactosamine (GalNAc) residue and the hydroxyl group of either a serine or threonine residue on the protein (Fig. 27.2B; see Fig. 27.4). There does not appear to be a consensus sequence of amino acids for O-linked glycosylation, so it is not known why some serines (or threonines) become glycosylated but others do not.
A glucosyl-galactose disaccharide is frequently linked to the hydroxyl group of hydroxylysine residues in the fibrous protein, collagen (Fig. 27.2C). Hydroxylysine is an uncommon amino acid, found only in collagens and proteins with collagenous domains. Hydroxylysine is not directly incorporated into protein as such, but is produced by posttranslational hydroxylation of lysine residues. Lysine hydroxylase requires vitamin C as a cofactor, which is why vitamin C is frequently used to expedite wound healing. Collagen is first synthesized in the cell as a precursor form called procollagen. Procollagens are usually synthesized as N-linked glycoproteins, but the N-glycan is removed as part of that peptide that is cleaved from procollagen during its maturation to collagen. Only the O-linked disaccharides remain on the mature collagen molecule. The less glycosylated collagens tend to form ordered, fibrous structures, such as occur in tendons, while the more heavily glycosylated collagens are found in meshwork structures, such as basement membranes in the vascular wall and renal glomerulus (see Chapter 29).
A single GlcNAc is attached to the hydroxyl group of serine or threonine residues on a number of cytoplasmic and nuclear proteins (Fig. 27.2D). This O-linked GlcNAc (O-GlcNAc) is linked to specific serine and threonine residues that become phosphorylated by protein kinases during hormonal stimulation or other signaling events. The enzyme that adds the GlcNAc is widespread, but how it is controlled is still not clear. There is a second enzyme that removes the GlcNAc from the serine and threonine residues, similar to the contrasting regulatory roles of protein kinases and phosphatases. The GlcNAc modification (O-GlcNAcylation) may represent a mechanism that allows the cells to block phosphorylation of specific serine and threonine residues on selected proteins, while allowing others to still be phosphorylated. Then, the GlcNAc can be removed under appropriate conditions to allow phosphorylation. The donor substrate of O-GlcNAcylation is UDP-GlcNAc (see below), which is derived from glucose. O-GlcNAcylation is dependent on the intracellular UDP-GlcNAc level, which is correlated with extracellular glucose concentration. O-GlcNAcylation of proteins involved in the insulin signaling pathway brings about insulin resistance in skeletal muscle, adipose tissue and pancreatic β-cells, which causes type 2 diabetes. O-GlcNAcylation also regulates transcription factors as well as the proteasome, which is involved in protein turnover (Chapter 34).
A novel class of O-linked glycans in which mannose is linked to serine or threonine residues is found on the muscle and nerve-specific protein dystroglycan. A typical O-mannose glycan consists of GlcNAc, galactose and sialic acid, which are attached to the core mannose in this order. O-mannose glycans serve as linkers between the intracellular cytoskeleton and the extracellular matrix to maintain myocyte function. In fact, deficiency in the biosynthesis of O-mannose glycans causes muscular dystrophy.
One other amino acid that can serve as a site for glycosylation is tyrosine. The only example of this linkage is in the protein glycogenin, found at the core of glycogen (see Chapter 13). Glycogenin is a self-glucosylating protein that initially attaches a glucose to the hydroxyl group of one of its tyrosine residues. The protein then adds a number of other glucoses to the protein-linked glucose to make an oligosaccharide which serves as the acceptor for glycogen synthase.
Although there are a large number of different carbohydrate structures produced by living cells, most of the oligosaccharides on glycoconjugates have many sugars and glycosidic linkages in common. All N-glycans have oligosaccharide chains that are branched structures, having a common core of three mannose residues and two GlcNAc residues (Fig. 27.3A and B), but differing considerably beyond the core region to give high-mannose and complex types of chains. The reason for this similarity in structure is that the high-mannose type of N-linked oligosaccharide is the biosynthetic precursor for all other N-glycans. As indicated below (see Fig. 27.11), the oligosaccharide is initially assembled on a carrier lipid in the endoplasmic reticulum as a high-mannose structure, then transferred to protein. It may remain as a high mannose structure, especially in lower organisms, but in animals the oligosaccharide undergoes a number of processing steps in the endoplasmic reticulum and Golgi apparatus that involve removal of some mannoses and addition of other sugars. As a result, beyond the core region, the oligosaccharides give rise to a vast array of structures that are referred to as high-mannose (Fig. 27.3A), and complex chains (Fig. 27.3B).
Fig. 27.3 Typical structures of high-mannose and complex, N-linked oligosaccharides.
The core structure (shaded area) is common to both structures. Asn, asparagine; Gal, galactose; GlcNAc, N-acetylglucosamine Man, mannose.
Complex oligosaccharides are so named because of their more complex sugar composition, including galactose, sialic acid and L-fucose. Complex chains have terminal trisaccharide sequences composed of sialic acid-galactose-GlcNAc attached to each of the branched core mannoses (Fig. 27.3A). L-Fucose may also be found attached to the core GlcNAc (Fig. 27.1A), the terminal galactose (Fig. 28.12) or the penultimate GlcNAc (Fig. 27.9). In common with sialic acid, fucose is usually a terminal sugar on oligosaccharides; that is, no other sugars are attached to it. Some of the complex oligosaccharides have two trisaccharide sequences, one attached to each of the branched core mannoses, and are therefore called biantennary complex chains, whereas others have three (triantennary) or four (tetra-antennary) of the trisaccharide structures (Fig. 27.3B). More than 100 different complex oligosaccharide structures have now been identified on various cell surface proteins, providing great diversity (microheterogeneity) as mediators of cellular recognition and chemical signaling events.
A glycoprotein may have a single N-glycan chain or it may have several of these types of oligosaccharides. Furthermore, the N-glycans may all have identical structures or they may be quite different in structure. For example, the influenza virus coat glycoproteins, hemagglutinin and neuraminidase, are both glycoproteins which usually have seven N-glycan chains, of which five are biantennary complex chains and two are high-mannose structures. Thus, a range of related structures is commonly found in a single glycoprotein and, in fact, multiple different structures may also be found at a single site on a glycoprotein. Microheterogeneity of oligosaccharide structures results from incomplete processing on some chains during their biosynthesis (see Fig. 27.12). As a result, some of the oligosaccharides on a glycoprotein may be complete complex chains whereas others may be only partially processed. Such diversity in the processing of N-glycans is controlled by many factors. The protein structure near the N-glycosylation site can have some influence on the processing. In general, N-glycans exposed on the surface of a folded protein are vulnerable to the processing enzymes, whereas those shielded within the structure of the protein are inaccessible to the enzymes.
Many N-linked glycoproteins also contain O-glycans of the type shown in Figure 27.4. The number of O-glycans varies considerably depending on the protein and its function. For example, the low-density lipoprotein (LDL) receptor is present in plasma membranes of smooth muscle cells and fibroblasts, and functions to bind and endocytose circulating LDL, as a source of dietary cholesterol. The LDL receptor has two N-glycans that are located near the LDL-binding domain, and a cluster of O-glycans near the membrane-spanning region. As shown in Figure 27.5, this receptor has a membrane-spanning region of hydrophobic amino acids, an extended region of amino acids on the external side of the plasma membrane that contains the cluster of O-glycans, and a functional domain that is involved in binding LDL (see Chapter 18). Although the two N-glycans are near the functional domain, they do not play a role in binding LDL. Instead, their function appears to be in helping the protein to fold into the proper conformation in the endoplasmic reticulum so that it can be translocated to the Golgi apparatus. The negatively charged O-glycans, each having a sialic acid, are believed to function to keep the protein in an extended state, and to prevent it from folding back on itself.
Fig. 27.5 Model of the low-density lipoprotein (LDL) receptor.
(See also Chapter 18 and compare Fig. 18.2.)
Mucins are glycoproteins that are secreted by epithelial cells lining the respiratory, gastrointestinal and genitourinary tracts. These proteins are very large in size with subunits having molecular weights of over one million daltons, and having as much as 80% of their weight as carbohydrate. Mucins are uniquely designed for their function, with about one-third of the amino acids being serines or threonines, and most of these being substituted with an O-glycan. Because most of these oligosaccharides carry a negatively charged sialic acid and these negative charges are in close proximity, they repel each other and prevent the protein from folding, causing it to remain in an extended state. Thus, the protein solution is highly viscous, forming a protective barrier on the epithelial surface, providing lubrication between surfaces and facilitating transport processes, such as the movement of food through the gastrointestinal system. There is a wide range of complex linear and branching oligosaccharide structures on mucins, including blood group antigens (see Chapter 28). Some oligosaccharides participate in interaction and binding to various bacterial cell surfaces. This property may play a significant role in bacterial sequestration and elimination, limiting colonization and infection.
Humans have a dietary requirement for some essential fatty acids and amino acids and vitamins (Chapter 11), but all the sugars that they need to make glycoconjugates can be synthesized from blood sugar, i.e. D-glucose. Figure 27.6 presents an overview of the sequence of reactions involved in interconversion of sugars in mammalian cells. All these sugar interconversion reactions involve sugar phosphates or sugar nucleotides.
Glucose is phosphorylated by hexokinase (or glucokinase in liver) as it enters the cell. Glucose-6-phosphate (Glc-6-P) can then be converted by a mutase (phosphoglucose mutase) to form Glc-1-P, which reacts with uridine triphosphate (UTP) to form UDP-Glc, catalyzed by the enzyme UDP-Glc pyrophosphorylase (Fig. 27.6). This enzyme is a pyrophosphorylase, named for the reverse reaction in which the phosphate of Glc-1-P acts to cleave the pyrophosphate bond of UTP to form UDP-Glc and pyrophosphate (PPi); cleavage of the high energy bond of PPi by pyrophosphatase provides the driving force for the reaction. This is the same pathway used in liver and muscle for incorporation of glucose into glycogen. UDP-Glc is epimerized to UDP-galactose (UDP-Gal) by UDP-Gal 4-epimerase, providing UDP-Gal for synthesis of glycoconjugates.
Glc-6-P may also be converted to fructose-6-phosphate (Fru-6-P) by the glycolytic enzyme phosphoglucose isomerase, and Fru-6-P may be isomerized to mannose-6-phosphate (Man-6-P) by phosphomannose isomerase. The Man-6-P is then converted by a mutase (phosphomannose mutase) to Man-1-P, which condenses with GTP to form GDP-Man. GDP-Man is the form of mannose that is used in the formation of N-linked oligosaccharides and is also the precursor for the formation of activated fucose, GDP-L-fucose. Mannose is present in our diet in small amounts but it is not an essential dietary component, since it can be readily produced from glucose. However, dietary mannose can be phosphorylated by hexokinase to Man-6-P, and then enters metabolism through phosphomannose isomerase.
Although normal animal cells can make all the galactose they need from glucose, galactose is still an important component of our diet because it is one of the sugars that make up the milk disaccharide, lactose. The pathway of galactose metabolism requires three enzymes (see Fig. 27.6). Dietary galactose is transported to the liver where it is phosphorylated by a specific kinase, galactokinase, that attaches phosphate to the hydroxyl group on carbon-1, rather than carbon-6, to form galactose-1-phosphate (Gal-1-P). Humans lack a UDP-Gal pyrophosphorylase, so the conversion of Gal-1-P to Glc-1-P involves the participation of a sugar nucleotide, UDP-Glc. The enzyme Gal-1-P uridyltransferase catalyzes an exchange between UDP-Glc and Gal-1-P to form UDP-Gal and Glc-1-P (see Fig. 27.6). The UDP-Gal is used for synthesis of glycoconjugates, while the Glc-1-P can be converted to Glc-6-P by phosphoglucomutase, and thus the original galactose molecule enters glycolysis.
UDP-Glc is present in only micromolar concentrations in cells so that its availability for galactose metabolism would be quickly exhausted were it not for the presence of a third enzyme, UDP-Gal-4-epimerase. This enzyme catalyzes the equilibrium between UDP-Glc and UDP-Gal, providing a constant source of UDP-Glc during galactose metabolism. The reactions catalyzed by (1) galactokinase, (2) Gal-1-P uridyltransferase, and (3) UDP-Gal 4-epimerase are summarized below, illustrating the roundabout way by which galactose enters mainstream metabolic pathways.
Fructose can be metabolized by two pathways in cells as shown in Figure 27.7. It can be phosphorylated by hexokinase, an enzyme that is present in all cells; however, hexokinase has a strong preference for glucose as a substrate, and glucose, which is present at a concentration of about 5 mmol/L in blood, is a strong competitive inhibitor of the phosphorylation of fructose. The primary pathway of fructose metabolism, which is especially important in liver after a meal, involves the enzyme fructokinase. This enzyme is a very specific kinase that phosphorylates fructose at carbon-1 (rather than the 6-position like hexokinase) to give fructose-1-phosphate (Fru-1-P). The liver aldolase is called aldolase B, and it is different in substrate specificity from the muscle aldolase A, because aldolase B can cleave both Fru-1-P and Fru-1,6-P2, whereas aldolase A will only cleave Fru-1,6-P2. Thus, in liver, the products of fructose cleavage by aldolase B are dihydroxyacetone phosphate and glyceraldehyde (not glyceraldehyde-3-P).The glyceraldehyde must then be phosphorylated by triose kinase in order to be metabolized in glycolysis.
It should be noted that in the liver, fructose enters glycolysis at the level of the triose phosphate intermediates, rather than as Fru-6-P as in muscle. Thus, in liver, the ingested fructose is not subject to regulation by the usual control points for regulatory enzymes, hexokinase and phosphofructokinase. By circumventing these two rate-limiting steps, fructose provides a rapid source of energy in both aerobic and anaerobic cells. This is part of the rationale behind the development of high-fructose drinks such as Gatorade®. The significance of the fructokinase, as opposed to the hexokinase, pathway of fructose metabolism is indicated by the pathology of hereditary fructose intolerance (see Clinical Box below).
UDP-Glc is the precursor of a number of other sugars, such as glucuronic acid, xylose and galactose, which are required for proteoglycan and/or glycoprotein synthesis. The reactions that lead to the formation of these other sugars are outlined in Figures 27.6 and 27.8. A two-step oxidation by the enzyme UDP-Glc dehydrogenase leads to the formation of the activated form of glucuronic acid (UDP-GlcUA) (Fig. 27.8). This sugar nucleotide is the donor of glucuronic acid, both for the formation of proteoglycans (see Chapter 29) and for the detoxification and conjugation reactions that occur in the liver to remove bilirubin, drugs and xenobiotics (see Chapter 30). UDP-GlcUA is also the precursor of UDP-xylose, a pentose sugar nucleotide (Fig. 27.8). UDP-GlcUA undergoes a decarboxylation reaction that removes carbon-6 to form UDP-xylose, the activated form of xylose. Xylose is the linkage sugar between protein and glycan in proteoglycans (Figure 27.1B and Chapter 29). Xylose is also present on many plant glycoproteins, as part of their N-linked oligosaccharides, and is partly responsible for allergic reactions to peanut and nut proteins.
Guanosine diphospate-mannose (GDP-Man) is the donor substrate for most mannosyltransferases. As shown in Figure 27.6, it is produced from Man-6-P and is also the precursor to GDP-L-fucose (GDP-Fuc), which is the donor substrate for all fucosyltranferases. Fucose is a 6-deoxyhexose that is an important sugar participating in many recognition reactions in biological events, such as inflammatory response (Fig. 27.9). The conversion of GDP-Man to GDP-Fuc involves a complex series of oxidative and reductive steps, as well as epimerizations. Deficiency in the GDP-Fuc transporter that translocates GDP-Fuc from the cytosol into the Golgi lumen is associated with a defective inflammatory response and increased susceptibility to infection (Leukocyte adhesion deficiency II: LADII). Resulting depletion of GDP-Fuc in the Golgi lumen blocks the biosynthesis of the leukocyte recognition signal, sialyl Lewis-X structure (see Fig. 27.9).
Fig. 27.9 Carbohydrate-dependent cell–cell interactions in inflammation.
Sialyl Lewis-X, a tetrasaccharide antigen that forms part of the membrane structure of leukocytes, is recognized by a carbohydrate-binding protein, selectin (Sel), on the surface of endothelial cells. The sialyl Lewis-X-selectin interaction mediates the initial, weak binding that results in rolling of the leukocytes along the endothelial monolayer. It facilitates the firm adhesion mediated by the protein–protein interactions leading to extravasation.
Figure 27.10 shows the pathway of formation of GlcNAc, GalNAc and sialic acid. The initial reaction involves the transfer of an amino group from the amide nitrogen of glutamine to Fru-6-P to produce glucosamine-6-P (GlcN-6-P). An acetyl group is then transferred from acetyl-CoA to the amino group of the GlcN-6-P to form GlcNAc-6-P, which is converted to its activated form, UDP-GlcNAc, by sequential mutase and pyrophosphorylase reactions. In addition to its role as a GlcNAc donor, UDP-GlcNAc can also be epimerized to UDP-GalNAc. With few exceptions, all amino sugars in glycoconjugates are acetylated; thus they are neutral and do not contribute any ionic charge to the glycoconjugates.
UDP-GlcNAc is the precursor of N-acetylneuraminic acid (NeuAc), also referred to as sialic acid. Sialic acid, a 9-carbon N-acetylamino-ketodeoxyglyconic acid, is produced by the condensation of an amino sugar with phosphoenolpyruvate (see Fig. 27.10). Cytidine monophosphate neuraminic acid (CMP-NeuAc) is the activated form of sialic acid and is the sialic acid donor in biosynthetic reactions. CMP-sialic acid is the only nucleoside monophosphate sugar donor in glycoconjugate metabolism.
The pathway of assembly of the N-glycans begins with the transfer of two GlcNAc residues to a membrane-bound lipid, called dolichyl phosphate. Mannose and glucose residues are added to build a lipid-linked oligosaccharide intermediate, which is transferred “en bloc” to protein in the lumen of the endoplasmic reticulum (Fig. 27.11). Dolichols are long-chain polyisoprenol derivatives usually having about 120 carbon atoms (about 22–26 isoprene units) with a phosphate group at one end. They are synthesized in membranes using the same machinery that is used to make cholesterol, but in contrast to cholesterol, the dolichols remain as long straight chains. The length of the chain requires it to snake through the phospholipid bilayer, providing a strong anchor for the growing oligosaccharide chain.
Fig. 27.11 Synthesis of N-linked oligosaccharides in the endoplasmic reticulum.
Tunicamycin is an inhibitor of the GlcNAc phosphotransferase that the catalyzes the first step in glycan synthesis. GlcNAc, N-acetylglucosamine; Dol, dolichol; Man, mannose.
The first sugar to be added to dolichyl-P from UDP-GlcNAc is GlcNAc-1-P by a GlcNAc-1-P transferase to produce dolichyl-P-P-GlcNAc. A second GlcNAc is linked to the first GlcNAc followed by addition of 4–5 mannose residues from GDP-Man. Dolichyl-P-Man and dolichyl-P-Glc serve as glycosyl donors for the remaining mannoses and the three glucoses residues. Each of the sugars is transferred by a specific glycosyltransferase located in or on the endoplasmic reticulum membrane. The glucoses are not found on any of the N-linked oligosaccharides on glycoproteins, but are removed by glucosidases in the endoplasmic reticulum. Why are they added in the first place? They serve two very important functions. First of all, the presence of glucoses on the lipid-linked oligosaccharide has been shown to expedite the transfer of oligosaccharide from lipid to protein – the transferring enzyme (oligosaccharide transferase) has a preference for oligosaccharides that contain three glucoses and transfers those oligosaccharides to protein much faster. Secondly, the glucoses are important in directing protein folding in the endoplasmic reticulum (below).
In a series of trimming or pruning reactions (Fig. 27.12), all three glucoses are removed in the ER. The oligosaccharide may then remain as a high-mannose oligosaccharide or it may be further processed to a complex oligosaccharide structure. One or more mannoses may be removed in the ER, and the folded protein is then translocated to the Golgi apparatus where three or four additional mannoses may be removed to leave the core structure of three mannose and two GlcNAc residues. This core oligosaccharide is then elongated in the Golgi apparatus.
After the pruning reactions in the ER and proper folding, the protein with one or more Man8GlcNAc2 oligosaccharides is transported to the Golgi apparatus where other modification reactions occur (see Fig. 27.12). Usually in the cis-Golgi, several other mannoses are removed by α-mannosidases to leave the core structure. Also in the cis-Golgi, GlcNAc residues are added to each of the mannoses. Then the protein enters the trans-Golgi fraction where the remaining sugars of the trisaccharide sequences, i.e. galactose, sialic acid and fucose, can be added to make a variety of different complex chains. The final structure of the oligosaccharide chains depends on the glycosyltransferases complement of the cell.
In contrast to the biosynthesis of N-glycans, the synthesis of the O-glycans occurs only in the Golgi apparatus by the stepwise addition of sugars from their sugar nucleotide derivatives to the protein. No lipid intermediates are involved in O-glycan formation. Figure 27.13 outlines the stepwise sequence of reactions that are involved in the assembly of an oligosaccharide chain on salivary mucin. In this sequence, GalNAc is first transferred from UDP-GalNAc to serine or threonine residues on the protein by a GalNAc transferase in the Golgi apparatus. The resulting GalNAc-serine-protein serves as the acceptor for galactose and then sialic acid, transferred from their sugar nucleotides (UDP-Gal and CMP-sialic acid) by Golgi galactosyltransferases and sialyltransferases. Other Golgi glycosyltransferases are involved in the stepwise biosynthesis of more complex mucin oligosaccharides and in the synthesis of O-glycans in proteoglycans and collagens (see Chapter 29). There are more than 100 glycosyltransferases involved in glycoconjugate synthesis in a typical cell.
Resident proteins in the endoplasmic reticulum, known as chaperones, assist newly synthesized proteins to fold into their proper conformations. Two of these chaperones, calnexin and calreticulin, bind to unfolded glycoproteins by recognition of high-mannose oligosaccharides that still contain a single glucose remaining on their structure, after the glucosidases have removed two of the three glucoses. Not all of the glycoproteins synthesized in the cell require assistance in folding but for those that do, the rate of folding is greatly accelerated by the chaperones. Incorrectly folded or unfolded proteins do not undergo normal transport to the Golgi apparatus and if they do not fold properly, they precipitate in the endoplasmic reticulum and are subsequently degraded by the ubiquitin–proteasome system in the cytoplasm.
Lysosomes are subcellular organelles involved in the hydrolysis and turnover of many cellular organelles and proteins. They contain a variety of hydrolytic enzymes with acidic pH optima. Most of these lysosomal enzymes are N-linked glycoproteins that are synthesized and glycosylated in the endoplasmic reticulum and Golgi apparatus. The sorting of lysosomal enzymes occurs in the cis-Golgi. Proteins destined to be transported to the lysosomes contain a cluster of lysine residues that come together as a result of the protein folding into its proper conformation. As shown in Figure 27.14, this cluster of lysine residues serves as a docking site for an enzyme, GlcNAc-1-P transferase, that transfers a GlcNAc-1-P from UDP-GlcNAc to terminal mannose residues on the high-mannose chains of the lysosomal enzymes. A second enzyme, called an uncovering enzyme, then removes the GlcNAc, leaving the phosphate residues still attached to the mannoses on the high-mannose chains. The resulting Man-6-P residues on the high-mannose structure are now recognized by a Golgi protein called a Man-6-P receptor that directs the enzyme to the lysosomes. Thus, the Man-6-P residues are a targeting signal used by the cell to sort out those proteins that are destined to go to the lysosomes, and separate them from other proteins being synthesized in the Golgi apparatus. The Man-6-P receptor is also present on the cell surface, so that even extracellular enzymes that have this signal are endocytosed and transported to the lysosomes.
Because oligosaccharides are hydrophilic, they increase the solubility of proteins in the aqueous environment. Thus, most of the proteins that are secreted from cells are glycoproteins, including plasma proteins, excepting plasma albumin. These glycoproteins and enzymes generally also have high stability to heat, chemical denaturants, detergents, acids and bases. Enzymatic removal of the carbohydrate from many of these proteins greatly reduces their stability to stress. Indeed, when glycoproteins are synthesized in cells in the presence of glycosylation inhibitors, such as tunicamycin, which prevents the production and therefore the attachment of the N-glycan chain, many of these proteins become insoluble and form inclusion bodies in the cells as a result of incorrect folding and/or decreased hydrophilicity.
N-glycans on the mammalian cell surface play critical roles in cell–cell interactions and other recognition processes. One cell may contain on its cell surface a carbohydrate-recognizing protein, known as a lectin, that binds to a specific oligosaccharide structure on the surface of the complementary cell. The interaction between these two chemical interfaces mediates a specific chemical recognition between the cells, and such a process is a key factor in fertilization, inflammation, infection, development and differentiation.
Carbohydrate–protein interactions are also important in non-self interactions. Many pathogens use this mechanism to recognize their target cells. E. coli, for example, and some other Gram-negative enteric bacteria have short hair-like projections called pili on their surfaces. These pili have mannose-binding lectins at their tip that can recognize and bind to high-mannose oligosaccharides on the brush border membranes of intestinal epithelial cells. This interaction allows the bacteria to be retained in the intestine. The influenza virus uses a hemagglutinin protein on its surface to bind to sialic acid residues on glycoproteins and glycolipids on the surfaces of target cells.
Variations in mucin structure appear to have a role in the specificity of fertilization, cell differentiation, development of the immune response, and virus infectivity. Glycoprotein ZP3, which is present on the zona pellucida of the mouse egg, functions as a receptor for sperm during fertilization. Enzymatic removal of O-glycans from ZP3 results in loss of sperm receptor activity, whereas removal of the N-glycans has no effect on sperm binding. The isolated O-glycans obtained from ZP3 also have sperm-binding activity and inhibit sperm–egg interaction and fertilization in vitro. Differences between the O-glycan structures of cytotoxic lymphocytes and helper cells involved in the immune response are also believed to be important in mediating cellular interactions during the immune response.
Glycosylation is the major posttranslational modification of tissue proteins.
Glycosylation is a multicompartment activity, involving sugar interconversions and activation in the cytosolic compartment, building of complex structures on lipid intermediates in the ER, and glycosylation and pruning reactions in the ER and Golgi apparatus. The outcome is an amazingly diverse range of oligosaccharide structures on proteins.
Sugars on glycoconjugates can serve a number of different functions, including:
modification of the physical properties of the protein (solubility, stability and/or viscosity);
aiding in the folding of the protein;
participating in the targeting of the protein to its proper location in the cell;
mediating cell–protein and cell–cell recognition during fertilization, development, inflammation and other processes.
A number of human diseases involve defects in sugar metabolism, including galactosemia and hereditary fructose intolerance, leukocyte adhesion deficiency, congenital disorders of glycosylation (CDG) and lysosomal storage diseases.
Asano, N. Sugar-mimicking glycosidase inhibitors: bioactivity and application. Cell Mol Life Sci. 2009; 66:1479–1492.
Etzioni, A. Genetic etiologies of leukocyte adhesion defects. Curr Opin Immunol. 2009; 21:481–486.
Kato, K, Kamiya, Y. Structural views of glycoprotein-fate determination in cells. Glycobiology. 2007; 17:1031–1044.
Martin, PT. Congenital muscular dystrophies involving the O-mannose pathway. Curr Mol Med. 2007; 7:417–425.
Schachter, H, Freeze, HH. Glycosylation diseases: Quo vadis? Biochim Biophys Acta. 2009; 1792:925–930.
Taylor, ME, Drickmer, K. Introduction to glycobiology, ed 2. New York: Oxford University Press; 2006.
Varki, A, Cummings, RD, Esko, JD, et al. Essentials of glycobiology, ed 2. New York: Cold Spring Harbor Press; 2009.
Zeidan, Q, Hart, GW. The intersections between O-GlcNAcylation and phosphorylation: implications for multiple signaling pathways. J Cell Sci. 2010; 123:13–22.
Congenital disorders of glycosylation. www.familyvillage.wisc.edu/lib_cdgs.htm.
Galactosemia. http://galactosemia.org/.
Glycoprotein biosynthesis. http://www.ccrc.uga.edu/~lwang/bcmb8020/N-glycans-A.pdf.
Glycoprotein topics. www.glycoforum.gr.jp/science/word/glycoprotein/GP_E.html.
Hereditary fructose intolerance. www.bu.edu/aldolase/HFI/.
I-cell disease. http://emedicine.medscape.com/article/945460-overview.
Plant lectins. www.ansci.cornell.edu/plants/toxicagents/lectins.html.