2. Dynamic O-GlcNAcylation Cycle and Hexosamine Biosynthesis Pathway
2.1. OGT and OGA Are the Only Antagonistic Enzymes for Precisely Regulating the O-GlcNAcylation Cycle in Space-Time Specificity
OGT has been found to exist in many organisms with high homology. Here, three subtypes of human OGT are taken as an example to demonstrate its gene and protein structure characteristics. OGT has only three isoforms: ncOGT, mimOGT and sOGT
[21]. The genomic structure of human OGT is shown in
Figure 3A.
Ogt is a single gene residing on the chromosome X (Xq13.1)
[22]. X inactivation regulates
Ogt transcription in female mammals, and Xq13.1 is a region involved in Parkinson’s pathology
[23].
Ogt contains 23 exons with alternatively spliced variants
[24]. Exon 1 and promoter 1 produce ncOGT. Promoter 2 and the alternative start codon in exon 5 produce mOGT. The alternative start codon in exon 10 generates sOGT. All subtypes of OGT are mainly composed of the amino-terminal TPR domain, the same central linker domain and carboxyl-terminal catalytic domain
[25]. All subtypes of OGT are shown in
Figure 3B. The three subtypes of OGT contain different numbers of TPRs: 13.5 TPRs in nOGT, 9.5 TPRs in mOGT and 2.5 TPRs in sOGT
[26]. Each TPR unit is comprised of 34 amino acid motifs. These TRP units are α-helical and clustered to form a repeating antiparallel helix-turn-helix super spiral
[27]. TPRs acts like a “gatekeeper” to identify and interact with the target substrates by contacting their side chains
[28]. The mutation of conserved aspartate residues in TPRs has resulted in significant changes in the selectivity, preference and
O-GlcNAcylated rate of OGT for target substrates
[29]. The influence of TPRs on substrate preference may also be the potential reason for their different cellular localization: mOGT exists in the mitochondria, while ncOGT and sOGT exist in the cytoplasm and nucleus, respectively
[30]. In addition, TPRs also has a certain effect on the activity of OGT, and this effect can be exerted only by partial TPRs
[31]. TPRs deleting the front 3 or 6 units do not affect the glycosylation of substrates by OGT; however, TPRs deleting the front 9 or 11 units will deactivate OGT
[26][32]. OGT belongs to the GT41 gene family of GT-B glycosyltransferase superfamily
[33]. Therefore, the carboxyl-terminal catalytic region in OGT has high homology, structural characteristics and catalytic activity of GT-B glycosyltransferases superfamily. Consistent with the structural characteristics of GT-B, OGT contains two similar Rossman folds separated by a deep fissure, which are called CDⅠ and CD Ⅱ, respectively, forming the catalytic core of OGT without the assistance of divalent metal ions
[34]. CD Ⅰ mainly consists of a UDP identification pocket and some catalytic groups composed of acidic amino acids that stabilize the pyrophosphate bond through the synergistic action of divalent cations
[35]. CD Ⅱ is a lectin-like domain that can be used to recognize and bind the glycoside of UDP-GlcNAc due to its strong affinity for carbohydrates caused by its own abundant trimer structure
[36]. However, an Int-D exists in OGT and packs against the carboxyl-terminal catalytic lobe, which is distinct from the structural characteristics of GT-B superfamily
[37]. It is preliminarily inferred that Int-D may interact with negatively charged membranes or nucleic acids based on the structural characteristics that contain an unusually large number of surface-exposed basic residues
[25]. Meanwhile, PPO is located at the carboxyl terminal of OGT and strongly interacts with PIP3, which promotes the recruitment of OGT to the membrane under insulin induction for catalyzing the dynamic
O-GlcNAcylation of the insulin signaling pathway
[38]. Targeting special localization of this interaction led to the alteration in the phosphorylation of pivotal insulin signal molecules and weakening of the insulin signal transduction
[39]. This suggests the indispensable role of OGT in diabetic pathology. In addition, there is a central flexible linker domain composed of about 120 amino acids
[40]. The linker domain seems to exist only in metazoans without high conservatism and its function is unknown, which is the root cause of the difficulty in crystallization of OGT in higher metazoans
[41]. The activity and substrate recognition of OGT are also regulated by phosphorylation
[42]. OGT is phosphorylated by CHK1 on Ser
20, which changes its stabilization and required for cytokinesis
[43]. OGT is also phosphorylated by CaMKII on Ser
20, which increases its activity
[44]. In addition, OGT is also
O-GlcNAcylated by itself in Ser
3/4 and Thr
1045, and their role is unknown
[9]. There are multiple acetylation sites in OGT
[45]. The presence of these acetylation sites in the catalytic domain suggests that they may modulate OGT activity
[14]. The advanced structure of OGT and the sites modified by various PTMs are shown in
Figure 3C.
Figure 3. Genomic and proteomic structures of various subtypes of human OGT and OGA. (
A) Human
Ogt gene mapping and structure. (
B) Primary protein structure of three subforms of human OGT. Subtype. (
C) Advanced structures of human OGT in 3D and its PTMs. The advanced structure of OGA is displayed in cartoon and surface form with 5M7R in the protein data bank by PyMOL Molecular Graphics System, v2.5.2 (Schrödinger, LLC, New York, NY, USA). The various post-translational modification sites of OGT are present. These predicted modification sites are derived from the PhosphoSitePlus database (
https://www.phosphosite.org, (accessed on 10 March 2022). (
D) Human
Oga gene mapping and structure. (
E) Primary protein structure of three subforms of human OGA. Subtype. (
F) Advanced structures of human OGA in 3D and its PTMs.
Oga is highly conserved in eukaryotic species, especially in mammals, but absent in prokaryotes and yeast
[46].
Oga is mapped to chromosome 10 (10q24.32) as a single gene copy
[47]. It is selectively spliced to produce ncOGA and sOGA, which are different at different carboxyl terminals
[30]. Gene and protein structures of OGA are shown in
Figure 3D,E. Cell fractionation analysis showed that ncOGA was mainly located in the cytoplasm, while the sOGA subtype existed in the nucleus
[48]. ncOGA contains the amino-terminal catalytic domain and the central stalk domain and the carboxyl-terminal pseudo-HAT domain linked through two highly disordered (or low complexity) regions
[49]. The amino-terminal catalytic domain of OGA is the GlcNAc hydrolysis domain with sequence homology to GH84
[50]. The stalk domain is a hinged region containing multiple alpha helices
[51]. It is not conserved between species, which makes it a flexible region that facilitates the folding of the entire protein
[52]. Although it has been reported that the HAT-like domain of ncOGA in mice has histone acetyltransferase activity in vitro, it has not been supported by more studies in vivo for lacking the critical residues for the binding of acetyl-coenzyme A
[53]. However, the HAT-like domain is evolutionarily conserved, indicating that the pseudo-HAT domain may play an important role in the deglycosylation-associated functions
[54]. sOGA lacks the HAT-like domain but contains 15 unique amino acid residues at the carboxyl terminal
[55]. Interestingly, it has been reported that sOGA has higher hydrolytic activity in vitro. OGA preferentially removes GlcNAc from some sites, indicating that it has an equal cooperative relationship with OGT in regulating the replacement of
O-GlcNAcylation
[56]. The active form of OGA appears as homodimer
[57]. OGA forms a homodimer in the form of arm to arm, in which the glycoside hydrolase domain of each monomer is covered by the stalk domain of another monomer, thus forming a potential substrate-binding cleft comprising conserved hydrophobic residues
[40]. The glycopeptide of the
O-GlcNAcylated protein is tightly bound in the substrate-binding cleft through the abundant GlcNAc contacts of the catalytic pocket in OGA, which involves the peptide side chain and the backbone interactions with cleft surface residues
[58]. Meanwhile, OGA recognizes the specific characteristics of substrate peptides and hydrolyzes GlcNAc from a wide range of peptide sequences
[59]. In addition, some specific residues on OGA contribute to its interaction with different peptide substrates, which means the differential regulation of
O-GlcNAcylation on various proteins
[60]. OGA is also affected by PTMs such as phosphorylation and
O-GlcNAcylation
[61]. There are abundant phosphorylation and ubiquitination sites in the domains of glycoside hydrolase and the HAT-like domain
[9], but the effect of these modifications at corresponding sites on OGA activity remains to be further determined. The advanced structure of OGA and the sites modified by various PTMs are shown in
Figure 3F. The
O-GlcNAcylation of OGA at Ser
405 is located in the central highly disordered region, suggesting a role in the regulation of OGA-OGT interactions because this is the binding region of OGA-OGT
[62]. OGA is also SUMOylated at Lys
358 and acetylated at Lys
599, respectively
[9][63].
2.2. Nutrient Availability Drives Global O-GlcNAcylation through HBP
Extracellular glucose is transported into the intracellular via GLUT-4
[64]. Only 2~3% of the intracellular glucose enters the HBP, while most of the remaining intracellular glucose enters the glycolysis, pentose phosphate pathway (PPP), glycogen synthesis and even polyol pathways, respectively
[65]. Therefore, the
O-GlcNAcylation cycle is strictly controlled by the flow of glucose through the HBP
[66]. Initially, in a study, intracellular glucose was phosphorylated to Glc-6-P by HK, and then Glc-6-P was further isomerized to Fru-6-P by GPI
[67]. Subsequently, 3~5% of Fru-6-P was added with an amino group from glutamine to synthesize GlcN-6-P and glutamate by GFAT, while the other 95% of Fru-6-P was used for glycolysis
[68]. The enzymatic reaction is the rate-limiting step of HBP, and GFAT is also the key rate-limiting enzyme of HBP
[69]. The activity of GFAT is still regulated by multiple pathways
[70]. Firstly, the activity of GFAT is regulated by substrate availability, which is positively activated by the concentration of glucose and glutamine, and the negative feedback is inhibited by the concentration of UDP-GlcNAc and GlcN-6-P
[71]. The activity of GFAT is also closely related to some PTMs. The Ser
243 of GFAT is phosphorylated and its activity is reduced by AMPK, mTORC2 and CaMKII, and a similar effect is also caused by 2-Deoxy-D-glucose
[72][73]. PKA also promotes the phosphorylation of GFAT at Ser
205/235 [74]. Succinylation of GFAT at Lys
529, acetylation of GFAT at Lys
114, 547, 650 and multiple ubiquitination Lys sites of GFAT are predicted by PhosphoSitePlus
® v6.6.0.2 (
https://www.phosphosite.org, accessed on 10 March 2022). Meanwhile, it has been reported that specificity protein 1, activating transcription factor 4 and X-box-binding protein 1 regulate GFAT at the transcriptional level
[75][76]. Glutamine is necessary for this enzymatic reaction, but this restriction can be bypassed by glucosamine as an extended supplement
[77]. Therefore, incubating cells with glucosamine or high concentration glucose or glutamine can bypass the rate-limiting step catalyzed by GFAT, thereby increasing global
O-GlcNAcylation. GNA converts GlcNAc-6-P using acetyl-CoA
[78]. Then, GlcNAc-6-P is catalytically translocated to GlcNAc-1-P by AGM
[79]. It is worth noting that the only difference of HBP in prokaryotes is that GlcN-6-P is isomerized to GlcN-1-P and then GlcN-1-P is acetylated to form GlcNAc-1-P
[80]. The HBP process in eukaryotes is as shown above. Finally, UTP is then utilized by UAP to convert GlcNAc-1-P into UDP-GlcNAc and release iPPi
[81]. The HBP process involves the participation of glucose, glutamine, uridine, acetyl-CoA and ATP
[82]. Therefore, UDP-GlcNAc, as the end-product of HBP, integrates the metabolisms of carbohydrates, amino acids, fats and nucleotides
[83]. UDP-GlcNAc is a unique donor of
O-GlcNAcylation, which provides GlcNAc, which is necessary and irreplaceable for
O-GlcNAcylation
[84]. GlcNAc provided by UDP-GlcNAc is used and transferred by OGT to the oxygen atom of the hydroxyl group of serine or threonine residues of the target protein
[85]. On the contrary, the GlcNAc moiety is removed from
O-GlcNAcylated proteins by OGA
[86]. These hydrolyzed GlcNAc or other free GlcNAc obtained by lysosomal or nutrient degradation are converted to GlcNAc-6-P through N-Acetylglucosamine kinase (NAGK) and then used again for the synthesis of UDP-GlcNAc
[87]. Therefore, GlcNAc can also bypass the rate-limiting step of HBP and GFAT, which is also effective for salvage pathways such as glucosamine and glutamine
[88]. In addition, UDP-GlcNAc is also used as a substrate for the synthesis of proteoglycans, hyaluronic acid, glycolipids, GPI anchor,
N-glycosylation and other
O-glycosylation
[89]. The activated UDP-GlcNAc is utilized by concentration-sensitive enzymes in the nucleus, cytoplasm and membrane to glycosylate the substrate or generate glucose conjugates
[89]. UDP-GlcNAc is actively transported by nucleotide sugar transporters to cellular organelles, such as the ER and Golgi apparatus
[90]. The differences in UDP-GlcNA permeability and relative cell volume of these organelles complicate the estimation of the cytoplasmic and nuclear concentrations of UDP-GlcNAc
[91]. The relative abundance of
O-GlcNAcylation is roughly negatively correlated with the more complex glycans
[92]. These characteristics make UDP-GlcNAc and its derivatives extremely sensitive to the variations in cellular nutrients, so that the dynamic
O-GlcNAcylaion can be used as a reporter of the functional status of multiple pathways and regarded as a metabolic sensor
[20]. Meanwhile, the mutual conversion and complex relationship of the intermediate products in the HBP, polyol pathway, PPP, glycogen, glycolysis and TCA cycle intermediates greatly enlarge the nutritional sensitivity of
O-GlcNAcylation
[93] and also suggest the potential mechanism of
O-GlcNAcylaion’s negative feedback regulation of these glucose metabolism branches. Indeed,
O-GlcNAcylation is involved in multiple modes of metabolic regulation. Almost all the enzymes involved in glycolysis were identified to have been modified by
O-GlcNAcylation
[94]. The
O-GlcNAcylated enzymes exist in every step of glycolysis, including GLUT4, HK, GPI, PFK, FBA, GAPDH, PGK, PDM, ENO, PK and PDC
[95][96][97][98]. Glycogen synthesis is also regulated by
O-GlcNAcylated GSK3β, and PPP activity is affected by
O-GlcNAcylated G6PD
[99][100]. In addition, increased HBP flux and
O-GlcNAcylation also promotes fatty acid oxidation in the heart and adipose tissue
[101]. The
O-GlcNAylation of several transcription factors, such as PGC1α, FoxO3, NF-κB and CREB, also indirectly participates in transcriptional regulation of metabolism
[102][103][104]. Although only briefly shown in
Figure 4, it is worth noting that almost all enzymes in the TCA cycle are also modified by
O-GlcNAcylation, such as AH, IDH, KGD, SL, SDH, MDH and the several subunits of respiratory chain complexes
[105][106]. CS and FH may be potentially
O-GlcNAcylatied, but there is still a lack of supporting evidence
[94].
Figure 4. Nutrient availability drives the dynamic cycle of protein O-GlcNAcylation via the hexosamine biosynthesis pathway. The GlcNAc provided by UDP-GlcNAc is necessary and unique for O-GlcNAcylation, and the only source of UDP-GlcNAc is HBP. Changes in nutritional availability, such as carbohydrates, lipids, amino acids, nucleotides and ATP, fluctuate HBP flux. Therefore, UDP-GlcNAc and HBP link carbohydrate metabolism, lipid metabolism, amino acid metabolism and nucleotide metabolism. These physiological characteristics make OGT extremely sensitive to nutrient fluctuations.
3. O-GlcNAcylation, Energy Metabolism and Insulin Sensitivity in Skeletal Muscle
Skeletal muscle is a repository of nutrients, enabling it to serve as the consumer and producer of energy during exercise, stress and starvation
[107]. The energy requirements of skeletal muscle are enormous, and its energy expenditure increases 300 times from the base state to the full contraction state
[108][109]. This directly affects glucose homeostasis in skeletal muscle, where CaMKII or GLUT4 is activated to increase glucose uptake
[110]. In fact, skeletal muscle processes more than 80% of insulin to stimulate glucose uptake and is considered as one of the most critical insulin-sensitive tissues
[111]. The metabolic flexibility in skeletal muscle ensures an adequate supply of energy for its work
[112].
O-GlcNAcylation as a cellular nutrient sensor plays a key role in glucose metabolism during this physiological process
[113]. In addition, skeletal muscle is the main target organ of insulin, and nutrient-driven
O-GlcNAcylation is a key regulator of insulin signaling in skeletal muscle
[114].
O-GlcNAcylation is considered to be critical in the dysregulation of the insulin signaling cascade and the molecular mechanism of insulin resistance
[115].
4. O-GlcNAcylation Is an Emerging Mediator of Contractile and Structural Properties in Skeletal Muscle
4.1. O-GlcNAcylation Is an Essential Regulator of Contractile Properties in Skeletal Muscle
To date, many
O-GlcNAcylated contractile proteins and contractile-related regulatory proteins in sarcomere have been identified
[116]. These key contractile proteins include actin, myosin, MLC and MHC, tropomyosin and troponin, etc.
[117][118][119]. In view of this, the physiological role of
O-GlcNAcylation on skeletal muscle contractile activity has been concerned. Many data emphasize that
O-GlcNAcylation mediates calcium activation properties to regulate the contractile activity of skeletal muscle. Increased
O-GlcNAcylation of MHC, α-tropomyosin and α-sarcomeric actin in myocardium of diabetic mice resulted in the decrease of sarcomere calcium sensitivity
[120]. Reversible reductions in calcium affinity and sensitivity of muscular fibers occur when exposed to GlcNAc, and
O-GlcNAcylation of some critical contractile proteins increased, such as MHC, MLC and actin
[121]. Moreover, similar results appeared in skinned fibers and cardiac trabeculae. Phosphorylation of Tn I at the Ser
23 and Ser
24 by PKA improves the calcium sensitivity of cardiomyocytes and alter of myofilament properties
[122]. Interestingly, Tn I, Tn T and Tn C of the troponin complex are
O-GlcNAcylated in skeletal muscle and myocardial tissue
[123]. The
O-GlcNAcylation levels of Tn I in fiber cells exposed to GlcNAc or OGA inhibitors increased, and the calcium activation properties reduced
[124]. However, this treatment did not affect the phosphorylation levels of Tn I at Ser
23/24 [125]. One of the possible mechanisms is that the GlcNAc part destroys the protein–protein interaction
[119].
O-GlcNAcyation of MLC2 in myocardial tissue occurred at the site of Ser
15. It is worth noting that the only phosphorylation site overlaps with the only
O-GlcNAcyation site on MLC2. This suggests that the close potential interaction between
O-GlcNAcyation and phosphorylation of MLC2 is in the calcium activation properties of sarcomere. This interaction has been proved to be mutually exclusive, and the dynamics of this interaction vary according to the pattern of skeletal muscle activity
[126]. The precise regulation of phosphorylation and
O-GlcNAcyation on MLC2 involves a multienzyme cluster. This multienzyme cluster contains MLCK/MYPT2/PP1 and OGT/OGA, which are involved in the phosphorylation and
O-GlcNAcyation of MLC2
[127]. This multienzyme cluster is preferentially located at the Z disk of the sarcomere and responds to the physiological signals of skeletal muscle to strengthen the interaction between the enzymes contained in itself
[128]. For example, the partial recombination of this multienzyme clusters in skeletal muscle dysfunction, such as increased co-localization between MLCK and OGA. The location of OGT and OGA in the diabetic heart is redistributed in the sarcomere
[129]. This is due to enhanced OGA activity and increased the interaction with α-actin, tropomyosin and MLC1, while OGT was the opposite. This results in the removal of abnormal
O-GlcNAcylation to restore myofilaments to Ca
2+ response
[130]. Nevertheless, the exact effect and mechanism of
O-GlcNAcylation on MLC2 need to be further confirmed. One credible assumption is that
O-GlcNAcylation causes steric hindration between MLC2 and MHC due to its stokes radius being many times larger than phosphate
[15].
In general,
O-GlcNAcylation is considered as a new mechanism to regulate the contractile properties of skeletal muscle by modifying critical contractile proteins to regulate the interaction between other proteins or with itself, and phosphorylation to mediate calcium activation. In addition, the regulatory effect of
O-GlcNAcylation on the contractile activity of skeletal muscle is also related to its effect on sarcomere structure
[131], which isdiscussed in the next section.
4.2. O-GlcNAcylation Is an Emerging Maintainer of the Structural Properties in Skeletal Muscle
O-GlcNAcylation also regulates the polymerization of desmin, just as it regulates the polymerization of tubulin and cytokeratin filaments 8/18
[132][133][134]. α/β-crystallin serve as molecular chaperones to facilitate the localization, aggregation, and assembly of desmin
[135].
O-GlcNAcylation of α/β-crystallin at Thr
170 and Thr
162 regulates its localization and its interaction with desmin, respectively
[136][137]. The multiple
O-GlcNAcylation sites of mouse titin are located on the kelch-12 domain, and the absence of titin leads to the change of muscle structure and the decrease of muscle performance
[138][139]. These
O-GlcNAcylation sites are located in the key regions of sarcomere assembly and myosin polymerization and its interaction with MyBP-C and MHC
[140][141]. Interestingly,
O-GlcNAcylation sites of MHC have also been found. These sites are adjacent to its domain of polymerization and interaction with myosin and titin
[142].
O-GlcNAcylation of MHC is located at Ser
1708, and it is further considered to be involved in Laing early-onset distal myopathy due to its proximity to mutant Leu
1706 residue. Meanwhile,
O-GlcNAcylation sites in the PxxP domain of BAG3 and the plakin domain repeat B5 of plectin have been identified, which are related to the interaction with SH3-containing protein and intermediate filament proteins, respectively
[138]. This evidence suggests that
O-GlcNAcylation occurs in specific domains of certain structural proteins to regulate interactions other proteins for maintaining sarcomere structure and function. This view was confirmed by a recent study that
O-GlcNAcylated milton binds to FHL2 to anchor mitochondria to F-actin
[143][144]. The main targets and pathways of
O-GlcNAcylation in skeletal muscle physiology are shown in
Figure 5.
Figure 5. The major targets and pathways shown to be altered by O-GlcNAcylation in in the skeletal muscle physiology. Thousands of O-GlcNAcylated proteins have been identified in skeletal muscle cells. These O-GlcNAcylated proteins are classified into contractile proteins, sarcolemma proteins, structural proteins and cytoskeletal proteins, as well as mitochondrial proteins, metabolic enzymes, transcription factors and signal proteins. Therefore, the effects of O-GlcNAcylation on various physiological processes of skeletal muscle may be realized from the following four aspects: (1) its regulation of carbohydrate metabolism with sensing nutritional availability; (2) its maintenance of structural protein synthesis/degradation balance; (3) its improvement of sarcomere contractile activity by modulating the calcium activation properties; (4) its promotion of adaptation and protection under exercise and certain adverse circumstances.