2. The Sumoylation Pathway
The sumoylation pathway is quite similar to the ubiquitination pathway, but there is its own set of enzymes for modification by SUMO (
Figure 1). This involves several steps: (i) initial maturation of the SUMO precursor, by proteolysis of several C-terminal amino acids to expose a Gly-Gly (GG) motif; (ii) activation of mature SUMO by the heterodimeric SAE1/UBA2 E1 enzyme, through ATP hydrolysis; and (iii) transfer to UBC9, the unique conjugating E2 enzyme in the sumoylation system. Both activation and transfer to UBC9 involves the formation of a thioester bond between the C-terminus of mature SUMO and a Cys (C) residue in the catalytic sites of E1 or E2, (iv) transfer from UBC9 to targets, frequently assisted by a SUMO ligase or E3. Besides, specific SUMO proteases are in charge of SUMO maturation and scission from targets.
Figure 1. The sumoylation pathway. E1 enzyme (SAE1-UBA2 heterodimer), by using ATP, activates mature SUMO and transfers it to the E2 enzyme UBC9, which ultimately transfer SUMO to targets, either directly, or more frequently, assisted by an E3 SUMO ligase. SUMO maturation and scission from targets are performed by specific SUMO proteases. Maturation involves the exposition of two tandem Gly (G) residues at the C-terminus of mature SUMO. Activation and transfer to UBC9 involve the formation of thioester bonds between C-terminus of mature SUMO and specific Cys (C) residues at catalytic sites of E1 and E2, respectively. Defined Lys (K) residues at targets are the final SUMO acceptors for covalent attachment. Proteases and ligases are linked to target selection-associated steps in the sumoylation pathway.
2.1. SUMO Proteases
Proteases involved in SUMO maturation and recycling are SUMO specific, and the most studied are those of the SENP family (
Figure 1 and
Figure 2 and
Table 1). This family includes SENP1–3 and SENP5–7
[21]. Besides, additional proteases also with specific activity on SUMO have been described more recently. They include a new family of desumoylating isopeptidases (DeSI)
[22], which comprises DESI1 and DESI2, and USPL1
[23] and HINT1
[24] (
Table 1). All SENP proteins display a desumoylation catalytic domain at the C-terminus, which appears split in the case of SENP6 and SENP7 (
Figure 2A). Different sequences in the N-terminal region of SENPs have been related to cellular localization, and, in the case of SENP7, to interaction with the heterochromatin protein 1 (HP1) through two tandem PxVxL motifs
[25]. Moreover, a SENP7 splice variant, lacking one of these motifs, has been reported to predict for good prognosis in breast cancer patients
[26]. Additional variants have been described for other SENPs (
https://www.ncbi.nlm.nih.gov/gene/?term=SENP (accessed on 14 July 2022)). SENP1 and SENP2 have been indicated to be able to maturate SUMO1–3, although SENP1 shows preference for SUMO1 and SENP2 for SUMO2
[21] (
Figure 2B). SENP5 has been also shown to efficiently maturate SUMO2. Regarding SUMO recycling, SENP1 and SENP2 have been demonstrated to efficiently detach the three SUMO paralogs from targets, while SENP3 and SENP5 are more selective on SUMO2/3 and SENP6 and SENP7 display poly SUMO2/3 chains editing activity
[21] (
Figure 2B). In the cell, SENP1 and SENP2 have been associated with PML NBs in interphase (as SENP6) and with the nuclear pore complex (NPC); SENP3 and SENP5 with both the nucleolus and the mitochondria; and SENP6, SENP7 and SENP3 with chromatin
[27] (
Figure 2C).
Figure 2. The SENP family. (A) Schematic representation of human SENP proteins with the C-terminal catalytic domain and the key His (H) and Cys (C) residues. HP1 interacting domains are also shown on SENP7, for which two isoforms have been described: large (SENP7L) and short (SENP7S), lacking the latter of one of the HP1 interacting motifs. The number of amino acids is also shown for each protein. (B) Endopeptidase (maturation) and isopeptidase (deconjugation and editing) activities for each SENP protein are shown. Strength of activity in relation to paralog preference is indicated for maturation activity. (C) Localization of SENP proteins to different cellular compartments is schematically represented. NPC, nuclear pore complex.
2.2. SUMO Ligases
A key difference between SUMO proteases and ligases is that most SUMO ligases display additional functions independent of the SUMO ligase activity. This complicates in many cases the functional analysis in relation to the role of sumoylation. PIAS proteins (
Figure 3), probably the most studied SUMO ligases, were among the first identified proteins displaying SUMO ligase activity, together with the NPC-associated protein RanBP2 and the Polycomb Repressive Complex 2 protein CBX4 (or PC2)
[28]. Among these, PIASs and RanBP2 have been well characterized at the biochemical level (reviewed in
[29]). The PIAS family comprises members 1–4. Two variants due to alternative splice have been classically described for PIAS2 (
Figure 3A), although additional variants have been described for the different PIAS coding genes (
https://www.ncbi.nlm.nih.gov/gene/?term=PIAS (accessed on 14 July 2022)). Structurally, PIAS proteins present an N-terminal SAP domain, involved in DNA binding as well as in interaction with transcriptional co-regulators, a PINIT region involved in nuclear localization, a RING-like SP-RING domain, an acidic stretch, and a C-terminal region rich in Ser/Thr (S/T)
[30]. Different SIMs, relevant for PIAS function, have been described for the different members, both inside and outside the acidic stretch (reviewed in
[31]). Most ligases are not required for
in vitro modification of targets, which seems to depend exclusively on the presence of mature SUMO, E1 and E2. However, ligase requirement has been reported
in vivo for a great variety of physiological processes. It was initially indicated that ligase activity in many cases relies in the ability of ligases to recruit at the same time, through different domains, SUMO-loaded UBC9 and the SUMO target to facilitate SUMO transfer, being the SP-RING domain the responsible for UBC9 recruitment in the case of PIAS proteins (
Figure 3B). However, for RanBP2 ligase activity, target interaction seems to be dispensable, as minimal ligase appears to only require the simultaneous binding of SUMO1 and UBC9 to RanBP2 to optimally position the thioester bond for efficient transfer to its main target, RanGAP1
[32][33][32,33] (
Figure 3B). Initial
in vitro studies using the yeast PIAS1 ortholog Siz1 mapped the minimal E3 ligase domain to the region comprising the PINIT and the SP-RING motifs
[34] (
Figure 3B). Interestingly, another RING-related motif, the PHD domain, has been also implicated in UBC9 binding and sumoylation; for instance, in KAP1 and AtSIZ1
[35][36][35,36]. First studies on PIAS proteins also showed a significant level of promiscuity in target selection, although subsequent
in vivo approaches have demonstrated more specific effects (reviewed in
[18]). In contrast to ubiquitination that requires hundreds of E3 ligases for specific target selection, only a few dozens of ligases have been described for SUMO. Besides PIAS, RanBP2 and CBX4, enhanced sumoylation of specific targets has been associated with additional proteins, including: TOPORS, RSUME, MUL1, RHES, some proteins of the tripartite motif family (TRIM), ARF, SF2, class IIa histone deacetylases (HDACs), the PIAS-related proteins NSMCE2 and ZMIZ1-2, SLX4, KROX20, RNF212, UHRF2, TRAF7, ZBED1, MDM2 and ZNF451 (
Table 1). Among TRIM proteins, SUMO ligase activity has been attributed to TRIM1, 11, 19, 22, 27, 28, 32, 33, 36, 38, 39 and L2 (
Table 1). Interestingly, in some cases, as for TRIM27, and in relation to TP53, ligase activity has been reported for both SUMO and ubiquitin
[37]. Additional proteins also show a dual function as E3 ubiquitin and SUMO ligases, such as UHRF2, TOPORS, TRAF7 or MDM2, and in some cases, distinct domains contribute to one or another activity
[31]. Nevertheless, for some proteins dual function is controversial. For instance, TRIM25 has been described to stimulate TP53 sumoylation, but mechanistically this has been explained on the basis of TRIM25 requirement for recruitment of the RanBP2/G3BP2 complex, which ultimately mediates TP53 sumoylation
[38]. Of note, ZNF451 has also been well characterized biochemically
[39], and it has been proposed to display elongase E4 activity besides E3 ligase activity, due to its ability to assemble poly-SUMO2/3 chains through the action of tandem SIMs at the N-terminus
[40]. Another open reading frame (ORF): KIAA 1586, close to ZNF451, encodes a protein with similar N-terminal tandem SIMs, which has also proved to display elongase activity. Known SUMO ligases are quite divergent phylogenetically and structurally and, as indicated, there are not defined families of proteins exclusively devoted to this function, which seems to appear in a high variety of proteins with additional functions. Thus, it seems likely that well-known proteins will reveal unexpected SUMO ligase activity in the near future.
Table 1.
SUMO proteases and ligases.
Figure 3. The PIAS family. (A) Schematic representation of human PIAS proteins. The SP-RING and SAP domains, together with the PINIT region, the acidic stretch (AC) and the C-terminal Ser/Thr (S/T) rich region, are shown. Two alternative isoforms, α and β, have been classically described for PIAS2. The number of amino acids is also shown for each protein. (B) Proposed ligase mechanisms using PIAS1 and RanBP2 as models. In the case of PIAS1, simultaneous binding of target and SUMO-loaded UBC9 through different domains may facilitate SUMO transfer to the target. The SP-RING is important for UBC9 binding but may also mediate other protein interactions for additional functions. Other domains, like the SAP domain, may recruit SUMO targets, but it is also involved in additional functions. The minimal protein region required for ligase activity is indicated with brackets. In the case of RanBP2, interaction of SUMO and UBC9 with a SIM and the 50-amino acid internal repeat (IR) 1, respectively, should position the thioester bond in the appropriate conformation for efficient transfer of SUMO to the target, which is not required to directly interact with the ligase. RB, Ran binding domain.
Pathway components can be influenced by the physiological environment, which may have an impact in their activities. In this sense, several SENPs have been shown to be sensible to hypoxic inactivation
[50]. In addition, SENP3 has been defined as a redox sensor, and, for instance, reactive oxygen species (ROS) may modulate E1 activity (reviewed in
[51]). Besides, a number of PTMs, including phosphorylation, acetylation, ubiquitination and sumoylation itself, have been shown to affect several pathway components. Otherwise, a patent crosstalk between sumoylation and other PTMs like ubiquitination, phosphorylation and methylation operates on target modification
[4][28][52][4,28,52]. As indicated, vertebrate viability depends on sumoylation, and acting on E1 or E2, which will globally affect sumoylation, should certainly have general and strong effects. However, the function of regulating modification of specific targets mostly relies on SUMO proteases and ligases. Target modification output depends on localization, expression, activity and paralog preference of these enzymes. Thus, their unbalanced expression, localization or function, directly impact on sumoylation of defined proteins, which in many cases is in the basis of physiological alterations leading to tumorigenesis. Depending on the specific association of particular ligases and proteases with precise targets, and, thereby, on the nature of these and the affected tissue, alterations will lead to the appearance of a great variety of cancer types.