The histidine–aspartate (HD)-domain protein superfamily contains metalloproteins that share common structural features but catalyze vastly different reactions ranging from oxygenation to hydrolysis. This chemical diversion is afforded by (i) their ability to coordinate most biologically relevant transition metals in mono-, di-, and trinuclear configurations, (ii) sequence insertions or the addition of supernumerary ligands to their active sites, (iii) auxiliary substrate specificity residues vicinal to the catalytic site, (iv) additional protein domains that allosterically regulate their activities or have catalytic and sensory roles, and (v) their ability to work with protein partners. More than 500 structures of HD-domain proteins are available to date that lay out unique structural features which may be indicative of function. In this respect, we describe the three known classes of HD-domain proteins (hydrolases, oxygenases, and lyases) and identify their apparent traits with the aim to portray differences in the molecular details responsible for their functional divergence and reconcile existing notions that will help assign functions to yet-to-be characterized proteins.
The histidine–aspartate (HD)-domain superfamily [1] (IPR003607) contains more than 318,000 metalloproteins that are involved in a wide array of functions including immunoresponse [2], nucleic acid metabolism [3][4][5], inflammation [6], virulence [6][7][8], stress response [9][10], and small molecule activation [11][12][13]. They are found across all domains of life and are typified by a tandem histidine–aspartate (HD) dyad that coordinates at least one (often two or three) metal ions. Although there are many uncharacterized HD-domain proteins, chemical diversion appears to be linked to details in the local protein environment, extra ligands, and genomic co-occurrence with partner proteins, all of which may serve as blueprints for their functional assignment.
The main traits that functionally differentiate HD-domain proteins are (i) the chemical nature of the metal ion/cofactor, (ii) cofactor nuclearity, (iii) supernumerary ligands, (iv) conserved amino acid sequence motif insertions or residues vicinal to the active site, (v) auxiliary domains with catalytic or regulatory roles, and (vi) interaction with protein partners. These six features appear to distinctly influence the range of chemistries these proteins perform. Despite variations in primary sequence, all HD-domain proteins share the HD residue dyad that coordinates transition metal ions such as Fe, Mn, Co, Mg, Cu, Zn, and Ni (Table 1) as well as a helical fold (Figure 1a) and functionally cluster based on sequence similarities (Figure 1b) [14].
Table 1. List of representative histidine–aspartate (HD)-domain proteins from the three known subclasses, oxygenases, phosphatases and phosphodiesterases (PDEs), that are crystallographically and biochemically [15][20]characterized.
Subclasses | Protein | Nuclearity | Active metal | Chemistry | Substrate | PDB ID | Origin | References | ||||||||||||||||||
oxygenases | MIOX | dinuclear | Fe | Oxygenase | myo-inositol | 2HUO | Mus musculus | |||||||||||||||||||
PhnZ | dinuclear | Fe | Oxygenase | OH-AEP | 4MLM | bacterium HF130_AEPn_1 |
[15] |
[12,15] |
||||||||||||||||||
TmpB | dinuclear | Fe | Oxygenase | OH-TMAEP | 6NPA | Leisingera caerulea |
[13] |
|||||||||||||||||||
phosphatases | YfbR | mononuclear | Co | Monophosphatase | dAMP | 2PAQ | Escherichia coli K-12 |
[3] |
||||||||||||||||||
YGK1 | mononuclear | Mn | Monophosphatase | dNMP | 5YOX | Saccharomyces cerevisiae |
[5] |
|||||||||||||||||||
YqeK | dinuclear | Fe | Diphosphatase | Ap4A | 2O08 | Bacillus halodurans |
[10] |
|||||||||||||||||||
YpgQ | mononuclear | Mn | Diphosphatase | dNTP | 5DQV | Bacillus subtilis |
[16] |
|||||||||||||||||||
SpoT | mononuclear | Mn | Diphosphatase | (p)ppGpp | 1VJ7 | Streptococcus dysgalactiae |
[17] |
|||||||||||||||||||
SAMHD1 | mononuclear | Mg | Triphosphatase | dNTP | 3U1N | Homo sapiens |
[18,19] |
|||||||||||||||||||
EF1143 | mononuclear | Mg | Triphosphatase | dNTP | 4LRL | Enterococcus faecalis V583 |
[20,21] |
|||||||||||||||||||
OxsA | Mono/dinuclear* | Co | Mono/Di/ Triphosphatase | Oxetanocin-A | 5TK8 | Bacillus megaterium |
[22] |
|||||||||||||||||||
phosphodiesterases | Cas3 | dinuclear | Co | PDE | ssDNA | 4QQW | Thermobifida fusca YX |
[23] |
||||||||||||||||||
Cas3 | dinuclear | Ni | PDE | ssDNA | 4Q2C | Thermobaculum terrenum |
[24] |
|||||||||||||||||||
Cas3’’ | dinuclear | Ca | PDE | ssDNA | 3S4L | Methanocaldococcus jannaschii |
[25] |
|||||||||||||||||||
Cas10 | dinuclear | Ni, Mn | PDE | ssDNA | 4W8Y | Pyrococcus furiosus |
[26] |
|||||||||||||||||||
PDE1-3 | dinuclear# | Mg, Mn | PDE | cAMP, cGMP | 1TAZ, 3ITU, 1SO2 | Homo sapiens |
[27,28] |
|||||||||||||||||||
PDE4 | dinuclear# | Mg, Mn | PDE | cAMP | 1F0J | Homo sapiens |
[29] |
|||||||||||||||||||
PDE5 | dinuclear# | Mg | PDE | cGMP | 1TBF | Homo sapiens |
[30] |
|||||||||||||||||||
PDE7-8 | dinuclear# | Mn | PDE | cAMP | 4PM0, 3ECM | Homo sapiens |
[31,32] |
|||||||||||||||||||
PDE9 | dinuclear# | Mn, Mg | PDE | cGMP | 3DY8 | Homo sapiens |
[33] |
|||||||||||||||||||
| PDE10 | dinuclear# | Mg | PDE | cAMP, cGMP | 2OUN | Homo sapiens |
[34] |
||||||||||||||||||
| PgpH | dinuclear | Mn | PDE | c-di-AMP | 4S1B | Listeria monocytogenes |
[6] |
||||||||||||||||||
| Bd1817 | dinuclear | Fe* | PDE | c-di-GMP | 3TM8 | Bdellovibrio bacteriovorus |
[35] |
||||||||||||||||||
| PmGH | trinuclear | Fe, Mn | PDE | c-di-GMP | 4MCW | Persephonella marina EX-H1 |
[36] |
||||||||||||||||||
| PA4781 | trinuclear& | Mg | PDE | c-di-GMP | 4R8Z | Pseudomonas aeruginosa PAO1 |
[14] |
||||||||||||||||||
lyases | Ddi2 | mononuclear | Zn | Hydratase | Cyanamide | 6DKA | Saccharomyces cerevisiae |
[37] |
# denotes proteins for which the crystal structure shows two active site metal ions at an average interatomic distance of ≈3.8 Å. The primary sequence suggests a mononuclear binding site. In phosphodiesterases (PDEs), the second metal ion is stabilized by the aspartate of the HD motif, a bridging hydroxide and four terminally ligated water molecules. & Although no experimental evidence currently exists, PA4781 belongs the trinuclear clade of HD-GYP proteins as inferred from its primary amino acid sequence. * Bd1817 is inactive toward Bis-(3'-5')-cyclic guanosine monophosphate (c-di-GMP); therefore, the active metal ion refers to the metal ion observed in the crystal structure. OH-AEP stands for 1-hydroxy-2-aminoethylphosphonate, OH-TMAEP stands for 1-hydroxy-2-(trimethylammonio)ethylphosphonate, dNMP stands for deoxymonophosphate, in which N can be A, G, U, C, Ap4A stands for diadenosine tetraphosphate, cGMP stands for guanosine 3',5'-cyclic monophosphate, cAMP stands for adenosine 3',5'-cyclic monophosphate, c-di-AMP stands for Bis-(3'-5')-cyclic adenosine monophosphate.
Figure 1. (A). Helical structure of three HD-domain proteins. YqeK (PDB: 2O08) is a phosphatase, PhnZ (PDB: 4N6W) is an oxygenase, and Ddi2 (PDB: 6DKA) is a lyase. All exhibit a helical fold characteristic to HD-domain proteins despite their diverse functions. (B). Sequence similarity network (SSN) of the HD-domain superfamily depicting its size and functional clustering. The SSN was generated via the Enzyme Function Initiatives-Enzyme Similarity tool (EFI-EST) and visualized in Cytoscape. The SSN was generated by employing the IPR003697 family and tailored so that nodes represent sequences with ≥ 50% identity and an e-value of 5. The SSN was further refined to contain the major protein clusters (size-wise), which amount to 183,015 unique protein sequences. Edges between nodes represent an alignment score of 70. HD-domain phosphohydrolases (SpoT/RelA, SAMHD1, deoxyguanosine phosphatases (dGTPases), nucleotidyltransferases) are represented in green and blue, while hydratases are shown in yellow. PDEs are shown in red (HD-GYP proteins), light pink (exoribonucleases), orange (Cas proteins), and pink (PDEases). Oxygenases are shown in purple and their cluster, which consists of four nodes (372 sequences), is enlarged for visualization. Gray clusters contain proteins of unidentified function.
The HD-domain superfamily fosters three different enzymatic classes: hydrolases, oxygenases, and lyases, with the hydrolases being the largest and best-characterized group. Hydrolases are further subdivided into (i) phosphatases, including dGTPases [38], RelA/SpoT [9,17], SAMHD1, [18,19], EF1143, [20,21], etc. and (ii) phosphodiesterases (PDEs), including exoribonucleases [39], PDEases [27–34], Cas proteins [23–26], and HD-GYPs[40] [14,35,36,40] (Figure 1b). Phosphatases hydrolyze a multitude of (deoxy)nucleotide-based substrates that vary in the identity of their base(s) and the extent of phosphorylation (Figure 2) [3,5,16,22]. PDEs, on the other hand, degrade a variety of cyclic signaling molecules and single-stranded nucleic acids (Figure 2) [41][23–25,41]. Oxygenases and lyases are relatively recent additions to the HD-domain superfamily with only a handful of representatives to have been biochemically characterized. All identified oxygenases catalyze the oxidative cleavage of C-C/P bonds [11–13,15] (Figure 2), while the single known lyase is a cyanamide hydratase [37]. This functional plurality widens the chemical repertoire of the HD-domain superfamily, which is likely to harbor more oxygenases, lyases, or enzymes with novel chemistries.
Figure 2. Known substrates of HD-domain proteins. Phosphatases can remove one to three terminal phosphate groups from (deoxy)ribonucleotides or cleave (a)symmetrically polyphosphate containing nucleotides (represented in gray). The position of cleavage has been highlighted in red for substrates with four phosphates. PDEs hydrolyze phosphodiester bonds of cyclic (di)nucleotide substrates via either one-step hydrolysis (cleavage of one side of the diester bond) releasing a linearized product or two-step hydrolysis releasing individual nucleotides. PDEs can also act on RNA and DNA substrates. HD-domain oxygenases oxidatively cleave a C-X bond (indicated in red).
HD-domain phosphatases play essential roles in regulating the cellular pool of (deoxy)ribonucleotides and signaling molecules involved in bacterial stress responses [42][3,4,10,16,42], such as (p)ppGpp and Ap4A (Figure 2). Phosphatases are further subdivided into mono-, di-, and triphosphatases with distinct structural features that may provide clues for the future classification of unknown HD-domain phosphatases. Interestingly, all have a strictly conserved arginine residue prior to the first histidine of the metal binding motif (Figure 3), which is pivotal for activity [16,18,22]. The exact chemical role of this arginine in catalysis has not yet been delineated. Its importance in hydrolysis most likely stems from the ability of this residue to ensure proper substrate positioning and/or to form intermolecular interactions with the substrate phosphate groups. Most biochemically characterized HD-domain mono- and diphosphatases are dimers in solution ,[3,5,16], which appears to be related to enzymatic function and may represent a regulatory mechanism for tuning activity. However, it is currently unknown if both sites are catalytically active or if one site allosterically activates the other. In contrast, triphosphatases are allosterically regulated by nucleotide binding to secondary sites, and most are active as tetramers [4,20] or hexamers [38].
Figure 3. Active sites of HD-domain phosphohydrolases. Mononuclear HD-domain phosphohydrolases utilize a conserved motif “H.HD…D” to bind a variety of metals including cobalt (pink), zinc (purple), magnesium (light green), nickel (dark green), or iron (orange). Small red spheres represent water molecules. The dinuclear phosphatase YqeK harbors two extra histidines between the HD and D residues to stabilize the second metal ion. Phosphatases are classified into mono-, di-, or triphosphohydrolases, labeled in light green, dark green, and blue, respectively. All phosphohydrolases have a conserved arginine (shown in teal), which is located typically three residues prior to the first histidine of the HD motif and in the vicinity of the oxygens of the substrate phosphate group. Other important residues are shown in pale blue and are described in the text.
In most cases, phosphate hydrolysis is supported by non-redox metal ions, with the most active cofactors being Co or Mn for monophosphatases, Mn for diphosphatases, and Mg or Mn for triphosphatases (Table 1). With a few notable exceptions (YqeK, SAMHD1, and OxsA), these metals are bound in a mononuclear configuration by a conserved “H…HD…D” motif (Figure 3).
PDEs can harbor both mononuclear and polynuclear cofactors. A common feature of all known polynuclear (di-or trinuclear) PDEs is an extra histidine residue in the active site such that their characteristic metal binding sequence is “H…HD…H…HH…D” (Figure 4). The role of this histidine in activity or structure has not been explicitly established. However, on the basis of structural studies, this residue makes additional hydrogen bond contacts to the substrate phosphate groups, suggesting a possible role in substrate binding.
Figure 4. Active sites of HD-domain PDEs. HD-domain PDEs utilize the conserved HD motif “H…HD…H.HH.D” to bind a di- or trinuclear metal center. The metal ions coordinated in their active sites are zinc (purple), magnesium (light green), nickel (dark green), or iron (orange). Small red spheres represent water molecules.
Clustered regularly interspaced short palindromic repeats (CRISPR)-associated systems (Cas) are major players in prokaryotic adaptive immunity and RNA-based defense [50][51] [50,51]. Type I CRISPR–Cas utilize a multicomponent system and recruit a single nuclease, Cas3, for the degradation of invader nucleic acids. The Cas3-associated gene can encode for a protein that has only the HD-domain (Cas3′’, I-A subtype), or more commonly, an N-terminal HD-domain fused to a Superfamily 2 helicase (Cas3). Type III-B CRISPR–Cas utilize Cas10 (Cmr2) for RNA-activated ssDNA cleavage [26].
The crystal structure of the Thermobifida fusca Cas3 shows a diiron active site (Figure 4), yet no activity with this cofactor has been demonstrated [23]. Cas3 proteins can be promiscuously activated by various metal ions, with most being activated by Ni or Co and, to a lesser extent, by other divalent metal ions (with the exception of Mg and Ca) [23,25]. In contrast, the Pyrococcus furiosus Cmr2 (dinuclear) and the Cas3′’ proteins (mononuclear in the published structures) exhibit an Mg-dependent PDE activity [23,25]. The nuclearity of the Cas3′’s may be an artifact due to the larger flexibility of the protein polypeptide, making binding of the second metal ion more transient. Of note, no correlation between helicase and PDE activities has been demonstrated to date.
cAMP- and cGMP-specific PDEs (also referred to as PDEases) are essential regulators in cyclic nucleotide-dependent signal transduction in diverse physiological processes including immune response, neuronal activity, hypertension, and inflammatory response [52][30,31,52]. Currently, 21 genes encoding human PDEases have been identified and classified into 12 families according to their substrate specificities, pharmacological properties, and tissue localization [53][31,53]. These are further divided into three groups: cAMP-specific (PDE4, PDE7, PDE8, and PDE12), cGMP-specific (PDE5, PDE6, and PDE9) [31,41], and dual-specific (PDE1, PDE2, PDE3, PDE10, and PDE11) [31].
All PDEases are dimeric and have a conserved catalytic carboxy terminal domain as well as a variable regulatory amino terminal domain [54][30,54]. However, PDEases are also active as monomers; therefore, the functional significance of dimerization remains unknown [54]. On the basis of their binding affinity for divalent metal ions, PDEases are distinguished into two classes: Class I in mammals and flies and Class II in yeast and protozoans [55]. The most extensively studied are Class I PDEases, in which two metal ions (e.g., Zn and Mg) are octahedrally coordinated, forming a somewhat unconventional bimetallic site [30] Although the canonical motif suggests the binding of a single metal ion (M1) in the HD motif, stabilization of the second metal ion (M2) is accomplished via the aspartate of the HD motif and five water molecules, one of which is bridging M1 and M2. The bridging water molecule has been suggested to be a hydroxide, which can nucleophilically attack the phosphodiester bond [30]. In the crystal structures, the identity of M1 and M2 is often found to be Zn2+ and Mg2+, respectively (Figure 4) [30]. Class I PDEs are active with either Mn or Mg but not with Zn. Therefore, although Zn can bind in the M1 position with high affinity, it cannot stimulate activity by itself or has inhibitory effects [31].
The substrate specificity in PDEases is afforded by a so-called “glutamine-switch” mechanism in which [30] a conserved glutamine in the vicinity of the active site can adopt two different orientations. In one orientation, it can form a bidentate hydrogen bond with the adenine ring (cAMP-specific) or two hydrogen bonds with guanine ring and two hydrogen bonds with neighboring alanine and tryptophan residues (cGMP-specific) [30,31]. In the dual-specific PDEases, the conserved glutamine has higher rotational flexibility and no orientation constraints, allowing it to adopt orientations for both substrates.
Cyclic-di-AMP is a second messenger essential in bacterial signaling and a critical player in bacterial physiology and pathogenesis [56][57][56,57]. PgpH performs the one-step hydrolysis of c-di-AMP to 5′pApA in an Mn-dependent fashion but cannot hydrolyze other cyclic dinucleotides (i.e., c-di-GMP) [6]. The active site Mn2+ ions are octahedrally coordinated by seven residues, H514, H543, D544, H580, H604, H605, and D648, as well as the two terminal oxygen atoms of the c-di-AMP phosphate group (Figure 4). The metal ions activate a water molecule opposite the scissile phosphate for the nucleophilic attack of phosphorus. Protonation of the resulting oxyanion terminates the reaction [6].
HD-GYPs are a special subclass of the PDE subfamily and are functionally homologous to EAL proteins (typified by the glutamate-alanine-leucine residue triad) [58][1,58]. They can be single domain proteins or fusions to extra regulatory, sensory, or catalytic protein domains [59][60][8,59,60].
While cyclic-di-GMP is their most common substrate, the recently discovered hybrid dinucleotide, 3′3′c-GAMP, is also hydrolyzed by some HD-GYPs [59]. Out of the nine HD-GYPs encoded in Vibrio cholerae, VCA0681, VCA0931, and VCA0210 are the only HD-GYPs to hydrolyze both c-di-GMP and 3′3′c-GAMP. More recently, PmxA from Myxococcus xanthus was identified as a 3′3′c-GAMP specific PDE that is hardly active toward c-di-GMP or c-di-AMP [60]. Selectivity for 3′3′c-GAMP is attributed to a glutamine near the active site, although this residue is not conserved in VCA0681, VCA0931, and VCA0210, suggesting that the molecular origins for 3′3′cGAMP specificity may vary among HD-GYPs.
In addition to the seventh ligand added to their active site (i.e., an extra histidine adjacent to the last histidine of the motif), all active HD-GYPs have a glycine-tyrosine-proline (GYP) residue triad in a loop close to the active site (Figure 5) [35][14,35]. However, because single amino acid substitutions of each of the GYP domain residues to alanines hardly affect PDE activity [36], its role in catalysis and protein stability remains poorly understood. The GYP motif is considered important for interaction with the GGDEF cyclase (named after its highly conserved Gly-Gly-Asp-Glu-Phe sequence motif) [8] and serves as a substrate specificity element for the recognition of c-di-GMP and its hybrid 3′3′-cGAMP analog [40].
Figure 5. Active site of HD-GYPs. HD-GYP proteins utilize an “H…HD…H…HH…D” motif that typically binds a dinuclear metal center. The third metal ion in the PmGH active site is stabilized by crystallization molecules shown in gray. In addition, these enzymes contain a GYP residue triad vicinal to the active site (shown in blue), the importance of which is currently unclear. Bd1817, which is inactive toward c-di-GMP, lacks the GYP tyrosine and the terminal aspartate is an asparagine.
HD-GYPs differ on the basis of their active metal cofactor and possible catalytic outcomes. While most commonly harbor a dimetal cofactor, some incorporate a trinuclear cofactor by involving a glutamate residue as an eighth ligand to the site (Figure 5) [36]. The assembly of a dinuclear or trinuclear cofactor is presumed to afford different reaction outcomes. Dinuclear HD-GYPs can only perform a one-step hydrolysis, whereas trinuclear ones can perform a two-step hydrolysis, leading to the respective monophosphates. Metal ions that can stimulate hydrolysis are Fe, Mn, and, to a lesser extent, Co and Ni [40].
PmGH from Persephonella marina is the prototypical trinuclear HD-GYP and the first to be crystallographically characterized [36]. PmGH harbors a triiron cofactor with the third iron coordinated by the glutamate E185. The trimetal cofactor is additionally stabilized by three other crystallization molecules, invoking the possibility that other solvent molecules may be incorporated under physiological conditions (Figure 5). It is active with both Fe2+ and Mn2+. On the basis of primary amino acid sequence, PA4781 from Pseudomonas aeruginosa is also a putative trinuclear PDE; however, the available crystal structure shows two Ni ions in the active site at an elongated distance. PA4781 is unselective in its metal ion incorporation, has limited activity, and exhibits a preference for 5′-pGpG over c-di-GMP to form GMPs [14].
Only one structure of a dinuclear HD-GYP exists: Bd1817 from Bdellovibrio bacteriovorus. It harbors a diiron cofactor, but the presence of an asparagine instead of the last aspartate of the binding motif, a degraded GYP motif (Figure 5), and its complete inactivity toward c-di-GMP [35] do not allow for the inference of substrate positioning and specificity in dinuclear HD-GYPs.
Most of the known HD-domain proteins are phosphohydrolases, but three members, namely myo-inositol oxygenase (MIOX), PhnZ, and TmpB, are monooxygenases and perform the oxidative cleavage of a C-X bond [11][12][13][15][11–13,15]. The discovery of this chemistry expands the catalytic repertoire of the HD-domain superfamily, and their conserved protein features may provide insight into the identification of yet-to-be-characterized HD-domain proteins as oxygenases.
The first discovered HD-domain oxygenase, MIOX, catalyzes the oxidative cleavage of a C-C bond of myo-inositol to form D-glucuronic acid (Figure 6) [11]. Myo-inositol is a precursor for inositol phosphoglycans, which act as insulin mediators, and altered inositol metabolism has been associated with diabetes. Therefore, the activity of MIOX is of increasing interest, as it presents a potential therapeutic target for treating both type-1 and type-2 diabetes.
Figure 6. Active sites of HD-domain oxygenases. Oxygenases utilize the “H…HD…H…H…D” motif to bind a diiron metal center. The substrate scissile bond is positioned above one of the iron sites, leaving the second site open for oxygen binding. PhnZ and TmpB contain an YxxE loop (green) in their primary sequence that is located vicinal to the active site, which upon substrate binding undergoes a conformational change to allow for oxygen binding and catalysis.
PhnZ and TmpB were later established as oxygenases, demonstrating that MIOX is not a functional outlier [12][13][12,13]. Both PhnZ and TmpB are involved in the degradation of organophosphonates, which are compounds that serve as sources of inorganic phosphate for bacteria that occupy phosphate-limited environments (e.g., marine ecosystems) [12]. Unlike MIOX, PhnZ and TmpB act in tandem with the non-heme α-ketoglutarate (KG) dependent hydroxylases PhnY and TmpA, respectively, to cleave the C–P bond of their substrates (Figure 2) [12][13][12,13]. PhnY initiates the degradation of 2-aminoethylphosphonate (2-AEP) via the addition of a hydroxyl group to the C1 carbon in a stereospecific manner producing (R)-2-amino-1-hydroxyethyl phosphonate (OH-AEP) [12]. PhnZ performs the subsequent oxidative cleavage of the C-P bond of OH-AEP forming inorganic phosphate and glycine (Figures 2 and 6) [12]. The TmpA/TmpB pathway is mechanistically similar, with the only difference being the nature of the substrate, i.e., 2-(trimethylammonio)ethyl phosphonate (TMAEP) for TmpA[13].
MIOX, PhnZ, and TmpB bind a catalytically essential diiron cofactor via the HD-domain sequence “H…HD…H…H…D” (Figure 6). Each iron is coordinated in an octahedral geometry and bridged by the carboxylate group of the first aspartate residue in the HD-domain sequence as well as a μ-oxo/hydroxo bridge (Figure 6) [15]. Unlike other dinuclear nonheme–iron oxygenases, which utilize the fully reduced FeII/FeII form of their cofactors, HD-domain oxygenases stabilize a mixed-valent FeII/FeIII state for the four-electron oxidation of a C-C/P bond and do not require an external reducing system for reactivation of the cofactor (i.e., after one substrate turnover the site returns to the FeII-FeIII form) [15]. Stabilization under the same redox conditions of the FeIIFeIII cofactor in oxygenases and the FeIIFeII cofactor in HD-domain hydrolases suggests that the HD-domain sites may tune activity through the modulation of cofactor reduction potentials [40].
The iron ion in the Fe1 site serves as a Lewis acid and binds the substrate such that the C-X bond is opposite the iron (Figure 6) [15]. Then, the second iron site (Fe2) is available to bind molecular oxygen, forming a FeIII/FeIII superoxo species that initiates oxidative cleavage by abstracting a hydrogen atom from the substrate. Following turnover, the active mixed-valent FeII/FeIII form is regenerated [12], and thus, there is no need for an external reducing system to reactivate the enzyme, which is a feature unique to HD-domain containing oxygenases.
Unlike MIOX, PhnZ and TmpB sequences contain a transient YxxE loop involved in substrate binding. Prior to substrate binding, the tyrosine (Y24 in PhnZ and Y30 in TmpB) is oriented toward the active site and binds to the Fe2 site, while the glutamate (E27 in PhnZ and E33 in TmpB) faces away from the active site [13][15][13,15]. Substrate binding induces a conformational change, positioning the glutamate within hydrogen bonding distance to the amino group of the substrate and causing the tyrosine–iron bond to break [13][15][13,15]. Dissociation of the tyrosine frees the Fe2 site for O2 binding and subsequent turnover and most likely serves as a protective mechanism to prevent oxidative inactivation of the active site (Figure 6).
Collectively, HD-domain oxygenases have catalytic and structural features that differ significantly not only from that of other nonheme–iron oxygenases, but also of HD-domain hydrolases. This divergence is useful as it can provide some descriptors to distinguish oxygenases from hydrolases within the HD-domain family. It is likely that these characteristics are conserved among all HD-domain oxygenases and may provide a critical first step into the characterization of other HD-domain proteins of unknown function.