Protease inhibitors (PIs) are ubiquitous regulatory proteins present in all kingdoms. They play crucial tasks in controlling biological processes directed by proteases which, if not tightly regulated, can damage the host organism. PIs can be classified according to their targeted proteases or their mechanism of action.
1. Introduction
Proteases are ubiquitous enzymes in plants, animals, and microorganisms that play key roles in the majority of physiological processes [1][2][1,2]. Proteases are involved in several reversible and irreversible reaction cascades including hormone production/signaling pathways, apoptosis, inflammatory reactions, and homeostasis [3]. Depending on the active amino acid in the enzyme’s active site, proteases are classified into cysteine, serine, aspartic, glutamic, and threonine proteases [3], and metalloproteases represent another protease class containing a divalent metal ion linked to the active site residues [4]. Serine proteases are the most abundant proteolytic enzymes, followed by metallo- and cysteine proteases and finally the aspartic and threonine proteases [5]. In addition to their roles in vital biochemical processes, proteases are also implicated in various diseases such as viral diseases, cancer, inflammation, and bleeding disorders [6]. Given their important roles in diverse physiological processes, protease activity must be rigorously controlled and regulated to avoid any enzyme dysregulation that might be pathogenic to the host organism [7]. This tight regulation is usually conducted by blocking the zymogen, the inactive enzyme precursor, or through the action of protease inhibitors (PIs) [8], which partially or totally inhibit enzymes by forming a complex with their target proteases [4].
While the majority of PIs are proteins or peptides, some low molecular weight non-proteinaceous compounds such as polysaccharides, glycerolipids, triterpenes, and polyphenols are also considered PIs [9]; these are not specific to particular proteases and inhibit a broad spectrum of enzymes [10]. In contrast, proteinaceous PIs are usually more specific and can even target unique proteases. As a result, PIs can be classified according to their target enzyme, although exceptions are frequently encountered, such as the α2-macroglobulin that inhibits proteases of different classes [6]. A particularly impressive role for PIs has been observed in parasite–host crosstalk; PIs found in tick saliva have been shown to modulate host immune cells [11], mediating local immunosuppression and modulating blood clotting at the site of infection, thereby exerting a beneficial effect to the tick (allowing attachment and feeding) at the expense of the host [12].
Given their fundamental roles and potential translational application, there have been significant efforts to identify new PIs from various sources and study those already identified in detail using novel technologies and methods. Indeed, with technological advances, the study of PIs has substantially improved over recent years, not least the availability of three-dimensional (3D) structural information for several PIs and their targeted proteases, permitting receptor-based design.
Here, we review the biochemistry and fundamental mechanisms of action of PIs. We enumerate and discuss the different classes of PIs based on the proteases they inhibit and their mechanism of action. Moreover, we discuss their applications in critical fields like agriculture and medicine. In the final section, we focus on an interesting natural source of PIs, tick salivary glands, and their potential pharmacological applications.
2. Classification of Protease Inhibitors
The catalytic activity of proteases is regulated by different inhibition mechanisms and different PI families [13]. Despite similarities in the 3D structure of PIs, they can be classified into over 107 families and divided into 40 clans according to their structural similarities (secondary and tertiary) and their different functions [13]. Laskowski and Kato first developed a classification scheme for PIs in 1980 [14] [14] according to their reactive center, disulfide bond number, and amino acid sequence [14]. With advances in biotechnology and increasing knowledge about PIs, Rawlings et al. [15] established a classification of PIs in 2004 based on amino acid sequence homology that subclassified PIs to 48 families and 26 clans [5].
2.1. Target-Based Classification
PIs can be classified according to their target protease into six groups [4]: serine, cysteine, aspartyl, glutamate, and threonine protease inhibitors. Metalloprotease inhibitors are also contained within this classification, as they inhibit proteases with a divalent metal ion in their active site [16]. A non-exhaustive list of the most common PI families, their principal features, and their properties according to the target-based classification system is illustrated in Figure 1 and discussed below.
Figure 1.
The target-based classification of protease inhibitors.
Serpins
The serpin superfamily is the largest serine PI family [20]. Serpins typically weigh ~45 kDa and are relatively large molecules (~350–400 amino acids) compared to other PIs [21]. The 3D structure of serpins includes three β-sheets (A, B, and C) and up to nine α-helices that fold to form a specific spatial structure including a reactive center loop (RCL) near the C-terminus [22]. Serpins are categorized as “suicide inhibitors” due to the inactive covalent complex formed with their target protease [23]. This suicide inhibition is often referred to as a “mousetrap”, since the RCL interacts with the target protease active site, and its scissile bond (P1-P1’) is cleaved to generate a stable complex [24]. The resulting bond between the protease and the RCL leads to the insertion of the cleaved RCL into the β-sheet A and relocation of the protease to the opposite pole of the serpin, forming a suicide covalent complex [24]. Serpins are involved in the regulation of different physiological processes such as blood coagulation, fibrinolysis, signal transduction, the complement cascade, and immune responses [25].
Kunitz-Type PIs
PIs in the Kunitz superfamily are characterized by the presence of one or many Kunitz inhibitory domains. They are generally small proteins with molecular weights ranging from 18 to 24 kDa [26]. The Kunitz domain is characterized by anti-parallel β-sheets and α-helices maintained in a compact 3D structure by three disulfide bonds [27]. Most Kunitz-type PIs are competitive inhibitors, acting in a substrate-like manner and binding reversibly to the protease [28]. Active site blocking is mediated by the RCL attachment to the catalytic zone through a non-covalent bond. The highly exposed RCL loop of Kunitz-domain inhibitors is suitable for a wide variety of proteases, so these inhibitors are relatively non-specific and therefore potentially useful across a range of applications [29]. Indeed, Kunitz-type PIs are known to regulate inflammation and coagulation factors and have also been implicated in tumor biology [3].
Bowman–Birk Inhibitors (BBIs)
This superfamily of PIs is characterized by small molecular weight peptides ranging from 5 to 16 kDa and a structure with a single or two inhibitory regions [30]. BBIs are competitive inhibitors and follow the classical mechanism of substrate binding to the protease active site [31]. A single BBI inhibitor protein can act on two different target proteases simultaneously by virtue of two opposed loops formed by antiparallel β-sheets and stabilized by seven disulfide bonds [23][32]. Given their specific mechanism, several researchers have focused on BBIs for specific applications such as to inhibit cancer [33].
Cysteine protease inhibitors (CPIs), or cystatins, are the second largest group of PIs after serine PIs. They are divided into three main families: family-1 cystatins or stefins, family-2 cystatins or cystatins, and family-3 cystatins or kininogens. Stefins are mostly intracellular and the smallest cystatin family in terms of molecular weight (10–11 kDa) [
3]. Stefins inhibit cathepsins B, L, and H and also papain. In several therapeutic investigations, they have been identified as potential diagnostic tools for cancer [34]. Like stefins, cystatins inhibit papain and cathepsins B, L, and H, but, they are larger (11–14 kDa) and are transported out of the cell to exert their action [35]. Kininogens are divided into three categories: high molecular weight kininogens (120 kDa), T-kininogens (68 kDa), and low molecular weight kininogens (60–80 kDa) [3][36][37]. They play important roles in the modulation of inflammatory responses and are used as biomarkers of kidney disorders [37].
There are numerous documented functions of CPIs, and some have been shown to be critical for the proper functioning of important physiological pathways such as cathepsin regulation [38]. The structural features of CPIs include 4 to 5 antiparallel β-sheets surrounding an α-helix. Their highly conserved inhibitory domain is mainly composed of two hairpin-like loops formed by the β-sheets and the N-terminal region [39]. Cystatins follow the competitive inhibition model with slight modifications, as they do not bind in a substrate-like manner. In fact, the two hairpin loops bind to the protease active site and block the access of any substrate, while the N-terminal region maintains effective attachment of the inhibitor to the enzyme [40][41].
Despite their low molecular weight (only 3–4 kDa), metalloprotease inhibitors (MPIs) effectively inhibit a wide range of metalloproteases [42]. MPIs are classified as competitive inhibitors, since they act in a substrate-like manner [43]. From a structural point of view, MPIs do not possess an inhibitory loop or a specific secondary structure for inhibition. Instead, their inhibitory fragment is located near the C-terminus and contains a scissile bond. The cleavage of this latter bond allows the fixation of the new C-terminal side to the active site of the protease with the help of its metallic ion [23][28]. The efficiency of MPI inhibition has been reported to be enhanced by secondary interactions outside the active site of the protease [44].
The natural aspartyl PIs are proteins of about 20 to 22 kDa and conformational stability ensured by two disulfide bonds [45]. Despite aspartyl proteases including important members such as cathepsins D and E, renin, pepsin A and C, and, most importantly, the HIV-1 protease, their natural inhibitors remain poorly described for several reasons. One main reason is probably related to the low representation of aspartyl proteases in the human genome, with only 15 members described [46]. Regardless of their low bioavailability, the presence of a scissile bond and their short half-life have meant that strategies to inhibit aspartyl proteases involve the development of synthetic peptides or mimics with a non-cleavable bond to replace the scissile bond [47].
2.2. Mechanism-Based Classification
The target-based classification is limited, as numerous PIs are active against two or more enzymes. Indeed, in humans, there are vastly more proteases than PIs; despite the continuous discovery of human proteases and their respective inhibitors, the ratio of one PI for every five proteases has remained constant [41][48][41,48]. However, in addition to the target-based classification, it is possible to classify PIs according to their mechanism of inhibition, in particular the steric or catalytic inhibition of the enzyme active site or its neighboring regions [49]. Enzyme inhibition mechanisms can be divided into two general categories, reversible and irreversible. Reversible inhibition can be further subdivided into competitive, uncompetitive, and non-competitive inhibition [41] [41]. The mechanism based-classification can be divided into three major classes (Figure 2): competitive protease inhibition (also called canonical inhibition), exosite-assisted competitive inhibition or non-canonical inhibition, and finally irreversible inhibition or trapping inhibition [41].
Figure 2.
a
b
c): Irreversible inhibition: the inhibitor binds irreversibly to the enzyme inducing its inactivation
Figure 2a). The β-sheet conformation allows the binding of the inhibitor to the active site in a substrate-like manner. Consequently, the RCL scissile bond is slowly hydrolyzed by the protease without any product release, as the amide bond is later reconnected [15][41]. Numerous PIs belong to this family, mainly the BBIs and Kazal and Kunitz domain-containing inhibitors.
Figure 2b). Access to this active site is blocked in a non-catalytic manner [51][52]. This inhibition mechanism is classified as competitive but differs from the standard Laskowski mechanism [50]. Inhibitor binding to the exosite is crucial for inhibition, as it maintains the inhibitor–enzyme interaction and enhances inhibitor specificity [51]. As mentioned above, CPIs are non-canonical PIs that follow a slightly modified competitive inhibition mechanism, since the N-terminal region does not interact with the active site in a substrate-like manner but switches to the side of the active site to ensure sufficient binding energy for the enzyme–inhibitor complex [41][53].
2.2.3. Irreversible Inhibition
This class of inhibition is triggered by the protease, which catalyzes the activation of its respective inhibitor (also referred to as a suicide substrate). The cleavage of the inhibitor reactive loops (Figure 2c) triggers a major conformational change, resulting in the irreversible cross-linking of the protease to its inhibitor [23][51][23,51]. α-2-macroglobulin is a 600 kDa inhibitor with four reactive loops on its surface, which plays a major role in the elimination of excessive proteases in the blood [54]. Serpins are also well-known suicide inhibitors, as described above [55].