Covalent inhibitors are a class of small molecule compounds that can covalently bind to specific target proteins, thereby inhibiting their biological functions. Cysteine is one of the least abundant amino acids in proteins of many organisms, which plays a crucial role in catalysis, signal transduction, and redox regulation of gene expression. The thiol group of cysteine possesses the ability to perform nucleophilic and redox-active functions that are not feasible for other natural amino acids. Cysteine is the most common covalent amino acid residue and has been shown to react with a variety of warheads, especially Michael receptors.
1. Introduction
Covalent inhibitors are a class of small molecule compounds that can covalently bind to specific target proteins, thereby inhibiting their biological functions. For a long time, the off-target side effects of covalently bound drugs have been a substantial problem, which limits their potential for drug development
[1]. Nonetheless, it is beginning to be recognized that focusing on a specific protein and the mechanism of action, not just the molecular target, but also often the specific binding site and desired mode of action must be specified at the beginning of the project. More and more covalent drugs have been reported and even successfully applied in clinical, such as the proton pump inhibitor omeprazole, the anticoagulant clopidogrel, and the non-small-cell lung cancer inhibitor afatinib, which made people re-recognize the potential of covalent drugs.
Compared with non-covalent inhibitors, one of the most important advantages of covalent inhibitors is their high binding affinity to target proteins, which may result in a relatively long duration of action
[2]. In addition, covalent bond inhibitors can also reduce drug resistance caused by target mutations. For example, MRTX849 (adagrasib), developed by Mirati Therapeutics, has been identified as a highly selective covalent inhibitor of KRAS (G12C) and is currently in phase I/II clinical studies. It is an oral selective inhibitor of small molecule Kirsten rat sarcoma viral oncogene (KRAS) (G12C) mutations. It not only inhibits KRAS mutation almost completely in vivo, but also exhibits favorable drug-like properties
[3].
2. Covalent Inhibitors
Recently, various covalent drugs have been emerging. The development of covalent drugs contains many fields such as anticancer, antiviral, and diabetes. In general, covalent binding is much more stable than non-covalent binding. Therefore, compared with classical non-covalent inhibitors, covalent inhibitors have many potential benefits such as prolonging the duration of action, improving ligand efficiency, avoiding drug resistance when targeting amino acids required for enzyme catalysis, and targeting the non-conserved amino acids in high selectivity
[4]. Moreover, small-molecule modulators that selectively bind to target proteins can be served not only as tools for understanding protein function but also as clues for drug discovery. While most small molecules function by interacting with their biological targets under equilibrium binding conditions, ligands that chemically modify proteins through the covalent bond formation can offer potential advantages, including improving ligand potency, prolonging the duration of action, enhancing the ability to overcome resistance mutations, and facilitating the identification of target proteins
[5]. It has been reported that the covalent reaction-related residues in proteins can be cysteine, lysine
[6][7], glutamic acid
[8], serine
[9][10], threonine
[11] and tyrosine
[12], etc. (
Figure 1a). However, the most common covalently reactive residue is the poorly conserved non-catalytic cysteine
[13].
Figure 1. (a) The covalent reaction-related residues; (b) mechanism action of irreversible covalent inhibitors and reversible covalent inhibitors.
Unlike non-covalent inhibitors, covalent inhibitors block their target proteins in two steps. Covalent binding of covalent inhibitors to target proteins first requires the formation of target protein–inhibitor complexes through non-covalent binding, which is similar to the binding mechanism of non-covalent inhibitors via an equilibrium process. In the second rate-determining step, the electrophilic warhead of the covalent inhibitor is appropriately positioned near amino acid residues at the binding site. Irreversible covalent inhibitors and reversible covalent inhibitors are also different. Compared with the first step, the rate of covalent bond formation is relatively slow, and there is a reaction equilibrium constant k* if the rate constant of the reverse reaction for the dissociation of the inhibitor–protein complex (k−2) is equal to or close to zero, that is to say, when k2 is much larger than k−2, k* tends to infinity, then the inhibitor is called an irreversible covalent inhibitor, such as clopidogrel (Figure 1b). When the difference between k2 and k−2 is not very large, that is, when k* is in a reasonable range, it can be called reversible covalent binding. The electrophilic warheads of reversible covalent inhibitors are mostly reversible nucleophilic addition reaction receptors such as cyano- and keto-carbonyl groups, such as sitagliptin (Figure 1b).
3. Covalent Inhibitors Covalently Bound to Cysteine
3.1. Cysteine Profile
Cysteine is one of the least abundant amino acids in proteins of many organisms, which plays a crucial role in the catalysis, signal transduction, and redox regulation of gene expression
[14]. Due to the large atomic radius of sulfur and the low dissociation energy of the S-H bond, the thiol group of cysteine possesses the ability to perform nucleophilic and redox-active functions that are not feasible for other natural amino acids. The thioether group in methionine and the sulfhydryl group in neutral amino acids are only moderately nucleophilic; however, once it exists in the cysteine side chain as a thiolate, its nucleophilicity increases by several orders of magnitude, thus becoming the most nucleophilic nucleophile among the 20 common amino acids. Therefore, cysteine has become the most common covalent amino acid residue in covalent drug development
[15]. cysteines are very rare, and they have been shown to react with a variety of warheads, especially Michael receptors
[16][17]. These unique properties have led to widespread interest in the nucleophile, leading to the development of a variety of cysteine-targeting warheads with different chemical compositions (
Figure 2)
[18].
Figure 2. The general structure of the electrophilic library sorted by warhead chemistries.
3.2. Cysteine-Directed Covalent Drugs
The potency and selectivity of small-molecule drugs can be improved via introducing cysteine-targeting elements that can covalently bind to their targets
[13][19]. Most covalent inhibitors have been designed to target the highly nucleophilic thiol groups of cysteine residues
[20]. However, the irreversible nature of these covalent inhibitors might increase the severity of off-target effects, and further optimization of covalent drugs is always required
[21]. Recently, there has been mounting interest in developing irreversible inhibitors that can form covalent bonds with cysteines or other nucleophilic residues in the ATP-binding pocket. Gray et al. described the distribution of accessible cysteines and divided cysteines into five groups spanning the entire kinase to facilitate the design of covalent inhibitors. Cysteine residues located in different parts of the binding sites of various targets provided potential opportunities to develop specific irreversible inhibitors using different cysteine positions
[22]. Much work in the field of covalent drug development has focused on targeting cysteines with Michael receptors. In particular, acrylamide-based inhibitors have achieved great success. For example, the U.S. Food and Drug Administration (FDA) has approved the marketing of the anticancer drug ibrutinib which covalently inhibited Bruton’s tyrosine kinase (BTK)
[23]. Ibrutinib formed irreversible covalent bonds with cysteines near the active site of BTK and has been successfully used to treat B-cell carcinomas
[24]. In addition, Gray et al. also reported a selective covalent fibroblast growth factor receptor (FGFR) inhibitor targeting covalent cysteines located at various positions within the ATP-binding pocket, which was of great interest for overcoming the most common resistance to kinase inhibitors
[25]. The cyclin-dependent kinase 7 (CDK7) inhibitor THZ1 covalently binds irreversibly to Cys312 of CDK7 located near the kinase domain via the acrylamide moiety
[26]. Therefore, both reversible and irreversible inhibitors have the potential to target cysteine. Reversible covalent cysteine-directed drugs have the potential advantage that they are less likely to exhibit off-target effects and instead form covalent adducts with cysteines, thereby increasing specificity and possibly reducing toxicity
[27].