Mammalian Zona Pellucida Glycoproteins: Comparison
Please note this is a comparison between Version 1 by Pavla Postlerova and Version 2 by Peter Tang.

Zona pellucida (ZP) plays an important role in the oocyte lifespan providing mechanical protection and defense against polyspermic fertilization by directly modulating sperm function.

  • zona pellucida
  • glycoproteins
  • mammals
  • gamete interaction

1. Introduction

Mammalian fertilization is a species-specific event that involves a series of interactions between sperm protein molecules and zona pellucida (ZP) glycoproteins of the oocyte. The initial gamete interaction, also known as the primary binding of the spermatozoa to the ZP of the oocytes, is facilitated by the complementary sperm and zona surface molecules.

To gain the ability to bind to the ZP of an oocyte, spermatozoa undergo a sequence of post-testicular maturation events resulting in changes in the sperm protein composition, especially those localized to the sperm plasma membrane. Ejaculated spermatozoa have a fully differentiated morphology with a myriad of different protein molecules present on their surface [1][2][3][1,2,3]. During sperm transit through the female reproductive tract, the protein composition of the sperm plasma membrane changes dramatically, adapting spermatozoa to survival in the uterine environment [4] [4] with the final step of capacitation leading to exposure of the receptors on the sperm surface responsible for ZP binding [5][6] [5,6]. The sperm surface proteins are complementary to the oligosaccharide chains that decorate the ZP of the oocyte. Spermatozoa bind the ZP carbohydrate moieties via their membrane protein receptors resulting in, for most, part species-specific gamete recognition (reviewed by Clark [7]).

The differences in ZP carbohydrate moieties and sperm surface proteins are considered the main factor in the species specificity of sperm-ZP recognition and binding. While the concept of strict species-specificity applies to mice [8] [8] and humans [9], this does not hold true for domestic animals such as pigs and cattle [10][11][12][10,11,12].

The initial interaction between the spermatozoa and oocyte takes place at the level of ZP. Therefore, receptors on the surface of capacitated spermatozoa are key to the fertilization process. The species-specificity of the sperm-ZP interaction can be ensured on the one hand by the presence of a certain receptor and, on the other hand, by a particular glycosylation pattern of the ZP.

2. Zona Pellucida Glycoproteins

Zona pellucida (ZP) plays an important role in the oocyte lifespan providing mechanical protection [13] [13] and defense against polyspermic fertilization by directly modulating sperm function [14][15][14,15]. The mammalian ZP is composed of three to four glycoproteins most commonly designated ZP1, ZP2, ZP3, and ZP4, with inter-species differences addressed below (Table 1). Four mammalian ZP glycoproteins are the products of three genes: ZPA, ZPB, and ZPC [16]. Phylogenic studies revealed that ZP2, encoded by ZPA and ZP3, coded ZPC is common in all the mammalian species so far investigated; meanwhile, ZP1 and ZP4 are products of the common progenitor ZPB gene, a duplication event that occurred during the evolution of the amniotes [17][18][17,18], see Table 1. Some authors differentiate ZPB paralogues into (ZP1/ZPB1) coding ZP1 and (ZPB/ZPB2) coding ZP4 [19]. In newer literature, genes encoding four ZP glycoproteins are termed ZP1-4 to avoid nomenclature confusion [20], which is in accordance with HUGO nomenclature. From here on, we will use the HUGO nomenclature of ZP glycoproteins. Depending on species, either ZP1 or ZP4, or both are present. Synthesis of ZP glycoproteins was attributed to the growing oocyte in mice [13] whereas, in humans and other species (e.g., domestic pig, cattle, rabbit, and dog), granulosa/cumulus oophorus cells contribute to the synthesis and deposition of ZP as well [20]. ZP glycoproteins are conserved throughout the mammalian species sharing a high amino acid sequence identity between individual ZP1-4 homologs.

Table 1. Summary of zona pellucida (ZP) glycoproteins in different mammalian species. ZP protein AA sequences were taken from the UniProtKB database, uniport.org and the sequence alignment was performed using BLAST® software blast.ncbi.nlm.nih.gov/BlastAlign.cgi.

2.1. ZP Glycoproteins in the Mouse Model

In the best-studied animal model, a mouse, ZP is composed of three glycoproteins: mZP1 (200 kDa, dimer), mZP2 (120 kDa, monomer), and mZP3 (83 kDa, monomer) [23]. mZP1 shares the domain architecture with ZP4 that is expressed in other mammals such as human, pig, bovine, and dog (see relevant references in Fahrenkamp et al. [15]), and their genes are considered paralogous [22][24][22,24]. ZP4 (ZPB/ZPB1) is a pseudogene in mice and therefore not expressed. The basic structural elements of murine ZP are repeating fibers formed by a pair of glycoproteins mZP2 and mZP3 (heterodimers) linked together by a dimer of mZP1 glycoprotein [23][25][23,25]. The estimated molar ratio of ZP1/ZP2/ZP3 is 1:4:4 [41]. Functional ZP glycoproteins consist of domains, including the signal peptide, ZP “domain” modules responsible for ZP polymerization, the consensus protease cleavage site, and a GPI-anchor [21]. ZP1 and ZP4, on top of the aforementioned domains, also contain the trefoil domain.

2.2. ZP Glycoproteins in the Humans

Contrary to the mouse, humans express all four ZP genes resulting in four ZP glycoproteins termed hZP1, hZP2, hZP3, and hZP4 [28]. hZP1 and hZP4 are paralogs, and their amino acids sequences share 47% identity. Human hZP1, hZP2, hZP3 amino acid sequences show 68%, 58%, and 68% homology with mouse mZP1, mZP2, and mZP3 glycoproteins, respectively (https://blast.ncbi.nlm.nih.gov/). Comparing the amino acid sequences between human ZP2, ZP3, and ZP4 and porcine glycoprotein homologs, there is 64%, 74%, and 68% sequence identity [27]. SDS-PAGE analysis revealed hZP2 as a 120 kDa band, hZP3 as a 58 kDa band, and the 65 kDa band contained both hZP4 and hZP1 [26]. The assembly of ZP glycoproteins into a matrix has been studied in a mouse model and was discussed above. It was reported recently that a frameshift mutation in the human ZP1 gene caused primary female infertility as a result of the absence of the ZP2-ZP3 filament crosslinking and the inability to form a stable ZP matrix [29].

2.3. ZP Glycoproteins in the Pig Model

Porcine ZP is composed of three ZP glycoproteins, pZP2-4. ZP1 is a pseudogene in the pig, and therefore ZP1 is not expressed. SDS-PAGE analysis revealed pZP2 (ZPA/PZPL) as a 90 kDa band that splits under reducing conditions into two smaller bands of 65 kDa and 25 kDa [31][32][33][36][31,32,33,36]. Both pZP3 (ZPC/ZP3-β) and pZP4 (ZPB/ZP3-α) migrated as 55 kDa protein bands [38]. pZP3 and pZP4 make about 80% of total porcine ZP glycoproteins [30][32][30,32]. The pZP2 and mouse mZP2 homologs share a 55% amino acid sequence identity, while pZP3 and mouse mZP3 share a 66% amino acid sequence identity (https://blast.ncbi.nlm.nih.gov/). The pZP4 was implied to have the same function as the mZP1 paralogue [35][37][35,37]. It was later predicted that similar to mice, pig ZP filaments are formed by pZP3 and pZP4 heterodimers, crosslinked with pZP2 based on their estimated molar ratio of 1:6:6 (pZP2:pZP3:pZP4) [34].

2.4. ZP Glycoproteins in the Bovine Model

Similarly, as in the pig, three glycoproteins were identified in bovine ZP, termed bZP2 (ZPA), bZP3 (ZPC), and bZP4 (ZPB) [39], and the ZP1 is a pseudogene. Furthermore, SDS-PAGE analysis of deglycosylated ZP glycoproteins showed that bZP2 migrated at 76 kDa, bZP3 at 47 kDa, and bZP4 at 68 kDa. Similar to the domestic pig, bZP2, under reducing conditions, split into two smaller bands of 63 kDa and 21 kDa [39]. Amino acid sequences of bovine ZP glycoproteins show high similarity to their pig counterparts, i.e., 78%, 84%, and 76% for ZP2, ZP3, and ZP4, respectively (https://blast.ncbi.nlm.nih.gov/). bZP4 was found to have the strongest sperm-binding activity among the components, while bZP3 had about one-sixth that of bZP4 [40]. The estimated molar ratio of bZP2/bZP3/bZP4 in bovine is 1:2:1 [41].

3. Carbohydrate Structure and Glycosylation of ZP Glycoproteins

All ZP glycoproteins are highly heterogeneous due to post-translational modification by glycosylation of serine/threonine (O-linked glycosylation) and asparagine (N-linked glycosylation) residues, which are mostly sulfated and sialylated. Structures of the glycan portion of ZP proteins have been characterized by in-depth and reviewed in-detail [7][42][43][44][7,42,43,44]. The carbohydrate content of ZP is estimated at 15–54% (w/w), and its heterogeneity is reflected as sets of trailing spots on 2-DE electrophoretograms. The glycosylation sites of individual oligosaccharides and cognate carbohydrate-binding proteins are involved in the sperm-ZP binding in many species in a species-specific manner [45][46][47][45,46,47].

In the 1990s, the sugar structures of ZP have deducted from lectin-binding studies. Some conserved carbohydrate structures were found in almost all species investigated, such as mannose and N-acetylglucosamine that are common components of the core of N-linked oligosaccharides [48][49][50][48,49,50]. On the other hand, β-galactose was found in mouse and bovine but not in porcine ZP [51]. Terminal N-acetylgalactosamine and α-galactose residues constitute minor components in murine and bovine ZP, whereas porcine N-glycans are lacking these N-acetylgalactosamine and α-galactose residues [45]. Human ZP also contains mannosyl, N-acetylglucosaminyl, and β-galactosyl residues and βGal-(1–3)GalNAc sugar sequences that are exposed only after removing terminal sialic acid residues [49]. Sialyl-Lewisx structures are uniquely present in human ZP [52].

3.1. Glycosylation in the Mouse Model

The basic structure of N-linked oligosaccharides (complex-type) in mice is similar to porcine ZP [53][54][53,54]. Also, bovine N-linked glycans show practically the same structure as their murine and porcine homologs [55]. Species-specific differences are most obvious in the structure of neutral N-linked carbohydrates [56]. In the pig and cattle, neutral oligosaccharides represent about 25% of the total carbohydrate portion, whereas in the mouse they are present at less than 5%. Variations in other species are in di-, tri-, tetra-antennary chains, sulfation, and sialylation. The number of sulfated lactosamine repeats and degree of sialylation in both N- and O-glycans are the causes of enormous heterogeneity of the ZP glycoproteins in all species [45][57][45,57].

Mouse ZP contains N-linked oligosaccharides with high-mannose and complex-type structures (such as di-, tri-, and tetra-antennary branched N-glycans) as well as O-linked oligosaccharides [58]. The mZP oligosaccharides are complexes containing fucose residues [51] [51] and form mainly acidic tri- and tetra-antennary chains containing lower amounts of sulfates and sialic acids in the N-linked chains [51][58][59][51,58,59]. N-glycans are fucosylated and elongated by non-branched N-acetyllactosamine chains. Acidic glycans contain sialic acids at the nonreducing end or sulfates in the C-6 position of the N-acetylglucosamine residues of the lactosamine repeats [45][55][45,55]. N-acetyl-D-lactosamine (LacNAc), sialized LacNAc, and terminal N-acetylglucosamine (GlcNAc) were found as terminal units of N-linked oligosaccharides. In O-linked oligosaccharides, the majority were core-2 type O-N-acetylgalactosamine [58], with mainly sialic acid found as a terminal unit [60]. Mouse ZP glycoproteins are composed of 16 potential N-glycosylation sites, with 15 of them being actually occupied [61]. The mZP1 contains four, mZP2 six and mZP3 six N-glycosylation sites. Mouse ZP has many additional potential O-glycosylation sites that are less utilized. There are as many as 82 potential O-linkage sites in mZP1, 84 in mZP2 and 58 in mZP3 [61]. mZP1 is more O-glycosylated than N-glycosylated, whereas mZP2 is predominantly N-glycosylated, with low or no O-glycosylation, and mZP3 is more N-glycosylated with relatively low O-glycosylation [61].

3.2. Glycosylation in the Humans

The glycan profile of human ZP is unique compared to other mammalian species [62]. Even though the lectin studies initially indicated a high content of D-mannose in human ZP [49], ultrasensitive mass spectrometric analyses revealed the absence of the high-mannose type chain [63]. Human N-linked ZP glycans have bi-, tri, and tetra- antennary fucosylated complex-type structures, and are terminated with sialyl-Lewisx (SLEX) and sialyl-Lewisx-Lewisx. O-linked glycans in human ZP are core-1, and -2 type O-N-acetylgalactosamine, but only core-2 type possess terminal SLEX [63]. Sialyl-Lewisx sequences on O- and N-glycans are important for sperm-oocyte binding. Human sperm-egg binding depends primarily on the recognition of terminal SLEX that is expressed on about 85% of all N-glycans [52][63][52,63]. SLEX was found to be expressed more densely in the outer region of ZP than in the inner layer [52]. In human hZP2, hZP3 and hZP4 glycoproteins, the N-linked glycosylation is predominant. Although N-linked glycosylation occupies 37%, 27% and 18% of the molecular mass of hZP2, hZP3, and hZP4, respectively, the percentages of O-linked glycosylation are only 8% for hZP2, 9% for hZP3 and hZP4 seems to be without O-linked glycosylation [26].

3.3. Glycosylation in the Pig Model

As in the other species previously discussed, porcine ZP glycoproteins are highly heterogeneous due to varied amounts of sialylated and/or sulfated poly-N-acetyllactosamine [64] [64]. N-linked chains are composed of neutral and acidic chains at a molar ratio of about 1:3 that constitute di-, tri- and tetra-antennary N-glycans complex with α-fucosyl residue in the innermost N-acetylglucosamine [65]. The main neutral N-glycans of porcine ZP glycoproteins belong to the di-antennary fucosylated glycans containing N-acetyllactosamine chains [45] [45] and are implicated in sperm-oocyte recognition [34]. Highly sulfated acidic N-glycans consist of poly-N-acetyllactosamine sequences of different lengths, sulfated at the C-6 position of GlcNAc [54]. In contrast to the N-glycans of ZP in cyclic sows, a lower degree of glycan sulfation in the prepuberal zona pellucida has been reported [66]. N-linked glycans contain fucose residues but no high mannose chains [51] [51]. The largest ZP glycoprotein in the pig, pZP2 has six, pZP3 three, and pZP4 five potential N-glycosylation sites. In addition, pZP4 contains three and pZP3 six potential O-glycosylation sites [37]. Sugar-mapping of pZP4 glycopeptides has revealed that all three potential N-glycosylation sites Asn203, Asn220, and Asn333 of the mature pZP4 carry neutral bi-antennary N-glycans, whereas only Asn220 is also glycosylated with neutral tri- and tetra-antennary chains. At least one disulfide bond between the neighboring cysteine residues Cys224 and Cys243 has been localized in the N-terminal part of pZP4 [45][57][45,57]. O-linked glycans comprise 9 neutral and 26 acidic unbranched chains of core-1 O-N-acetylgalactosamine type [67]. Similar to N-linked glycans, the O-linked glycans are sulfated at the C-6 position of GlcNAc and/or sialylated. The N-glycosylation of porcine ZP glycoproteins, which occurs during meiotic maturation is crucial in sperm-ZP interactions, including sperm binding to ZP and induction of AE in ZP-bound sperm [68]. Nevertheless, the binding and induction of AE in boar spermatozoa do not require the participation of terminal Galα1-3Gal sequences [69].

3.4. Glycosylation in the Bovine Model

Thus far, only N-linked glycans have been reported in bovine ZP [51]. Bovine ZP glycoproteins are contained with 23% of neutral carbohydrate chains, of which the main constituent is high-mannose-type oligosaccharide structure, and 77% of acidic chains with a high content of sialic acid as opposed to the high content of sulfation that is typical for the pig [59] [59]. Bovine ZP glycans are therefore more similar to those of the mouse than the pig and human. The acidic N-linked glycans of bovine ZP contain di-, tri- and tetra-antennary sialylated complex-type structures with a fucose residue at their reducing ends [51]. Molecular cloning of bovine ZP revealed five potential N-glycosylation sites in bZP4 (ZPB), three potential glycosylation sites in bZP3 (ZPC), and four potential N-glycosylation sites in bZP2 (ZPA) [40][70][40,70]. Further studies confirmed bZP2 being N-glycosylated at Asn83, Asn191, and Asn527 [71], and bZP2 being N-glycosylated at Asn124, and Asn146 [70].