2.2. SARS-CoV-2 Attaches to RBCs, Other Blood Cells and Endothelial Cells
Several of the glycans at the 22 N-glycosylation sites and both O-linked glycans on SARS-CoV-2 spike protein (as depicted in Figure 3) are capped with SA monosaccharides of the same type that are densely distributed on human RBCs at the tips of GPA molecules, as will be considered in some detail below. These matching glycans enable SARS-CoV-2 to hemagglutinate when mixed with human RBCs, as indeed demonstrated using the hemadsorption assay [
35] (similar to the hemagglutination assay [
106,
107]). Hemagglutination occurs more generally in eight families of viruses, including other coronaviruses [
35,
37,
38,
39,
40,
41,
42,
43,
44,
45,
46,
47,
48]. Attachment of SARS-CoV-2 spike protein to RBCs was demonstrated directly through immunofluorescence analysis of RBCs from the blood of nine hospitalized COVID-19 patients [
36]. The mean percentage of RBCs having SARS-CoV-2 spike protein punctae was 41% at day 0 of hospital admission, with values ranging from 0% for one patient and 18% for two patients to 79% for another patient. This mean percentage increased to 44% at day 7 after hospital admission.
Like RBCs, several other cell types express surface SA glycoconjugates and can thus also attach to SARS-CoV-2 virus particles. SA and SA-tipped CD147 are expressed on endothelial cells of blood vessel linings (luminal surface), platelets, lymphocytes, macrophages, and other types of white blood cells [
22]. The potential for pathogen attachments to SA and CD147 that impede vascular blood flow is indicated in another disease, severe malaria, in which the malaria parasite attaches to SA-binding sites on an RBC [
73,
108,
109] and penetrates the RBC through the latter’s CD147 receptors [
110,
111]. Clumps develop between infected and uninfected RBCs, often including platelets, which, along with endothelial cytoadhesion by infected RBCs, cause vascular occlusion, the key morbidity of severe malaria [
22].
The interlaced attachments of RBCs with SARS-CoV-2 virions as observed in vitro in the hemadsorption assay [
35] may well define the mechanism by which clumps (rouleaux) of RBCs form in the blood of COVID-19 patients [
112,
113,
114], as shown in
Figure 2. These clumps would present vascular obstructive potential, given that an RBC of average disk diameter 8 µm [
115,
116] traverses through an alveolar capillary of smaller average cross-sectional diameter [
117], achieved only by a distortion of the RBC’s shape to press against the capillary wall [
116,
118]. Such RBC clumps could be a prime cause of the microvascular occlusion which, as noted above, is characteristic of COVID-19. These clumps could contribute as well to microvascular occlusion in larger capillaries, of cross-sectional diameter up to 20 µm [
115,
119], elsewhere in the body. Vascular occlusion would be promoted even though such stacked RBC clumps (rouleaux) typically dynamically aggregate and disaggregate [
120,
121,
122], as do RBC aggregates that can form in the absence of pathogens, promoted by macromolecules in plasma under conditions of low blood shear rates [
123,
124]. Formation of RBC rouleaux would increase blood viscosity [
121,
122], impeding blood flow, especially in the small-diameter pulmonary capillaries, which would cascade as reduction in both flow velocity and associated shear forces would in turn tend to favor aggregation vs. disaggregation and further occlude flow [
121].
Figure 2. Images of RBC rouleaux (clumps) from the blood of COVID-19 patients, obtained using light ((
A) [
112], (
B) [
113]) and electron microscopy ((
C) [
114]). The first study (
A) found huge rouleaux formation by RBCs in 85% of COVID-19 patients studied [
112]; the second (
B) found these in 33% of patients [
113]; and the third (
C) found these prevalent in its series of 31 patients, all with mild COVID-19 [
114]. Reproduced with permission from (
A) SIMTIPRO Srl; (
B) CC-BY 4.0; (
C) Georg Thieme Verlag KG.
The abundant distribution of SA-tipped CD147 on endothelial cells of blood vessel linings, with 28,000 CD147 receptors (vs. 175 ACE2 receptors) per endothelial cell [
125], may be key to the attachments of SARS-CoV-2 to endothelium and the ensuing damage that has been widely observed in COVID-19 patients [
15,
18,
19,
126,
127,
128]. Damage to endothelium caused by SARS-CoV-2 spike protein in the absence of whole virus was demonstrated both in vitro and in vivo in three studies [
129,
130,
131], and presence of isolated SARS-CoV-2 spike protein on endothelial cells was also observed clinically [
130,
132,
133,
134,
135]. Additionally, one study of 31 hospitalized patients with mild to moderate COVID-19 found that serum levels of circulating endothelial cells (CECs), as determined by different measures, were up to 100-fold the levels for matched controls, and that these CECs from the COVID-19 patients typically each had several holes in their membranes approximately the size of SARS-CoV-2 viral capsid (the viral envelope) [
114].
2.3. The Glycan Distribution and Composition at the 22 N-Glycosylation Sites of SARS-CoV-2 Spike Protein
To explore key characteristics of glycan-mediated SARS-CoV-2 spike protein binding to host cells, it is helpful to consider the specific distribution and composition of the glycans at its glycosylation sites. SARS-CoV-2 spike glycoprotein is a trimer with a central helical stalk embedded in the viral envelope at its C-terminal end. The stalk consists of three joined S2 subunits each capped with an S1 subunit head spreading out in a mushroom-like shape [
136,
137,
138,
139]. An atomistic model of a full-length trimeric SARS-CoV-2 spike protein with its attached glycans, its C-terminal (stalk) end embedded in the viral envelope, is shown in
Figure 3.
Figure 3. Atomistic model of the full-length trimeric S protein of SARS-CoV-2 shown in cartoon representation. Reproduced from Sikora et al., 2021 [
140] (CC-BY 4.0). The three monomeric chains are differentiated by color. Palmitoylated cysteine residues are shown in pink licorice (only one chain shown for clarity), anchored into the viral envelope. Glycans are shown in green licorice representation. A 20 s movie showing a 600 ns atomistic molecular dynamics simulation trajectory of four S proteins embedded in a viral membrane is also provided at this source [
140].
A map of the 22 N-linked glycans on each of a spike’s three monomers is shown in
Figure 4. In addition, two O-linked glycosylation sites, S325 and T323, were identified for each spike monomer, both on S1 RBD [
86], and both containing SA terminal monosaccharides [
141]. Each SARS-CoV-2 virion has a diameter, excluding spikes, of approximately 100 nm, with the number of spikes estimated at up to 65 per virion, these spikes having a length of approximately 20 nm [
142,
143,
144,
145].
Figure 4. Spike domains and glycosylation. Reproduced from Sikora et al., 2021 [
140] (CC-BY 4.0). (
A) Domains of S (SARS-CoV-2 spike protein). (
B) Glycosylation pattern of S. Sequons are indicated with the respective glycans in a schematic representation for a fully glycosylated system (“full”). A key to the monosaccharides represented is shown at the bottom.
The NTD on SARS-CoV-2 spike protein S1, at its N-terminal end, is a focal region for the spike’s glycans—eight of the spike’s 22 N-glycosylation sites are located there [
84,
85,
86]. The NTD is accordingly the typical point of initial viral attachments to glycoconjugate binding sites on host cells [
44,
81,
82,
83,
84,
85,
86,
87]. After initial attachment, viral fusion to a host cell begins with linkage of the spike’s receptor-binding domain (RBD), situated just below NTD on spike S1, to an ACE2 receptor on the host cell membrane. The S2 stalk then becomes engaged and viral replication proceeds [
47,
138,
146]. The RBD, one on each of a spike’s three monomers, constantly switches between open (“up”) and closed (“down”) configurations, the former enabling both ACE2 binding and immune surveillance, the latter blocking both of those functions [
136,
147].
Our focus now shifts to specifics of virally-mediated clumping of RBCs, notwithstanding the importance of inflammatory processes and endothelial damage in triggering and exacerbating the morbidities of COVID-19, especially in its critical phase. Additionally, platelets, the second most copious blood component [
147], serve a pathogen clearance role like that of RBCs [
148,
149,
150], and like RBCs have abundant CD147 and SA surface molecules [
151,
152,
153,
154,
155,
156,
157,
158]. Platelets can adhere to viruses, RBCs and endothelial cells, especially under inflammatory conditions [
16,
22,
149], and are often enmeshed in clumps that develop in severe malaria between infected and healthy RBCs [
159,
160,
161,
162]. Virally induced clumping of RBCs, however, is of particular interest for these reasons. First, as noted above, this clumping alone could limit the efficiency of blood oxygenation in the pulmonary capillaries. Second, such clump formation is directly testable both by examination of the blood of COVID-19 patients and by mixing spike protein with RBCs in vitro. Third, as will be detailed, if such virally induced RBC clumping is confirmed, an existing drug that has been found in silico to bind to glycan sites on both spike protein and host cells can be tested in vitro and clinically for inhibition of virally mediated RBC clumping, in conjunction with an anti-COVID-19 therapeutic benefit.
Of the 22 N-linked glycosylation sites at each of the three monomers of SARS-CoV-2 spike protein, eight are located on NTD as noted above, two on RBD, three others elsewhere on S1, and the other nine on S2 [
84,
85,
86,
87]. N1194 is the closest N-glycosylation site from the C-terminal domain, the end of the spike attached to the virion. In different studies, glycans have been identified as populating between 17 and 21 of these 22 N-glycosylation sites [
86,
87,
163,
164]. One study found that ten of these 22 sites have terminal SA moieties, in particular of α5-
N-acetylneuraminic acid (Neu5Ac), the predominant type of SA found in human cells [
25,
30]. The terminal monosaccharides on SARS-CoV-2 spike N-glycans other than SA are galactose, mannose, fucose,
N-acetylglucosamine (GlcNAc) and/or
N-acetylgalactosamine (GalNAc) [
84,
87]. As noted above, two SA-tipped O-linked glycans sites have been identified as well, both on S1 RBD [
86,
141]. Spike protein S1 at its NTD domain was found to bind strongly, in particular to Neu5Ac [
94], the type of SA at SARS-CoV-2 N-glycosylation sites and predominant in human cells.
GPA, the major sialoglycoprotein in human RBCs, is of central interest in the attachment of SARS-CoV-2 to RBCs, as observed in vitro [
35] and on RBCs of COVID-19 patients [
36] as noted above. GPA populates human RBCs at approximately one million molecules per cell and contains most of the SA (of type Neu5Ac) on them [
31,
33,
74,
165,
166]. GPA molecules have the shapes of strands that are anchored approximately 14 nm apart on the RBC plasma membrane, each extending outwards 5 nm [
75]. GPA constitutes the bulk of the RBC’s sialoglycoprotein coating, thus determining its 5 nm thickness [
75,
167,
168], and accounts for most of its negative charge [
31,
169]. Electrostatic repulsion imposes a minimum distance of approximately 8 nm between the outer boundaries of those sialoglycoprotein coatings of adjoining RBCs in a static suspension, but a smaller separation can be achieved when additional forces are pushing these RBCs together [
168,
170]. SA in its predominant human form, Neu5Ac, is the most common terminal residue of GPA, with its other terminal monosaccharides matching those on SARS-CoV-2 spike N-glycans as noted above: galactose, mannose, fucose,
N-acetylglucosamine (GlcNAc) and
N-acetylgalactosamine (GalNAc) [
74,
75,
166]. A representation of a portion of an RBC membrane with strands of GPA and with other glycoproteins interspersed is shown in
Figure 5.
Figure 5. A representation of a 350 × 350 Angstrom area of the RBC surface depicting its sialoglycoprotein coating, consisting of GPA molecules, extending approximately 5 nm from the RBC cell membrane, plus other smaller glycoprotein molecules interspersed. Reproduced with permission from Elsevier (Viitala, 1985 [
75]).