Human Immunodeficiency Virus type 1 (HIV-1) is a lentivirus containing two RNA genomes that must be reverse transcribed into double-stranded DNA and then integrated into the host genome to ensure a productive infection. HIV-1 can infect host CD4+ T cells and macrophages, resulting in CD4+ T cell loss and immune dysfunction that leads to the acquired immune deficiency syndrome (AIDS).
HIV-1 has evolved to usurp the host machinery to replicate and spread, as well as to counteract host immune defenses. These processes are achieved mainly from the physical interactions between viral and host cellular proteins, which contribute to the uniqueness of HIV-1 infection [1][2].
HIV-1 infection relies on the viral envelope to bind to the host CD4 receptor and CXCR4 or CCR5 coreceptors for attachment and fusion, resulting in the release of the viral core, capsid, into the cellular cytoplasm [3]. Capsid is comprised of ~250 hexamers and exactly 12 pentamers, assembled from ~1500 copies of monomeric capsid proteins (monomer is defined as CA herein) [4]. This conical capsid houses the viral RNA genome, and the replicative enzymes reverse transcriptase (RT) and integrase (IN), which are required for reverse transcription and integration [5]. Once capsid is present in the cytoplasm, it traffics along the microtubule network to the nuclear envelope, and then is imported into the nucleus in an intact or nearly intact state via the nuclear pore complex (NPC) [6][7]. Reverse transcription appears to be completed in the nucleus, followed by complete uncoating of the capsid and integration of the viral DNA into the host cell genome [8]. However, the exact mechanisms of capsid nuclear entry and when uncoating starts remain to be determined. These events are the early stages of the HIV-1 replication cycle (see Figure 1 for an overview). The successfully integrated viral genome is then transcribed and translated into viral proteins by host machinery to generate new infectious virions, defined as late stages of the HIV-1 replication cycle [9].
Figure 1. Schematic overview of the early stages of Human Immunodeficiency Virus type 1 (HIV-1) replication, which is spatiotemporally regulated by diverse capsid-interacting host factors.
Capsid performs a protective role by shielding the viral genome from host innate immune detection and provides a micro-environment for reverse transcription [10][11]. Moreover, the exterior surface of the capsid has functions beyond the simple encapsidation by providing an expansive area for recognition by diverse cellular proteins acting as either pro-viral or anti-viral factors [12][13][14][15]. Noticeably, these host factors specifically bind to unique high-order interfaces only present in the assembled capsid lattice, and they have no or low affinity to CA monomers, underscoring the fact that capsid structural integrity is critical for the interplay with host factors [16].
Antiretroviral therapy (ART) has turned AIDS from a ‘virtual death sentence’ to a ‘chronic manageable disease’ by targeting and inhibiting the function of viral enzymes (reverse transcriptase, integrase, and protease) [17]. However, the emergence of ART resistance, the absence of an effective vaccine, and the need for long-lasting anti-retroviral compounds to enhance treatment adherence, are reasons for continued research to develop new therapeutics [18][19][20]. A number of non-viral enzymatic targets have been investigated as possible antiretroviral targets. One successful example is the development of the co-receptor antagonist maraviroc which blocks the cellular entry of HIV-1 by preventing the interaction between viral envelope and CCR5 [21][22]. This is the first and only Food and Drug Administration (FDA)-approved anti-HIV drug targeting a host cellular protein. Due to the multiple roles capsid plays during infection, it has been exploited as a target for anti-retroviral compound screening [23][24]. A handful of compounds have been reported to bind capsid and disrupt infection, of which, PF-3450074 (PF74) and the Gilead compounds, GS-CA1 and GS-6207, are potent inhibitors [25]. GS-6207 has entered clinical trials [26]. These potent capsid inhibitors have similar scaffolds and share the same capsid binding pocket with the host proteins CPSF6 and Nup153, which both mediate capsid nuclear entry [26]. These findings underscore the value of investigating whether additional host factor-binding sites on capsid can be identified and exploited for novel therapeutic interventions.
Further to the point of identifying and investigating capsid–host factor binding, which may allow small molecule targeting, HIV–host factor identification studies have benefited significantly from the application of high-throughput “-omic” screening techniques [27][28].
The viral capsid shell consists of roughly 1200–1500 monomeric capsid proteins (named CA) assembled into approximately 250 CA hexamers and exactly 12 CA pentamers producing fullerene cone geometry [4][29][30] (Figure 2A–D). Five and seven CA pentamers incorporated at the narrow and broad end of the cone respectively, close the capsid and induce the characteristic curvature. This cone-like capsid measures ~60 nm at its broad end and tapers to 20–30 nm (Figure 2D). However, capsid shapes can vary from the cone-like geometry to tube-like structures with capped ends [31]. As an α-helical protein, each CA has two structurally distinct domains, an N-terminal domain (CA-NTD) and a C-terminal domain (CA-CTD), connected by a flexible linker (Figure 2A). The CA-NTD contains seven α-helices (α1–α7), a characteristic extended cyclophilin A (CypA)-binding loop and a β-hairpin, whereas CA-CTD consists of four α-helices (α8–α11), a short 310-helix, and the major homology region (MHR). The assembly and stability of the capsid shell is driven by three sets of intermolecular protein–protein interactions between CA subunits: (1) intra-hexameric NTD-NTD contacts between individual CA molecules stabilize the hexamers and pentamers that function as the building blocks of the capsid, as well as form a central pore gated by the β-hairpin, (2) intra-hexameric NTD-CTD contacts between adjacent CA molecules further stabilize the individual hexamers and pentamers, and (3) inter-hexameric CTD-CTD contacts participate in dimeric and trimeric interactions to link the individual hexamers and pentamers [31][32][33] (Figure 2D). Therefore, the capsid core not only shields the viral genome but also provides expansive and diverse interfaces for cellular protein interactions. The unique capsid architecture underlies intrinsic core stability and provides for capsid–host factor interactions.
Figure 2. HIV-1 capsid architecture and summary of binding interfaces of capsid with diverse host factors. During the maturation of HIV-1 virion, ~1500 CA monomers assemble into ~250 hexamers and exactly 12 pentamers in alignment produce fullerene cone geometry, forming capsid (Figure 2A–D). The interfaces within the capsid which bind diverse host factors are highlighted in different colors, as detailed in the Figure 2E legend. (A) Structure of the CA monomer (PDB ID: 4XZF). The left structure is shown in surface representation and the right structure as ribbon representation. CA consists of two α-helical domains, an N-terminal domain (NTD) labeled in gold and a C-terminal domain (CTD) labeled in light blue, connected by a flexible linker (grey). All α-helices (H1-H11) and key structural elements are indicated by arrows with names. (B) Structure of a CA hexamer with top view (left) and side view (right) (PDB ID: 4XZF). (C). Structure of high-order (hexagonal) CA hexamers with top view (left) and side view (right) (PDB ID: 4XZF). (D). Architecture of the capsid (PDB ID: 3J3Y). The incorporation of pentamers (orange) at either end of capsid provides the curvature necessary to close the conical structure. (E) The interfaces within capsid that can bind diverse host factors are highlighted in different colors, which are shown in Figure 2A–D. Host proteins CypA, Nup358, and TRN-1 bind to the flexible CypA-binding loop (labeled in red) exposed on the top of CA-NTD that protrudes from the capsid outer surface. Host proteins CPSF6 and Nup153 share the same phenylalanine-glycine binding pocket (labeled in green) created by the NTD-CTD interface between two neighboring CA monomers in a hexamer. The restriction factor Mx2 specifically binds to the three-fold inter-hexamer interface (labeled in purple) of capsid; moreover, it is unable to bind to a CA monomer or a single hexamer. Host protein FEZ1 and metabolite IP6 can bind to the positively charged central pore of hexamers formed by R18 (labeled in dark blue) of CA. TRIM5α can form a hexagonal network (labeled in pink, shown in Figure 2D, also called TRIM5α cage) that avidly binds the capsid shell. The extent to which the TRIM5α cage can cover the capsid and how TRIM5α directly contacts the capsid surface have not been fully established.
This entry is adapted from the peer-reviewed paper 10.3390/v13030417