2.1. HIV-1 Env
The HIV-1 Env is a heavily glycosylated type I membrane protein with five conserved regions 1–5 (C1-C5) and five variable regions 1–5 (V1-V5) in gp120; and various segments in gp41 including a hydrophobic fusion peptide (FP), two heptad-repeat regions (HR1 and HR2), a disulfide-bridged loop (C-C loop), a tryptophan-rich membrane proximal external region (MPER), a transmembrane segment (TM) and a cytoplasmic tail (CT) (A). The protein has been historically a very challenging target for structural analysis due to technical difficulties associated with large membrane-bound glycoproteins. Nevertheless, a truncated version of gp120, named ‘gp120 core’, with V1-V3 and terminal segments deleted, was crystallized in two forms: a deglycosylated one in complex with CD4 and a CD4-induced antibody for HIV-1 [
17] and an unliganded and fully glycosylated one for closely related simian immunodeficiency virus (SIV) [
18], producing structures that gave us the first glimpse of gp120 folding and its interaction with CD4. Likewise, the structure of a gp41 fragment of HR1 and HR2 has been solved by X-ray crystallography and nuclear magnetic resonance (NMR) [
7,
8,
19,
20,
21], revealing the postfusion conformation of gp41 as a six-helix bundle, in which the HR1 and HR2 helices are arranged into a trimer of hairpins.
The first breakthrough on high-resolution structures of the Env trimer only came more than a decade later from a designed soluble construct, termed ‘SOSIP’, with stabilizing modifications (i.e., a disulfide bond between gp120 and gp41, an I559P substitution in gp41, and a truncation at residue 664 deleting the MPER; [
22]) by both cryogenic electron microscopy (cryo-EM) and X-ray crystallography [
23,
24,
25] (B). Subsequently, the structure of a detergent-solubilized Env trimer without the CT and SOSIP modifications was determined in complex with neutralizing antibodies by cryo-EM [
26]. More recently, the cryo-EM structures of two full-length HIV-1 Env constructs purified in detergent have also been reported [
27,
28]. These trimer structures have shown that the prefusion gp41 adopts a drastically different conformation from the postfusion six-helix bundle structure and provided much-needed insights on Env structure and its conformational changes. The MPER, TMD, and CT are all disordered in these structures, however, highlighting the important role of the lipid bilayer in stabilizing the structure of these regions. An attempt to determine the structure of the missing regions using a full-length Env reconstituted in lipid nanodiscs did not yield much additional high-resolution information [
29]. In addition, cryo-electron tomography (cryo-ET) has been used to study the structures of Env trimer on the surface of both HIV and SIV chemically inactivated virions, leading to reconstructions at a low resolution (~20 Å) during early days [
30,
31,
32,
33], and a more recent one at ~10Å resolution (B; [
34]), but the regions near the membrane remain difficult to resolve.
Recent data indicate that the membrane-related components of HIV-1 Env, including the MPER, TM domain (TMD), and CT, influence the stability and antigenicity of the Env ectodomain, as well as cell–cell fusion and viral infection [
12,
35,
36,
37,
38,
39], in agreement with their conserved features. For example, the MPER has been studied extensively because it contains epitopes recognized by a group of broadly neutralizing antibodies [
40,
41,
42,
43,
44]. The TMD has a ‘GXXXG’ motif and a highly conserved positively charged residue (Lys or Arg). The CT includes the Kennedy sequence, three conserved amphipathic α-helices segments referred to as a lentiviral lytic peptide (LLPs: LLP1, LLP2, and LLP3) [
45,
46,
47]. Truncation of the CT of the full-length HIV-1 Envs has minimal impact on their fusogenic activity, but it has an unexpectedly large impact on the antigenic structure of the ectodomain [
35]. Some other studies showed that the CT modifications had little effect on the Env antigenicity for certain HIV-1 isolates [
48,
49,
50]. Nevertheless, structural studies in the context of a lipid bilayer appear to support crosstalk between the CT and the ectodomain.
NMR spectroscopy and isotopic labeling techniques enable the structural determination of peptides and small proteins at high resolutions, particularly for those with some flexibility. 2D Transverse relaxation-optimized spectroscopy (2D-TROSY) combined with 3D spectroscopies and selective deuterium-labeling methods allows structural characterization of oligomeric membrane proteins in micelles or bicelles. Moreover, functional dynamics and protein–protein/ligand interactions can be quantitatively measured by NMR techniques at the residue level. Thus, NMR spectroscopy is a powerful approach suitable for structural analysis of the MPER, TMD, and CT regions of gp41 [
51]. For instance, the structure of the TMD reconstituted in bicelles mimicking a lipid bilayer was first determined by NMR [
36] (B). The TMD forms a well-ordered trimer, and that mutational changes disrupting the TMD trimer alter antibody sensitivity of the ectodomain, suggesting that the TMD contributes to Env stability and antigenicity. Moreover, although previous studies reported that the MPER might be buried in the viral membrane [
52,
53,
54], the NMR structure that contains both the MPER and TMD in the bicelle system has shown that the MPER forms a well-ordered trimeric assembly, not buried in the membrane [
12] (B). The MPER mutations can destabilize the Env ectodomain and shift it towards an open conformation, suggesting that the MPER is a control relay that modulates open and closed states of the Env trimer. Furthermore, a third NMR structure containing the TMD and part of CT in bicelles has been reported recently [
13] (B), revealing that the CT folds into amphipathic helices, attached to the bilayer surface, wrapping around the C-terminal end of the TMD and thereby forming a support baseplate for the entire Env. Mutational data have also confirmed that altering the CT-TMD packing interface affects the antigenicity of the full-length Env trimer. These results support the model of a dynamic coupling across the TMD between the CT and ectodomain. Nevertheless, a high-resolution structure of the native, full-length HIV-1 Env in the membrane is still required for a full understanding of its structure and function.
2.3. Coreceptor
CD4 alone was not sufficient to support HIV-1 infection, leading to intensive search and subsequent identification of CXCR4 and CCR5, the seven-transmembrane (7TM) chemokine receptors, as the coreceptor for the virus [
61,
62,
63,
64,
65,
66]. Coreceptor usage is the primary determinant for viral tropism [
67], as those that use CCR5 (R5 viruses) are the dominant form during viral transmission, and others using CXCR4 (X4 viruses) or both (dual-tropic; R5/X4 viruses) emerge mainly during disease progression [
68,
69,
70,
71]. Both CCR5 and CXCR4 have a core structure formed by 7TM helices, decorated by an N-terminal segment and three extracellular loops (ECL) exposed outside of the cell, as well as three intracellular loops (ICL), and a cytoplasmic C-terminal tail on the opposite side of the membrane (A). A C-terminally truncated CXCR4 construct with stabilizing mutations, and a T4 lysozyme fusion in complex with different ligands, and a similarly modified CCR5 construct containing a rubredoxin fusion in complex with the anti-HIV drug, Maraviroc, have been crystallized, yielding high-resolution structures [
15,
16,
72] (D), with a typical 7TM helical bundle topology typically seen for other GPCRs (C; [
73]). A two-site model has been proposed for their ligand interactions [
74], as the N-terminal segment of CXCR4 or CCR5 forms the chemokine recognition site 1 (CRS1) to bind the globular core domain of chemokine, while their TM helices make up the chemokine recognition site 2 (CRS2) to interact with the N-terminus of the chemokine. These structures have revealed the general architecture of these chemokine receptors and their interactions with the ligands [
15,
16,
72,
75], but they did not provide many of the molecular details of how they function as HIV-1 coreceptors.