1000/1000
Hot
Most Recent
| Version | Summary | Created by | Modification | Content Size | Created at | Operation |
|---|---|---|---|---|---|---|
| 1 | Tatsuhito Matsuo | + 2707 word(s) | 2707 | 2021-05-26 10:35:24 | | | |
| 2 | Vicky Zhou | Meta information modification | 2707 | 2021-05-27 03:47:21 | | |
SARS-CoV-2 nucleocapsid protein (NCoV2) plays a key role in various processes related to the viral replication cycle such as the RNA genome packaging, interaction with other viral proteins. It has thus been a target for new drug development.
Figure 1. Structural features of the nucleocapsid protein of SARS-CoV-2 (NCoV2). (a) Schematic of NCoV2, which consists of two domains, i.e., the N-terminal domain (NTD) and the C-terminal domain (CTD); (b) atomic structure of the NTD in the RNA-unbounded form (left; PDB ID: 6M3M [12]) and the single-stranded RNA-bounded form (right; PDB ID: 7ACT [13]) are shown in light sea green and magenta, respectively. In the right panel of (b), RNA is denoted in orange and the RNA-unbounded form of the NTD is superimposed for structural comparison. The direction of the movement of the basic finger upon RNA-binding is shown by a black arrow; (c) CTD dimer interface. The CTD monomers (PDB ID: 6WZO [15]) are shown in orange and marine blue, respectively. In (b,c), the residue numbers at both termini of the structures that are visible in the crystal structures are denoted. All the models are depicted using UCSF Chimera [16].
Figure 2. Comparison of the NTD and CTD structures of the nucleocapsid protein of different coronaviruses. (a) (left) Comparison of the NTD structures of the nucleocapsid protein of SARS-CoV-2 (NCoV2; PDB ID: 6M3M [12]) and SARS-CoV (NSARS; PDB ID: 2OFZ [21]), which are shown in marine blue and magenta, respectively. (right) Comparison of NCoV2 NTD and the NTD of the nucleocapsid protein of MERS-CoV (NMERS; PDB ID: 4UD1 [22]) shown in orange. Note that NCoV2 NTD in the left and right panels are displayed at slightly different angles to focus on the structural differences from the NSARS and NMERS counterparts; (b) (left) Comparison of the CTD structures of NCoV2 (PDB ID: 7C22 [18]) and of NSARS (PDB ID: 2CJR [23]), which are shown in marine blue and magenta, respectively. (right) Comparison of NCoV2 CTD and the CTD structure of NMERS (PDB ID: 6G13 [24]) shown in orange. In both (a,b), the major structural differences are shown by grey arrows. All the models are depicted using UCSF Chimera [16].A powerful method to characterize the molecular structure of NCoV2 in droplets would be small-angle neutron scattering (SANS). This technique has been used to characterize the structure of viral proteins in isolated state [32][33] and in complex with other viral components [34][35][36]. However, to the author’s knowledge, the structural and dynamical analysis of droplets formed by virus-related proteins has not been carried out so far. Even though there are excellent reviews and books on neutron scattering [37][38][39][40], some basic information on neutron scattering is given below in order to describe a possible method to reveal the structure and dynamics of NCoV2 droplets. In neutron scattering, the strength of scattering (coherent scattering length) depends on the type of atoms bombarded by neutron. The major feature of coherent neutron scattering is that the scattering length of hydrogen and its isotope deuterium is largely different and thus deuteration technique plays a significant role in the structure analysis of a complex system that consists of different kinds of components [41][42][43]. In the case of the solution samples such as protein solutions, the scattering signal in SANS measurements derives from the difference (contrast) in the scattering length density (SLD: the total scattering length of a molecule divided by its volume) between solutes and solvents. Figure 3a shows the variations of the SLD of major biomolecules (proteins, lipids, DNA and RNA) as a function of the fraction of D2O in the solvent. As the D2O content in the solvent increases, the labile hydrogen atoms in the solutes are exchanged for deuterium atoms, which modifies the SLD of the solutes. The SLD of perdeuterated proteins (denoted as “D-protein” in Figure 3a) is much larger than that of the hydrogenated counterparts because all the hydrogen atoms including those in methyl and ethyl groups are replaced with deuterium atoms. As seen in Figure 3a, the SLD of the solvent containing 40% D2O is equal to that of the hydrogenated protein (H-protein), meaning that the scattering contrast of the H-protein is zero. In this condition, the contribution of the H-protein to the scattering signal is effectively zero and hence it is “invisible” to neutron. Even though techniques to perdeuterate or partially deuterate RNA molecules by chemical synthesis, in vitro transcription and in vivo production have been established for NMR studies [44][45], production of the sufficient amount of perdeuterated RNA for neutron experiments is more expensive and/or more difficult than production of perdeuterated protein and thus there have been only a very few SANS studies using perdeuterated RNA [38][46]. Therefore, in the following, only RNA in the hydrogenated state (H-RNA) is considered.
Figure 3c illustrates a promising method to characterize the NCoV2 structure in droplets. In the case where droplets are formed by NCoV2, RNA and another protein (Figure 3c), three kinds of samples would be required to obtain the scattering signal from each protein: (1) droplets formed by H-NCoV2, H-RNA and the hydrogenated another protein (H-AP), (2) those formed by H-NCoV2, H-RNA and the perdeuterated another protein (D-AP) and (3) those formed by the perdeuterated NCoV2 (D-NCoV2), H-RNA and D-AP. The SANS measurements in these samples in the ~65% D2O solvent, the SLD of which matches that of H-RNA, provide three kinds of scattering data where each protein contribution is different. From these data, the scattering data arising from each protein can be extracted, leading to the elucidation of its conformation together with how each protein is arranged inside the droplets.
The structural fluctuation of the molecules in the droplets are obtained by incoherent neutron scattering (iNS). This technique measures the intensity of neutron scattered by the samples as a function of the momentum and energy change of the neutron, from which the amplitude, the frequency and their distribution of atomic motions at pico- to nanosecond timescale at ångström length scale are estimated. The values of incoherent scattering cross-section of atoms found in biomolecules are shown in Figure 3b. It is found that the scattering cross-section of hydrogen atoms is much larger than any other atom and deuterium, meaning that iNS provides information of the motions of hydrogen atoms. Furthermore, the measurements in 100% D2O buffer can minimize the solvent contribution to the iNS spectra. Since the hydrogen atoms are quasi-uniformly distributed throughout the molecules in the biological systems, the dynamical information averaged over the whole molecule is obtained. In the case of proteins, the observed motions reflect the fluctuations of amino acid side chains and backbones because hydrogen atoms are bound to these chemical groups. The schematic illustration of the dynamical analysis of the NCoV2 droplets is depicted in Figure 3d. For droplets generated from NCoV2, RNA and another protein, two types of samples are required: (1) droplets formed by H-NCoV2, H-RNA and D-AP, (2) those formed by D-NCoV2, H-RNA and D-AP. By subtracting the iNS spectra of the latter from those of the former measured in 100% D2O, the remaining spectra arise from the structural fluctuation of the NCoV2 molecules, thereby providing dynamical information on NCoV2 inside the droplets. Instead, the dynamics of another protein in the droplets can be extracted in a similar manner.
Figure 3. A promising method for physical characterization of the NCoV2 droplets using neutron scattering. (a) Variation of the neutron scattering length density (SLD) of biomacromolecules as a function of the heavy water concentration in the solvent. H-protein and D-protein denote the hydrogenated and perdeuterated proteins, respectively. The values of the SLD of H-protein, D-protein, lipids and solvent are taken from [37], and those of DNA and RNA are taken from [39]; (b) values of incoherent neutron scattering cross-section of atoms found in biomolecules and an isotope of hydrogen atom, deuterium. Note that 1 barn = 10−24 cm2. The values are taken from [47]; (c) schematic illustration of the structural analysis of the NCoV2 droplets using contrast-matching small-angle neutron scattering (SANS). The components that are “invisible” to neutron are shown in the same color as solvent. The prefixes H- and D- denote “hydrogenated” and “perdeuterated”, respectively, which are shown in different colors. AP denotes another protein; (d) schematic illustration of the molecular dynamics analysis of NCoV2 droplets using incoherent neutron scattering (iNS) combined with deuteration technique. For detailed explanations of (c,d), refer to the main text.