2.1. Intermediate States Characterized under In Vitro Conditions
The intermediate states have been exerting a pull in recent times on scientists who put forth research on protein folding mechanisms to present hints for understanding the classical two-state and/or three-state unfolding methods. The first report on the MG state was observed in 1981, where the heat capacity function in the MG state of apo-α-lactalbumin was examined by a scanning microcalorimeter under physiological pH
[24]. Observations of this study showed an enthalpy variation between the MG state and unfolded state (assumed) at neutral pH, which was observed to be almost zero, signifying that the MG state does not show sign any co-operative transition upon heating
[24]. Preceding two decades, another third state, measured as a new thermodynamic state of the hierarchical protein folding process called the PMG, has drawn interest among scientists in the protein folding research field because it presents intimations to comprehend the classical three-phase mechanism in unfolding, observed in many proteins
[15][38][39][40][41][44][46][47][48]. In vitro experiments have revealed that proteins can be guided to the MG state at acidic pH or high temperatures or in moderate doses of chemical denaturants
[54][55]. Many proteins belonging to the structural class of all α or α + β have the majority of MG states characterized and classified in their folding/unfolding routes
[56]. Interestingly, only a few studies of proteins found to be fit into all β-sheet categories, which were recognized as an MG state(s)
[57][58][59]. Downhill folding mechanisms were intended to exist effectively for proteins with highly optimized native interactions under extremely stable conditions
[60][61] or when constructive mutations take place
[62]. Their study concluded that at least in metalloproteins, downhill folding can occur under a much greater choice of conditions and can be associated with a variety of other transitions
[13]. According to the study, the bacterial zinc finger protein Ros87 has a bipartite folding/unfolding process in which a metal-binding intermediate converts to the native structure via a sensitive barrierless downhill transition. These intermediates were examined using DSC, CD, and NMR in a range of pH, temperature, and ionic strength parameters, showing that the downhill mechanism can be discovered under a considerably broader range of conditions and can be related to a variety of other transitions
[13].
2.2. Significance of Intermediary States under In Vivo Conditions
Taking benefits from the protein folding (new wing) (i.e., intermediate states in the cellular conditions) upholds cellular protein homeostasis (proteostasis), which is critical for cell function and development
[63][64]. Besides, the folding process—these intermediates assist in many genetic illnesses
[8][65][66]. Proteostasis is governed in cellular conditions by networks of protein complexes that include the translation machinery
[67][68], proteases
[69][70], ubiquitin–proteasome system (UPS)
[71], secretory pathways
[72][73], autophagic machinery
[74], and molecular chaperones
[3], which have a significant role in protein homeostasis. To illustrate, a non-native compact type of cyt c is implicated in programmed cell death (induces apoptosis), after which the protein is released from the mitochondrion; non-native forms of the protein are also associated in several of the amyloid-related illnesses
[8]. Characterizing the heterogeneity present within the process of folding and unfolding proteins, intermediate states are vital to understanding intermediates and defining their boundaries. The cell intermediate states can be defined as attractors on a potential landscape
[1][75][76].
The intermediates not only help to decipher the enormously complex troubles in protein folding, although this also reveals new insight into the importance of structural changes in proteins within cells, whereas protein intermediates can be imported and exported more easily through membranes than native proteins
[2][3][77]. The native ⇔ molten globule transition is also considered because the conversion of a protein’s native state to a condensed intermediate structure might occasionally allow it to perform different physiological activities inside the cell
[8]. A non-native compact conformation of cyt c, for example, is linked to programmed cell death (apoptosis), whereupon the protein is released from the mitochondrion; non-native forms of the protein are also linked to various amyloid-related diseases
[8]. Nuclear genes code for the majority of mitochondrial proteins, which are formed on cytoplasmic ribosomes and transferred into mitochondrial subcompartments
[2][78]. To preserve the integrity of protein function in cellular compartments, protein sorting and transport through the lipid membrane of the mitochondrion is desired without intervening with the organelle’s integrity or functions. To understand this to a greater extent, molecular specificity and targeting of mitochondrial preprotein mechanisms and postproteins after import–export via an inner membrane and outer membrane facilitates recognition or identification and is characterized by cellular signaling
[2][78]. The presence of these intermediate structures of protein has a significant role in transport via membranes in cellular conditions. The purpose of a set of proteins identified as heat-shock proteins or molecular chaperones located both outside and within the mitochondrion are intimately connected to the unfolding and folding of proteins during transmembrane movement. Investigating the folding of polypeptides in the mitochondrial matrix has provided new and unique findings into general protein folding pathways supported by folding factors
[2][78]. Folding and misfolding of proteins in the human membrane help in the resolution of problems related to health and diseases
[79]. The new perspective that links membrane protein folding energetics with the degree of complexity of biological systems is recognized via intermediates that play an essential role in the import–export of native protein via membranes and can easily interact with the drug to cure diseases. These advancements in the production of therapeutics and precision medicine are influenced by these intermediate structures in cells
[79][80]. From Anfinsen’s experiments
[81], which provided how proteins choose their structural elements from denatured conformations and each fraction competes for renaturation to native state
[79][82]. The complexity of protein folding makes it difficult to comprehend and even describe the process. Much of this heterogeneity can be described and understood using a statistical approach to the energetics of protein structure (i.e., the energy landscape)
[60]. The statistical energy landscape strategy describes why and when particular folding pathways emerge in some proteins, and also how to spot the difference between folding mechanisms that are universal to all sequences and those that are specific to individual sequences. This method also provides fresh quantitative ideas in understanding protein folding thermodynamics and kinetic studies and simulations
[60][83].
The ribulose-bis-phosphate carboxylase/oxygenase, Rubisco (an abundant protein on earth), has greater kinetic facets in plants to enhance photosynthesis quality, resulting in species with high nitrogen and water-use efficiencies. This protein improves crop improvement and can provide relief from the CO2 increase caused by anthropogenic activities that lead to global climate changes. Type I Rubisco is a highly conserved hexa-decameric complex found in cyanobacteria, algae, and plants. It consists of eight large subunits with ~50 kDa molecular mass and eight small subunits with ~15 kDa molecular mass. Another kind of bacterial Rubisco (Type II) is a dimer of large subunits that folds and assembles spontaneously in a GroEL-mediated reaction
[84][85]. Whether GroEL/GroES was co-overexpressed or not, the expression of Type I Rubisco (from cyanobacteria) in E. coli did not result in the formation of soluble protein, in comparison with the bacterial process. The investigations noticed that in cyanobacteria, the Rubisco operon holds an ORF (open reading frame) for a protein called RbcX. Apart from the genes for the Rubisco subunits, there are genes for the small and large Rubisco subunits (RbcS and RbcL, correspondingly). The existence of the protein (RbcX) was very less renowned before, but researchers have developed methods in E. coli and express Rubisco upon coexpression of RbcX. A Rubisco-specific chaperone could thus be a crucial step in allowing efficient folding of imported Rubiscos in both prokaryotic and plant systems
[65][86].
Figure 2 shows the significant role of intermediate assembly in the folding and assembly of L8S8 Rubisco mediated by GroEL/GroES and RbcX. The study showed that folding process includes steps, (i) the substrate bound (primary state) to the chaperonin complex, (ii) structural characterization of intermediate states kinetically trapped and accumulated throughout the folding route, and (iii) kinetic measurements during the process (unfolded ⇔ intermediate ⇔ native state conversion)
[84].
Figure 2. Schematic representation showing the importance of intermediates in folding and assembly of L8S8 Rubisco (cyanobacterial) mediated by GroEL/GroES and RbcX.
In addition, the researchers observed that folding intermediates provide approaches to differences in immunoglobulin amyloidogenicity and thus can shape the folding landscape positively to favor either folding or misfolding
[3][71][87][88]. The researchers used an antibody domain’s intrinsically slow folding process to define its essential folding intermediate
[87]. They were able to trap the intermediate in equilibrium and identify it at atomic resolution using a single-point mutation. It is also worth noting that intermediate has the simple β-barrel topology; however, a few strands were observed to be distorted
[87]. Unexpectedly, the presence of two short-strand-connecting helices in the constant region of antibody domains suggests that a native structure is fully present in the intermediate, which was then used as a framework for subsequent strands
[87]. Transplanting these conserved stands of helices into β 2-microglobulin (homologous member of the same superfamily) considerably showed a reduction in its amyloidogenicity
[87]. As a result, a high level of local structuring intermediates through protein folding can have a considerable effect on the folding landscape which favored vigorous folding against negative misfolding. In addition, throughout evolution, the small differences acquired amid members of the identical protein superfamily can evade pathogenic misfolding reaction and identical protein topology conservation
[87].
As it is a known fact that transferable agents called prions cause spongiform encephalopathies (TSEs) in animals as well as humans. They are made up of PrPSc, the infectious isomer of PrPC, and the cellular prion protein
[89]. The conversion and propensity of the protein commence alternative folds, which are liable for the species-specific transmission of the disease. Kachel et al. defined and confirmed the structural stages of the human prion protein (hu PrP)
[89] by using a hydrostatic pressure (up to 200 MPa) and two-dimensional NMR spectroscopy in combination. They recognized folding intermediates that were stabilized by pressure of the human prion protein. They observed that the β1/α1-loopand the solvent-exposed side of α3 are the strongest regions reflecting the transition to the intermediate states
[89]. Their findings showed that the loop between β-strand 1 and α-helix 1 (residues 139–141) was the most pressure-sensitive region (intermediate I1), and may be the first gateway for the infectious moiety to transform the cellular protein
[89].
Therefore, folding intermediates are essential in determining protein folding parameters, understanding protein folding mechanisms, conservation of protein topology, cellular transport regulation, structural maintenance, and avoidance of protein misfolding. These elements are also better for understanding the biologically significant mechanism of conformational changes, such as structural distributions, harmonic vibrations, and structural fluctuations
[83][90]. Therefore, it may not be wrong to say that intermediates are vibrant and vigorous elements of cellular architecture.