Human FoxP Transcription Factors

Human FoxP Transcription Factors: Comparison

Please note this is a comparison between Version 1 by Exequiel Medina and Version 2 by Nora Tang.

Although the Fox family of transcription factors has been described as monomers even in the presence of their cognate DNA, suggesting their full functionality without requiring oligomerization, members of the FoxP subfamily show both monomers and 3D-DS dimers (B). This novel ability in a well-known monomeric family has been largely attributed to a single replacement of a conserved proline by alanine (Pro39Ala) in the hinge region that connects helices H2 and H4 (B). Additionally, the ability to adopt intertwined dimers has been a focus of interest in terms of the possibility to bind different DNA loci within a given chromosome or even in physically mediating interchromosomal contacts, suggesting that the emergence of the 3D-DS could impact their mechanism of action and the complexity of the gene regulation networks in which they participate.

domain swapping
protein flexibility
FoxP subfamily
protein evolution
local unfolding
high-resolution biophysics

1. Introduction

An evolutionary advantage of oligomerization is that it allows the formation of large macromolecular complexes with additional functional sites in comparison to isolated monomers, all without increasing genome size ^[1][2][4,5]. Concomitantly, several protein structure databases convey that homooligomers are dominant over monomers ^[3][4][1,6], emphasizing the relevance of understanding the evolution of protein–protein interactions and their role in both structural and functional diversification in all organisms.

The emergence of oligomers in nature starting from ancestral monomers has been largely explained by the accumulation of random mutations on the surface of the monomers throughout evolution, which generates an optimized interface that enables protein–protein interactions ^[5][6][7,8]. These mutations allow the stabilization of hydrophobic interactions, salt bridges, and hydrogens bonds, which are the main interactions characterized as fundamental in enabling the stabilization of protein–protein interfaces during oligomer evolution ^[5][6][7,8]. However, three-dimensional domain swapping (3D-DS) has been proposed as an alternative and ancient mechanism for the evolution of protein oligomerization that does not require an optimized interaction surface ^[7][9]. In this mechanism, two (or more) protein chains exchange identical elements of their structures to form an intertwined oligomer ^[8][10] ( Figure 1 A).

Figure 1. Three-dimensional domain swapping (3D-DS) in FoxP proteins. (A) Human FoxP proteins are constituted by a trans repressor domain (TRD), a zinc finger domain (ZFD), a leucine zipper domain (LZD), and a DNA-binding domain (DBD). Evolutionary changes in the latter enable FoxP proteins to associate into domain-swapped dimers. The secondary structure topology of the DBD is shown for both monomeric and dimeric FoxP. All secondary structure elements are colored and highlighted in the three-dimensional structure of a monomeric mutant of the DBD of FoxP1 (PDB 2KIU). (B) Sequence comparison of the DBD from different Fox subfamilies, showing the secondary structure topology for the monomeric members in gray and for the dimeric FoxP proteins in orange. Note that helices H2 and H4 in the monomer merge into a single helix H2 in the domain-swapped dimer. FoxP sequences are highlighted in light orange.

In order to reach a 3D-DS dimer, different intramolecular contacts that stabilize the monomeric structure must be broken and then recruited in an intermolecular fashion in the dimer. Since the residue pairs that participate in these quaternary contacts resemble those constituting monomeric contacts within a single polypeptide chain, there is no need to optimize a specific surface in an ancestral monomer to contact the partner subunit in the context of 3D-DS ^[7][9]. As a consequence, the only structural difference between a subunit in the 3D-DS dimer and its monomeric counterpart is the conformation of the hinge region, which connects the exchanging part in the intertwined dimer with the rest of its corresponding polypeptide chain ^[9][11] ( Figure 1 A).

Evidence of the evolutionary emergence of quaternary structures via 3D-DS in extant proteins is exemplary illustrated by histones. These are dimeric interleaved helical protein bundles where each monomer adopts a histone fold, common to all core eukaryotic histones and also present in archaeal histones and composed by two contiguous helix–strand–helix motifs connected as a result of a tandem duplication ^[10][12]. A sequence-based search for distant homologs employing Hidden Markov Models led to the identification of other protein folds also containing tandem duplications of these helix–strand–helix motifs, such as the C-domain of AAA+ ATPase proteins ^[11][13]. However, the C-domain is topologically different to the histone dimer in that it involves the association of the two consecutive motifs in a single polypeptide chain to constitute a four-helix bundle instead of protein–protein interactions between two monomers ^[11][13], which is a description that is consistent with the differences between isolated monomers and 3D-DS dimers.

2. Molecular Evolution towards 3D-DS in Human FoxP Transcription Factors

While 3D-DS offers a simple evolutionary pathway to the emergence of oligomerization that does not require a specialized protein–protein interface, it is accompanied by the biophysical intricacy of requiring the breakage of many intramolecular contacts through at least partial protein unfolding to form an intertwined dimer ^[12][15]. Since the first description of 3D-DS in diphtheria toxin ^[7][9], seminal cases such as cyanovirin ^[13][16] and the yeast cell cycle controller p13suc1 ^[14][17] have shown that dimerization via 3D-DS is dramatically accelerated when these proteins are exposed to structural perturbations, i.e., pH and temperature changes or the addition of chemical denaturing agents, which is followed by restoring physiological conditions while employing high protein concentrations ^[15][18]. These observations demonstrate that reaching the unfolded state is a main limiting step to adopting the dimer for most studied cases. Moreover, the need of protein unfolding to speed up 3D-DS obscures the elucidation of the physiological significance of these intertwined oligomers ^[12][15].

Strikingly, a recent example of 3D-DS under physiological conditions has been observed in the FoxP subfamily of transcription factors. Fox proteins are present in yeast and metazoans, and several evolutionary analyses have strongly suggested that these proteins were present in the ancestor of all eukaryotes ^[16][17][19,20]. Their absence in plants suggests that their origin is linked to a clade of unicellular organisms that gave rise to both the fungal and animal lineages ^[18][21]. Moreover, evolutionary analyses combined with gene expression studies in the ancient invertebrate chordate amphioxus have demonstrated that a cluster of four Fox genes with sequential and coordinated endo-mesodermal tissue expression has been present since basal Bilateria and maintained in several lineages during animal evolution for more than 500 million years ^[19][22].

According to most solved Fox structures to date, their DBDs exist as monomers with helix H 3 contacting the major groove of DNA ^{[20][21][22][23][24]}[25,28,29,30,31]. While a canonical sequence RYAAAYA located on promoter regions has been defined as the target for Fox DBDs, phylogenetic and functional DNA-binding studies among the family found that some DBDs can bind to alternative DNA sequence motifs in addition to the canonical ones ^[25][32]. These results showed that changes in DNA-binding specificity across the Fox family were not explained by changes in DNA-contacting amino acids that define the specificity for canonical DNA binding sites. Moreover, some Fox proteins can specifically bind two sequence motifs. To explain these results, Nakagawa et al. ^[25][32] proposed that the DBD could adopt an alternative conformation with respect to the one observed in solved structures, allowing the recognition of additional DNA binding motifs. These insights suggest that structural heterogeneity may play a crucial role in the function of Fox proteins.

Regardless of the sequence and structure similarity between Fox proteins, they are differentially involved in several regulatory networks related to proliferation, differentiation, angiogenesis, apoptosis, and cell cycle progression ^[26][33]. Moreover, Fox subfamilies such as FoxC, FoxM, FoxP, and FoxA have been closely related to cancer ^[26][27][33,34]. Indeed, FoxP members have been described as oncogenes or tumor suppressors depending on cellular contexts ^[28][35], revealing the complexity of transcriptional networks in which these proteins are involved.

3. Biophysically Dissecting the Evolutionary Strategies of FoxP Proteins to Overcome the Thermodynamic Limitations of 3D-DS

As mentioned above, 3D-DS emerged in the FoxP subfamily as an evolutionary novelty to promote protein association. However, most of the studied 3D-DS models to date conclude that the acquisition of the intertwined dimer (or oligomer) is kinetically limited by protein unfolding ^{[13][14][15][29]}[16,17,18,48]. Although this behavior has been widely observed in canonical examples of 3D-DS ^[29][30][31][48,49,50], the human FoxP subfamily has shown different kinetic and thermodynamic properties when compared with such models ^{[32][33][34][35][36]}[40,41,42,43,47].

One of the main hypotheses that explains the properties shown by FoxP1 is a decrease in protein stability. Specifically, the decrease in the energy barrier that imposes the loss of native contacts in FoxP1 could be compensated by the intermolecular stabilization of the structure once the dimer is adopted ^{[37][38][39][40]}[53,54,55,56]. In this line, NMR analysis of the monomeric and dimeric states of FoxP1 indicated that helices H 2 and H 3, and the wing W 1 are notably flexible compared with the other secondary structures in the chain ^[34][42]. Thus, stability changes could promote at least local unfolding. In addition, these analyses determined that the dimer is more flexible throughout its backbone when compared with the monomer, giving preliminary clues about the role of structural flexibility in the stability and propensity of 3D-DS.

In order to gain more insights regarding local structural changes in FoxP1 upon 3D-DS, hydrogen–deuterium exchange coupled with mass spectrometry (HDXMS) was used. This experimental strategy ascertains the solvent accessibility and/or flexibility of local regions of a given protein through its incubation in a deuterated buffer, such that the extent of deuteron incorporation from the solvent into the backbone amides of a protein acts as a mass probe, which is followed by its proteolytic digestion and final analysis by mass spectrometry to monitor these changes over local protein regions ( Figure 2 ) ^[41][42][61,62]. In the case of FoxP1, HDXMS experiments on the wild-type protein under physiological and mild-denaturing conditions described regions with high flexibility (as determined by their higher deuteron incorporation when compared to other local regions within the protein), such as helices H 1 and H 5, and beta strand S 1 ( Figure 2 ), suggesting the relevance of these localized regions for both the monomeric intermediate acquisition as well as favoring the dimerization of FoxP1 ^[35][43]. Moreover, the comparison between FoxP1 and the mutant Ala39Pro showed a stabilization of the region H 1- S 1 that could be a relevant factor to modulate 3D-DS in FoxP proteins, further strengthening that local rather than global structural perturbations suffice to facilitate their dimerization.

Figure 2. Insights into the 3D-DS pathway in FoxP1. Using high-resolution experimental and computational approaches, the mechanism by which FoxP1 dimerizes via 3D-DS was proposed. By using HDXMS, differences in flexibility between secondary structure elements of the protein suggested that helices H1 and H5 and strand S1 are prone to unfold in the monomer (in red), facilitating the accumulation of the monomeric intermediate described in equilibrium unfolding experiments. To detect these differences, the protein is first incubated in deuterated buffer (D₂O) at different times (from seconds to few minutes); then, the reactions are quenched, and lastly, the sample is pepsin-digested, and all identified peptides are analyzed by mass spectrometry. In addition, a smMFS toolbox was used to describe the local dynamics of the double-labeled 3D-DS dimer at the single-molecule level under pulsed interleaved excitation (PIE) in a confocal volume, finding the presence of the predicted folded dimer (High-FRET state, HF) and the accumulation of a dimeric intermediate ensemble that is locally unfolded (Low FRET, LF). As an example, we show the results from two different FRET pairs (C57–C57 and C78–C78), which monitor the dynamics between helices H3 of the dimer, or between helices H3 and H5. The differences between FRET populations in both sets of FRET pairs indicates the differences in structural behavior of different regions of the same protein. By comparing the measured FRET distances with those intermediate structures observed in folding molecular dynamics simulations, a structural view of the mechanism of 3D-DS in FoxP can be inferred. The time scales for HDXMS and smMFS are shown to describe their temporal resolution.

Altogether, these experiments represent the first indications of an evolutive strategy in this subfamily to overcome the thermodynamic limitations of 3D-DS. A decrease in protein stability and an increase in local structural flexibility seemed to be correlated with the dimerization (and dissociation) properties of FoxP proteins. However, none of those approaches explain how these structural features relate with the mechanism by which FoxP associates via 3D-DS.

4. Evolution Pathway inside FoxP Subfamily and Their Impact in Functionality: From Homodimers to Heterodimers and Beyond

While FoxP1 is the most biophysically characterized member in the subfamily, these observations could be described as general features for all FoxP members given that the DBDs of these transcription factors exhibit a high degree of sequence identity (75–92%).

An evolutionary pathway forming a secondary interface to enhance intersubunit contacts in a 3D-DS dimer was suggested as the primitive mechanism to form stable intertwined dimers in IFN-y, IL-5, and βb2-crystallin ^[7][9]. Since those proteins do not have a monomeric structure, comparing their intertwined dimer with closely monomeric homologs structures showed that the interdomain interface of the dimer resembles the monomeric interdomain interface. Therefore, domain swapping could be the first step in the evolution of these dimers, which is followed by the formation of a secondary interface ^[7][9]. Considering the aforementioned higher dissociation constants and higher sequence identity between FoxP1, FoxP2, and FoxP4 when compared with FoxP3, the FoxP family constitutes an excellent model to dissect the mutations accumulated throughout evolution that provide primary and secondary interfaces for the emergence of 3D-DS and its specialization into obligated dimers, and to examine the different dimerization properties exhibited by these transcription factors in the context of their direct impacts on functional diversification.

Several computational studies have indicated that 3D-DS depends primarily on the monomer’s topology ^[37][39][43][53,55,73] and the structural characteristics of the hinge loop ^[14][44][45][17,52,74]. Experimental studies on covalently fused immunoglobulin domains with varying degrees of sequence identity from the muscle protein titin showed that their propensity of aggregation into dimers is high when the sequence identity is above 70% ^[46][75], such that low sequence identity is necessary to avoid misfolding. Similar studies at the single-molecule level and accompanied by native-centric molecular dynamics demonstrated that the interactions underlying misfolding in these domains are the result of sequence-specific 3D-DS ^[47][76]. Building upon these results, molecular dynamics simulations using force fields that take into account the sequence-dependence of protein interactions demonstrated that the reduction of the sequence identity between covalently linked multidomains decreased the formation of 3D-DS contacts, and that these contacts compete with other strong hydrophobic self-recognition contacts, leading to non-3D-DS misfolding in a balance that can be modulated by point mutations ^[48][77].

These insights suggest the possibility of heteroassociation via 3D-DS in the FoxP subfamily due to their high sequence identity. Although some studies showed the ability to heterodimerize between FoxP members ^[49][50][71,78], this process has not been biophysically explored, opening an interesting question about the functional and structural role of FoxP1 in increasing the association ability of the other FoxP members.