TPPP-like Proteins in Myzozoa: Comparison
Please note this is a comparison between Version 1 by Ferenc Orosz and Version 3 by Camila Xu.

TPPP-like proteins contain one or more p25alpha domains. They obtained their name after tubulin polymerization promoting protein (TPPP1), the first identified member of this protein family. Originally, it was named p25alpha protein, which became the eponym of the domain. Myzozoans are a monophyletic clade, and a sister clade to the Ciliata, within Alveolata.

  • apicortin
  • TPPP
  • Myzozoa
  • chrompodellids
  • dinoflagellates

1. Introduction

TPPP-like proteins contain one or more p25alpha domains [1]. They obtained their name after tubulin polymerization promoting protein (TPPP1), the first identified member of this protein family [2][3][2,3]. Originally, it was named p25alpha protein, which became the eponym of the domain [4]. P25alpha (Pfam05517; IPR008907) is not a structural domain but is derived from a sequence alignment (https://bioinf.umbc.edu/DMDM/generatelogo.php?accession=pfam05517 (accessed on 5 March 2023)). The p25alpha domain is exclusive to eukaryotes and there is a strong correlation between its presence and the presence of the eukaryotic flagellum/cilium [5][6][5,6]. TPPP-like proteins occur in different types, such as long-, short- and truncated (the C-terminal third is completely missing) TPPPs, depending on the length of the p25alpha domain (about 160, 140, and 120 amino acids, respectively) (Figure 1). The main difference between long- and short-type TPPPs is that the C-terminal end of short-type TPPPs is incomplete; long-type TPPPs but not short-type ones contain a very conservative sequence of 31–32 amino acids, the most typical part of which is the GXGXGXXGR ‘Rossmann-like’ motif (Figure 1). Some TPPP-like proteins also possess another domain/region such as EF-hand (CDD:428504) or doublecortin (DCX; Pfam 03607, IPR003533) in addition to the p25alpha [1]. The latter one is named apicortin, which unifies partial p25alpha and DCX domains [7] (Figure 1).
Figure 1. Schematic structure of some TPPP-like proteins. Highly conservative sequence motives are denoted with black boxes (GxGxGxxGR), vertical-striped boxes (L(V)xxxF(Y)xxF), and diagonal-striped boxes (GGP). The dashed line represents a disordered region unique to some apicortins.
The first TPPP-like protein, TPPP/p25 or TPPP1, was identified in mammalian brain [2][3][4][2,3,4] and its physiological significance is connected to the nervous system [8][9][8,9] as well as having a role in neurodegenerative disorders, as Parkinson’s disease and multiple system atrophy [10][11][12][10,11,12]. Obviously, the role of TPPP-like proteins will be different in eukaryotic microbes without a nervous system. However, their interaction with tubulin and the microtubular system is a conserved property [13][14][15][13,14,15]. 
The various members of the TPPP-like protein family are characteristic for the phylogenomic supergroups [1]. For example, animals contain only long-type TPPPs, except for the placozoan Trichoplax adhaerens, which contains apicortin instead [7]. Truncated- and fungal-type TPPPs occur only in Endopterygota (Holometabola) [16] and in fungi [6], respectively. In this researchticle, the occurrence, and the possible role of TPPP-like proteins in Myzozoa, which include apicomplexans and related taxa, chrompodellids (chromerids plus colpodellids), dinoflagellates, and perkinsids, is reviewed.

2. TPPP-like Proteins in Myzozoa

2.1. Apicortin

Apicortins unite two conserved domains, a DCX motif and a partial p25alpha sequence, that are separately found in other proteins, in doublecortins and TPPPs, respectively [7] (Figure 1). The DCX domain is named after the brain-specific X-linked gene doublecortin [17][28]. Both the p25alpha and the DCX domains play an important role in the stabilization of microtubules [2][3][17][18][2,3,28,29]; thus, a similar role was suggested for apicortin [7]. Apicortin was originally thought to occur only in apicomplexans and in the placozoan animal T. adhaerens [7]. Later, it was found that its occurrence is broader than thought earlier: it is present in chromerids (Chromera velia and Vitrella brassicaformis) [19][30] and in flagellated fungi [6][20][21][6,31,32]. The presence of apicortin in chromerids was not surprising, given the phylogenetic proximity and the structural similarity of chromerids and apicomplexans.  BLASTP analyses [22] were performed on myzozoan protein and nucleotide sequences available at the NCBI webpage. Since apicortins contain two different domains, various domain databases were also checked for proteins with both the DCX and the partial p25alpha domains that BLAST may not have been able to detect. The results of the search, i.e., the new apicortins not known before, are listed in Table 1. They were found in all myzozoan phyla, in Apicomplexa, chrompodellids, dinoflagellates, and Perkinsozoa. The newly re-interpreted [23][24][36,37] squirmids [25][38] (Digyalum oweni [26][39]) also possess it (the list of apicortins identified earlier is published in the references [6][7][20][6,7,31]).
Table 1.
Newly identified apicortins.
Apicortin still seems to be a characteristic protein of apicomplexans; it occurs in all but one apicomplexan genome fully sequenced so far; the exception is the apicomplexan, with the smallest genome, Babesia microti [27][40]. In previous years, mainly medically important genera (e.g., Plasmodium, Toxoplasma, etc.) were sequenced; however, in recent years, transcriptomes and genomes of an increasing number of apicomplexans have been partially or completely established. They include gregarines as Ancora sagittata, Cephaloidophora cf. communis, Polyrhabdina sp., Siedleckia nematoides, Selenidium pygospionis [28][20], and Porospora gigantea [29][41], as well as coccidiomorphs as Cardiosporidium cionae [30][42], Eleutheroschizon duboscqi, Rhytidocystis sp. [28][20], and Nephromyces ex Molgula occidentalis [31][43]. All species listed here contain apicortin. The photosynthetic chromerids, C. velia and V. brassicaformis, belong to a monophyletic group, called the chrompodellids, with heterotrophic colpodellids, namely, Alphamonas edax, Voromonas pontica, and Colpodella angusta [32][33][17,18]. It is clear that although apicortin has not been known in them until now, it was worth examining them in this regard (cf. Table 1). There are another two myzozoan phyla, Dinoflagellata and Perkinsozoa, which are sisters to each other, and together, are sisters to Apicomplexa and chrompodellids. So far, apicortin has not been detected in dinoflagellates, while there were traces of its presence in perkinsids [34][35][44,45]. For evolutionary reasons, it is obvious that if, on the one hand, apicomplexans and chromerids contain apicortins, and on the other hand, their presence in perkinsids can be counted on, then their occurrence in dinoflagellates is not surprising (Table 1). A few sequences are incomplete; this may be due to the fact that in the case of some species, a significant percentage of the genome/transcriptome data are missing, e.g., 48% of Lankesteria abbotti data are missing [28][20], and the N-terminus of the HBHB01002866 (Table 1) is not present in the available TSA sequence. The missing transcriptome data may also explain that while apicortin was identifiable in C. angusta, it was not found in the closely related (same family) V. pontica, in which 22% of the data are missing [28][20].

2.2. Short-Type TPPP

Short-type TPPPs have not been previously systematically investigated. They occur in various protists, mostly in Ciliata, Euglenozoa, and Chlorophyta [1], and they are present in Apicomlexa and Perkinsozoa [7]. Based on BLAST search, new short-type TPPPs were found in many myzozoans, not only in Apicomplexa and Perkinsozoa, but also in chrompodellids and dinoflagellates (Table 2). Within the apicomplexans, short-type TPPPs were found in Haemosporida, Piroplasmida, Eimeriidae, and Sarcocystidae, but they were absent in Cryptosporidiidae [7]. This analysis showed that they also occur in Nephromycida, gregarines (Eu-, Blasto-, and Archigregarinorida) and the recently defined class, Marosporida [23][36]. There are some characteristic sequences in short-type TPPPs. For example, one of the sequences, which is common with long-type TPPPs, is the L(V)xxxF(Y)xxF at the very beginning of the p25alpha domain. Another one is a GGP sequence in the C-terminal half of the protein (Figure 1). The length of the proteins varies between 120 and 170 amino acids.
Table 2.
Newly identified short-type TPPPs.