One of the most exciting discoveries of the recent years has been the recurring somatic mutations in SF3B1, in particular in hematopoietic and lymphoid malignancies
[24]. SF3B1 is an essential component of the U2 snRNP, which is critical to BPS recognition and essential for 3′ splice site selection
[1][2]. SF3B1 mutations are highly prevalent in myelodysplasia syndromes (MDS, ~30%), reaching mutational rates up to 83% in the refractory anemia with ring sideroblasts MDS subtype (RARS)
[25]. SF3B1 is also the most frequently mutated gene in CLL (~14%)
[26][27], and one of the few genes found to be frequently mutated in uveal melanoma (UVM) (~18%)
[28]. Interestingly, SF3B1 mutations correlate with good prognosis in MDS and UVM patients
[28][29], while a reduced overall survival was observed in CLL and luminal B and progesterone receptor-negative breast cancer patients
[30][31][32].
Alterations of the SF3B1 gene, such as non-silent mutations, could alter both splice site recognition and splicing decision. Indeed, SF3B1 harbors unique characteristics that support its function. On the one hand, SF3B1 acts on splice site recognition, crosslinking with RNA at both flanking sites of the BPS
[7]. On the other hand, SF3B1 can directly interact, via its N-terminal region, with the BPS binding protein and U2 snRNP subunit p14
[33] and with the PPT binding protein U2AF2
[7].
Remarkably, mutations in SF3B1 are heterozygous and the vast majority are single nucleotide missense mutations located at the C-terminal of the protein inside the HEAT-repeat domain (HD) (
Figure 3A). The most frequent SF3B1 mutations are K700E, K666N, and R625H. The K700E mutation is the prevalent mutation in MDS and CLL
[25], whereas mutations in R625 are highly frequent in UVM
[28]. Hereafter, these hotspot mutations will be referred to as SF3B1
MUT.
3.1. Molecular Basis of Cryptic 3′ Splice Site Usage by SF3B1MUT Spliceosomes
Transcriptome analysis of SF3B1
WT and SR3B1
MUT tumors from CLL, UVM, and breast cancer patients allowed the identification of splicing alterations associated with SF3B1 hotspot mutations and revealed that SF3B1
MUT stimulates the usage of cryptic 3′ splice sites by recognizing a cryptic BPS
[34][35][36]. Indeed, the most frequently altered splicing events in SF3B1
MUT tumors are cryptic 3′ splice site usage (up to 60–76%)
[34][35][36], with 3′ cryptic splicing sites located both close to (proximal, ~55%) or far away from (distal, ~45%) the canonical 3′ splice site
[34].
Figure 3. Mutant SF3B1 binds cryptic BPS and promotes the use of cryptic 3′ splice sites. (
A) Schematic representation of the SF3B1 protein domains and its mutation profile. SF3B1 domains are indicated: U2AF-homology ligand motif (ULM), p14 binding domain (p14), and HEAT-repeat domain (HEAT). The mutation profile was obtained from the COSMIC database (
https://cancer.sanger.ac.uk/cosmic, accessed on 25 October 2021) and visualized using ProteinPaint (
https://pecan.stjude.cloud/proteinpaint, accessed on 25 October 2021). The number of samples harboring each mutation is indicated inside each circle and their relative abundance is represented by the disc size. (
B) Sequence requirements for cryptic BPS usage by mutant SF3B1. A schematic exon-intron-exon region is shown together with the characteristic sequence features present in the splicing events altered by SF3B1
MUT: the 5′ splice site (5′ ss), the canonical and cryptic 3′ splice sites (3′ ss), the canonical and cryptic BPS, and the PPT. Red indicates the SF3B1 mutant, the adenine at the cryptic BPS, and the AG dinucleotide at the 3′ cryptic splice site. (
C) Mutant SF3B1 promotes the inclusion of a poison exon in the BRD9 transcript. Schematic representation of the BRD9 pre-mRNA containing the exon 14-intron-exon 15 region. The required splicing sequences are indicated as in (
B). ESE, exon splicing enhancer. Poison exon inclusion targets the BRD9 transcript to NMD, promoting BRD9 protein down-regulation and impaired formation of the ncBAF complex.
Analysis of specific splicing events in model cell lines showed that neither SF3B1 knock-down (KD) nor SF3B1 over-expression recapitulate the splicing alterations observed in SF3B1
MUT cells, indicating that hotspot mutations in SF3B1 are likely change-of-function mutations
[36]. The main consequence of the cryptic 3′ splice site usage by SF3B1
MUT is the loss of the open reading frame, with ~50% of the affected transcripts being targeted for degradation by non-sense mediated decay (NMD), leading to reduced expression of the codified protein
[35] (
Figure 3B).
Proximal cryptic 3′ splice sites recognized by SF3B1
MUT-U2 snRNPs are characterized by a sequence signature that includes a cryptic AG dinucleotide (cryptic 3′ spice site) localized upstream the canonical 3′ splice site and downstream the canonical BPS
[35][36] (
Figure 3B). In addition, a hidden BPS, rich in adenines, is present upstream the cryptic AG 3′ splice site. SF3B1
MUT-U2 snRNP recognizes an adenine present in this cryptic BPS, thus stimulating the usage of the cryptic 3′ splice site
[35][36].
Interestingly, SF3B1
MUT likely stimulates the usage of distal and proximal 3′ splice sites by the very same mechanism
[34]. Indeed, inspection of the distal cryptic 3′ sites exploited by SF3B1
MUT led to the identification of an upstream adenosine-rich sequence, very similar to the cryptic BPS sequence used by SF3B1
MUT at proximal 3′ splice sites. As not all introns contain such cryptic BPS required for SF3B1
MUT recognition, only a subset of transcripts is actually aberrantly spliced in SF3B1
MUT tumors.
Why the SF3B1
MUT-U2 snRNP binds preferentially to a cryptic BPS instead of the canonical one remains unclear. It has been proposed that SF3B1
MUT may possess an enhanced affinity towards the specific nucleotides at flanking regions of the cryptic BPS
[35]. Another hypothesis suggests that SF3B1
MUT could induce a conformational change in the U2 snRNP, which in turn promotes binding to the cryptic BPS
[36]. Unfortunately, at present, no data validated these models. Using purified SF3b complexes and crosslinking experiments, SF3B1
WT and SF3B1
MUT were shown to bind equally efficiently to a synthetic BPS-PPT RNA
[37]. Moreover, SF3B1
MUT did not seem to affect the structure and molecular interaction of the SF3b complex
[37], including the interaction with U2AF2 in vivo
[36] and in vitro
[37]. Thus, SF3B1
MUT shows no major effects on the topological architecture of the splicing complex which, however, does not exclude minor but critical conformational changes in the context of a full U2 snRNP complex where the stability of the U2 snRNP:BPS interaction could be affected by SF3B1 mutations.
A third possibility is that SF3B1 may affect binding with interactors outside the U2 snRNP, such as the intron itself or additional splicing factors. Indeed, the hotspot residue K700 has been found to be exposed to the solvent in the SF3b complex structure
[37]. By purifying and analyzing the composition of U2 snRNP complex wild-type and mutant from K562 myelogenous leukemia cells, Zhang and colleagues found that the SUGP1 protein is lost in SF3B1
MUT-containing complexes
[38]. Strikingly, SUGP1 KD in SF3B1
WT cell lines recapitulates the usage of cryptic BPS and cryptic 3′ splice sites observed in SF3B1
MUT cell lines
[38]. Furthermore, the over-expression of SUGP1 in SF3B1
MUT cells rescues the interaction between SUGP1 and SF3B1
MUT-containing complexes and partially rescue 3′ splice site usage alterations
[38]. Both observations suggest that the loss of SUGP1 from SF3B1
MUT spliceosomes could dictate the usage of the cryptic 3′ splicing site observed in SF3B1
MUT tumors.
How SUGP1 achieves such a critical role remains an open question. Interestingly, point mutations in the G-patch domain at the C-terminal of SUGP1 is sufficient to induce the usage of cryptic 3′ splice sites in cell lines
[38]. Based on the role of G-patch proteins as activators of DEAH-box RNA helicases
[39], it has been proposed that SUGP1 could act by modulating RNA helicase activity during splicing
[38]. By doing so, SUGP1 could mediate the remodeling and exchange of splicing factors at the conventional BPS, such as the displacement of SF1 by the U2 snRNP. Thus, in the context of the SF3B1
MUT, SUGP1 would not be recruited at the spliceosome, forcing U2 snRNP deposition at the cryptic BPS
[38]. Again, this intriguing model lacks experimental validation, as the hypothetical RNA helicase modulated by SUGP1 remains unsettled.
3.2. Mechanism of SF3B1MUT-Driven Tumorigenesis
SF3B1 mutations have multiple consequences in tumorigenesis. It has been suggested that SF3B1 mutations arise later in leukemia development and contribute to disease progression
[27]. Accordingly, mutations in SF3B1 correlate with a rapid disease progression in CLL
[40]. Mice expressing heterozygous and hematopoietic-restricted SF3B1 K700E mutation do not develop leukemia but exhibit macrocytic anemia due to a block in terminal erythroid maturation, erythroid dysplasia, and long-term hematopoietic stem cell expansion, all of which are characteristic features of MDS
[15]. Transcriptome analysis of CLL samples from SF3B1
MUT tumors, identified expression alterations in genes belonging to biological pathways associated with poor prognosis
[41]. An example is represented by the Notch pathway, whose aberrant signaling has been linked to CLL with reduced overall survival
[42]. Myeloid and leukemic cell lines expressing SFR3B1
MUT exhibit a higher Notch pathway activity
[26], possibly due to the expression of a DVL2 exon 11-aberrant isoform unable to repress Notch signaling
[41]. Correlative evidence was also found for a few altered splice variants induced by SF3B1
MUT (CHD1L, GAK, RAD9A, JMY) involved in the DNA damage response
[41][43].
Recently, Inoue and colleagues identified one key oncogenic effector of the SF3B1
MUT spliceosome, the non-canonical chromatin remodeling BAF complex (ncBAF) subunit BRD9
[17]. By using a distal cryptic 3′ splice site, a cryptic BPS, and an exon splicing enhancer (ESE), SF3B1
MUT induces the inclusion of a new exon upstream the BRD9 exon 15 (
Figure 3C). This poison exon (14a) interrupts the open reading frame and targets the aberrant BRD9 transcript to NMD, thereby down-regulating BRD9 expression. Remarkably, the inclusion of exon14a was observed in all cohorts of CLL, UVM, and MDS patients harboring SF3B1 hotspot mutations
[17]. At the functional level, SF3B1
MUT, by inducing BRD9 loss, disrupts the ncBAF complex and, as a consequence, its loss from chromatin affects the expression of genes involved in apoptosis, cell growth, adhesion, and migration
[17]. Validating experiments showed that BRD9 KD in non-tumorigenic Melan-a melanocytes results in tumor growth and the induction of melanomagenesis in vivo while BRD9 depletion in melanoma cells increases the number of pulmonary metastases
[17]. Finally, the inhibition of BRD9 poison exon inclusion by using specific antisense oligonucleotides (ASOs) rescues BRD9 expression in SF3B1
MUT melanoma cells in vitro and inhibits tumor growth by inducing tumor necrosis in vivo
[17]. Thus, the loss of BRD9 represents the first SF3B1
MUT-induced pro-oncogenic event driving tumorigenesis. While more in-depth functional studies are needed to uncover further critical oncogenic effectors of SF3B1
MUT, these studies have validated ASOs as a valuable therapeutic option to treat SF3B1
MUT cancers.