Cap-Independent Circular mRNA Translation: Comparison
Please note this is a comparison between Version 1 by Andrei A Deviatkin and Version 2 by Rita Xu.

The mRNA platform has become the method of choice in vaccine development to find new ways to fight infectious diseases.  This instability is due to the fact that the five-prime and three-prime ends of the mRNA are a substrate for the ubiquitous exoribonucleases. To address the problem, circular mRNAs have been proposed for transgene delivery as they lack these ends. Notably, circular RNAs do not have a capped five-prime end, which makes it impossible to initiate translation canonically.

  • circRNA
  • cap-independent translation
  • mRNA vaccines

1. Introduction

The more frequent and severe the infection, the more resources are invested into research on the interaction of the pathogen with the human organism. As a result, there is a need to develop effective approaches to treat and prevent disease. The most effective and safest method to prevent the spread of an infectious disease is vaccination. At the beginning of the COVID-19 pandemic, many types of vaccines were effectively used for controlling various diseases [1]. The development of alternative approaches was hampered to some extent, as there did not appear to be a problem that needed solving.
The emergence of a new challenge became a turning point in vaccine development. Indeed, there were virtually no clinical trials of mRNA vaccines to prevent infections before the COVID-19 pandemic (Figure 1, magenta line). At the same time, the number of mRNA-based vaccine trials against COVID-19 in the eleven months of 2022 does not exceed the same indicator for 2021 (Figure 1, cyan line). In contrast, the development of such drugs to prevent other infections almost doubled in one year (Figure 1, blue line). In other words, the mRNA vaccine platform that has been promoted to combat COVID-19 is increasingly being used as an approach to prevent infections in general.
Figure 1. Dynamics of mRNA vaccine clinical trials registered on clinicaltrials.gov. The number of trials was counted by keyword ([condition “Infections”, title “mrna vaccine”] (magenta), [condition “Infections”, other terms “NOT coronavirus NOT COVID19 NOT BNT162b2”, title “mrna vaccine”] (blue), [condition “Infections”, other terms “coronavirus OR COVID19 OR BNT162b2”, title “mrna vaccine”] (cyan)) occurrence, as of 28 November 2022.
However, mRNA technology has some weaknesses. Firstly, widely used mRNA vaccines do not maintain their stability at room temperature for more than 24 h [2]. Refrigeration at +4 degrees Celsius extends the shelf life of the vaccine by several weeks, while −20 degrees Celsius extends it by several months. Therefore, there is a clear need for a second-generation mRNA-based platform for vaccine development with increased stability to ensure adequate protection against future pandemics [2].
Secondly, the half-life of mRNA in the mammalian cell is estimated to be several minutes to several days, with a median value of 7 h [3]. Surprisingly, to theour knowledge, the half-life of Moderna and Pfizer/BioNTech mRNA vaccines in human cells is unclear. This parameter influences the vaccine dose required to elicit a sufficient immune response.
The main reason for the instability of mRNA is its sensitivity to RNases. Exoribonucleases actively degrade the 5′ and 3′ ends of mRNA and play an important role in mRNA degradation [4]. Circular RNAs (circRNA) do not have 5′ and 3′ ends, which increases their stability compared to the linear form, which is one of its advantages. They are generally stable in serum, with a 24-h incubation at room temperature having minimal effect on titer [5]. In vivo, circRNA can remain in the cell for more than 48 h as they are resistant to degradation by exonucleases and localize mainly in the cytoplasm [6]. As such, they are a promising alternative to linear RNA for use in vaccines, as they do not require special equipment for storage. At the same time, to the best of theour knowledge, currently, there are no registered clinical trials of vaccines against infectious diseases based on the circular mRNA platform.
There are several ways to synthesize circRNAs. These molecules can be produced directly in cells by transcription and back-splicing when delivered as vector DNA (e.g., AAV [7]). On the other hand, circRNAs can be synthesized in vitro [8][9][8,9]. Currently, there are three methods for circRNA vaccines in in vitro production: chemical-, enzyme-, and ribozyme-based ligations. At the same time, these approaches have shortcomings (reviewed in [10]). Therefore, new protocols for large-scale production are needed to enable the cost-effective design and production of circRNA vaccines.
Much of genetic construction is tailored to canonical translation, which begins with cap-dependent initiation. Endless circular RNAs have no cap necessary for canonical translation initiation (Figure 2, left upper panel). As far as rwesearchers know, only one of the possible methods of cap-independent translation initiation is used in the development of modern circular mRNA vaccines, although there are several approaches to increase the effectiveness of transgene production. The methods of translation initiation applicable to the engineering of circular mRNAs (Figure 2).
Figure 2. Simplified scheme of the mechanisms for cap-independent initiation of circRNA translation. IRES—internal ribosome entry site; m6A—N6-methyladenosine; YTHDC1—the reader of m6A modification; TEE—translation enhancing element; CITE—cap-independent translation enhancer.

2. Translation Initiation Mechanisms

For eukaryotic translation (Figure 3), the 80S ribosome should be assembled at the mRNA. The canonical cap-dependent process begins with the interaction of a small 40S ribosomal subunit with a ternary complex consisting of trimeric eukaryotic initiation factor 2 (eIF2), GTP and initiator transport RNA (Met-tRNAi) to form the 43S pre-initiation complex (PIC) [11][12][11,12]. 43S PIC binds to the 5′ end of the mRNA, interacting with the eIF4F complex (cap-binding eIF4E, the scaffolding eIF4G, and the helicase eIF4A). It should be noted that in the absence of the cap, the above complex does not bind to the 5′ end of the mRNA and translation would not be initiated in this way. 43S PIC and eIF4F complexes, together with poly(A)-binding proteins (PABPs) at the poly(A) tail, form the 48S PIC, which then scans a 5′-untranslated region in the 5′-3′ direction for the AUG start codon within the Kozak sequence context. Start codon recognition by methionine anticodon triggers GTP hydrolysis and eIF2-GDP release, as well as 60S ribosomal subunit interaction mediated by eIF5B to form the 80S initiation complex [13]. eIF1, eIF1A, eIF2, eIF3, eIF4B, eIF4F, eIF5, eIF5B, and PABP are the key factors involved in cap-dependent translation initiation.
Figure 3. A simplified scheme of cap-dependent translation initiation mechanism.

2.1. Efficiency of IRES-Dependent Translation Initiation

The 5′UTR of mRNA has the potential to control translation independently of the cap via an internal ribosome entry site (IRES). Remarkably, cap-independent translation initiation can also be caused by other factors, which are discussed in the following sections. IRESs are present in both viral and cellular mRNAs. Cellular IRESs are very diverse, which makes them difficult to classify. In other words, there appear to be many different, as of yet uncharacterized, mechanisms that control cap-independent translation using endogenous IRESs [14].
Based on the differences in the way viral IRESs interact with ribosomes, they can be classified into four groups [15]. Group 1 IRESs bind to the ribosome without accessory factors. Group 2 IRESs bind to the ribosome with the help of eIFs and Met-tRNA. Group 3 IRESs bind to the ribosome using eIFs, Met-tRNA, as well as IRES transactivating factors (ITAFs), and function in rabbit reticulocyte lysate (RRL) without extracts from other cell types. Group 4 IRESs bind to the ribosome using eIFs, Met-tRNA, as well as ITAFs, and function in RRL only after addition of extracts from other cell types.
Originally, IRESs were studied as part of the viral replication system. Initial comparative studies showed low efficiency of IRES-associated translation initiation compared to cap-dependent initiation. In these studies, a bicitron system was used in which the first protein was synthesized in a classical cap-dependent manner and the second protein was translated by IRES. The efficiency of translation of the second protein was significantly lower [16][17][16,17]. The low protein expression induced by IRES was explained by the inefficient re-initiation of translation at an internal AUG codon downstream of IRES. IRES therefore gained a reputation as a less efficient translation initiation method as early as the 1980s, prompting researchers to modify IRES to improve the efficiency of translation initiation.
In the case of bicistronic cassettes, the efficiency of IRES-dependent translation initiation can be improved by a tailored intercistronic sequence [18]. The size of the intercistronic spacer sequence in the 5′ region of the IRES sequence has a significant impact on downstream protein expression. By choosing the spacer size, it was possible to achieve even stronger expression of proteins whose initiation was dependent on IRES and not on the cap. Another comparative study showed that IRES did not perform worse than cap under certain conditions [19]. Translation efficiency using EMCV IRES was similar compared to cap-dependent gene expression in the Lcl1D cell line, although efficiency remained low in many other cases [19]. This study compared the transcriptional efficiency of a gene in a bicistronic vector in which the initiation of the first protein was cap-dependent and that of the second protein was initiated by an EMCV IRES (type 2 IRES). Furthermore, it was shown that the efficiency of IRES strongly depends on the expressing cell line [19].
For more efficient use of IRES, it is worth choosing the correct combination of IRES type and express cell line. For example, group 1 IRESs (from PV, ECHO and HRV viruses) showed high translational efficiency in the HeLa, HepG2 and FRhK4 cell lines, while they were extremely inefficient in the BHK21 and Neuro-2A non-primate cell lines and in the SKNBE human neuronal cell line. At the same time, group 2 IRESs (from EMCV, FMDV and HCV) appear to be excellent candidates for the development of expression vectors, as they functioned efficiently in all cell lines tested by the authors [20].
Currently, there are several examples of successful developments based on IRES-mediated translation initiation. A circular mRNA prototype encoding the modified spike protein of SARS-CoV-2 induced neutralizing antibody titers for up to 7 weeks after secondary inoculation and had broad neutralizing activity against different viral strains [21]. In another study, circRNA was found to confer higher and more durable efficiency than analogous mRNA. The induced immunity persisted for at least 8 weeks after secondary inoculation [22].

2.2. m6A and Translation Initiation

The most common internal chemical RNA modification, N6-methyladenosine (m6A), is involved in the initiation of translation [23][28]. Even a single m6A modification within the 5′UTR of mRNA has the potential to recruit a 40S ribosomal subunit through direct eIF3 binding. Ribonucleotide methylation is a flexible process that depends on the activity of epitranscriptional “writer” and “eraser” factors [24][29].
The “readers” of the m6A modification bind mainly to a modified nucleotide in the RR(m6A)CH motif (R stands for purines, H for pyrimidines and C for the cytosine base). There are several representatives of the YTH domain-containing protein family: YTHDF1/2/3, localized in the cytoplasm of cells, and YTHDC1/2/3, localized in the nucleus [25][30].
Recently, a firefly luciferase assay was used to demonstrate that m6A-mediated translation initiation is less efficient compared to cap-dependent initiation [26][31]. On the other hand, this approach demonstrated comparable results in comparison to IRES-mediated translation initiation [27][32]. This technique can be applied in circular mRNA engineering, but the efficiency of such constructs should be carefully assessed.

2.3. Endogenous IRES-like Elements in Eukaryotic Genome

Several endogenous circRNAs are known to translate proteins in eukaryotic cells. Their 5′UTRs contain IRES-like sequences that drive translation independently of cap [28][33]. Fan et al. [29][23] found 97 short random hexamer sequences enriched in endogenous translating circRNAs (compared to linear mRNAs) that drive translation independently of cap. These IRES-like structures interact with various RNA-binding factors that initiate cap-independent protein expression. At the same time, RBPs are differentially expressed in various tissues [30][34]. This means that IRES-like sequences have different efficiencies in different tissues, which should be taken into consideration during implementation of IRES-like structures into genetic constructs. Mutation of these sequences significantly reduced the translation rate. In addition, some hexamers contain specific context-dependent m6A modifications of such RNAs, further complicating the system. Remarkably, virtually every random sequence longer than 50 nucleotides contains such a short IRES-like element. Indeed, the authors demonstrated that around 2% of all hexamers (97 out of 4096) may initiate translation. A sequence of 50 nucleotides can be divided into 45 possible hexamers, while a sequence of 100 nucleotides can be divided into 95 hexamers. Assuming that nucleotide sequences have some degree of randomness, reswearchers can conclude that any RNA sequence of any length can have translational potential if they contain an ORF. However, the identified elements showed great similarity with previously described IRES-like sequences of some endogenous genes, such as Hsp70 [31][35] and Gtx [32][36].
In addition, screenings have shown that almost 10% of eukaryotic mRNAs contain IRES-like sequences that can drive translation independently of the cap [33][34][37,38]. To date, it is not clear how effective such short sequences are in initiating translation compared to viral IRESs that have been previously used. At the same time, a comprehensive paper [29][23], published only half a year ago, showed that the efficiency of translation initiated by some hexamers is at least as good as initiation using the m6A modification.

2.4. Translation Enhancing Elements

Translation enhancing elements, TEE, are genomic sequences that facilitate cap-independent translation. They were first identified using a method based on the mRNA display principle [35][39]. AIn this study, a genomic library consisting of approximately 1013 randomly selected 150-nucleotide fragments of total genomic DNA was created. These sequences were placed in the 5′-UTR of the starter DNA construct encoding a His6 affinity tag.
After in vitro transcription, the resulting set of uncapped single-stranded RNAs was conjugated by photoligation with a puromycin residue at the 3′-end and subjected to translation. Remarkably, during translation, a chemical bond was formed between the newly translated peptides and their coding mRNA by the natural peptidyl transferase activity of the ribosome, which recognized puromycin as a tyrosyl-tRNA analogue. As a result, molecules capable of cap-independent translation consisted of uncapped mRNAs linked to His-tags. In addition, these substances were selected by immobilized metal ion affinity chromatography. The functional RNAs were then isolated, reverse transcribed and amplified by PCR to create the novel pool of DNA for another selection cycle. This procedure was repeated six times until the sequences capable of cap-independent translation enhancement became predominant in the library. This approach revealed more than 12 thousand human genome elements mediating translation in a cap-independent manner in vitro and in mammalian cells.
Several TEEs have shown extremely high efficiency in protein production. For example, the addition of hTEE-658 to the capped construct increased the expression of the target protein 100-fold compared to a construct without TEE [36][40]. According to a further study, this 37-nucleotide motif placed upstream of the certain protein coding region of transfection plasmid was capable of increasing the protein expression level in experiments with transfection and subsequent infection using the vaccinia virus (VACV) [37][41]. Notably, neither the length of the 5′-leader sequence, nor the distance between the motif and the coding region affects the ability of hTEE-658 to increase transcription and translation rates. To the best of our knowledge, the mechanism of TEE cap-dependent translation enhancement or cap-independent translation initiation is unclear.
ScholarVision Creations