1. Introduction
One topic of particular interest in human health is epigenetic regulation by non-coding RNAs. Non-coding RNAs (ncRNAs) encompass all RNAs that are not translated into proteins. They are classified by features such as length and structure, and include microRNAs (miRs), long non-coding RNAs (lncRNAs), and circular RNAs (circRNAs). These different classes of ncRNAs participate in complex interaction networks with each other, mRNAs, and various signaling pathways to enact changes on cells, surrounding tissue, and whole organ systems.
2. The Major Types of Non-Coding RNAs
MicroRNAs (miRs) are the smallest of the non-coding RNAs, with an average size of 22 nucleotides [
11]. A primary miR (pri-miR), which may be kilobases long, is transcribed from its genomic coding region [
12]. The Microprocessor complex trims and processes the pri-miR to create a 60–90 nucleotide precursor miR (pre-miR) [
12]. The pre-miR often exists as a double-stranded hairpin loop and is exported into the cytoplasm by Exportin 5 bound to Ran-GTP (see
Figure 1A) [
13]. Two mature miRs are created when the Dicer nuclease splits and processes the strands [
13]. These mature miRs are then referred to as the “5p” or “3p” species, coming from the 5′ and 3′ ends of the pre-miR transcript, respectively [
14]. However, not all mature miRs are created equal. Often, either the 5p or 3p strand is expressed more highly and has a stronger effect on epigenetic gene regulation (and is thus referred to as the “guide strand”), while the other “passenger strand” is frequently (but not always) degraded [
13]. Advancing technology has recently established that stably expressed passenger strands are more common than previously thought, and further studies are warranted to uncover unknown functions of these miR species [
14].
Figure 1. The biogenesis of non-coding RNAs. (A) An immature miR transcript is transcribed and exported into the cytoplasm by Exportin 5 bound to Ran-GTP. The “guide” and “passenger” strands are created when the Dicer nuclease splits and processes these precursors. (B) The sequence of a lncRNA is transcribed by RNA polymerase II (PolII) and either spliced to its mature form or inefficiently spliced and retained in the nucleus. Alternatively, the sequence is transcribed by phosphorylation-dysregulated PolII, remains tethered to the chromatin, and either accumulates over time or is degraded. (C) CircRNAs are created through back-splicing mediated by RNA binding proteins, inverted repeat elements, or lariats created during mRNA splicing. The mature circRNA may contain only exons, only introns, or a combination of both. RBP: RNA-binding protein.
Following processing, the mature miR associates with Argonaute protein (Ago), transactivation response element RNA-binding protein (TRBP), protein kinase RNA activator (PACT), and the still-bound Dicer to create the microRNA-induced silencing complex (miRISC), which carries out the epigenetic suppression of target mRNAs (
Figure 2A) [
13]. The identification of target mRNAs is not trivial and currently cannot be done reliably using only computational tools. This is because only 6–8 consecutive nucleotides in the miR’s 5′ “core” sequence are required to bind the microRNA recognition element (MRE) in a target mRNA [
13]. In addition, imperfect base pairing can occur between the core sequence and MREs [
13]. MiR to MRE interactions are also able to make G:U pairs, providing additional flexibility in target binding [
13]. Due to the variable nature of this process, a given miR can have hundreds of potential mRNA targets [
13]. Fortunately, biochemical methods such as CLIP-Seq (cross-linking immunoprecipitation followed by high throughput sequencing) have been developed that greatly facilitate the identification of miR-mRNA targeting relationships, and these have been applied to the study of liver regeneration [
15]. Depending on the miR/mRNA pair, miRs can either cause degradation of the mRNA or lead to the inhibition of its translation [
11].
Figure 2. Epigenetic regulation by ncRNAs. (A) The microRNA-induced silencing complex downregulates mRNAs through translational inhibition or degradation. TRBP: transactivation response element RNA-binding protein; PACT: protein kinase RNA activator; Ago: Argonaute (B) lncRNAs alter gene expression by binding DNA, binding mRNA, or by sequestering chromatin modifiers. DNA binding by lncRNAs can result in chromatin modifier or transcription factor recruitment, gene silencing, or the formation of DNA-DNA-RNA structures. Binding of a lncRNA to mRNA often leads to alternative splicing of the mRNA. For instance, the presence of a lncRNA on an exon can lead to that exon being skipped. Alternatively, recruitment of splicing factors by a bound lncRNA can lead to the retention of an exon that would otherwise be skipped. SF: splicing factor (C) CircRNAs function as sponges for RNA-binding proteins or microRNAs. Some can also colocalize enzymes with their substrates. A few are proposed to enhance protein activity or be translated into novel peptides, although these functions have yet to be experimentally validated. RBP: RNA-binding protein.
As the name implies, long non-coding RNAs (lncRNAs) are more sizable than miRs, operationally defined as being at least 200 nucleotides in length [
16]. Some estimates for the number of lncRNAs encoded in humans exceed 100,000, although the number that are actually transcribed or functional is unknown [
16]. LncRNAs resemble protein-coding mRNAs in both their length and nuclear processing (many possess a 5′ cap and 3′ polyadenylation and have introns); however, they are processed less efficiently [
16]. LncRNAs are more likely than mRNAs to be transcribed by phosphorylation-dysregulated Pol II (
Figure 1B). This can lead to lncRNA transcripts becoming tethered to chromatin, where they are either degraded or accumulate over time [
16]. The splicing signals of lncRNAs are also weaker, and therefore lncRNAs are frequently retained in the nucleus due to inefficient splicing [
16].
Importantly, nuclear localization does not mean that a given lncRNA is biologically inert. LncRNAs impact gene regulation through a wide array of physical interactions (
Figure 2B). LncRNA-chromatin interactions are capable of recruiting or decoying chromatin modifiers, and of interacting directly with the DNA to form DNA-DNA-RNA triplexes and R-loops, both of which have unique effects on gene regulation [
16]. LncRNAs are also well known to silence gene expression. This can serve to balance gene dosage, most famously in the silencing of the second mammalian female X chromosome through lncRNA
XIST in a process termed “X-inactivation” [
16,
17]. LncRNAs can also act on gene expression through alterations to gene accessibility or recruitment of transcription factors [
16]. Finally, lncRNAs have been shown to alter pre-mRNA splicing through direct association with mRNAs or recruitment or decoying of splicing factors [
16,
18].
Circular RNAs (circRNAs), self-joined RNA molecules, have proven difficult to isolate and characterize due to their lack of the canonical features of mature RNA, such as a polyadenylated 3′ end [
19]. However, with continuing advances in sequencing technology, the role of these unique molecules in disease pathology and homeostasis is beginning to be appreciated. There are multiple ways circRNAs can arise in a cell (
Figure 1C). Inverted repeat elements or RNA-binding proteins that flank one or more exons can lead to back-splicing. This results in a circular RNA molecule containing either solely exons or both exons and introns [
19]. Back-splicing can also occur in lariats created during exon-skipping, leading to a circRNA containing the skipped exon(s) [
19]. Lastly, lariats that escape debranching in canonical linear splicing can self-join to become circRNAs containing only introns [
19]. The majority of exonic circRNAs are exported to the cytoplasm, while circRNAs containing introns tend to remain in the nucleus [
19]. Since these molecules do not have exposed ends like most RNA molecules, they are extremely stable and tend to accumulate with age [
19].
Although most circRNAs remain uncharacterized, those that have been studied in-depth have revealed multiple potential mechanisms of action (
Figure 2C). The majority of those analyzed thus far have been proposed to function as sponges for miRs, reducing their effective concentration [
19]. CircRNAs are also able to decoy RNA-binding proteins, and a few have been shown to bind enzymes and their substrates, promoting colocalization and activity [
19]. Although not yet experimentally confirmed, circRNAs may also functionally enhance or recruit proteins [
19]. Some circRNAs contain internal ribosome entry sites and AUG codons, making it possible to be translated, which could give rise to unique peptides that would not be possible through canonical RNA processing [
19].
In addition to biological functions in their cell of origin, ncRNAs are sometimes secreted from the cell, typically in exosomes. The protection of the exosomal membrane allows ncRNAs to remain relatively stable in the extracellular space, and exosomal ncRNAs have been detected in both the blood and urine of human patients [
20]. Given this stability, and the ability of cells to uptake exosomes produced by distal tissues, it has been proposed that exosomal ncRNAs are able to exert biological effects far from their cell of origin [
21,
22]. The ability to monitor ncRNA levels through the non-invasive collection of body fluids has made exosomal ncRNAs prime candidates to act as biomarkers for various disease states.
Numerous studies have found changes in non-coding RNAs in various liver pathologies, several of which are outlined in Table 1. Emphasis has been placed on RNA species that are highly expressed in the liver, whether at baseline or in proliferative states, or that have experimentally proven impacts on liver cell biology reflected in multiple sources. Examined pathologies and injury states mainly affect hepatocyte function.
Table 1. Summary of major ncRNA-associated liver pathologies.
ncRNA |
Disease Association(s) |
Other Notable Function(s) |
Citations |
miR-122 |
Hepatitis C |
|
[23,24,25,26,27,28] |
NASH |
|
miR-21 |
NASH |
Limits regenerative ability |
[29,30,31,32,33] |
Hepatocellular carcinoma |
|
let-7a |
Hepatocellular carcinoma |
Altered following partial hepatectomy |
[34,35,36] |
Cholangiocarcinoma |
|
let-7b |
Hepatitis C |
Downregulated in small volume liver grafts |
[36,37,38,39] |
Acetaminophen overdose |
Altered following partial hepatectomy |
Hepatocellular carcinoma |
|
let-7c |
Acetaminophen overdose |
Altered following partial hepatectomy |
[36,37] |
let-7d |
Acetaminophen overdose |
Altered following partial hepatectomy |
[36,37] |
let-7e |
|
Altered following partial hepatectomy |
[36] |
let-7f |
|
Downregulated in small volume liver grafts |
[36] |
|
Altered following partial hepatectomy |
|
let-7g |
Hepatitis C |
Altered following partial hepatectomy |
[36,40,41,42,43] |
Hepatocellular carcinoma |
|
|
let-7i |
|
Altered following partial hepatectomy |
[36] |
miR-451a |
Chronic hepatitis B |
|
[44,45] |
Hepatocellular carcinoma |
|
|
miR-144 |
Chronic hepatitis B |
|
[44,45] |
Hepatocellular carcinoma |
|
|
ZSCAN16-AS1 (lncRNA) |
Hepatocellular carcinoma |
|
[46] |
MALAT1 (lncRNA) |
Hepatocellular carcinoma NAFLD/NASH Fibrosis |
Liver regeneration |
[47,48] |
This entry is adapted from the peer-reviewed paper 10.3390/cells12030359