Retrotransposons | Encyclopedia MDPI

Retrotransposons: History

Please note this is an old version of this entry, which may differ significantly from the current revision.

Contributor: Roberto Ferrari , Nicole Grandi , Enzo Tramontano , Giorgio Dieci

Retrotransposons, a large and diverse class of transposable elements that are still active in humans, represent a remarkable force of genomic innovation underlying mammalian evolution.

SINE
ERV
LINE-1
transposable elements
enhancer
exaptation

1. Introduction

A large fraction of most eukaryotic genomes is constituted by transposable elements (TEs), interspersed repeats of which the high copy number reflects mobile DNA integration events that occurred countless times throughout evolutionary history ^[1]. Although they represent a constant challenge for genome stability, TEs have at the same time introduced potentially fruitful changes into genomes both by driving genomic rearrangements (resulting, for example, in gene duplication) and by exaptation of TE-derived sequences ^[2]^[3]^[4]. Since TEs replicate as genomic parasites, eukaryotic host organisms have co-evolved TE silencing systems largely based on the deposition of repressive epigenetic marks, of which the effect on the epigenome accompanied the effect of TE on the genome over the course of evolution ^[5]. By allowing TE retention, TE silencing mechanisms have provided genomes with large pools of latent functional elements poised for exaptation ^[6]. While the so-called DNA transposons employ a mechanism directly moving DNA segments from one genomic location to another, the heterogeneous class of TEs referred to as retrotransposons (or retroelements) do so through reverse transcription of an RNA copy of the original element, thereby affecting genome composition through the constant introduction of new DNA material. The peculiar ability of retrotransposon systems to support conversion from RNA to DNA incessantly contributes new sequences, potentially encoding new protein/RNA molecules or providing new cis-regulatory functions that selection can act on to produce genomic and organismal innovations ^[7]^[8].

The last 15 years have seen a remarkable spurt of studies exploring the idea that retrotransposon activity affects brain development and function in mammals, through the promotion of somatic mosaicism in the brain ^[9]^[10], the generation of novel transcripts and proteins playing diverse roles in neuron biology ^[11]^[12], as well as the seeding of cis-regulatory elements, affecting transcription factor-dependent gene regulation, and of boundary elements, participating in three-dimensional (3D) genome architecture ^[13]^[14]. The deep involvement of retrotransposons in brain biology also makes them a source of vulnerability and disease ^[15]^[16]. All these indications of the pervasive influence of retrotransposons on brain biology consolidate the idea that the evolution of the nervous system, of which the results include the uniquely evolved human brain, has retrotransposons among its driving factors ^[13]^[17]^[18].

2. Retrotransposons and Their Impact on Mammalian Genome Evolution

2.1. Retrotransposons in Mammalian Genomes

TEs are repetitive DNA sequences, typically ranging in length from 100 to 10,000 bp, capable of colonizing new genomic locations with copies of themselves. Based on their mode of transposition, with or without an RNA intermediate, TEs are split into two major classes: the eukaryote-specific class I TEs, or retrotransposons, which mobilize via a “copy-and-paste” mechanism involving reverse transcription of an RNA copy of a source element, and the class II TEs or DNA transposons, mobilizing without reverse transcription mainly via a “cut-and-paste” mechanism ^[4]. The increase in TE copy number in genomes is primarily due to their capacity for vertical inheritance through the germline, a property giving TEs great potential for the generation of evolutionary novelties. Indeed, an enormous amount of studies since their discovery has mitigated the emphasis on TEs as a useless form of parasitic DNA, by revealing their irreplaceable contribution to genome structure, information content and regulation ^[19]^[20], thus substantially corroborating early, far-seeing hypotheses about their role ^[21]^[22] (see ^[23] for a historical perspective). The other side of the coin is that TE mobility has the potential to disrupt functional genetic elements in both germinal and somatic cells, thus leading to disease ^[24]^[25]. It is thus not surprising that retrotransposons coevolve with mechanisms counteracting retrotransposition ^[26]^[27]^[28].

Each class of TEs comprises different clades/superfamilies and families, the diversity and complexity of which has prompted decades-long identification and classification efforts ^[29]^[30]. In particular, class I TEs are typically subdivided into long terminal repeat (LTR) and non-LTR retrotransposons, with the former displaying a close relationship with retroviruses and other reverse-transcribing viruses (Figure 1). TEs of both class I and II can further be classified as either autonomous or non-autonomous, the former having the ability to self-mobilize, the latter relying on the enzymatic machinery of other TEs for mobilization. Such a distinction is especially relevant in the case of non-LTR retrotransposons, which can be broadly divided into the autonomous elements referred to as long interspersed elements (LINEs) and the non-autonomous short interspersed elements (SINEs) ^[4]. SINEs are known to exploit the LINE retrotransposition machinery for mobilization, often facilitated by 3′ end sequence similarity between LINE and SINE partners ^[31]^[32]. Although LTR and LINE retrotransposons are transcribed by the RNA polymerase II machinery, assisted by a few sequence-specific transcription factors (TFs), most SINEs are transcribed by RNA polymerase III due to the presence of internal control regions (A box and B box) recognized by the Pol III-specific basal transcription factor TFIIIC ^[33]^[34]. Indeed, the evolutionary origin of most SINEs has been traced back to Pol III-transcribed genes coding for abundant small RNAs, such as tRNA, 5S rRNA and 7SL RNA, all employing TFIIIC as a sequence-specific DNA binding protein essential for transcription complex assembly ^[35]. Accordingly, SINEs are generally divided into SINE1/7SL, SINE2/tRNA and SINE3/5S (Figure 1), to which the more recently identified SINEU (derived from U1 and U2 snRNAs) have been added ^[29].

Figure 1. Schematic view of retrotransposons. Retrotransposons are divided into two main classes depending on the presence of long terminal repeat (LTR) regions. (A) LTR retrotransposons are also generally referred to as endogenous retroviruses (ERVs). Their full-length sequence (top) is schematically composed of a 5’-LTR (orange arrow) containing the RNA polymerase II (Pol II) promoter, from which the entire unit is transcribed. The coding region of the transcribed ERV spans three main genes, such as Gag, Pol and Env, represented by the blue, purple and yellow boxes, respectively. The coding region is followed by the 3’-LTR sequence (orange arrow); 5’ and 3’ LTRs are formed from viral RNA ends during reverse transcription and are identical at the time of integration. Intact LTR retrotransposons are autonomous, as they encode for the protein machinery required for their reverse transcription and integration. Many LTR retrotransposons are incomplete, however. In extreme cases (bottom), the recombination between 5’ and 3’ LTRs of the same provirus can reduce ERV sequences to a solitary LTR only, from which transcripts can originate by virtue of the Pol II promoter embedded within the LTR (orange arrow). (B) Non-LTR retrotransposons include both autonomous (LINE) and non-autonomous (SINE, SVA) classes, illustrated in the upper and lower parts of the panel, respectively. LINEs, predominantly represented by the LINE-1 (L1) group in humans, harbor a 5’-UTR, containing the Pol II promoter from which they are transcribed (curved arrow). The coding region of the transcribed LINE-1 is composed of two main open-reading frames, ORF1 and ORF2, coding for the homonymous proteins (green and pink boxes). The 3’-UTR of the LINE-1 contains a poly(A) tract (A_n). SINEs are further divided into three main groups or clades: SINE1/7SL, SINE2/tRNA and SINE3/5S, depending on the type of ancestral sequence from which they originated, specifically the 7SL RNA, the tRNA and the 5S RNA sequence. Consistent with their origin, SINEs contain internal Pol III promoters. Alu elements, the most numerous SINEs in humans, belong to the SINE1/7SL group as they contain two 7SL-related moieties (gray boxes). The upstream 7SL-related moiety harbors A- and B-box internal control elements, recognized by the Pol III-specific transcription factor TFIIIC (yellow boxes) in their left 7SL-related moiety. The two moieties are also separated by an A-rich tract (A_n). Another poly(A) tract is found at the end of the SINE. SINEs of the SINE2/tRNA group harbor A- and B-boxes in their tRNA-related upstream moiety (yellow boxes), followed by sequences of diverse origins. A noteworthy example of this group is represented by the mammalian-wide interspersed repeat (MIR) elements. SINE3/5S elements are exemplified in the Figure by AmnSINE1, formed by an upstream 5S-derived moiety (red box), containing the 5S-specific A- and C-box internal promoter elements, followed by a tRNA-related fragment (yellow). Represented in the bottom part of the panel is the structure of an SVA element, consisting of (from the 5’ to 3’ end) a hexameric repeat region, an Alu-related region, a variable number tandem repeat (VNTR) region, and a SINE-R sequence sharing homology with human endogenous retrovirus HERV-K10.

In the last decade, mainly due to high-throughput sequencing and TE annotation advancements, TEs have been disclosed as a major component of vertebrate genomes, strongly contributing to their diversity ^[36]. In mammals, which are the best-studied vertebrates in terms of TE biology, the mobilome (defined as the whole set of TEs in the genome) generally display distinguishing features among vertebrates. TEs account for more than 50% of the size of many mammalian genomes, with a preponderance of retrotransposons, which are present in extremely large copy numbers, a minimized content of DNA transposons and a low subfamily diversity compared to other vertebrates ^[36]. For example, in the two most intensively studied mammals, humans and mice, DNA transposons represent approximately 1.2–3% of the genome sequence, to be compared with the 40–45% of retrotransposons. Among the latter, LINEs contribute 20–22% of the genome sequence in both humans and mice, with LINE-1 (or L1) being the most abundant subfamily, contributing ~17% of the genome sequence. The relative abundance of SINEs and LTR retrotransposons differs markedly between the two mammals, however. In humans, LTR retrotransposons and SINEs represent ~8% and ~13% of the genome, respectively, whereas in mice the corresponding values are 12% and 8%, respectively ^[37]^[38].

The L1 family, perhaps the most evolutionarily successful retrotransposon family in mammals, has been a resident of their genomes since early in mammalian radiation, and is likely to have undergone recurrent cycles of adaptation and innovation, leading to the persistence of a single successful lineage ^[39]. In contrast, SINEs, whose expansion is free from the need to code for a retrotransposition machinery, did not expand continuously from a single, evolutionarily successful family. Instead, novel SINEs have arisen multiple times in the evolution of mammals and, more broadly, of vertebrates ^[36]. The diversity of lineage-specific SINE families that emerged during mammalian evolution is evident when looking at the distribution of SINE families across major mammal groups (see, for example, ^[13]). In particular, the most numerous human SINEs, represented by the SINE1/7SL Alu elements (~1.1 × 10⁶ copies), are primate-specific. A comparative study of mobile element insertions in human and great ape genomes revealed that, during recent human/great ape evolution, the most variable form of genetic variation is represented by Alu retrotransposition, with remarkable increases and decreases occurring over very short evolutionary times ^[40]. The recently discovered, composite SVA elements—evolutionarily young, SINE-derived retrotransposons which include subfamilies restricted to the human lineage—are also specific to primates ^[1]^[41]^[42]. In mice, the most numerous SINE family is represented by the 7SL-derived, monomeric B1 elements (~5.6 × 10⁵ copies), closely followed by the tRNA-derived B2 elements (~3.5 × 10⁵ copies) and by the B1-and tRNA-derived B4 elements (3.9 × 10⁵ copies) ^[37]^[43]^[44]. More generally, drastically different retrotransposon landscapes characterize the genomes of even closely related taxa, and speciation events and the expansion of new retrotransposon families are often correlated, all pointing to the mobilome as a driver of organism diversification ^[45]. This does not exclude the existence of conserved retrotransposon subfamilies that appeared at very early stages of mammalian evolution, such as the tRNA-derived SINEs referred to as mammalian-wide interspersed repeats (MIRs), which were actively propagating prior to the radiation of mammals and before placental mammals separated. Although they are retropositionally inactive, MIRs still represent the second most numerous SINE subfamily in humans ^[46]^[47]^[48].

Concerning LTR retrotransposons, as mentioned above, they derive from ancestral retroviral infections sustained by exogenous retroviruses that have now gone extinct, except for a few exceptional examples of ongoing endogenization ^[49]. Having originated from proviral integrations, these elements display a typical retroviral structure—presenting two LTRs that flank the three main genes gag, pro-pol and env—and are hence also named endogenous retroviruses (ERVs) (Figure 1). ERVs are present in all vertebrate genomes, constituting around 10% of the diverse species’ DNA, and have provided important contributions to their hosts over the course of evolution ^[50]. As suggested by comparative studies, the numerous ERV lineages found in modern mammal genomes arose from multiple independent events of genome invasion, also affecting the host germline, followed by the vertical inheritance of ERVs as host alleles. As a relevant number of such events occurred after the divergence of mammalian orders, each mammalian order tends to have its own distinct ERV content, composition and history, with some ERVs being unique even to individual genera or species, and the same diversification trend also applies to vertebrates as a whole ^[51]^[52]. In the case of primates, for example, lineage-specific ERV insertions have been observed in the genomes of African great apes that are absent from human and Asian ape genomes ^[53]. Even though the retrotransposition activity of human ERVs (HERVs) is presently very limited or absent ^[54], there is growing evidence that HERVs are widely expressed in human tissues, even in the absence of protein production, which has led to an intense study of their possible roles in human pathologies, including cancer, autoimmune disorders and infectious diseases ^[55]^[56]^[57].

ERVs are usually divided into three classes based on their affinity to exogenous animal viruses: class I (gammaretrovirus- and epsilonretrovirus-like), class II (betaretrovirus-like) and class III (spumaretrovirus-like). Concerning individual ERV group classification, it is still incomplete—also due to the relatively recent availability of assembled genome sequences for many vertebrates—and sometimes controversial, given that ERVs are not always named based on phylogenetic and taxonomical criteria. A recent work performed on the human genome with the software RetroTector employed a multi-step classification approach, identifying ~3300 reasonably intact HERV loci that were divided in 31 taxonomical groups, plus 39 “non-canonical” clades showing high degrees of mosaicism and recombination events ^[58]. Such a comprehensive genomic analysis, complemented by the available detailed characterizations of individual HERV groups (see, for instance, ^[59]^[60]^[61]^[62]), represent an ideal background to evaluate HERV expression in human tissues and its variation in diseased contexts ^[63].

The high proportion of retrotransposons in mammalian genomes, exceeding 90% of all TEs in humans and 95% in mice and rats, together with the presence of at least one family of currently accumulating retrotransposons in most mammals ^[38], has attracted the greatest attention onto this TE class. An impressive body of studies in the last two decades have addressed the role that retrotransposons played in mammalian, and particularly in human, evolution by facilitating the appearance of genomic novelties. As new evidence accumulated, authoritative reviews were published at various times, covering in great detail the genomic impact and the different emerging aspects of retrotransposons ^[6]^[8]^[13]^[37]^[38]^[52]^[54]^[64]^[65]^[66]^[67]^[68]^[69]^[70]^[71]^[72]^[73]^[74]^[75]^[76]^[77]^[78]. In a nutshell, it is thought that retrotransposons contributed to the generation of genomic novelties in two main ways: (i) indirectly, through the promotion of genomic rearrangements; and (ii) directly, through exaptation of retrotransposon-derived sequences.

2.2. Retrotransposons as Drivers of Genomic Rearrangements

Their indirect contributions to genomic novelties derive from the fact that retrotransposons, due to their high copy number and high sequence homology within families, are a relatively frequent substrate of unequal recombination events producing gene and/or exon duplication, shuffling or deletion. As an additional mechanism, retrotransposons sometimes carry flanking genomic sequences with them (a process referred to as 5’ or 3’ transduction) thus potentially introducing new copies of genes/exons into new locations ^[54]. The generation of retrogenes is a further indirect consequence of the presence of autonomous retrotransposons, the machinery of which may be exploited by mRNAs or other RNAs to generate new copies of their coding sequences ^[7]. Independently from the mechanisms of their generation, new gene copies have great potential for neofunctionalization favoring phenotypic evolution ^[79], a property that, contrary to what has been prevailingly thought, is likely to also apply to pseudogenes and retropseudogenes ^[80].

2.3. Retrotransposon Exaptation as a Source of Genomic Novelties

The exaptation of retrotransposed sequences, consisting in their cooption for a current function out of a hitherto neutral evolution mode, is a well-documented phenomenon ^[2]^[74]. In general, large-scale DNA editing of retrotransposons, by simultaneously generating large numbers of mutations, may have accelerated their exaptation during mammalian evolution ^[81]. In a similar vein, inverted SINE repeats being part of longer RNAs may have promoted RNA editing by adenosine to inosine deamination, thus generating potential novelties in both coding and regulatory sequences ^[82].

For simplicity, two major exaptation modes can be distinguished. According to the first mode, retrotransposon-derived sequences become physical and functional parts of transcription products, even being eventually translated into protein sequences. The second mode consists in the co-optation of retrotransposon-derived sequences as transcription regulatory elements or 3D genome boundary elements. This exaptation mode, allowing retrotransposon sequences to exert their influence without becoming incorporated into gene products, might have had an even wider influence on genome evolution ^[83]^[84].

2.3.1. Retrotransposon-Derived Sequences within Gene Products

As to the first mode of action, there is solid evidence that SINE (in particular, primate-specific Alu) exonization contributes to both untranslated and protein-coding regions of mRNAs ^[85], as well as portions of long noncoding RNAs ^[86], to which the embedded Alu can confer new regulatory functions ^[87]. Alu-derived exons are often the site of alternative splicing, due to the presence in the Alu body of multiple cryptic splice sites ^[88]. SVA-mediated transduction events, involving alternative mRNA splicing at cryptic splice sites, have been found to promote exon shuffling and thus genomic novelty ^[89]. At the same time, cells have evolved precise mechanisms to control Alu and the incorporation of other retroelements within mRNA sequences via their cryptic splice sites, as their incorrect presence might induce devastating physiological responses ^[90]. Moreover, exonic SINE sequences embedded into the 3’ UTR of mRNAs participate in different layers of post-transcriptional gene regulation, which may also involve intermolecular base-pairing with SINE sequences embedded in lncRNAs ^[43]^[88]^[91]^[92]. Alu SINEs embedded into precursor transcripts were also found to promote the formation of circRNAs ^[93], a complex family of eukaryotic regulatory transcripts under intense study ^[94]^[95]. There is also abundant evidence for TE-derived microRNAs, some of which are potentially involved in human evolution and disease ^[96]^[97]^[98]^[99]^[100].

In the case of autonomous retrotransposons, which contain protein-coding sequences in their body, several striking cases of exaptation of retrotransposon-encoded proteins as new host proteins have been documented. A remarkable example is represented by syncytins, an ensemble of Env proteins coded by different ERVs in the genome of various vertebrates, that through a process of convergent evolution led to the development of the placenta in eutherian mammals ^[55]^[101]. In fact, the union between maternal and fetal cells to constitute the placental syncytiotrophoblast—the main site of trophic exchanges during pregnancy—is mediated by the fusogenic activity of syncytins, which changed from being mechanisms of viral entry to exerting physiological activity domesticated to serving the host biology ^[55]. Some syncytins are indeed thought to have a role in other placenta-associated functions, such as the establishment of maternal immune-tolerance against the fetal allograft through their natural immune-suppressive properties, which in ancestral infections likely guaranteed their immune escape ^[102]^[103].

2.3.2. Retrotransposons as a Source of cis-Regulatory Sequences

Given the centrality of cis-regulatory elements, and particularly of enhancers, in orchestrating organ-, tissue- and cell type-specific gene expression both during development and in adult organisms ^[104], it has been argued that the “vast majority of the genetic changes responsible for the evolution of morphology occur at pre-existing cis-regulatory elements” ^[105], and that TE-mediated cis-regulatory network rewiring has been one of the key mechanisms for the appearance of such changes ^[6]. In the last 10–15 years, the exaptation of TE-derived sequences (especially retrotransposon-derived) as cis-regulatory elements has been well documented by a rapidly growing body of studies, the majority of which have focused on mammalian genomes, characterized by the overwhelming prevalence, in terms of both amount and activity, of retrotransposons over DNA transposons. Retrotransposon-derived cis-regulatory sequences have been reported to play several roles in gene regulation as promoters, enhancers, silencers and boundary elements ^[2]^[83]. In general, due to their own replicative needs, retrotransposons have evolved cis-acting sequences mimicking those of the host, a fact that predisposes them to cis-regulatory activity ^[76]. Although researchers are still far from a comprehensive picture of the multiple layers of TE-derived regulatory novelties and their integration with the whole genomic background of mammalian evolution, various cis-regulatory modes of TE exaptation have begun to be clearly portrayed (Figure 2).

Figure 2. Transcriptional and 3D genome effects of retrotransposons. Retrotransposons (or retroelements, RE) are responsible for a wide range of possible effects on both transcription control and 3D genome organization. (A) Schematic representation of some of the predominant effects of REs on transcription. REs (purple box) can be inserted within the coding region between two exons (gray boxes), providing new transcription start sites (TSS, dark gray arrow) for both sense and antisense transcription. REs can also provide new cis-regulatory sequences (such as enhancers or insulators) which can in turn activate (green arrow) and repress (red arrow) transcription of the associated gene. REs could also alter the epigenetic state of a given gene, leading to its transcriptional repression, by increasing the DNA methylation (yellow circles) within the promoter region of the transcription unit and directly or indirectly recruiting repressive complexes (red box). (B) REs can impact the 3D genome organization of the chromatin within the nuclei. REs (especially Alu elements) are found to be enriched at topologically associating domain (TAD) boundaries. Represented in the Figure is a putative case in which two TADs, one active (blue) and one inactive (red), are separated by a TAD boundary. This boundary limits the action of a brain enhancer region (yellow box) within the active TAD towards a gene (white box) within the inactive TAD, thereby impeding the ectopic brain expression of the gene. As a result of an RE insertion event within the inactive TAD, the 3D genome organization is altered, and a new active TAD is formed due to the boundary effect of the RE. This leads to the spreading of the active TAD over the gene, which allows the brain enhancer region (yellow) to now induce gene activation and therefore its ectopic expression within the brain (yellow area).

First of all, many binding sites for diverse TFs are contributed by retrotransposons, as mainly revealed by genome-wide TF occupancy mapping by chromatin immunoprecipitation coupled with high throughput sequencing (ChIP-seq) ^[106]. Although some of the TF binding sites carried by TEs are justified by their need to employ host TFs for their own life cycle, others may have been acquired independently through TE propagation mechanisms ^[34]. Molecular evolution studies have revealed waves of expansion of the TF target repertoire over the course of vertebrate evolution, with TEs majorly contributing to such expansions ^[107]. TFs tend to bind to TE-provided cognate sites in a species-specific manner, in line with the expansion of different TE subfamilies at different evolutionary timepoints ^[83]. A striking example of how the evolutionary recruitment of TE-derived TF binding contributed to mammalian evolution is provided by the TE-dependent transformation of the uterine regulatory landscape in the evolution of mammalian pregnancy ^[108]. An emerging topic that is potentially highly relevant to the exaptation of TE-binding TFs, is that of Krüppel-associated box domain zinc finger proteins (KRAB-ZFPs). The great expansion and diversification in mammals of these TFs has been correlated with the invasion of new endogenous retroelements, which require specialized mechanisms of repression via the binding of specific KRAB-ZPs and subsequent recruitment of the KAP1 corepressor ^[28]. It is thought that the arms race between KRAB-ZFPs and their target retroelements, facilitated by the evolutionary plasticity conferred on both contenders by the repetitive organization of their genes, favored retroelement domestication, allowing them to develop cis-regulatory functions, to which KRAB-ZFPs have the potential to directly contribute as enhancers or promoter-binding TFs ^[28]^[71]^[109]^[110].

A second, more complex mode of TE exaptation for cis-regulatory purposes is represented by TE-derived clusters of TF binding sites, exemplified by the contribution of species-specific, composite enhancers to mouse placental development by rodent endogenous retroviruses ^[111]. In addition, mouse-specific LTRs have been found to carry multiple pluripotency TF-binding sites (specifically, ESRRB-, KLF4- and SOX2-binding motifs) regulating gene expression in a mouse embryonic stem cell (ESC)-specific manner, thereby distinguishing ESCs in mice from ESCs in other species ^[112]. In a similar vein, recent hominoid-specific LTR and SVA retrotransposons were shown to host enhancers that were active in human naive ESCs and embryonic genome activation ^[110]. Systematic studies of TEs’ contribution to enhancer function have benefited greatly from high-resolution profiling of the regulatory epigenome, such as the profiling of DNase hypersensitivity, histone H3-lysine 4 mono-methylation (H3K4me1) and histone H3-lysine 27 acetylation (H3K27ac) as typical enhancer chromatin signatures ^[113] and by the use of a chromatin characterization software such as ChromHMM ^[114]. A recent comprehensive quantification of the epigenomic status of TEs across many human tissues and cell types revealed that approximately one quarter of the human regulatory epigenome is composed of retrotransposed sequences, with motif-enriched LTRs being particularly favorable substrates for the evolution of new host regulatory elements ^[115]. In other studies, based on epigenomic profiling, evolutionary novelties in primate gene regulation were similarly found to have TEs as the primary source, with a major contribution from ERV-derived sequences ^[116]^[117]. Accordingly, a subset of ERV sequences were found to be significantly enriched in cis-regulatory elements, having a critical role in primate liver gene regulation ^[117]. A fascinating example of ERV contribution in the shaping of entire regulatory pathways is represented by the interferon (IFN) transcriptional network, a crucial innate antiviral system which also serves as a fundamental effector to initiate and maintain adaptive immunity. Chuong and coauthors showed that ERV insertions had a central role in its evolution and amplification, accounting for the independent dissemination of a wide number of IFN-inducible enhancers in many mammalian genomes, which are required for the correct functioning of different immune responses ^[118]. A similar scenario is found for p53 tumor suppressor factor, of which the genomic binding sites in humans overlap in more than one-third of cases with ERV elements ^[119]. Of note, these binding sites are primate-specific and not present in other mammals, further demonstrating that TEs are able to shape important regulatory networks in a species-specific manner. An intriguing observation, consistent with the previous ones, is that of the pervasive function of an ape-specific class of ERV-derived LTRs, LTR5HS, as early embryonic enhancers, regulating hundreds of human genes ^[120], and the strong contribution of ERV and L1 retrotransposon families to species-specific differences in enhancer activity between chimpanzee and human cranial neural crest cells ^[83]^[121]. Epigenome profiling also allowed researchers to distinguish between older retrotransposon copies displaying most of the features of de facto enhancers and younger copies that seem instead to be configured as proto-enhancers, serving as a repertoire for the de novo evolutionary birth of enhancers ^[122]. Despite the scarcity of studies, an intriguing retrotransposon feature favoring their exaptation as enhancers is their intrinsic capability of generating functional non-protein-coding RNAs (ncRNAs) that could overlap with the so-called enhancer RNAs (eRNAs) ^[123], thereby raising the possibility that many eRNAs could be generated through TE-derived ncRNAs.

2.3.3. Involvement of Retrotransposons in Three-Dimensional Genome Architecture

Chromosome contacts within the nuclear space, recently revealed at unprecedented resolution by HiC and complementary approaches ^[124], exert a wide and still largely unexplored influence on gene regulation by demarcating regulatory districts in a highly dynamic way. At a large scale within nuclei, chromosomes segregate into regions of preferential long-range interactions that form two mutually excluded types of chromatin, referred to as “A” and “B” compartments ^[125], the formation of which has been recently linked to homotypic clustering of L1 and B1/Alu, respectively ^[126]. At a scale of tens to hundreds of kilobases, chromosomes fold into domains with preferential intradomain interactions known as topologically associating domains (TADs), which harbor the potential to influence enhancer function and thus gene regulatory networks ^[127]^[128]^[129]^[130]^[131]^[132]. TAD demarcation is achieved by specific regions called TAD boundaries, which are enriched for the occupancy of CCCTC-binding factor (CTCF), a zinc finger DNA binding protein also known to mediate the formation of chromatin loops ^[133]. SINE retrotransposons have also been found to be enriched at TAD boundaries ^[134]^[135]. Curiously, in rodents (but not in humans) B2 SINE retrotransposons have been shown to carry CTCF binding motifs, and therefore rodent B2 SINEs can contribute to clustered CTCF sites at TAD boundaries, thus helping in the maintenance of genome organization ^[136]. However, the rapid expansion of rodent SINEs might provide excessive CTCF sites throughout the genome, therefore critically increasing the possibility of genome mis-folding due to the creation of aberrant CTCF sites. In this context, a complex formed by CHD4, ADNP and HP1 chromatin proteins (ChAHP complex) has been shown to play a role in the maintenance of evolutionarily conserved spatial chromatin organization via the buffering of novel CTCF binding sites that emerge through SINE expansion ^[137]. Moreover, SINE and other retrotransposons have been proposed to participate in the establishment of species-specific chromatin loops by introducing novel binding sites for architectural proteins, including CTCF ^[138]. CTCF might also participate, together with other proteins, in the DNA methylation and histone modification boundary activity recently attributed to currently active copies of mouse B2 SINEs, which might be involved in the epigenomic and phenotypic diversification of mouse species ^[139].

The contribution of retrotransposons to chromatin regulatory domains is not limited to providing CTCF binding clusters. MIR retrotransposons, for example, have been shown to provide regulatory sequences, functioning as insulators in the human genome independently from CTCF ^[140]. The presence of binding sites for the multi-subunit DNA binding protein TFIIIC is a distinguishing feature of SINEs, and TFIIIC bound to Alu elements has been shown to influence gene regulation through its chromatin looping and histone acetylation capacities ^[141]^[142]. In the case of SINEs exapted as enhancers or TAD boundaries, their regulatory function might even take advantage of their Pol III-dependent transcription, which was recently demonstrated to occur with a marked cell-type specificity ^[123]^[143]^[144]. Retrotransposon transcription has also been shown to be required for the cell type- and species-specific chromatin architecture remodeling properties recently attributed to the primate-specific HERV-H TE family of LTR retrotransposons ^[145].

This entry is adapted from the peer-reviewed paper 10.3390/life11050376

References

Burns, K.H.; Boeke, J.D. Human transposon tectonics. Cell 2012, 149, 740–752.
Brosius, J. Exaptation at the molecular genetic level. Sci. China Life Sci. 2019, 62, 437–452.
Warren, I.A.; Naville, M.; Chalopin, D.; Levin, P.; Berger, C.S.; Galiana, D.; Volff, J.N. Evolutionary impact of transposable elements on genomic diversity and lineage-specific innovation in vertebrates. Chromosome Res. 2015, 23, 505–531.
Wells, J.N.; Feschotte, C. A Field Guide to Eukaryotic Transposable Elements. Annu. Rev. Genet. 2020, 54, 539–561.
Choi, J.Y.; Lee, Y.C.G. Double-edged sword: The evolutionary consequences of the epigenetic silencing of transposable elements. PLoS Genet. 2020, 16, e1008872.
Rebollo, R.; Romanish, M.T.; Mager, D.L. Transposable elements: An abundant and natural source of regulatory sequences for host genes. Annu. Rev. Genet. 2012, 46, 21–42.
Brosius, J. The persistent contributions of RNA to eukaryotic gen(om)e architecture and cellular function. Cold Spring Harb. Perspect. Biol. 2014, 6, a016089.
Shapiro, J.A. Exploring the read-write genome: Mobile DNA and mammalian adaptation. Crit. Rev. Biochem. Mol. Biol. 2017, 52, 1–17.
Paquola, A.C.M.; Erwin, J.A.; Gage, F.H. Insights into the role of somatic mosaicism in the brain. Curr. Opin. Syst. Biol. 2017, 1, 90–94.
Bodea, G.O.; McKelvey, E.G.Z.; Faulkner, G.J. Retrotransposon-induced mosaicism in the neural Genome. Open. Biol. 2018, 8.
Evans, T.A.; Erwin, J.A. Retroelement-derived RNA and its role in the brain. Semin. Cell. Dev. Biol. 2020.
Hantak, M.P.; Einstein, J.; Kearns, R.B.; Shepherd, J.D. Intercellular Communication in the Nervous System Goes Viral. Trends Neurosci. 2021.
Nishihara, H. Transposable elements as genetic accelerators of evolution: Contribution to genome size, gene regulatory network rewiring and morphological innovation. Genes Genet. Syst. 2020, 94, 269–281.
Luo, X.; Liu, Y.; Dang, D.; Hu, T.; Hou, Y.; Meng, X.; Zhang, F.; Li, T.; Wang, C.; Li, M.; et al. 3D Genome of macaque fetal brain reveals evolutionary innovations during primate corticogenesis. Cell 2021.
Ahmadi, A.; De Toma, I.; Vilor-Tejedor, N.; Eftekhariyan Ghamsari, M.R.; Sadeghi, I. Transposable elements in brain health and disease. Ageing Res. Rev. 2020, 64, 101153.
Reilly, M.T.; Faulkner, G.J.; Dubnau, J.; Ponomarev, I.; Gage, F.H. The role of transposable elements in health and diseases of the central nervous system. J. Neurosci. 2013, 33, 17577–17586.
Bitar, M.; Barry, G. Multiple Innovations in Genetic and Epigenetic Mechanisms Cooperate to Underpin Human Brain Evolution. Mol. Biol. Evol. 2018, 35, 263–268.
Larsen, P.A.; Hunnicutt, K.E.; Larsen, R.J.; Yoder, A.D.; Saunders, A.M. Warning SINEs: Alu elements, evolution of the human brain, and the spectrum of neurological disease. Chromosome Res. 2018, 26, 93–111.
Bourque, G.; Burns, K.H.; Gehring, M.; Gorbunova, V.; Seluanov, A.; Hammell, M.; Imbeault, M.; Izsvak, Z.; Levin, H.L.; Macfarlan, T.S.; et al. Ten things you should know about transposable elements. Genome Biol. 2018, 19, 199.
Fedoroff, N.V. Transposable elements, epigenetics, and genome evolution. Science 2012, 338, 758–767.
McClintock, B. The significance of responses of the genome to challenge. Science 1984, 226, 792–801.
Davidson, E.H.; Britten, R.J. Regulation of gene expression: Possible role of repetitive sequences. Science 1979, 204, 1052–1059.
Biemont, C. A brief history of the status of transposable elements: From junk DNA to major players in evolution. Genetics 2010, 186, 1085–1093.
Kazazian, H.H., Jr.; Moran, J.V. Mobile DNA in Health and Disease. N. Engl. J. Med. 2017, 377, 361–370.
Hancks, D.C.; Kazazian, H.H., Jr. Roles for retrotransposon insertions in human disease. Mob. DNA 2016, 7, 9.
Levin, H.L.; Moran, J.V. Dynamic interactions between transposable elements and their hosts. Nat. Rev. Genet. 2011, 12, 615–627.
Chuong, E.B.; Rumi, M.A.; Soares, M.J.; Baker, J.C. Endogenous retroviruses function as species-specific enhancer elements in the placenta. Nat. Genet. 2013, 45, 325–329.
Bruno, M.; Mahgoub, M.; Macfarlan, T.S. The Arms Race Between KRAB-Zinc Finger Proteins and Endogenous Retroelements and Its Impact on Mammals. Annu. Rev. Genet. 2019, 53, 393–416.
Kojima, K.K. Human transposable elements in Repbase: Genomic footprints from fish to humans. Mob. DNA 2018, 9, 2.
Storer, J.; Hubley, R.; Rosen, J.; Wheeler, T.J.; Smit, A.F. The Dfam community resource of transposable element families, sequence models, and genome annotations. Mob. DNA 2021, 12, 2.
Kajikawa, M.; Okada, N. LINEs mobilize SINEs in the eel through a shared 3’ sequence. Cell 2002, 111, 433–444.
Dewannieux, M.; Esnault, C.; Heidmann, T. LINE-mediated retrotransposition of marked Alu sequences. Nat. Genet. 2003, 35, 41–48.
Dieci, G.; Conti, A.; Pagano, A.; Carnevali, D. Identification of RNA polymerase III-transcribed genes in eukaryotic genomes. Biochim. Biophys. Acta 2013, 1829, 296–305.
Hermant, C.; Torres-Padilla, M.E. TFs for TEs: The transcription factor repertoire of mammalian transposable elements. Genes Dev. 2021, 35, 22–39.
Orioli, A.; Pascali, C.; Pagano, A.; Teichmann, M.; Dieci, G. RNA polymerase III transcription control elements: Themes and variations. Gene 2012, 493, 185–194.
Sotero-Caio, C.G.; Platt, R.N., 2nd; Suh, A.; Ray, D.A. Evolution and Diversity of Transposable Elements in Vertebrate Genomes. Genome Biol. Evol. 2017, 9, 161–177.
Richardson, S.R.; Doucet, A.J.; Kopera, H.C.; Moldovan, J.B.; Garcia-Perez, J.L.; Moran, J.V. The Influence of LINE-1 and SINE Retrotransposons on Mammalian Genomes. Microbiol. Spectr. 2015, 3.
Platt, R.N., 2nd; Vandewege, M.W.; Ray, D.A. Mammalian transposable elements and their impacts on genome evolution. Chromosome Res. 2018, 26, 25–43.
Blumenstiel, J.P. Birth, School, Work, Death, and Resurrection: The Life Stages and Dynamics of Transposable Element Proliferation. Genes 2019, 10, 336.
Hormozdiari, F.; Konkel, M.K.; Prado-Martinez, J.; Chiatante, G.; Herraez, I.H.; Walker, J.A.; Nelson, B.; Alkan, C.; Sudmant, P.H.; Huddleston, J.; et al. Rates and patterns of great ape retrotransposition. Proc. Natl. Acad. Sci. USA 2013, 110, 13457–13462.
Wang, H.; Xing, J.; Grover, D.; Hedges, D.J.; Han, K.; Walker, J.A.; Batzer, M.A. SVA elements: A hominid-specific retroposon family. J. Mol. Biol. 2005, 354, 994–1007.
Hancks, D.C.; Kazazian, H.H., Jr. SVA retrotransposons: Evolution and genetic instability. Semin. Cancer Biol. 2010, 20, 234–245.
Maquat, L.E. Short interspersed nuclear element (SINE)-mediated post-transcriptional effects on human and mouse gene expression: SINE-UP for active duty. Philos. Trans. R. Soc. Lond. B Biol. Sci. 2020, 375, 20190344.
Kramerov, D.A.; Vassetzky, N.S. SINEs. Wiley. Interdiscip. Rev. RNA 2011, 2, 772–786.
Jurka, J.; Bao, W.; Kojima, K.K. Families of transposable elements, population structure and the origin of species. Biol. Direct. 2011, 6, 44.
Jurka, J.; Zietkiewicz, E.; Labuda, D. Ubiquitous mammalian-wide interspersed repeats (MIRs) are molecular fossils from the mesozoic era. Nucleic Acids Res. 1995, 23, 170–175.
Smit, A.F.; Riggs, A.D. MIRs are classic, tRNA-derived SINEs that amplified before the mammalian radiation. Nucleic Acids Res. 1995, 23, 98–102.
Carnevali, D.; Conti, A.; Pellegrini, M.; Dieci, G. Whole-genome expression analysis of mammalian-wide interspersed repeat elements in human cell lines. DNA Res. Int. J. Rapid. Publ. Rep. Genes Genomes 2017, 24, 59–69.
Greenwood, A.D.; Ishida, Y.; O’Brien, S.P.; Roca, A.L.; Eiden, M.V. Transmission, Evolution, and Endogenization: Lessons Learned from Recent Retroviral Invasions. Microbiol. Mol. Biol. Rev. 2018, 82.
Dewannieux, M.; Heidmann, T. Endogenous retroviruses: Acquisition, amplification and taming of genome invaders. Curr. Opin. Virol. 2013, 3, 646–656.
Hayward, A.; Cornwallis, C.K.; Jern, P. Pan-vertebrate comparative genomics unmasks retrovirus macroevolution. Proc. Natl. Acad. Sci. USA 2015, 112, 464–469.
Johnson, W.E. Origins and evolutionary consequences of ancient endogenous retroviruses. Nat. Rev. Microbiol. 2019, 17, 355–370.
Yohn, C.T.; Jiang, Z.; McGrath, S.D.; Hayden, K.E.; Khaitovich, P.; Johnson, M.E.; Eichler, M.Y.; McPherson, J.D.; Zhao, S.; Paabo, S.; et al. Lineage-specific expansions of retroviral insertions within the genomes of African great apes but not humans and orangutans. PLoS Biol. 2005, 3, e110.
Cordaux, R.; Batzer, M.A. The impact of retrotransposons on human genome evolution. Nat. Rev. Genet. 2009, 10, 691–703.
Grandi, N.; Tramontano, E. HERV Envelope Proteins: Physiological Role and Pathogenic Potential in Cancer and Autoimmunity. Front. Microbiol. 2018, 9, 462.
Grandi, N.; Tramontano, E. Type W Human Endogenous Retrovirus (HERV-W) Integrations and Their Mobilization by L1 Machinery: Contribution to the Human Transcriptome and Impact on the Host Physiopathology. Viruses 2017, 9, 162.
Liu, C.H.; Grandi, N.; Palanivelu, L.; Tramontano, E.; Lin, L.T. Contribution of Human Retroviruses to Disease Development-A Focus on the HIV- and HERV-Cancer Relationships and Treatment Strategies. Viruses 2020, 12, 852.
Vargiu, L.; Rodriguez-Tome, P.; Sperber, G.O.; Cadeddu, M.; Grandi, N.; Blikstad, V.; Tramontano, E.; Blomberg, J. Classification and characterization of human endogenous retroviruses; mosaic forms are common. Retrovirology 2016, 13, 7.
Grandi, N.; Cadeddu, M.; Blomberg, J.; Tramontano, E. Contribution of type W human endogenous retroviruses to the human genome: Characterization of HERV-W proviral insertions and processed pseudogenes. Retrovirology 2016, 13, 67.
Grandi, N.; Cadeddu, M.; Pisano, M.P.; Esposito, F.; Blomberg, J.; Tramontano, E. Identification of a novel HERV-K(HML10): Comprehensive characterization and comparative analysis in non-human primates provide insights about HML10 proviruses structure and diffusion. Mob. DNA 2017, 8, 15.
Pisano, M.P.; Grandi, N.; Cadeddu, M.; Blomberg, J.; Tramontano, E. Comprehensive Characterization of the Human Endogenous Retrovirus HERV-K(HML-6) Group: Overview of Structure, Phylogeny, and Contribution to the Human Genome . J. Virol. 2019, 93.
Subramanian, R.P.; Wildschutte, J.H.; Russo, C.; Coffin, J.M. Identification, characterization, and comparative genomic distribution of the HERV-K (HML-2) group of human endogenous retroviruses. Retrovirology 2011, 8, 90.
Pisano, M.P.; Grandi, N.; Tramontano, E. High-Throughput Sequencing is a Crucial Tool to Investigate the Contribution of Human Endogenous Retroviruses (HERVs) to Human Biology and Development. Viruses 2020, 12, 633.
Brosius, J. Retroposons--seeds of evolution. Science 1991, 251, 753.
Jurka, J. Repeats in genomic DNA: Mining and meaning. Curr. Opin. Struct. Biol. 1998, 8, 333–337.
Deininger, P.L.; Moran, J.V.; Batzer, M.A.; Kazazian, H.H., Jr. Mobile elements and mammalian genome evolution. Curr. Opin. Genet. Dev. 2003, 13, 651–658.
Kazazian, H.H., Jr. Mobile elements: Drivers of genome evolution. Science 2004, 303, 1626–1632.
Goodier, J.L.; Kazazian, H.H., Jr. Retrotransposons revisited: The restraint and rehabilitation of parasites. Cell 2008, 135, 23–35.
Okada, N.; Sasaki, T.; Shimogori, T.; Nishihara, H. Emergence of mammals by emergency: Exaptation. Genes Cells 2010, 15, 801–812.
Gifford, W.D.; Pfaff, S.L.; Macfarlan, T.S. Transposable elements as genetic regulatory substrates in early development. Trends Cell. Biol. 2013, 23, 218–226.
Friedli, M.; Trono, D. The Developmental Control of Transposable Elements and the Evolution of Higher Species. Annu. Rev. Cell. Dev. Biol. 2015, 31, 429–451.
Garcia-Perez, J.L.; Widmann, T.J.; Adams, I.R. The impact of transposable elements on mammalian development. Development 2016, 143, 4101–4114.
Cosby, R.L.; Chang, N.C.; Feschotte, C. Host-transposon interactions: Conflict, cooperation, and cooption. Genes Dev. 2019, 33, 1098–1116.
Mita, P.; Boeke, J.D. How retrotransposons shape genome regulation. Curr. Opin. Genet. Dev. 2016, 37, 90–100.
Etchegaray, E.; Naville, M.; Volff, J.N.; Haftek-Terreau, Z. Transposable element-derived sequences in vertebrate development. Mob. DNA 2021, 12, 1.
Chuong, E.B.; Elde, N.C.; Feschotte, C. Regulatory activities of transposable elements: From conflicts to benefits. Nat. Rev. Genet. 2017, 18, 71–86.
Elbarbary, R.A.; Lucas, B.A.; Maquat, L.E. Retrotransposons as regulators of gene expression. Science 2016, 351, aac7247.
Thompson, P.J.; Macfarlan, T.S.; Lorincz, M.C. Long Terminal Repeats: From Parasitic Elements to Building Blocks of the Transcriptional Regulatory Repertoire. Mol. Cell. 2016, 62, 766–776.
Chen, S.; Krinsky, B.H.; Long, M. New genes as drivers of phenotypic evolution. Nat. Rev. Genet. 2013, 14, 645–660.
Cheetham, S.W.; Faulkner, G.J.; Dinger, M.E. Overcoming challenges and dogmas to understand the functions of pseudogenes. Nat. Rev. Genet. 2020, 21, 191–201.
Carmi, S.; Church, G.M.; Levanon, E.Y. Large-scale DNA editing of retrotransposons accelerates mammalian genome evolution. Nat. Commun. 2011, 2, 519.
Daniel, C.; Silberberg, G.; Behm, M.; Ohman, M. Alu elements shape the primate transcriptome by cis-regulation of RNA editing. Genome Biol. 2014, 15, R28.
Sundaram, V.; Wysocka, J. Transposable elements as a potent source of diverse cis-regulatory sequences in mammalian genomes. Philos. Trans. R. Soc. Lond. B Biol. Sci. 2020, 375, 20190347.
Drongitis, D.; Aniello, F.; Fucci, L.; Donizetti, A. Roles of Transposable Elements in the Different Layers of Gene Expression Regulation. Int. J. Mol. Sci. 2019, 20, 5755.
Lin, L.; Jiang, P.; Park, J.W.; Wang, J.; Lu, Z.X.; Lam, M.P.; Ping, P.; Xing, Y. The contribution of Alu exons to the human proteome. Genome Biol. 2016, 17, 15.
Kim, E.Z.; Wespiser, A.R.; Caffrey, D.R. The domain structure and distribution of Alu elements in long noncoding RNAs and mRNAs. RNA 2016, 22, 254–264.
Toki, N.; Takahashi, H.; Sharma, H.; Valentine, M.N.Z.; Rahman, F.M.; Zucchelli, S.; Gustincich, S.; Carninci, P. SINEUP long non-coding RNA acts via PTBP1 and HNRNPK to promote translational initiation assemblies. Nucleic Acids Res. 2020, 48, 11626–11644.
Chen, L.L.; Yang, L. ALUternative Regulation for Gene Expression. Trends Cell. Biol. 2017, 27, 480–490.
Damert, A.; Raiz, J.; Horn, A.V.; Lower, J.; Wang, H.; Xing, J.; Batzer, M.A.; Lower, R.; Schumann, G.G. 5’-Transducing SVA retrotransposon groups spread efficiently throughout the human Genome. Genome Res. 2009, 19, 1992–2008.
Wu, Y.; Zhao, W.; Liu, Y.; Tan, X.; Li, X.; Zou, Q.; Xiao, Z.; Xu, H.; Wang, Y.; Yang, X. Function of HNRNPC in breast cancer cells by controlling the dsRNA-induced interferon response. EMBO J. 2018, 37.
Smalheiser, N.R.; Torvik, V.I. Alu elements within human mRNAs are probable microRNA targets. Trends Genet. 2006, 22, 532–536.
Gong, C.; Maquat, L.E. lncRNAs transactivate STAU1-mediated mRNA decay by duplexing with 3′ UTRs via Alu elements. Nature 2011, 470, 284–288.
Welden, J.R.; Stamm, S. Pre-mRNA structures forming circular RNAs. Biochim. Biophys. Acta Gene Regul. Mech. 2019, 1862, 194410.
Patop, I.L.; Wust, S.; Kadener, S. Past, present, and future of circRNAs. EMBO J. 2019, 38, e100836.
Di Timoteo, G.; Rossi, F.; Bozzoni, I. Circular RNAs in cell differentiation and development. Development 2020, 147.
Lee, H.E.; Huh, J.W.; Kim, H.S. Bioinformatics Analysis of Evolution and Human Disease Related Transposable Element-Derived microRNAs. Life 2020, 10, 95.
Smalheiser, N.R.; Torvik, V.I. Mammalian microRNAs derived from genomic repeats. Trends Genet. 2005, 21, 322–326.
Piriyapongsa, J.; Marino-Ramirez, L.; Jordan, I.K. Origin and evolution of human microRNAs from transposable elements. Genetics 2007, 176, 1323–1337.
Roberts, J.T.; Cooper, E.A.; Favreau, C.J.; Howell, J.S.; Lane, L.G.; Mills, J.E.; Newman, D.C.; Perry, T.J.; Russell, M.E.; Wallace, B.M.; et al. Continuing analysis of microRNA origins: Formation from transposable element insertions and noncoding RNA mutations. Mob. Genet. Elem. 2013, 3, e27755.
Spengler, R.M.; Oakley, C.K.; Davidson, B.L. Functional microRNAs and target sites are created by lineage-specific transposition. Hum. Mol. Genet. 2014, 23, 1783–1793.
Lavialle, C.; Cornelis, G.; Dupressoir, A.; Esnault, C.; Heidmann, O.; Vernochet, C.; Heidmann, T. Paleovirology of ‘syncytins’, retroviral env genes exapted for a role in placentation. Philos. Trans. R. Soc. Lond. B Biol. Sci. 2013, 368, 20120507.
Blaise, S.; de Parseval, N.; Benit, L.; Heidmann, T. Genomewide screening for fusogenic human endogenous retrovirus envelopes identifies syncytin 2, a gene conserved on primate evolution. Proc. Natl. Acad. Sci. USA 2003, 100, 13013–13018.
Lokossou, A.G.; Toudic, C.; Barbeau, B. Implication of human endogenous retrovirus envelope proteins in placental functions. Viruses 2014, 6, 4609–4627.
Long, H.K.; Prescott, S.L.; Wysocka, J. Ever-Changing Landscapes: Transcriptional Enhancers in Development and Evolution. Cell 2016, 167, 1170–1187.
Wagner, G.P.; Lynch, V.J. Evolutionary novelties. Curr. Biol. 2010, 20, R48–R52.
Sundaram, V.; Cheng, Y.; Ma, Z.; Li, D.; Xing, X.; Edge, P.; Snyder, M.P.; Wang, T. Widespread contribution of transposable elements to the innovation of gene regulatory networks. Genome Res. 2014, 24, 1963–1976.
Marnetto, D.; Mantica, F.; Molineris, I.; Grassi, E.; Pesando, I.; Provero, P. Evolutionary Rewiring of Human Regulatory Networks by Waves of Genome Expansion. Am. J. Hum. Genet. 2018, 102, 207–218.
Lynch, V.J.; Nnamani, M.C.; Kapusta, A.; Brayer, K.; Plaza, S.L.; Mazur, E.C.; Emera, D.; Sheikh, S.Z.; Grutzner, F.; Bauersachs, S.; et al. Ancient transposable elements transformed the uterine regulatory landscape and transcriptome during the evolution of mammalian pregnancy. Cell Rep. 2015, 10, 551–561.
Imbeault, M.; Helleboid, P.Y.; Trono, D. KRAB zinc-finger proteins contribute to the evolution of gene regulatory networks. Nature 2017, 543, 550–554.
Pontis, J.; Planet, E.; Offner, S.; Turelli, P.; Duc, J.; Coudray, A.; Theunissen, T.W.; Jaenisch, R.; Trono, D. Hominoid-Specific Transposable Elements and KZFPs Facilitate Human Embryonic Genome Activation and Control Transcription in Naive Human ESCs. Cell Stem Cell 2019, 24, 724–735.e725.
Chuong, E.B. Retroviruses facilitate the rapid evolution of the mammalian placenta. Bioessays 2013, 35, 853–861.
Sundaram, V.; Choudhary, M.N.; Pehrsson, E.; Xing, X.; Fiore, C.; Pandey, M.; Maricque, B.; Udawatta, M.; Ngo, D.; Chen, Y.; et al. Functional cis-regulatory modules encoded by mouse-specific endogenous retrovirus. Nat. Commun. 2017, 8, 14550.
Field, A.; Adelman, K. Evaluating Enhancer Function and Transcription. Annu. Rev. Biochem. 2020, 89, 213–234.
Ernst, J.; Kellis, M. Chromatin-state discovery and genome annotation with ChromHMM. Nat. Protoc. 2017, 12, 2478–2492.
Pehrsson, E.C.; Choudhary, M.N.K.; Sundaram, V.; Wang, T. The epigenomic landscape of transposable elements across normal human development and anatomy. Nat. Commun. 2019, 10, 5640.
Jacques, P.E.; Jeyakani, J.; Bourque, G. The majority of primate-specific regulatory sequences are derived from transposable elements. PLoS Genet. 2013, 9, e1003504.
Trizzino, M.; Park, Y.; Holsbach-Beltrame, M.; Aracena, K.; Mika, K.; Caliskan, M.; Perry, G.H.; Lynch, V.J.; Brown, C.D. Transposable elements are the primary source of novelty in primate gene regulation. Genome Res. 2017, 27, 1623–1633.
Chuong, E.B.; Elde, N.C.; Feschotte, C. Regulatory evolution of innate immunity through co-option of endogenous retroviruses. Science 2016, 351, 1083–1087.
Wang, T.; Zeng, J.; Lowe, C.B.; Sellers, R.G.; Salama, S.R.; Yang, M.; Burgess, S.M.; Brachmann, R.K.; Haussler, D. Species-specific endogenous retroviruses shape the transcriptional network of the human tumor suppressor protein p53. Proc. Natl. Acad. Sci. USA 2007, 104, 18613–18618.
Fuentes, D.R.; Swigut, T.; Wysocka, J. Systematic perturbation of retroviral LTRs reveals widespread long-range effects on human gene regulation. Elife 2018, 7.
Prescott, S.L.; Srinivasan, R.; Marchetto, M.C.; Grishina, I.; Narvaiza, I.; Selleri, L.; Gage, F.H.; Swigut, T.; Wysocka, J. Enhancer divergence and cis-regulatory evolution in the human and chimp neural crest. Cell 2015, 163, 68–83.
Su, M.; Han, D.; Boyd-Kirkup, J.; Yu, X.; Han, J.D. Evolution of Alu elements toward enhancers. Cell Rep. 2014, 7, 376–385.
Policarpi, C.; Crepaldi, L.; Brookes, E.; Nitarska, J.; French, S.M.; Coatti, A.; Riccio, A. Enhancer SINEs Link Pol III to Pol II Transcription in Neurons. Cell Rep. 2017, 21, 2879–2894.
Kempfer, R.; Pombo, A. Methods for mapping 3D chromosome architecture. Nat. Rev. Genet. 2020, 21, 207–226.
Szabo, Q.; Bantignies, F.; Cavalli, G. Principles of genome folding into topologically associating domains. Sci. Adv. 2019, 5, eaaw1668.
Lu, J.Y.; Chang, L.; Li, T.; Wang, T.; Yin, Y.; Zhan, G.; Han, X.; Zhang, K.; Tao, Y.; Percharde, M.; et al. Homotypic clustering of L1 and B1/Alu repeats compartmentalizes the 3D Genome. Cell Res. 2021.
Cavalheiro, G.R.; Pollex, T.; Furlong, E.E. To loop or not to loop: What is the role of TADs in enhancer function and gene regulation? Curr. Opin. Genet. Dev. 2021, 67, 119–129.
Schoenfelder, S.; Fraser, P. Long-range enhancer-promoter contacts in gene expression control. Nat. Rev. Genet. 2019, 20, 437–455.
Rowley, M.J.; Corces, V.G. Organizational principles of 3D genome architecture. Nat. Rev. Genet. 2018, 19, 789–800.
Schmitt, A.D.; Hu, M.; Ren, B. Genome-wide mapping and analysis of chromosome architecture. Nat. Rev. Mol. Cell. Biol. 2016, 17, 743–755.
Bonev, B.; Cavalli, G. Organization and function of the 3D Genome . Nat. Rev. Genet. 2016, 17, 661–678.
Pombo, A.; Dillon, N. Three-dimensional genome architecture: Players and mechanisms. Nat. Rev. Mol. Cell. Biol. 2015, 16, 245–257.
Xiang, J.F.; Corces, V.G. Regulation of 3D chromatin organization by CTCF. Curr. Opin. Genet. Dev. 2020, 67, 33–40.
Ong, C.T.; Corces, V.G. CTCF: An architectural protein bridging genome topology and function. Nat. Rev. Genet. 2014, 15, 234–246.
Nora, E.P.; Lajoie, B.R.; Schulz, E.G.; Giorgetti, L.; Okamoto, I.; Servant, N.; Piolot, T.; van Berkum, N.L.; Meisig, J.; Sedat, J.; et al. Spatial partitioning of the regulatory landscape of the X-inactivation centre. Nature 2012, 485, 381–385.
Kentepozidou, E.; Aitken, S.J.; Feig, C.; Stefflova, K.; Ibarra-Soria, X.; Odom, D.T.; Roller, M.; Flicek, P. Clustered CTCF binding is an evolutionary mechanism to maintain topologically associating domains. Genome Biol. 2020, 21, 5.
Kaaij, L.J.T.; Mohn, F.; van der Weide, R.H.; de Wit, E.; Buhler, M. The ChAHP Complex Counteracts Chromatin Looping at CTCF Sites that Emerged from SINE Expansions in Mouse. Cell 2019, 178, 1437–1451.e1414.
Choudhary, M.N.; Friedman, R.Z.; Wang, J.T.; Jang, H.S.; Zhuo, X.; Wang, T. Co-opted transposons help perpetuate conserved higher-order chromosomal structures. Genome Biol. 2020, 21, 16.
Ichiyanagi, T.; Katoh, H.; Mori, Y.; Hirafuku, K.; Boyboy, B.A.; Kawase, M.; Ichiyanagi, K. B2 SINE copies serve as a transposable boundary of DNA methylation and histone modifications in the mouse. Mol. Biol. Evol. 2021.
Wang, J.; Vicente-Garcia, C.; Seruggia, D.; Molto, E.; Fernandez-Minan, A.; Neto, A.; Lee, E.; Gomez-Skarmeta, J.L.; Montoliu, L.; Lunyak, V.V.; et al. MIR retrotransposon sequences provide insulators to the human Genome . Proc. Natl. Acad. Sci. USA 2015, 112, E4428–E4437.
Ferrari, R.; de Llobet Cucalon, L.I.; Di Vona, C.; Le Dilly, F.; Vidal, E.; Lioutas, A.; Oliete, J.Q.; Jochem, L.; Cutts, E.; Dieci, G.; et al. TFIIIC Binding to Alu Elements Controls Gene Expression via Chromatin Looping and Histone Acetylation. Mol. Cell 2020, 77, 475–487.e411.
Crepaldi, L.; Policarpi, C.; Coatti, A.; Sherlock, W.T.; Jongbloets, B.C.; Down, T.A.; Riccio, A. Binding of TFIIIC to sine elements controls the relocation of activity-dependent neuronal genes to transcription factories. PLoS Genet. 2013, 9, e1003699.
Conti, A.; Carnevali, D.; Bollati, V.; Fustinoni, S.; Pellegrini, M.; Dieci, G. Identification of RNA polymerase III-transcribed Alu loci by computational screening of RNA-Seq data. Nucleic Acids Res. 2015, 43, 817–835.
Zhang, X.O.; Gingeras, T.R.; Weng, Z. Genome-wide analysis of polymerase III-transcribed Alu elements suggests cell-type-specific enhancer function. Genome Res. 2019, 29, 1402–1414.
Zhang, Y.; Li, T.; Preissl, S.; Amaral, M.L.; Grinstein, J.D.; Farah, E.N.; Destici, E.; Qiu, Y.; Hu, R.; Lee, A.Y.; et al. Transcriptionally active HERV-H retrotransposons demarcate topologically associating domains in human pluripotent stem cells. Nat. Genet. 2019, 51, 1380–1388.

© Text is available under the terms and conditions of the Creative Commons Attribution (CC BY) license; additional terms may apply. By using this site, you agree to the Terms and Conditions and Privacy Policy.