| Version | Summary | Created by | Modification | Content Size | Created at | Operation |
|---|---|---|---|---|---|---|
| 1 | Cecilia Winata | + 2995 word(s) | 2995 | 2021-04-13 11:45:44 |
Enhancers positively influence the activity of its target gene by operating at long-range distances in either direction of the nucleotide sequence. Early heart development is tightly controlled by these cis-regulatory elements and mutations affecting them have been shown to result in devastating forms of congenital heart defect. Therefore, identifying enhancers implicated in heart biology and understanding their mechanism is key to improve diagnosis and therapeutic options.
The cell specification and tissue remodeling that entails the formation of an organ are complex biological processes which are modulated by coordinated spatiotemporal execution of gene regulatory networks which dictates cell fates and organize specialized cell types into complex three-dimensional units of structure and function [1][2]. These networks are composed of diverse genes and their regulatory elements that evolve at different rates and can undergo various modifications. Non-coding DNA regulatory elements mediate the molecular networks of regulatory processes at the transcriptional, post-transcriptional and post-translational levels [3]. They include promoters [4], silencer [5], insulator [6], and cis- and trans-regulatory elements [7][8].
Enhancers are a major type of cis-regulatory element in the genome which increase the likelihood of transcription of one or more distally located genes [9]. The function of an enhancer was demonstrated for the first time in a non-coding region which contains two 72 base pair (bp) repeats of the simian virus 40 (SV40), which is able to drive efficient transcription of SV40 early genes [10][11][12][13]. Subsequently, it was found that enhancers were associated with genes that exhibit tissue-specific expression. The first cell type-specific enhancer was identified in mammalian B lymphocytes within the IgH locus [14][15][16]. Enormous progress has been made since the first discoveries about enhancer properties and their modus operandi. Enhancers serve as binding sites for transcription factors (TFs) which are necessary for the activation of target gene expression. Through chromatin looping, these regulatory factors are brought into direct physical contact with their target promoters [17][18] and thereby potentiate transcriptional initiation and elongation by interacting with the basal transcriptional machinery and the local chromatin remodeling of target genes [19][20]. More recently, it was found that active enhancer state is associated with the generation of bi-directional non-coding transcripts of the enhancer region known as enhancer RNAs (eRNAs) which are functionally required for its activity [20][21][22][23][24].
The heart is an essential organ which serves to circulate blood throughout the whole body. It is the first organ to form in the embryo. Heart development involves an extraordinary and precisely orchestrated series of molecular processes. The combination of complex morphogenetic events necessary for the formation of the heart results from an evolutionarily conserved gene regulatory network [25][26][27][28][29][30]. A core set of conserved cardiac TFs, such as NK2, MEF2, GATA, Hand and Tbx, is involved in the specification of cardiac cell fate, contractility, morphogenesis, segmentation, and growth of cardiac chambers [1][31]. This set, together with others TFs, govern and stabilize the developmental program of the heart [32][33][34]. Expansion and modifications of this ancestral cardiac network could give rise to hearts with higher structure and function complexity [35][36][37]. Homologs of the cardiac TFs were found in early multicellular organisms (~800 million years ago). These organisms possess a primitive coelom surrounded by cells which express NKX2.5/tinman, a TF necessary for the specification of cardiac fate in higher vertebrates [38]. The specialization of mesodermal cells around the coelom in the phylum Bilateria gave rise to the first primitive cardiac myocytes [39]. A further evolved heart structure is found in Protostomia (e.g., Drosophila) and Deuterostoma (e.g., Amphioxus), in the form of a peristaltic tubular heart arising from a monolayer of contracting mesoderm [38][40][41]. In the phylum Chordata and subsequently in the vertebrates, the primitive linear heart develops looping, unidirectional circulation, enclosed vasculature, and the conduction system. Fish and amphibians had further specializations with regional protein localization in cardiomyocytes, while, in reptiles, birds, and mammals, the heart is further sophisticated with the formation of septa and a four-chambered heart which has lost the ability to regenerate cardiomyocytes [38].
Cardiomyocytes (CMs) are the basic units of cardiac tissue which are specified from a pool of mesodermal progenitors located at the anterior portion of the embryonic lateral plate mesoderm [28][42]. Heart development initiates with the specification of cardiac cell fate with the expression of the homeobox TF tinman in invertebrates or Nkx2.5 in vertebrates. This TF is considered the earliest molecular marker of heart progenitors and is known to interact with the zinc finger TF of the GATA family. In flies, Tinman directly activates Mef2 gene, which encodes transcriptional elements that control myocyte differentiation [33]. The MADS-box protein MEF2 is the most ancient myogenic TF that, via specific combinations of cis-regulatory elements, regulates different muscle gene programs [1]. In zebrafish, the tinman-related gene orthologs nkx2.5 and nkx2.7 are responsible for the earliest step of cardiac genes initiation and regulate the expressions of tbx5 and tbx20 through the heart tube stage [43]. Bmp and Nodal signaling induce nkx2.5 expression and cardiogenic differentiation by inducing gata5 [44].
Following the specification of cardiac progenitors, the beating linear heart tube forms by the convergence of cardiac muscle cells along the ventral midline of the embryo. This linear cardiac structure is composed of extracellular matrix and myocardial and endocardial layers. The GATA family TFs play an essential role in forming the linear heart tube in both invertebrate and vertebrate species [45][46][47]. In vertebrate such as mice and zebrafish, the heart tube subsequently undergoes rightward looping which is essential for the alignment of the heart chambers. The rightward direction of cardiac looping is driven by asymmetric axial signals which include members of the transforming growth factor beta (TGF-β) family, Sonic hedgehog, and Nodal, which are expressed in the lateral mesoderm [44][48]. In mice, frogs, and in zebrafish, the TF Pitx2 is involved in the mediation of left-right signals, which is expressed along the developing organs [48][49]. In addition, in zebrafish, the asymmetric heart looping is controlled by both Nodal-dependent and -independent mechanisms [50].
Despite the wealth of knowledge on TFs regulating heart development, very few enhancers to which they bind and exert their functions have been identified. As TFs bind to enhancers to exert their function, mutations in these regulatory elements can equally affect developmental outcome as mutations in coding regions. Studies in humans have revealed a number of such mutations associated with increased risk to various heart diseases (reviewed in [51][52][53][54]). The discovery of enhancers regulating multiple steps of heart development and regeneration has been greatly facilitated by the development of various approaches which allow their large-scale discovery. These include genome-wide profiling techniques such as chromatin immunoprecipitation followed by DNA sequencing (ChIP-seq) [20][55][56][57][58], computational predictions [59][60], and in vivo assay [61][62][63][64]. These approaches have been used on various model organisms such as the fruit fly Drosophila melanogaster [65], mouse Mus musculus [66][67], zebrafish Danio rerio [68][69], and the African clawed frog Xenopus laevis [70][71].
The zebrafish (Danio rerio) was established as a model organism for genetic studies by George Streisinger at the Oregon University in the late 1960s [72]. Several unique biological properties make zebrafish an attractive model for studying heart development. The embryos are transparent, allowing direct in vivo imaging of the developing heart. The zebrafish embryos are not fully dependent upon a functional cardiovascular system, which allows loss of function analysis up to a relatively late developmental stage compared to mammals. Despite having only two chambers, the zebrafish heart develops by means of a mechanism conserved with its mammalian counterpart [73]. The high number of offspring and low cost of maintenance of the zebrafish render it a good model for genomics studies, harboring the potential for rapid discovery of enhancers and other genetic regulatory elements involved in heart development [44][69][74]. The zebrafish entered the genomic field relatively late compared to mammalian and other model organisms as evidenced by the lack of systematic efforts such as ENCODE [75] and modENCODE [65][76] to study its regulatory genetic elements. Nevertheless, such efforts are underway (DANIO-code) and more and more high-throughput techniques are employed to explore its genome [77].
Functional characterization of enhancers was traditionally performed by deletion mapping, in which putative regulatory regions upstream of transcription start sites (TSS) of candidate genes are tested by means of a reporter assay to determine its ability to drive specific expression pattern (Table 1). In the zebrafish, these reporter assays can be done in vivo, where a transgenic reporter construct carrying a green fluorescent protein (GFP) reporter can be directly injected into the embryo and its resulting expression observed live. In one of the earliest applications of the deletion mapping strategy in zebrafish, Meng and colleagues [78] characterized the promoter region of the zebrafish gata1 gene which encodes a TF that plays an important role in hematopoietic development. Using transgene constructs containing deleted and point mutated regions of the gata1 promoter previously identified in zebrafish [79], they identified distinct cis-acting elements that regulate gata1 transcription in various tissues. Systematic deletion on the 5.6 kb genomic fragment upstream of the zebrafish gata1 translation start codon, identified a CACCC box, located between −146 and −142 bp upstream of the translation start codon, which were critical for the initiation of gata1 expression. Furthermore, they showed that the hematopoietic expression of gata1 was maintained by double GATA motif in the distal region between −4635 and −4627 bp. In addition, a 49 bp element located 218 bp upstream of the CACCC element and a CCAAT box at −4643 bp adjacent to the double GATA motif enhanced the erythroid-specific activity of gata1 promoter. Interestingly, a region located between −1776 and −468 was found to repress the gata1 expression in the notochord (a nonhematopoietic tissue), making it one of the earliest discoveries of a repressive hematopoietic regulatory element in the zebrafish. In a similar approach, Muller and colleagues [80] investigated the cis-regulatory elements directing sonic hedgehog (shh) expression in the zebrafish embryo. They focused on the zebrafish shh region, employing an enhancer screening strategy based on co-injection of putative enhancer sequences with a reporter construct [81]. They identified three regulatory elements in introns 1 (ar-A and ar-B) and 2 (ar-C) that mediated floor plate and notochord expression. Deletion fine mapping strategy on ar-C delineated three sub regions of 40 bp essential for its activity. A T-box TF binding site was found in one of these subregions, but none of the three regions contained binding sites of Foxa2, which was previously shown to regulate of shh expression [82], suggesting that other regulatory mechanisms were involved in shh expression. Importantly, these enhancers were able to drive similar expression pattern in zebrafish as well as in mouse embryos, showing that the mechanism controlling shh expression in the midline were evolutionarily conserved. Both these studies therefore showed for the first time that the transient enhancer expression assay using zebrafish embryos could be exploited to identify novel regulatory elements in gene fragments.
Table 1. A list of methods known for enhancer identification.
| Biological Approaches | Work-Principle | Reference |
|---|---|---|
| Enhancer-Deletion Approach | Deletion of non-coding cis-regulatory DNA elements can severely disrupt the systemic functions. | [83] |
| Enhancer-Trap Assay | Through microinjection of embryos random integration of a vector-construct with minimal promoter and reporter gene, driving expression if enhancer is present. | [84][85][86][87][88] |
| Transient Transfection Assay | Luciferase reporter plasmid constructs containing promoter and 5′-flanking DNA sequence increases the luciferase expression in the presence of enhancer region. | [81] |
| High-Throughput Techniques | ||
| DNase I-seq | DNase I digestion and DHS fragments mostly comprises cis-regulatory regions (e.g., enhancers). | [89] |
| Epigenomic Profiling (ChIP-Seq) | Enrichment of H3K4me1, H3K4ac, and P300 histone modifications determines the active enhancers. | [20][55][56][60][90][91][92][93] |
| CAGE | High resolution map of TSS and bidirectional transcription patterns defines the precise location of enhancers. | [94][95] |
| NET-CAGE | Capturing 5′-ends of nascent transcripts by fusing two technologies helps to identify unstable transcripts (eRNA). | [8] |
| ATAC-Seq | Accessible chromatin regions encompass enhancer elements. | [58][96] |
The enhancer deletion mapping approach was applied to study the regulatory mechanism of gata4 [83], a TF which plays an essential role in specification of cardiomyocytes and formation of the linear heart tube [45][46][47]. A 14.8 kb fragment upstream of the gata4 transcription initiation site was found to drive GFP expression in both chambers and the valves of the zebrafish heart. Truncation of 7 kb of the distal sequences eliminated expression in the atrium and the atrioventricular valve, while expression was retained in the ventricle and bulboventricular valves. Within this 7 kb distal regulatory region, a 1300 bp region with a cluster of consensus binding sites for T-box TFs was delineated. Mutation of these binding sites significantly reduced reporter gene expression in the heart, providing the first evidence that T-box factors function by directly regulating gata4 expression. This study established that gata4 regulatory elements control gene expression differentially along the rostro-caudal axis and that T-box binding elements in the gata4 promoter contribute to heart-specific expression.
To perform large-scale discovery of enhancers, several methods were developed in model organisms, particularly the zebrafish (Table 1). One of these methods, known as enhancer trapping, offers ease of genetic manipulation by transgenesis (Figure 1). In 2000, Kawakami and colleagues developed a gene trap method in zebrafish using a modified Tol2 transposable element isolated from medaka fish (Oryzias latipes), which encodes a gene for a fully functional transposase capable of catalyzing transposition in the zebrafish germ lineage [84]. The Tol2 element could be inserted in the zebrafish genome and transmitted to the next generation with a low transgenic frequency which prevented to generate hundreds or thousands of transposon insertions. In 2004, they further optimized the Tol2 system, incorporating the Xenopus EF1α enhancer/promoter, the rabbit β-globin intron, the enhanced green fluorescent protein (EGFP) reporter gene, and the SV40 polyA signal, which results in EGFP expression that could be observed consistently up to the F4 generation [85]. This established a highly efficient transgenesis method with more than 50% of frequency compared to other methods using naked plasmid DNA or other transposon systems. Applying this technique, Kawakami and colleagues established a collection of transgenic zebrafish lines with a variety of EGFP expression patterns, including ubiquitous, and spatiotemporally restricted patterns. Unique reporter expression patterns were detected in the heart, forebrain, notochord, floor plate, neural crest, and other tissues, establishing a novel transposon-mediated gene trap approach in zebrafish and facilitating studies of vertebrate development and organogenesis. In a parallel effort, Parinov and colleagues also reported the application of an enhancer trap approach in zebrafish using the Tol2 transposon system [86]. They used an enhancer trap construct that carried the EGFP gene as reporter in live zebrafish embryos controlled by a partial promoter of the epithelial keratin4 (krt4) gene. In their initial screen, 37 founders (F0) transmitting the actively expressed EGFP gene to their offspring (F1) were identified. These founders were raised, outcrossed with wild-type, and analyzed up to the F2 generation. In total, they established 28 enhancer trap (ET) lines that exhibited distinct EGFP expression patterns apart from the basal expression from the modified krt4 promoter. The EGFP fluorescence was detected in a variety of tissues and organs, including the central nervous system (CNS), neural crest and its derivatives, notochord, heart, muscles, digestive organs, and kidney. This study, together with that of Kawakami and colleagues, demonstrated that the enhancer trap construct could produce high trapping frequency and specificity.

Figure 1. Enhancer trapping method. The synthetic transposase mRNA, the transposon donor plasmid containing transposable elements with a minimal promoter, and the gene encoding green fluorescent protein (GFP) are co-injected into fertilized zebrafish eggs. The construct is excised from the donor plasmid and integrated into the endogenous genome. The activity of the “trapped” enhancer can be visualized in the injected embryo when the inserted transgene is expressed under control of nearby enhancers.
Subsequently, Poon and colleagues [61] further screened the collection of zebrafish enhancer trap lines [86][87][88] and found 18 cardiac enhancer trap (CET) lines with EGFP expression in various part of the embryonic heart. They characterized the EGFP expression pattern in the embryonic heart in vivo using fast scanning confocal microscopy coupled with image reconstruction, producing three-dimensional movies in time. The transgenic lines exhibited EGFP expression in distinct cell layers of the heart, including the endocardium, myocardium, and epicardium. Subsequently, the genomic locations of the transposon insertions were identified by thermal asymmetric interlaced polymerase chain reaction (TAIL-PCR). This screen therefore established a collection of CET lines which could be utilized as a starting point for discovery of cardiac enhancers through further analysis. Furthermore, the cardiac EGFP expression is useful for in vivo studies of heart development.
Balciunas and colleagues [97] modulated the salmonid-originating Sleeping Beauty (SB) transposon-based transgenesis cassette [98] to establish an enhancer trapping in zebrafish. It belongs to the Tc1/mariner superfamily of transposons consisting of two components: the transposase enzyme and a transposon vector containing the terminal-inverted repeat/direct repeat (IR/DR) sequences which moves by a cut-and-paste mechanism. Optimization of this system allowed them to establish nine transgenic lines (ET1–ET9) with different tissue-specific patterns including various tissues of the nervous system, otic vesicle, and the heart in one of the lines (ET7). Detailed analysis on lines ET2 and ET7 revealed that the ET2 line harbors a transposon insertion in the gene encoding poly(ADP-ribose) glycohydrolase (PARG) expressed in caudal primary motoneurons. The GFP expression in the ET2 line recapitulated that of the endogenous gene (PARG), indicating that transgene expression was under control of an endogenous enhancer. The ET7 line had closely resembled part of the dual specificity phosphatase 6 (dusp6) expression domain, which is expressed in the midbrain-hindbrain boundary, forebrain, tailbud, branchial arches, developing ear, pectoral fin buds, and other tissues [99], suggesting that the enhancer trap transposon in that line was under control of a subset of dusp6 enhancer elements.
Despite its potential to discover a large number of enhancers, a major drawback of the enhancer trap approach is that the enhancers driving the specific expression are not always easy to identify. Only in some cases, analyses of expression by whole-mount RNA in situ hybridization demonstrated that EGFP expression patterns in enhancer trap lines reflected the tissue-specific expression of nearby genes. Nevertheless, these lines provide a valuable starting point for assays such as chromatin conformation capture, which would allow the identification of the enhancer responsible for driving the reporter expression. In addition, the enhancer trapping screens has generated a wealth of resources in the form of live transgenic markers for various tissues, which is useful for developmental studies.