The vitamin D metabolite 1α,25-dihydroxyvitamin D3 is the natural, high-affinity ligand of the transcription factor vitamin D receptor (VDR). In many tissues and cell types, VDR binds in a ligand-dependent fashion to thousands of genomic loci and modulates, via local chromatin changes, the expression of hundreds of primary target genes. Thus, the epigenome and transcriptome of VDR-expressing cells is directly affected by vitamin D. Vitamin D target genes encode for proteins with a large variety of physiological functions, ranging from the control of calcium homeostasis, innate and adaptive immunity, to cellular differentiation.
1. VDR: A Transcription Factor Activated by Vitamin D
Transcription factors are proteins that are able to bind sequence-specifically to genomic DNA and interact with other nuclear proteins, which modulates the activity of RNA polymerase II and mRNA production
[1][2]. Some of the approximately 1600 transcription factors encoded by the human genome are constitutively active and regulated primarily by their expression, while most of them are activated by extra- and intracellular signals. A few of these signal-dependent transcription factors are located either in a latent form in the cytosol and activated through translocation into the nucleus, or—most of them—are found in the nucleus and modulated in their activity by post-translational modifications, such as phosphorylation or acetylation. Furthermore, some members of the nuclear receptor superfamily have an additional mechanism of activation, which is a ligand-induced conformational change on the surface of their ligand-binding domain (LBD)
[3][4].
The inner surface of VDR’s LBD forms a ligand-binding pocket, where 40, mostly non-polar, amino acids snugly enclose the molecule 1,25(OH)2D3, so that it binds with an affinity of 0.1 nM
[5]. This is a very high affinity, even in comparison with other nuclear receptors
[6]. Ligand binding changes VDR’s interaction profile with many of the more than 50 nuclear proteins that have been reported to cooperate with the receptor
[7]. Some of these VDR-interacting proteins function as co-repressors, such as NCOR1 (nuclear receptor corepressor 1)
[8] or COPS2 (COP9 signalosome subunit 2, also called ALIEN)
[9], co-activators of the NCOA (nuclear receptor coactivator) family
[10], or members of the Mediator complex, such as MED1
[11][12][13] (
Figure 1). Other VDR partner proteins are chromatin-modifying enzymes, such as histone acetyltransferases (HATs)
[10], histone deacetylases (HDACs)
[9], lysine demethylases, such as KDM6B
[14] and KDM1A
[15] or chromatin remodeling proteins, such as BRD (bromodomain-containing) 7 and 9
[16]. The large variety of its protein interaction partners suggests that VDR is a dynamic member of a large nuclear protein complex
[17].
Figure 1. VDR as the key ligand-inducible component of a multi-protein complex. VDR is a part of a multi-protein complex that, e.g., contains co-receptors (RXR), pioneer factors (PU.1, CEBPα, GABPα, ETS1, RUNX2, BACH2), chromatin modifiers (KDM1A, KDM6B), chromatin remodelers (BRD7, BRD9), co-activators (MED1) and co-repressors (NCOR1, COPS2). The complex is activated through the binding of 1,25(OH)2D3 to VDR and attaches preferentially to DR3-type binding sites within enhancer regions. The mediator complex connects the activated VDR complex with the RNA polymerase II waiting on transcription start site (TSS) regions of vitamin D target genes. In most cases the linear distance of enhancer and TSS region are multiple kb, so that the intervening genomic DNA forms a regulatory loop. In this way the expression of the vitamin D target genes is either increased or decreased.
The low percentage of DR3-binding VDR complexes detected by ChIP-seq also suggests that there are a number of scenarios in which VDR acts independently of RXR. VDR may use other nuclear proteins as alternative cooperative binding partners on genomic DNA
[18][19][20] or may bind indirectly to DNA, such as “backpack”, to other transcription factors
[21]. For example, below VDR ChIP-seq peak binding sites for the pioneer transcription factor PU.1 (purine-rich box-1) are enriched
[22]. In fact, in THP-1 monocytic leukemia cells, the presence of PU.1 is observed on two-thirds of VDR’s genomic binding sites
[23]. This makes sense, since PU.1, VDR and the pioneer factor CEBPα (CCAAT enhancer binding protein alpha) are the key transcription factors directing the differentiation of myeloid progenitor cells into monocytes and granulocytes, during the process of hematopoiesis
[24]. Interestingly, THP-1 cells CEBPα
[25], GABPα (GA-binding protein transcription factor alpha)
[26] and ETS1 (ETS proto-oncogene 1, transcription factor)
[27] also co-locate with VDR binding sites and act as pioneer factors for vitamin D signaling (
Figure 1). Furthermore, in osteoblasts, vitamin D signaling is enhanced by the pioneer factors CEBPα and RUNX2 (RUNX family transcription factor 2)
[28], while in T cells, this is mediated by the transcription factor BACH2 (BTB domain and CNC homolog 2)
[29]. Thus, VDR uses help from RXR, but also from many other transcription factors, in order to form functional complexes with genomic DNA.
2. Vitamin D Target Gene Regulation in the Context of Chromatin
TADs are far larger loops of genomic DNA than regulatory loops, with a size of hundreds of kb to a few Mb
[30]. They subdivide the human genome into at least 2000 units, which are functionally insulated from each other
[31] (
Figure 2). The borders of TADs are defined by the binding of the chromatin-organizing protein CTCF (CCCTC-binding factor)
[32][33], forming together with cohesin and other proteins, so-called TAD anchors
[34]. The interaction of the VDR-bound enhancers with genomic regions outside of a TAD are prevented by these insulating TAD borders. This is the reason why genes are regulated almost exclusively by enhancers that are located within the same TAD. This also implies that the linear distance between VDR-bound enhancers and TSS region(s) cannot be larger than the size of the respective TAD.
Figure 2. Elements of chromatin. Different elements of chromatin are shown, such as densely packed heterochromatin, DNA loops, such as TADs that are anchored by CTCF proteins, accessible euchromatin, the structure of a single nucleosome, chromatin modification via histone acetylation and methylation as well DNA methylation and a TSS, from which RNA polymerase II starts gene transcription into mRNA.
The distribution of eu- and heterochromatin (including their specific epigenetic markers) of a given cell is referred to as its epigenetic landscape or epigenome
[35]. The epigenome is determined by patterns of DNA methylation, post-translational modification of histone tails and 3-dimensional chromatin organization
[36]. It depends on the activity of chromatin-modifying enzymes, such as DNA methyltransferases (DNMTs), which add methyl groups to cytosines within genomic DNA, TET (Tet methylcytosine dioxygenase) proteins that initiate DNA methylation, or HATs, HDACs, lysine methyltransferases (KTMs) and KDMs, which add or remove acetyl and methyl groups to histones
[37]. Histone acetylation is generally associated with transcriptional activation, but it is not important which exact amino acid is acetylated. In contrast, for histone methylation, the exact residue and its degree of methylation (mono-, di- or tri-methylation) is critical. Furthermore, the function of chromatin-remodeling enzymes is to shift or evict nucleosomes, in an ATP-dependent fashion. The projects ENCODE (
www.encodeproject.org, accessed on 5 March 2022)
[38] and Roadmap Epigenomics (
www.roadmapepigenomics.org, accessed on 5 March 2022)
[39] systematically assessed the epigenomes of more than 100 human cell lines, as well as primary cells, and serve as a reference for the epigenome of non-stimulated human tissues and cell types. However, in contrast to the static genome, the epigenome dynamically responds to intra- and extracellular signals, since chromatin modifying enzymes are often the endpoints of transduction cascades of peptide hormones, cytokines and growth factors
[40]. Thus, the response of the epigenome to signals, such as 1,25(OH)2D3, is even more important than its ground state.
Nuclear hormones, such as 1,25(OH)2D3, affect the epigenome via direct interaction of their receptors with chromatin-modifying enzymes, as well as through up- or down-regulating the genes encoding for chromatin modifiers. In this way:
-
Vitamin D affects histone markers for active chromatin, such as H3K27ac (acetylated histone H3 at lysine 27), and for TSS regions, such as H3K4me3 (tri-methylated histone H3 at lysine 4)
[25][29][41];
-
VDR initiates the demethylation of its binding sites via interaction with TET2
[42];
-
The accessibility of thousands of VDR-binding enhancer and TSS regions is affected by 1,25(OH)2D3
[43][44];
-
The binding of CTCF, to more than 1000 of its genomic sites, is modulated by 1,25(OH)2D3
[45];
-
The organization of some 400 TADs is dependent on 1,25(OH)2D3
[45], i.e., vitamin D affects the 3-dimensional chromatin structure.
Thus, there are multiple ways by which 1,25(OH)2D3 modulates the epigenome of its target tissues. Interestingly, some 1,25(OH)2D3-modulated chromatin loci, such as TSS regions, open only 2 h after ligand stimulation, while most sites take 24 h to reach maximal accessibility
[41][43]. This suggests that many effects of vitamin D on the epigenome are secondary, i.e., they are mediated by genes and proteins that are primary vitamin D targets
[46][47]. Nevertheless, the vitamin D-triggered effects on the epigenome facilitate the looping of VDR-bound enhancers, towards accessible TSS regions within the same TAD
[48]. This assembly of enhancer and TSS regions enables the formation of a large protein complex, containing VDR, nuclear adaptor proteins, chromatin-modifying enzymes and RNA polymerase II, modulating gene transcription (
Figure 1).
3. Vitamin D Target Genes
Vitamin D target genes are detected via a statistically significant change in their expression, within a given time frame (often 24 h), after ligand stimulation. Long-time known vitamin D target genes, such as BGLAP (bone gamma-carboxyglutamate protein, also called osteocalcin)
[49] or CAMP (cathelicidin antimicrobial peptide)
[50], were deduced based on the observation of the physiological effects of vitamin D, e.g., on calcium homeostasis or the defense against microbe infection, respectively. On the level of mRNA changes, vitamin D target genes were analyzed, initially, by single gene approaches, using northern blotting or quantitative PCR.
Time course analysis of vitamin D target genes allows one to classify them into rapidly responding (4–8 h) “primary” target genes and delayed-reacting “secondary” targets (
Figure 3A). Primary target genes are directly regulated by 1,25(OH)2D3-activated VDR, as described in the model of vitamin D signaling; i.e., these genes need to have, within the same TAD, a VDR-binding enhancer. In contrast, secondary target genes may be regulated by transcription factors, co-factors or chromatin modifiers, which are encoded by primary vitamin D target genes (
Figure 3A). Suitable proteins encoded by primary target genes are the transcription factors BCL6 (B-cell CLL/lymphoma 6), NFE2 (nuclear factor, erythroid 2), POU4F2 (POU class 4 homeobox 2) and ELF4 (E74-like factor 4) in THP-1 cells
[46], as well as IRF5 (interferon regulatory factor 5), MAFF (MAF BZIP transcription factor F), MYCL (MYCL proto-oncogene, BHLH transcription factor), NFXL1 (nuclear transcription factor, X-box binding-like 1) and TFEC (transcription factor EC), as well as the transcriptional co-regulators MAMLD1 (mastermind-like domain-containing 1), PPARGC1B (PPARG coactivator 1 beta), SRA1 (steroid receptor RNA activator 1) and ZBTB46 (zinc finger and BTB domain-containing 46) in human PBMCs (peripheral blood mononuclear cells)
[51]. In this way, secondary target genes do not have to carry a VDR-binding enhancer within their TADs. For example, although the time course study in human PBMCs used a strict statistical approach (threshold testing applying a fold change (FC) > 1.5 and 2), 662 vitamin D-responding genes were identified (FDR < 0.05), 179 of which are primary and 483 secondary targets
[51]. An alternative classification of the same set of genes suggests that 293 of them are direct and 369 indirect targets of vitamin D (
Figure 3B). Irrespective of the timing of their response to 1,25(OH)2D3, the expression change of direct targets is driven by VDR-bound enhancers, while indirect targets are primarily stabilized in their expression by 1,25(OH)2D3 and its receptor, against an up- or down-regulation by other transcription factors and/or their epigenetic effects.
Figure 3. Classification of vitamin D target genes. Primary vitamin D target genes are directly regulated by VDR, while secondary vitamin D targets are controlled by transcriptional regulators that are encoded by primary targets (
A). Time course analysis allows to differentiate vitamin D target genes in four different types based on cause and direction of expression change
[51] (
B). This suggests an alternative view on vitamin D signaling: 1,25(OH)2D3 either directly induces or reduces the expression of its target genes via VDR or prevents their expression change mediated by other factors. Red and grey lines indicate gene’s expression level in the presence or absence of 1,25(OH)2D3, respectively.
The model of vitamin D signaling illustrates the mechanisms of up-regulation of primary vitamin D target genes. However, the majority of vitamin D target genes are down-regulated, in particular, when cells are stimulated with 1,25(OH)2D3 for 24 h or longer. The down-regulation of a gene by vitamin D is only possible when the gene is first up-regulated by other transcription factors or epigenetic effects, mediated by chromatin modifiers. The most, likely mechanism of down-regulation of a gene is to block one or several of its up-regulating factors. Accordingly, the majority of down-regulated target genes could be classified as indirect targets, i.e., vitamin D counteracts their up-regulation rather than prominently down-regulating their expression
[51]. For example, VDR antagonizes pro-inflammatory transcription factors, such as NFAT, AP1 and NFκB, in immune cells
[52]. However, since each gene is up-regulated by an individual set of transcription factors and chromatin modifiers, there are also individual mechanisms of its down-regulation. Thus, there is no general model for describing the mechanism of the down-regulation of vitamin D target genes.
The expression of the majority of vitamin D target genes is less than 5-fold, up- or down-regulated (after a stimulation for 24 h with 1,25(OH)2D3); i.e., only a few genes respond with huge expression changes to vitamin D. For example, in THP-1 cells, the top five are the up-regulated genes CYP24A1, CAMP, TSPAN18 (tetraspanin 18), CD14 (CD14 molecule) and FBP1 (fructose-bisphosphatase 1), with an FC of expression ranging from 47 to 402
[43]. In human PBMCs, CYP24A1 and CAMP also have an FC of 409 and 158, respectively, the top up-regulated vitamin D target genes, but STEAP4 (STEAP4 metalloreductase), NRG1 (neuregulin 1) and CXCL10 (C-X-C motif chemokine ligand 10) show an FC of 471, 450 and 158, respectively, meaning comparable levels of down-regulation of their expression
[51]. This prominent down-regulation of expression is more remarkable than the up-regulation, since it is far easier to increase a very low basal expression than a down-regulation of a highly expressed gene. Nevertheless, one should remember that these in vitro 1,25(OH)2D3 experiments are designed for maximal effects and do not reflect the reality of the endocrinology of vitamin D in vivo
[53][54].