RNA Polymerase II | Encyclopedia MDPI

RNA Polymerase II: Comparison

Please note this is a comparison between Version 1 by Jennifer F. Kugel and Version 3 by Catherine Yang.

Central to the development and survival of all organisms is the regulation of gene expression, which begins with the process of transcription catalyzed by RNA polymerases. During transcription of protein-coding genes, the general transcription factors (GTFs) work alongside RNA polymerase II (Pol II) to assemble the preinitiation complex at the transcription start site, open the promoter DNA, initiate synthesis of the nascent messenger RNA, transition to productive elongation, and ultimately terminate transcription.

adenomyosis
estrogen
progesterone
endocrine disruptors
therapy

1. Introduction

Controlling gene expression is essential to normal growth, development, and sustained life. In metazoans, this requires regulating the spatial, temporal, and developmental expression of genes in a wide diversity of cell types. Mis-regulation of gene expression contributes to most disease states. The main control point for regulating gene expression is at the level of transcription. In eukaryotic cells, RNA polymerase II (Pol II) transcribes protein-coding genes into messenger RNA (mRNA) transcripts. Pol II also synthesizes long non-coding RNA (lncRNA) and most small nuclear RNA (snRNA) and microRNA (miRNA). Pol II transcription is vital for cell proliferation, proper expression of metabolic enzymes, signaling, cell fate, differentiation, gene expression, and nearly every cellular process. Although the RNA polymerase II transcription system is highly conserved across eukaryotes, this resviearchw is primarily focused on the human system, with some references to data from Drosophila and yeast systems.

The Pol II core enzyme can itself synthesize RNA using a template DNA, but promoter-specific transcription initiation requires the canonical general transcription factors (GTFs): TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIIH (Table 1). In addition, the large multi-subunit complex Mediator is essential for proper transcription in cells ^[1][2][1,2]. Chromatin remodeling/modifying complexes and additional co-regulatory factors function together with promoter-specific transcriptional activators and repressors to set the proper level and timing of transcription from individual genes in specific cell types.

Table 1.

Summary of the general transcription factors and RNA polymerase II.

Protein	Subunits	Size (kDa)	Main Binding Partners	Function
TFIID	TBP, 13 TAFs	~1300	promoter, Pol II	Nucleates PIC assembly by binding multiple core promoter elements
TFIIA	TFIIAα, TFIIAβ, TFIIAɣ	35, 19, and 12	TBP, TFIID	Stabilizes the TFIID-DNA interaction; enhances the effects of transcriptional co-activators
TFIIB	TFIIB	33	promoter, TBP, Pol II	Helps to define the start site of transcription and orient Pol II in the proper direction
TFIIF	RAP30, RAP74	30 and 74	promoter, Pol II, GTFs	Guides Pol II to the PIC and facilitates elongation
Pol II	Rpb1–Rpb12	~514	promoter, all GTFs	Catalyzes RNA synthesis; phosphorylation of the CTD tail of Rpb1 serves a regulatory role
TFIIE	TFIIEα, TFIIEβ	56 and 34	promoter, TFIIH, Pol II, TFIIF	Recruits TFIIH to the PIC; stimulates enzymatic activities of TFIIH; stabilizes the open DNA conformation
TFIIH	Core domain: XPD, XBP, p62, p52, p44, p34, p8; CAK domain: CDK7, MAT1, cyclin H	~500	downstream DNA, TFIIE, Pol II	CDK7 kinase phosphorylates the CTD; ATP-dependent XPB translocase opens the promoter DNA

2. RNA Polymerase II

Pol II is a large ~500 kDa complex made up of 12 protein subunits, named Rpb1-12. Studies have shown that 10 of the 12 subunits form the catalytic core of the Pol II complex and are either identical (Rbp5, 6, 8, 10, 12) or highly similar (Rpb1-3, 9, 11) to subunits found in RNA polymerase I and RNA polymerase III, which transcribe primarily tRNAs and rRNAs, respectively ^[3][4][9,10]. Recent ChIP-seq (chromatin immunoprecipitation followed by high-throughput sequencing) and mass spectrometry studies have shown that different sets of Rpb subunits differentially regulate select subsets of human genes, demonstrating the dense layers of regulation within the Pol II complex itself ^[5][11]. Crystal structures of yeast Pol II, and more recently cryo-EM structures of human and yeast Pol II, have revealed that the Pol II complex can be divided into the core, shelf, jaw lobe, and clamp structural domains that interact with each other and undergo conformational changes during the stages of transcription (these structures have been extensively reviewed in the literature ^[6][7][8][9][12,13,14,15]). The core domain contains Rpb3 and Rpb10-12 as well as the positively charged active center cleft, formed by Rpb1 and Rpb2 ^[10][11][16,17]. The active site of Pol II is buried deep at the base of the active center cleft, thus requiring translocation of the template DNA strand to the active site after entering the cleft. The shelf and jaw lobe elements have little observed movement but can rotate parallel to the active center cleft ^[11][17]. The clamp domain is connected to the active site cleft in the core domain through an array of flexible switches, and it swings nearly 30 Å upon opening or closing the cleft ^[3][11][9,17]. While not considered part of the catalytic core of Pol II, binding of the Rpb4 and Rpb7 subunits has been shown to be vital for maintaining the closed conformation of the Pol II clamp over the DNA during initiation ^[12][13][18,19]. It is hypothesized that the closing of this clamp domain over the cleft coupled with DNA distortion may facilitate promoter melting ^[14][20].

At 250 kDa, Rpb1 is the largest of all Pol II subunits and the principle catalytic subunit of the Pol II complex ^[15][21]. Beyond its catalytic role, Rbp1 plays a regulatory role in the transcription cycle that is mediated by the unstructured C-terminal domain (CTD) on the Rpb1 subunit. The CTD consists of a long tail comprising heptapeptide repeats of the consensus sequence YSPTSPS, with minor variability at the Ser7 position in repeats near the C-terminus ^[16][17][22,23]. Mammalian Pol II contains 52 repeats, with the number of repeats varying among different organisms in a manner that loosely correlates with genomic complexity ^[18][24]. The YSPTSPS consensus sequence is well conserved across eukaryotes, emphasizing the functional importance of each residue ^[18][24]. The CTD tail is not necessary for basal (i.e., unregulated) Pol II transcription in vitro ^[19][20][25,26]; however, it is required for accurate Pol II transcription and proper termination in cells ^[21][22][23][27,28,29]. The CTD is thought to function as a binding platform for association of numerous other protein complexes that help regulate co-transcriptional processes or steps in transcription, including RNA splicing and transcription termination ^[24][25][26][30,31,32].

The residues within the heptapeptide repeat are substrates for many post-translational modifications, with phosphorylation being the most well characterized. The Tyr, Ser, and Thr residues can be reversibly phosphorylated/dephosphorylated, allowing for regulation of Pol II activity through the transcription reaction and of Pol II CTD affinity for various regulatory factors ^[16][17][24][22,23,30] (Figure 1). For example, the level of specific phosphorylation marks varies across different stages of transcription, depending on the purpose of the modification. The use of phospho-specific antibodies coupled to ChIP-seq, in addition to in vitro work, have enhanced understanding of how CTD phosphorylation patterns change throughout the transcription cycle. Pol II is recruited to preinitiation complexes on promoter DNA in a hypo-phosphorylated form. Phosphorylation of Ser5 by the CDK7 kinase subunit of TFIIH (which is one of the general transcription factors discussed below) facilitates initiation of transcription. The Ser5 mark is removed as Pol II moves throughout the gene body. As a counterpoint, Ser2 phosphorylation predominantly accumulates after initiation to help recruit elongation and RNA processing factors and peaks at the 3’ ends of genes where it is thought to facilitate termination ^{[27][28][29][30]}[33,34,35,36]. Beyond Ser5 and Ser2, other sites of phosphorylation on the Pol II CTD include Tyr1, Thr4, and Ser7, which have not been studied in the same detail as the other CTD residues. Research has shown that the significance of these three residues can vary across species, with metazoan and yeast systems sometimes exhibiting different behaviors ^[31][32][37,38]. Exploring the function of these other important Pol II CTD residues provides numerous areas for future study.

Figure 1. Phosphorylation state of the Pol II CTD is regulated during transcription. As Pol II transcribes through a gene and progresses through the stages of transcription (shown from left to right), different phosphorylation marks are added or removed to promote unique functions. The phosphorylation patterns shown here pertain to human Pol II; other organisms may exhibit slight differences in these patterns. TSS, transcription start site; PAS, polyadenylation site.