RNA polymerase II
RNA polymerase II (also called RNAP II and Pol II) is an enzyme found in eukaryotic cells. It catalyzes the transcription of DNA to synthesize precursors of mRNA and most snRNA and microRNA. A 550 kDa complex of 12 subunits, RNAP II is the most studied type of RNA polymerase. A wide range of transcription factors are required for it to bind to upstream gene promoters and begin transcription.
The eukaryotic core RNA polymerase II was first purified using transcription assays. The purified enzyme has typically 10-12 subunits (12 in humans and yeast) and is incapable of specific promoter recognition. Many subunit-subunit interactions are known.
- DNA-directed RNA polymerase II subunit RPB1 - an enzyme that in humans is encoded by the POLR2A gene. RPB1 is the largest subunit of RNA polymerase II. It contains a carboxy terminal domain (CTD) composed of up to 52 heptapeptide repeats (YSPTSPS) that are essential for polymerase activity. In combination with several other polymerase subunits, it forms the DNA binding domain of the polymerase, a groove in which the DNA template is transcribed into RNA. It strongly interacts with RPB8.
- RPB2 (POLR2B) - the second-largest subunit that in combination with at least two other polymerase subunits forms a structure within the polymerase that maintains contact in the active site of the enzyme between the DNA template and the newly synthesized RNA.
- RPB3 (POLR2C) - the third-largest subunit. Exists as a heterodimer with another polymerase subunit, POLR2J forming a core subassembly. RPB3 strongly interacts with RPB1-5, 7, 10-12.
- RNA polymerase II subunit B4 (RPB4) - encoded by the POLR2D gene is the fourth-largest subunit and may have a stress protective role.
- RPB5 - In humans is encoded by the POLR2E gene. Two molecules of this subunit are present in each RNA polymerase II. RPB5 strongly interacts with RPB1, RPB3, and RPB6.
- RPB6 (POLR2F) - forms a structure with at least two other subunits that stabilizes the transcribing polymerase on the DNA template.
- RPB7 - encoded by POLR2G and may play a role in regulating polymerase function. RPB7 interacts strongly with RPB1 and RPB5.
- RPB9 - The groove in which the DNA template is transcribed into RNA is composed of RPB9 (POLR2I) and RPB1.
- RPB11 - the RPB11 subunit is itself composed of three subunits in humans: POLR2J (RPB11-a), POLR2J2 (RPB11-b), and POLR2J3 (RPB11-c).
RPB3 is involved in RNA polymerase II assembly. A subcomplex of RPB2 and RPB3 appears soon after subunit synthesis. This complex subsequently interacts with RPB1. RPB3, RPB5, and RPB7 interact with themselves to form homodimers, and RPB3 and RPB5 together are able to contact all of the other RPB subunits, except RPB9. Only RPB1 strongly binds to RPB5. The RPB1 subunit also contacts RPB7, RPB10, and more weakly but most efficiently with RPB8. Once RPB1 enters the complex, other subunits such as RPB5 and RPB7 can enter, where RPB5 binds to RPB6 and RPB8 and RPB3 brings in RPB10, RPB 11, and RPB12. RPB4 and RPB9 may enter once most of the complex is assembled. RPB4 forms a complex with RPB7.
Enzymes can catalyze up to several million reactions per second. Enzyme rates depend on solution conditions and substrate concentration. Like other enzymes POLR2 has a saturation curve and a maximum velocity (Vmax). It has a Km (substrate concentration required for one-half Vmax) and a kcat (the number of substrate molecules handled by one active site per second). The specificity constant is given by kcat/Km. The theoretical maximum for the specificity constant is the diffusion limit of about 108 to 109 (M−1 s−1), where every collision of the enzyme with its substrate results in catalysis.
The turnover number for RNA polymerase II is 0.16 s−1 subject to concentration. Bacterial RNA polymerase, a relative of RNA Polymerase II, switches between inactivated and activated states by translocating back and forth along the DNA. Concentrations of [NTP]eq = 10 μM GTP, 10 μM UTP, 5 μM ATP and 2.5 μM CTP, produce a mean elongation rate, turnover number, of ~1 bp (NTP)−1 for bacterial RNAP, a relative of RNA polymerase II.
RNA Polymerase II is inhibited by α-amanitin.
RNA polymerase II holoenzyme is a form of eukaryotic RNA polymerase II that is recruited to the promoters of protein-coding genes in living cells. It consists of RNA polymerase II, a subset of general transcription factors, and regulatory proteins known as SRB proteins.
Part of the assembly of the holoenzyme is referred to as the preinitiation complex, because its assembly takes place on the gene promoter before the initiation of transcription. The mediator complex acts as a bridge between RNA polymerase II and the transcription factors.
Control by chromatin structure
This is an outline of an example mechanism of yeast cells by which chromatin structure and histone posttranslational modification help regulate and record the transcription of genes by RNA polymerase II.
This pathway gives examples of regulation at these points of transcription:
- Pre-initiation (promotion by Bre1, histone modification)
- Initiation (promotion by TFIIH, Pol II modification AND promotion by COMPASS, histone modification)
- Elongation (promotion by Set2, Histone Modification)
Please note that this refers to various stages of the process as regulatory steps. It has not been proven that they are used for regulation, but is very likely they are.
RNA Pol II elongation promoters can be summarised in 3 classes.
- Drug/sequence-dependent arrest-affected factors (Various interfering proteins)
- Chromatin structure-oriented factors (Histone posttranscriptional modifiers, e.g., HMTs)
- RNA Pol II catalysis-improving factors (Various interfering proteins and Pol II cofactors; see RNA polymerase II).
Protein Complexes Involved
Chromatin structure oriented factors:
(HMTs (Histone MethylTransferases)):
COMPASS§† - (COMplex of Proteins ASsociated with Set1) - Methylates lysine 4 of histone H3.
Set2 - Methylates lysine 36 of histone H3.
(interesting irrelevant example: Dot1*‡ - Methylates lysine 79 of histone H3.)
(Other): Bre1 - Ubiquinates (adds ubiquitin to) lysine 123 of histone H2B. Associated with pre-initiation and allowing RNA Pol II binding.
The N-terminus (also known as the amino-terminus, NH2-terminus, N-terminal end or amine-terminus) refers to the start of a protein or polypeptide terminated by an amino acid with a free amine group (-NH2). The convention for writing peptide sequences is to put the N-terminus on the left and write the sequence from N- to C-terminus. When the protein is translated from messenger RNA, it is created from N-terminus to C-terminus.
The N-terminus is the first part of the protein that exits the ribosome during protein biosynthesis. It often contains sequences that act as targeting signals, basically intracellular zip codes, that allow for the protein to be delivered to its designated location within the cell. The targeting signal is usually cleaved off after successful targeting by a processing peptidase. Some proteins are modified posttranslationally.
The C-terminus (also known as the carboxyl-terminus, carboxy-terminus, C-terminal end, or COOH-terminus) of a protein or polypeptide is the end of the amino acid chain terminated by a free carboxyl group (-COOH). The convention for writing peptide sequences is to put the C-terminal end on the right and write the sequence from N- to C-terminus.
Each amino acid has a carboxyl group and an amine group, and amino acids link to one another to form a chain by a dehydration reaction by joining the amine group of one amino acid to the carboxyl group of the next. Thus polypeptide chains have an end with an unbound carboxyl group, the C-terminus, and an end with an amine group, the N-terminus. Proteins are naturally synthesized starting from the N-terminus and ending at the C-terminus.
The C-terminus can contain retention signals for protein sorting. The most common ER retention signal is the amino acid sequence -KDEL (or -HDEL) at the C-terminus, which keeps the protein in the endoplasmic reticulum and prevents it from entering the secretory pathway.
The C-terminus of proteins can be modified posttranslationally, for example, most commonly by the addition of a lipid anchor to the C-terminus that allows the protein to be inserted into a membrane without having a transmembrane domain. With Pol II, the C-terminus of RPB1 is appended to form the C-terminal domain (CTD).
CTD of RNA polymerase
The carboxy-terminal domain of RNA polymerase II typically consists of up to 52 repeats of the sequence Tyr-Ser-Pro-Thr-Ser-Pro-Ser. Other proteins often bind the C-terminal domain of RNA polymerase in order to activate polymerase activity. It is the protein domain that is involved in the initiation of transcription, the capping of the RNA transcript, and attachment to the spliceosome for RNA splicing.
- RNA polymerase I
- RNA polymerase III
- RNA polymerase II holoenzyme
- Post-transcriptional modification
- Transcription (genetics)
- Eukaryotic transcription
- Meyer PA, Ye P, Zhang M, Suh MH, Fu J (Jun 2006). "Phasing RNA polymerase II using intrinsically bound Zn atoms: an updated structural model". Structure. 14 (6): 973–82. doi:10.1016/j.str.2006.04.003. PMID 16765890.
- Kornberg R (1999). "Eukaryotic transcriptional control". Trends in Cell Biology 9 (12): M46. doi:10.1016/S0962-8924(99)01679-7. PMID 10611681.
- Sims RJ 3rd, Mandal SS, Reinberg D (Jun 2004). "Recent highlights of RNA-polymerase-II-mediated transcription". Current opinion in cell biology 16 (3): 263–271. doi:10.1016/j.ceb.2004.04.004. ISSN 0955-0674. PMID 15145350.
- Sawadogo M, Sentenac A (1990). "RNA polymerase B (II) and general transcription factors.". Annu Rev Biochem. 59: 711–54. doi:10.1146/annurev.bi.59.070190.003431. PMID 2197989.
- Myer VE, Young RA (October 1998). "RNA polymerase II holoenzymes and subcomplexes". J. Biol. Chem. 273 (43): 27757–60. doi:10.1074/jbc.273.43.27757. PMID 9774381.
- Acker J, de Graaff M, Cheynel I, Khazak V, Kedinger C, Vigneron M (Jul 1997). "Interactions between the human RNA polymerase II subunits". J Biol Chem. 272 (27): 16815–21. doi:10.1074/jbc.272.27.16815. PMID 9201987.
- Brickey WJ, Greenleaf AL (June 1995). "Functional studies of the carboxy-terminal repeat domain of Drosophila RNA polymerase II in vivo". Genetics 140 (2): 599–613. PMC 1206638. PMID 7498740.
- "Entrez Gene: POLR2A polymerase (RNA) II (DNA directed) polypeptide A, 220kDa".
- "Entrez Gene: POLR2B polymerase (RNA) II (DNA directed) polypeptide B, 140kDa".
- Khazak V, Estojak J, Cho H, Majors J, Sonoda G, Testa JR, Golemis EA (May 1998). "Analysis of the interaction of the novel RNA polymerase II (pol II) subunit hsRPB4 with its partner hsRPB7 and with pol II". Mol Cell Biol. 18 (4): 1935–45. PMC 121423. PMID 9528765.
- "Entrez Gene: POLR2E polymerase (RNA) II (DNA directed) polypeptide E, 25kDa".
- "Entrez Gene: POLR2F polymerase (RNA) II (DNA directed) polypeptide F".
- "Entrez Gene: POLR2G polymerase (RNA) II (DNA directed) polypeptide G".
- "POLR2J3 polymerase (RNA) II (DNA directed) polypeptide J3".
- Kolodziej PA, Young RA (Sep 1991). "Mutations in the three largest subunits of yeast RNA polymerase II that affect enzyme assembly". Mol Cell Biol. 11 (9): 4669–78. PMC 361357. PMID 1715023.
- Jin J, Dong W, Guarino LA (Dec 1998). "The LEF-4 subunit of Baculovirus RNA polymerase has RNA 5'-triphosphatase and ATPase activities". J Virol. 72 (12): 10011–9. PMC 110520. PMID 9811739.
- Abbondanzieri EA, Greenleaf WJ, Shaevitz JW, Landick R, Block SM (Nov 2005). "Direct observation of base-pair stepping by RNA polymerase". Nature. 438 (7067): 460–5. doi:10.1038/nature04268. PMC 1356566. PMID 16284617.
- Meinhart A, Cramer P (July 2004). "Recognition of RNA polymerase II carboxy-terminal domain by 3'-RNA-processing factors". Nature 430 (6996): 223–6. doi:10.1038/nature02679. PMID 15241417.
- More information at Berkeley National Lab
- RNA Polymerase II at the US National Library of Medicine Medical Subject Headings (MeSH)