A catalytic triad usually refers to the three amino acid residues that function together at the centre of the active site of certain hydrolase and transferase enzymes (e.g. proteases, amidases, esterases, acylases, lipases and β-lactamases). A common method for generating a nucleophilic residue for covalent catalysis is by using an Acid-Base-Nucleophile triad. The residues form a charge-relay network to polarise and activate the nucleophile, which attacks the substrate, forming a covalent intermediate which is then hydrolysed to regenerate free enzyme. The nucleophile is most commonly serine or cysteine but occasionally threonine.
Because enzymes fold into complex three-dimensional shapes, the residues of a catalytic triad can be far from each other along the amino-acid sequence (primary structure), however, they are brought close together in the final fold.
As well as divergent evolution of function (and even the triad's nucleophile), catalytic triads show some of the best examples of convergent evolution. Chemical constraints on catalysis have led to the same catalytic solution independently evolving in at least 23 separate superfamilies. Their mechanism of action is consequently one of the best studied in all of biochemistry.
- 1 History
- 2 The identity of triad members
- 3 Examples of triads
- 4 Comparison of serine and cysteine hydrolase mechanisms
- 5 Divergent evolution
- 6 Convergent evolution
- 7 See also
- 8 References
The structures of trypsin and chymotrypsin were first solved in the 1930s. The serine triad member of trypsin and chymotrypsin was identified as the nucleophile (by disopropyl fluorophosphate modification) in the 1950s. Other protease sequences were aligned in the 1960s to reveal a family of related proteases, now called the S1 family. Simultaneously, the structures of the evolutionarily unrelated papain and subtilisin proteases were found to contain analogous triads. The 'charge-relay' mechanism for the activation of the nucleophile by the other triad members was proposed in the late 1960s. As more protease structures were solved by X-ray crystallography in the 1970s and 80s, homologous (such as TEV protease) and analogous (such as papain) triads were found. The MEROPS classification system in the 1990s and 2010s started to class proteases into structurally related enzyme superfamilies and so acts as a database of the convergent evolution of triads in over 20 superfamilies. Understanding of the chemical constraints on evolution that have led to the convergence of so many enzyme families on the same triad geometries has begun to be understood in the 2010s. The massive body of work on the charge-relay, covalent catalysis of catalytic triads has led to the mechanism as being the best characterised in all of biochemistry.
The identity of triad members
The side-chain of the nucleophilic residue performs covalent catalysis on the substrate. The lone pair of electrons present on the oxygen or sulphur attack the electropositive carbonyl carbon. The 20 naturally occurring biological amino acids do not contain sufficiently nucleophilic functional groups for many difficult catalytic reactions. The most commonly used nucleophiles are the alcohol (OH) of serine and the thiol/thiolate ion (SH/S-) of cysteine. Embedding the nucleophile in a triad makes it more catalytically active. A few proteases use the secondary alcohol of threonine, however, due to the extra methyl group, such proteases use the N-terminal amide as the base, rather than a separate amino acid.
Since no natural amino acids are strongly nucleophilic, the base in a catalytic triad polarises and deprotonates the nucleophile to increase its reactivity. Additionally, it protonates the first product to aid leaving group departure. It is most commonly histidine since its pKa allows for effective base catalysis as well as both hydrogen bonding to the acid residue and deprotonating the nucleophile residue. β-lactamases such as TEM-1 use a lysine residue as the base. Because lysine's pKa is so high (pKa=11), a glutamate and several other residues act as the acid to stabilise its deprotonated state during the catalytic cycle. In order to avoid steric clashes, threonine proteases use their N-terminal amide as the base, to increase the reactivity of the catalytic threonine residue.
The acidic residue aligns and polarises the basic residue. It is commonly aspartate or glutamate. Some enzymes act only as a dyad as the acid member of the triad can be less necessary for cysteine proteases. For example papain uses asparagine as its third triad member which orients the histidine base but cannot act as an acid. Similarly, hepatitus A virus protease contains an ordered water in the position where an acid residue should be. Lastly, cytomegalovirus proteases uses a pair of histidines, one as the base, as usual, and one as the acid. The second histidine is not as effective an acid as the more common aspartate or glutamate, leading to a lower catalytic efficiency.
Examples of triads
- Chymotrypsin binds its substrate, an exposed loop containing a large hydrophobic residue.
- The aspartate is hydrogen bonded (possibly low-barrier hydrogen bond) with histidine, increasing the pKa of its imidazole nitrogen from 7 to about 12. This allows the histidine to act as a powerful general base, and deprotonate serine.
- The serine serves as a nucleophile, attacking the carbonyl carbon and forcing the carbonyl oxygen to accept an electron, leading to a tetrahedral intermediate. This intermediate is stabilized by an oxanion hole, involving the backbone amide of serine.
- Collapse of this intermediate back to a carbonyl causes histidine to donate its proton to the nitrogen attached to the alpha carbon. The nitrogen and the attached C-terminal peptide fragment leave by diffusion.
- A water molecule then donates a proton to histidine and the remaining OH- attacks the carbonyl carbon, forming another tetrahedral intermediate. The OH is a poorer leaving group than the C-terminal fragment, so, when the tetrahedral intermediate collapses again, the enzyme's serine leaves, regaining a proton from histidine.
- The N-terminus of the cleaved peptide now leaves by diffusion.
The same triad has also convergently evolved in α/β hydrolases such as some lipases and esterases, however the chirality is reversed. Additionally, brain acetyl hydrolase (which has the same fold as a small G-protein has also been found to have this triad. The equivalent Serine-Histidine-Glutamate triad is used in acetylcholinesterase.
Several families of cysteine proteases use this triad set, for example TEV protease (Superfamily PA, Family C4) and papain (Superfamily CA, Family C1). The triad acts similarly to serine protease triads, with notable differences discussed in 'Comparison of serine and cysteine hydrolase mechanisms'. It is still unclear how important the Asp of the papain triad is to catalysis and several cysteine proteases are effectively dyads (e.g. hepatitis A virus protease).
The triad of cytomegalovirus protease (Superfamily SH, Family S21) uses histidine as both the acid and base triad members. Removing the acid histidine only results in a 10-fold activity loss (compared to >10,000-fold when aspartate is removed from chymotrypsin). This triad has been interpreted as a possible way of generating a less active enzyme to control cleavage rate.
An unusual triad is found in seldolisin proteases (Superfamily SB, Family S53). The low pKa of the glutamate carboxylate group means that it only acts as a base in the triad at very low pH. The triad is hypothesised to be an adaptation to specific environments like acidic hot springs (e.g. kumamolysin) or cell lysosome (e.g. tripeptidyle peptidase).
Threonine proteases, such as the proteasome protease subunit (Superfamily PB, Family T1) and ornithine acyltransferases (Superfamily PE, Family T5) use the secondary alcohol of threonine in an manner analogous to the use of the serine primary alcohol. However, due to the steric interference of the extra methyl group of threonine, the base member of the triad is the N-terminal amide which polarises an ordered water which, in turn, deprotonates the catalytic alcohol to increase its reactivity.
Ser-Nterm and Cys-Nterm
In a similar manner to threonine proteases, there exist equivalent 'serine only' and 'cysteine only' configurations such as penicillin acylase G (Superfamily PB, Family S45) and penicillin acylase V (Superfamily PB, Family S59) which are evolutionarily related to the proteasome proteases. Again, these use their N-terminal amide as a base.
This unusual triad occurs only in one superfamily of amidases. In this case, The lysine acts to polarise the middle serine. The middle serine then forms two strong hydrogen bonds to he nucleophilic serine to activate it (one with the side chain alcohol and the other with the backbone amide). The middle serine is held in an unusual cis orientation to facilitate precise contacts with the other two triad residues. The triad is further unusual in that the lysine and cisserine both act as the base in activating the catalytic serine but the same lysine also performs the role of the acid member as well as making key structural contacts.
Comparison of serine and cysteine hydrolase mechanisms
Nucleophilic enzymes use an interconnected set of active site residues to achieve catalysis. The sophistication of the active site network causes residues involved in catalysis, and residues in contact with these, to be the most evolutionarily conserved within their families. In catalytic triads, the most common nucleophiles are serine (an alcohol) or cysteine (a thiol). Compared to oxygen, sulphur’s extra d orbital makes it larger (by 0.4 Å), softer, form longer bonds (dC-X and dX-H by 1.3-fold) and have lower pKa (by 5 units). Here I concentrate on chemical differences between cysteine and serine proteases on catalytic chemistry, however similar issues affect hydrolases and transferases in general.
The pKa of cysteine is low enough that some cysteine proteases (e.g. papain) have been shown to exist as an S- thiolate ion in the ground state enzyme (a) and many even lack the acidic triad member (b). Serine is also more dependent on other residues to reduce its pKa for concerted deprotonation with catalysis (c) by optimal orientation of the acid-base triad members (d). The low pKa of cysteine works to its disadvantage in the resolution of the first tetrahedral intermediate as unproductive reversal of the original nucleophilic attack is the more favourable breakdown product. The triad base is therefore preferentially oriented to protonate the leaving group amide (e) to ensure that it is ejected to leave the enzyme sulphur covalently bound to the substrate N-terminus. Finally, resolution of the acyl-enzyme (to release the substrate C-terminus) requires serine to be re-protonated (f) whereas cysteine can leave as S-.
Sterically, the sulphur of cysteine also has longer bonds and a bulkier Van der Waals radius to fit in the active site and a mutated nucleophile can be trapped in unproductive orientations. For example the crystal structure of thio-trypsin indicates that cysteine points away from the substrate, instead forming interactions with the oxyanion hole.
The evolutionary specialisation of enzymes around the needs of their nucleophile makes it unsurprising that nucleophiles cannot be interconverted in extant proteases (nor in most other enzymes) and the large activity reductions (>104) observed can be explained as a result of compromised reactivity or structural misalignment.
Despite the chemical differences described above, it is clear that some protease superfamilies have evolved to use different nucleophiles though divergent evolution. This can be inferred because of several superfamilies (with the same fold) contain families that use different nucleophiles, indicating that nucleophile switches have occurred several times during evolutionary history, however the evolutionary mechanisms by which this can happen are still unclear.
|PA clan||C3, C4, C24, C30, C37, C62, C74, C99||TEV protease (Tobacco etch virus)|
|S1, S3, S6, S7, S29, S30, S31, S32, S39, S46, S55, S64, S65, S75||Chymotrypsin (mammals, e.g. bos taurus)|
|PB clan||C44, C45, C59, C69, C89, C95||amidophosphoribosyltransferase precursor (Homo sapiens)|
|S45, S63||penicillin G acylase precursor (Escherichia coli)|
|T1, T2, T3, T6||archaean proteasome, beta component (Thermoplasma acidophilum)|
|PC clan||C26, C56||gamma-glutamyl hydrolase (Rattus norvegicus)|
|S51||dipeptidase E (Escherichia coli)|
|PD clan||C46||hedgehog protein (Drosophila melanogaster)|
|N9, N10, N11||intein-containing V-type proton ATPase catalytic subunit A (Saccharomyces cerevisiae)|
|PE clan||P1||DmpA aminopeptidase (Ochrobactrum anthropi)|
|T5||Ornithine acetyltransferase precursor (Saccharomyces cerevisiae)|
The enzymology of proteases provides some of the clearest examples of convergent evolution. The same geometric arrangement of triad residues have independently evolved over 20 times (in separate enzyme superfamilies). This is because there are limited productive ways to arrange three triad residues, the enzyme backbone and the substrate. These examples reflect the intrinsic chemical constraints on enzymes, leading evolution to independently converge on equivalent solutions repeatedly.
Cysteine and serine hydrolases
Serine and cysteine proteases use different amino acid functional groups (alcohol or thiol) as a nucleophile. In order to activate that nucleophile, they orient an acidic and basic residue in a catalytic triad. The chemical and physical constraints on enzyme catalysis have caused identical triad arrangements to have evolved independently over 20 times in different enzyme superfamilies.
The same triad geometries been converged upon by serine proteases such as chymotrypsin and subtilisin superfamilies. Similarly, the same has occurred with cysteine proteases such as viral C3 protease and papain superfamilies. Importantly, due to the mechanistic similarities in cysteine and serine proteases, all of these triads have converged to almost the same arrangement.
Threonine proteases use the amino acid threonine as their catalytic nucleophile. Unlike cysteine and serine, threonine is a secondary alcohol (i.e. has a methyl group). The methyl group of threonine greatly restricts the possible orientations of triad and substrate as the methyl clashes with either the enzyme backbone or histidine base. Consequently, most threonine proteases use an N-terminal threonine in order to avoid such steric clashes.
Several evolutionarily independent enzyme superfamilies with different protein folds use the N-terminal residue as a nucleophile. Firstly they occurs in Superfamily PB (proteosomes using the Ntn fold) and secondly in Superfamily PE (acetyltransferases using the DOM fold) This commonality of active site in completely different protein folds indicates that the active site evolved convergently in those superfamilies.
|Superfamily||Threonine protease families||Examples|
|PB clan||T1, T2, T3, T6||archaean proteasome, beta component (Thermoplasma acidophilum)|
|PE clan||T5||ornithine acetyltransferase (Saccharomyces cerevisiae)|
- Enzyme catalysis
- Functional groups
- Enzyme superfamily
- PA clan
- Convergent evolution
- Divergent evolution
- Dodson, G; Wlodawer, A (September 1998). "Catalytic triads and their relatives.". Trends in Biochemical Sciences 23 (9): 347–52. doi:10.1016/S0968-0004(98)01254-7. PMID 9787641.
- Buller, AR; Townsend, CA (Feb 19, 2013). "Intrinsic evolutionary constraints on protease structure, enzyme acylation, and the identity of the catalytic triad.". Proceedings of the National Academy of Sciences of the United States of America 110 (8): E653–61. doi:10.1073/pnas.1221050110. PMID 23382230.
- Perutz, Max (1992). Protein structure. New approaches to disease and therapy. New York: W.H. Freeman and Co.
- "Proteolytic enzymes past and present: the second golden era. Recollections, special section in honor of Max Perutz.". Protein Sci 3 (10): 1734–9. Oct 1994. doi:10.1002/pro.5560031013. PMID 7849591.
- Ohman, KP; Hoffman, A; Keiser, HR (April 1990). "Endothelin-induced vasoconstriction and release of atrial natriuretic peptides in the rat.". Acta physiologica Scandinavica 138 (4): 549–56. doi:10.1111/j.1748-1716.1990.tb08883.x. PMID 2141214.
- Dixon, Gordon H.; Kauffman, Dorothy L.; Neurath, Hans (5 March 1958). Journal of the American Chemical Society 80 (5): 1260–1261. doi:10.1021/ja01538a059.
- WALSH, KA; NEURATH, H (October 1964). "TRYPSINOGEN AND CHYMOTRYPSINOGEN AS HOMOLOGOUS PROTEINS.". Proceedings of the National Academy of Sciences of the United States of America 52: 884–9. PMID 14224394.
- de Haën, C; Neurath, H; Teller, DC (Feb 25, 1975). "The phylogeny of trypsin-related serine proteases and their zymogens. New methods for the investigation of distant evolutionary relationships.". Journal of Molecular Biology 92 (2): 225–59. PMID 1142424.
- Lesk, AM; Fordham, WD (May 10, 1996). "Conservation and variability in the structures of serine proteinases of the chymotrypsin family.". Journal of Molecular Biology 258 (3): 501–37. doi:10.1006/jmbi.1996.0264. PMID 8642605.
- Blow, DM; Birktoft, JJ; Hartley, BS (Jan 25, 1969). "Role of a buried acid group in the mechanism of action of chymotrypsin.". Nature 221 (5178): 337–40. PMID 5764436.
- Gorbalenya, AE; Blinov, VM; Donchenko, AP (Jan 6, 1986). "Poliovirus-encoded proteinase 3C: a possible evolutionary link between cellular serine and cysteine proteinase families.". FEBS Letters 194 (2): 253–7. PMID 3000829.
- Bazan, JF; Fletterick, RJ (November 1988). "Viral cysteine proteases are homologous to the trypsin-like family of serine proteases: structural and functional implications.". Proceedings of the National Academy of Sciences of the United States of America 85 (21): 7872–6. doi:10.1073/pnas.85.21.7872. PMC 282299. PMID 3186696.
- Phan, J; Zdanov, A; Evdokimov, AG; Tropea, JE; Peters HK, 3rd; Kapust, RB; Li, M; Wlodawer, A; Waugh, DS (Dec 27, 2002). "Structural basis for the substrate specificity of tobacco etch virus protease.". The Journal of Biological Chemistry 277 (52): 50564–72. doi:10.1074/jbc.M207224200. PMID 12377789.
- Rawlings, N.D. & Barrett, A.J. (1993) Evolutionary families of peptidases. Biochem J 290, 205-218.
- Rawlings ND, Barrett AJ, Bateman A (January 2010). "MEROPS: the peptidase database". Nucleic Acids Res. 38 (Database issue): D227–33. doi:10.1093/nar/gkp971. PMC 2808883. PMID 19892822.
- Ekici, OD; Paetzel, M; Dalbey, RE (December 2008). "Unconventional serine proteases: variations on the catalytic Ser/His/Asp triad configuration.". Protein science : a publication of the Protein Society 17 (12): 2023–37. doi:10.1110/ps.035436.108. PMID 18824507.
- Damblon, C; Raquet, X; Lian, LY; Lamotte-Brasseur, J; Fonze, E; Charlier, P; Roberts, GC; Frère, JM (Mar 5, 1996). "The catalytic mechanism of beta-lactamases: NMR titration of an active-site lysine residue of the TEM-1 enzyme.". Proceedings of the National Academy of Sciences of the United States of America 93 (5): 1747–52. PMID 8700829.
- Jelsch, C; Lenfant, F; Masson, JM; Samama, JP (Mar 9, 1992). "Beta-lactamase TEM1 of E. coli. Crystal structure determination at 2.5 A resolution.". FEBS Letters 299 (2): 135–42. PMID 1544485.
- Brannigan, JA; Dodson, G; Duggleby, HJ; Moody, PC; Smith, JL; Tomchick, DR; Murzin, AG (Nov 23, 1995). "A protein catalytic framework with an N-terminal nucleophile is capable of self-activation.". Nature 378 (6555): 416–9. doi:10.1038/378416a0. PMID 7477383.
- Cheng, H; Grishin, NV (July 2005). "DOM-fold: a structure with crossing loops found in DmpA, ornithine acetyltransferase, and molybdenum cofactor-binding domain.". Protein science : a publication of the Protein Society 14 (7): 1902–10. doi:10.1110/ps.051364905. PMID 15937278.
- Shin, S; Yun, YS; Koo, HM; Kim, YS; Choi, KY; Oh, BH (Jul 4, 2003). "Characterization of a novel Ser-cisSer-Lys catalytic triad in comparison with the classical Ser-His-Asp triad.". The Journal of Biological Chemistry 278 (27): 24937–43. doi:10.1074/jbc.M302156200. PMID 12711609.
- Halabi, N; Rivoire, O; Leibler, S; Ranganathan, R (Aug 21, 2009). "Protein sectors: evolutionary units of three-dimensional structure.". Cell 138 (4): 774–86. doi:10.1016/j.cell.2009.07.038. PMID 19703402.
- McGrath, ME; Wilke, ME; Higaki, JN; Craik, CS; Fletterick, RJ (Nov 28, 1989). "Crystal structures of two engineered thiol trypsins.". Biochemistry 28 (24): 9264–70. PMID 2611228.
- Polgár, L; Asbóth, B (Aug 7, 1986). "The basic difference in catalyses by serine and cysteine proteinases resides in charge stabilization in the transition state.". Journal of Theoretical Biology 121 (3): 323–6. PMID 3540454.
- Beveridge, AJ (July 1996). "A theoretical study of the active sites of papain and S195C rat trypsin: implications for the low reactivity of mutant serine proteinases.". Protein science : a publication of the Protein Society 5 (7): 1355–65. doi:10.1002/pro.5560050714. PMID 8819168.
- Abrahmsén, L; Tom, J; Burnier, J; Butcher, KA; Kossiakoff, A; Wells, JA (Apr 30, 1991). "Engineering subtilisin and its substrates for efficient ligation of peptide bonds in aqueous solution.". Biochemistry 30 (17): 4151–9. PMID 2021606.
- Neet, KE; Koshland DE, Jr (November 1966). "The conversion of serine at the active site of subtilisin to cysteine: a "chemical mutation".". Proceedings of the National Academy of Sciences of the United States of America 56 (5): 1606–11. PMID 5230319.
- "A theoretical study of the active sites of papain and S195C rat trypsin: implications for the low reactivity of mutant serine proteinases.". Protein Sci 5 (7): 1355–65. Jul 1996. doi:10.1002/pro.5560050714. PMID 8819168.
- Turkenburg, Johan P.; Lamers, Marieke B. A. C.; Brzozowski, A. Marek; Wright, Lisa M.; Hubbard, Roderick E.; Sturt, Simone L.; Williams, David H. (21 February 2002). "Structure of a Cys25→Ser mutant of human cathepsin S". Acta Crystallographica Section D Biological Crystallography 58 (3): 451–455. doi:10.1107/S0907444901021825.
- Lawson, MA; Semler, BL (Nov 15, 1991). "Poliovirus thiol proteinase 3C can utilize a serine nucleophile within the putative catalytic triad.". Proceedings of the National Academy of Sciences of the United States of America 88 (22): 9919–23. PMID 1658804.
- Cheah, KC; Leong, LE; Porter, AG (May 5, 1990). "Site-directed mutagenesis suggests close functional relationship between a human rhinovirus 3C cysteine protease and cellular trypsin-like serine proteases.". The Journal of Biological Chemistry 265 (13): 7180–7. PMID 2158990.
- "Site-directed mutagenesis of the proposed catalytic amino acids of the Sindbis virus capsid protein autoprotease.". J Virol 64 (6): 3069–73. Jun 1990. PMID 2335827.
- Kowal, AT; Werth, MT; Manodori, A; Cecchini, G; Schröder, I; Gunsalus, RP; Johnson, MK (Sep 26, 1995). "Effect of cysteine to serine mutations on the properties of the [4Fe-4S] center in Escherichia coli fumarate reductase.". Biochemistry 34 (38): 12284–93. PMID 7547971.
- Sigal, IS; Harwood, BG; Arentzen, R (December 1982). "Thiol-beta-lactamase: replacement of the active-site serine of RTEM beta-lactamase by a cysteine residue.". Proceedings of the National Academy of Sciences of the United States of America 79 (23): 7157–60. PMID 6818541.
- Amara, AA; Rehm, BH (Sep 1, 2003). "Replacement of the catalytic nucleophile cysteine-296 by serine in class II polyhydroxyalkanoate synthase from Pseudomonas aeruginosa-mediated synthesis of a new polyester: identification of catalytic residues.". The Biochemical journal 374 (Pt 2): 413–21. doi:10.1042/BJ20030431. PMID 12924980.
- Walker, Ian; Easton, Christopher J.; Ollis, David L. (1 January 2000). "Site-directed mutagenesis of dienelactone hydrolase produces dienelactone isomerase". Chemical Communications (8): 671–672. doi:10.1039/b000365o.
- Li, J; Szittner, R; Derewenda, ZS; Meighen, EA (Aug 6, 1996). "Conversion of serine-114 to cysteine-114 and the role of the active site nucleophile in acyl transfer by myristoyl-ACP thioesterase from Vibrio harveyi.". Biochemistry 35 (31): 9967–73. doi:10.1021/bi9605292. PMID 8756458.
- Sharp, JD; Pickard, RT; Chiou, XG; Manetta, JV; Kovacevic, S; Miller, JR; Varshavsky, AD; Roberts, EF; Strifler, BA; Brems, DN (Sep 16, 1994). "Serine 228 is essential for catalytic activities of 85-kDa cytosolic phospholipase A2.". The Journal of Biological Chemistry 269 (37): 23250–4. PMID 8083230.