A catalytic triad refers to the three amino acid residues that function together at the centre of the active site of some hydrolase and transferase enzymes (e.g. proteases, amidases, esterases, acylases, lipases and β-lactamases). An Acid-Base-Nucleophile triad is a common motif for generating a nucleophilic residue for covalent catalysis. The residues form a charge-relay network to polarise and activate the nucleophile, which attacks the substrate, forming a covalent intermediate which is then hydrolysed to regenerate free enzyme. The nucleophile is most commonly serine or cysteine but occasionally threonine.
Because enzymes fold into complex three-dimensional structures, the residues of a catalytic triad can be far from each other along the amino-acid sequence (primary structure), however, they are brought close together in the final fold.
As well as divergent evolution of function (and even the triad's nucleophile), catalytic triads show some of the best examples of convergent evolution. Chemical constraints on catalysis have led to the same catalytic solution independently evolving in at least 23 separate superfamilies. Their mechanism of action is consequently one of the best studied in biochemistry.
- 1 History
- 2 Identity of triad members
- 3 Examples of triads
- 4 Comparison of serine and cysteine hydrolase triads
- 5 Divergent evolution
- 6 Convergent evolution
- 7 See also
- 8 References
The structures of trypsin and chymotrypsin were first solved in the 1930s. The serine triad member of trypsin and chymotrypsin was identified as the nucleophile (by diisopropyl fluorophosphate modification) in the 1950s. Other protease sequences were aligned in the 1960s to reveal a family of related proteases, now called the S1 family. Simultaneously, the structures of the evolutionarily unrelated papain and subtilisin proteases were found to contain analogous triads. The 'charge-relay' mechanism for the activation of the nucleophile by the other triad members was proposed in the late 1960s. As more protease structures were solved by X-ray crystallography in the 1970s and 80s, homologous (such as TEV protease) and analogous (such as papain) triads were found. The MEROPS classification system in the 1990s and 2010s started to class proteases into structurally related enzyme superfamilies and so acts as a database of the convergent evolution of triads in over 20 superfamilies. Understanding how chemical constraints on evolution led to the convergence of so many enzyme families on the same triad geometries has developed in the 2010s. The massive body of work on the charge-relay, covalent catalysis of catalytic triads has led to the mechanism as being the best characterised in all of biochemistry.
Identity of triad members
The side-chain of the nucleophilic residue performs covalent catalysis on the substrate. The lone pair of electrons present on the oxygen or sulphur attack the electropositive carbonyl carbon. The 20 naturally occurring biological amino acids do not contain sufficiently nucleophilic functional groups for many difficult catalytic reactions. The most commonly used nucleophiles are the alcohol (OH) of serine and the thiol/thiolate ion (SH/S−) of cysteine. Embedding the nucleophile in a triad increases its catalytic activity. Finally, threonine proteases use the secondary alcohol of threonine, however such proteases use the N-terminal amide as the base, rather than a separate amino acid due to steric hindrance of the extra methyl group.
Since no natural amino acids are strongly nucleophilic, the base in a catalytic triad polarises and deprotonates the nucleophile to increase its reactivity. Additionally, it protonates the first product to aid leaving group departure. It is most commonly histidine since its pKa allows for effective base catalysis as well as both hydrogen bonding to the acid residue and deprotonating the nucleophile residue. β-lactamases such as TEM-1 use a lysine residue as the base. Because lysine's pKa is so high (pKa=11), a glutamate and several other residues act as the acid to stabilise its deprotonated state during the catalytic cycle. In order to avoid steric clashes, threonine proteases use their N-terminal amide as the base, to increase the reactivity of the catalytic threonine residue.
The acidic residue aligns and polarises the basic residue and is commonly aspartate or glutamate. Some enzymes act only as a dyad and the acid member of the triad can be less necessary. For example papain (a cysteine protease) uses asparagine as its third triad member which orients the histidine base but does not act as an acid. Similarly, hepatitis A virus protease contains an ordered water in the position where an acid residue should be. Lastly, cytomegalovirus proteases uses a pair of histidines, one as the base, as usual, and one as the acid. The second histidine is not as effective an acid as the more common aspartate or glutamate, leading to a lower catalytic efficiency.
Examples of triads
The Serine-Histidine-Aspartate motif is one of the most thoroughly characterised catalytic motifs in biochemistry. The triad is exemplified by chymotrypsin (Superfamily PA, Family S1), which is a model serine protease. The aspartate is hydrogen bonded (possibly via a low-barrier hydrogen bond) to the histidine, increasing the pKa of its imidazole nitrogen from 7 to around 12. This allows the histidine to act as a powerful general base, and deprotonate serine. The serine serves as a nucleophile, attacking the carbonyl carbon and forcing the carbonyl oxygen to accept an electron, leading to a tetrahedral intermediate. This intermediate is stabilized by an oxanion hole, involving the backbone amide of serine. Collapse of this intermediate back to a carbonyl causes histidine to donate its proton to the nitrogen attached to the alpha carbon. The nitrogen and the attached C-terminal peptide fragment leave by diffusion. A water molecule then donates a proton to histidine and the remaining OH− attacks the carbonyl carbon, forming another tetrahedral intermediate. The OH is a poorer leaving group than the C-terminal fragment, so, when the tetrahedral intermediate collapses again, the enzyme's serine leaves, regaining a proton from histidine. The N-terminus of the cleaved peptide now leaves by diffusion. The same triad has also convergently evolved in α/β hydrolases such as some lipases and esterases, however the chirality is reversed. Additionally, brain acetyl hydrolase (which has the same fold as a small G-protein) has also been found to have this triad. The equivalent Serine-Histidine-Glutamate triad is used in acetylcholinesterase.
The second most studied triad is the Cysteine-Histidine-Aspartate motif. Several families of cysteine proteases use this triad set, for example TEV protease (Superfamily PA, Family C4) and papain (Superfamily CA, Family C1). The triad acts similarly to serine protease triads, with notable differences (see below). The importance of the Asp to catalysis varies and several cysteine proteases are effectively Cys-His dyads (e.g. hepatitis A virus protease).
The triad of cytomegalovirus protease (Superfamily SH, Family S21) uses histidine as both the acid and base triad members. Removing the acid histidine only results in a 10-fold activity loss (compared to >10,000-fold when aspartate is removed from chymotrypsin). This triad has been interpreted as a possible way of generating a less active enzyme to control cleavage rate.
An unusual triad is found in seldolisin proteases (Superfamily SB, Family S53). The low pKa of the glutamate carboxylate group means that it only acts as a base in the triad at very low pH. The triad is hypothesised to be an adaptation to specific environments like acidic hot springs (e.g. kumamolysin) or cell lysosome (e.g. tripeptidyl peptidase).
Threonine proteases, such as the proteasome protease subunit (Superfamily PB, Family T1) and ornithine acyltransferases (Superfamily PE, Family T5) use the secondary alcohol of threonine in an manner analogous to the use of the serine primary alcohol. However, due to the steric interference of the extra methyl group of threonine, the base member of the triad is the N-terminal amide which polarises an ordered water which, in turn, deprotonates the catalytic alcohol to increase its reactivity.
Ser-Nterm and Cys-Nterm
In a similar manner to threonine proteases, there exist equivalent 'serine only' and 'cysteine only' configurations such as penicillin acylase G (Superfamily PB, Family S45) and penicillin acylase V (Superfamily PB, Family S59) which are evolutionarily related to the proteasome proteases. Again, these use their N-terminal amide as a base.
This unusual triad occurs only in one superfamily of amidases. In this case, the lysine acts to polarise the middle serine. The middle serine then forms two strong hydrogen bonds to he nucleophilic serine to activate it (one with the side chain alcohol and the other with the backbone amide). The middle serine is held in an unusual cis orientation to facilitate precise contacts with the other two triad residues. The triad is further unusual in that the lysine and cis-serine both act as the base in activating the catalytic serine but the same lysine also performs the role of the acid member as well as making key structural contacts.
Comparison of serine and cysteine hydrolase triads
Nucleophilic enzymes use an interconnected set of active site residues to achieve catalysis. The sophistication of the active site network causes residues involved in catalysis, and residues in contact with these, to be the most evolutionarily conserved within their families. In catalytic triads, the most common nucleophiles are serine (an alcohol) or cysteine (a thiol). Compared to oxygen, sulphur’s extra d orbital makes it larger (by 0.4 Å) and softer, allows it to form longer bonds (dC-X and dX-H by 1.3-fold), and gives it a lower pKa (by 5 units). The resulting chemical differences between cysteine and serine proteases on catalytic chemistry are described below; however similar issues affect hydrolases and transferases in general.
The pKa of cysteine is low enough that some cysteine proteases (e.g. papain) have been shown to exist as an S− thiolate ion in the ground state enzyme and many even lack the acidic triad member. Serine is also more dependent on other residues to reduce its pKa for concerted deprotonation with catalysis by optimal orientation of the acid-base triad members. The low pKa of cysteine works to its disadvantage in the resolution of the first tetrahedral intermediate as unproductive reversal of the original nucleophilic attack is the more favourable breakdown product. The triad base is therefore preferentially oriented to protonate the leaving group amide to ensure that it is ejected to leave the enzyme sulphur covalently bound to the substrate N-terminus. Finally, resolution of the acyl-enzyme (to release the substrate C-terminus) requires serine to be re-protonated whereas cysteine can leave as S−.
Sterically, the sulphur of cysteine also has longer bonds and a bulkier Van der Waals radius to fit in the active site and a mutated nucleophile can be trapped in unproductive orientations. For example the crystal structure of thio-trypsin indicates that cysteine points away from the substrate, instead forming interactions with the oxyanion hole.
There are examples of divergent evolution in catalytic triads in both the reaction catalysed, and the residues used in catalysis. The triad remains the core of the active site, but it is evolutionarily adapted to serve different functions.
Catalytic triads perform covalent catalysis via an acyl-enzyme intermediate. If this intermediate is resolved by water, the result is hydrolysis of the substrate. However, if the intermediate is resolved by attack by a second substrate, then the enzyme acts as a transferase. For example, attack by an acyl group results in an acyltransferase reaction. Several families of transferase enzymes have evolved from hydrolases by adaptation to exclude water and favour attack of a second substrate.
Despite chemical differences between the possible triad nucleophiles, some protease superfamilies have evolved from one nucleophile to another through divergent evolution. This can be inferred when a superfamily (with the same fold) contains families that use different nucleophiles. Such nucleophile switches have occurred several times during evolutionary history, however the mechanisms by which this happen are still unclear.
|PA clan||C3, C4, C24, C30, C37, C62, C74, C99||TEV protease (Tobacco etch virus)|
|S1, S3, S6, S7, S29, S30, S31, S32, S39, S46, S55, S64, S65, S75||Chymotrypsin (mammals, e.g. Bos taurus)|
|PB clan||C44, C45, C59, C69, C89, C95||Amidophosphoribosyltransferase precursor (Homo sapiens)|
|S45, S63||Penicillin G acylase precursor (Escherichia coli)|
|T1, T2, T3, T6||Archaean proteasome, beta component (Thermoplasma acidophilum)|
|PC clan||C26, C56||Gamma-glutamyl hydrolase (Rattus norvegicus)|
|S51||Dipeptidase E (Escherichia coli)|
|PD clan||C46||Hedgehog protein (Drosophila melanogaster)|
|N9, N10, N11||Intein-containing V-type proton ATPase catalytic subunit A (Saccharomyces cerevisiae)|
|PE clan||P1||DmpA aminopeptidase (Ochrobactrum anthropi)|
|T5||Ornithine acetyltransferase precursor (Saccharomyces cerevisiae)|
The enzymology of proteases provides some of the clearest known examples of convergent evolution. The same geometric arrangement of triad residues has independently evolved over 20 times (in separate enzyme superfamilies). This is because there are limited productive ways to arrange three triad residues, the enzyme backbone and the substrate. These examples reflect the intrinsic chemical constraints on enzymes, leading evolution to independently converge on equivalent solutions repeatedly.
Cysteine and serine hydrolases
Serine and cysteine proteases use different amino acid functional groups (alcohol or thiol) as a nucleophile. In order to activate that nucleophile, they orient an acidic and basic residue in a catalytic triad. The chemical and physical constraints on enzyme catalysis have caused identical triad arrangements to have evolved independently over 20 times in different enzyme superfamilies.
The same triad geometries been converged upon by serine proteases such as chymotrypsin and subtilisin superfamilies. Similarly, the same has occurred with cysteine proteases such as viral C3 protease and papain superfamilies. Importantly, due to the mechanistic similarities in cysteine and serine proteases, all of these triads have converged to almost the same arrangement.
Families of Cysteine proteases
Families of Serine proteases
|SB||S8, S53||Subtilisin (Bacillus licheniformis)|
|SC||S9, S10, S15, S28, S33, S37||Prolyl oligopeptidase (Sus scrofa)|
|SE||S11, S12, S13||D-Ala-D-Ala peptidase C (Escherichia coli)|
|SF||S24, S26||Signal peptidase I (Escherichia coli)|
|SH||S21, S73, S77, S78, S80||Cytomegalovirus assemblin (human herpesvirus 5)|
|SJ||S16, S50, S69||Lon-A peptidase (Escherichia coli)|
|SK||S14, S41, S49||Clp protease (Escherichia coli)|
|SO||S74||Phage K1F endosialidase CIMCD self-cleaving protein (Enterobacteria phage K1F)|
|SP||S59||Nucleoporin 145 (Homo sapiens)|
|SR||S60||Lactoferrin (Homo sapiens)|
|SS||S66||Murein tetrapeptidase LD-carboxypeptidase (Pseudomonas aeruginosa)|
|ST||S54||Rhomboid-1 (Drosophila melanogaster)|
|PA||S1, S3, S6, S7, S29, S30, S31, S32, S39, S46, S55, S64, S65, S75||Chymotrypsin A (Bos taurus)|
|PB||S45, S63||Penicillin G acylase precursor (Escherichia coli)|
|PC||S51||Dipeptidase E (Escherichia coli)|
|PE||P1||DmpA aminopeptidase (Ochrobactrum anthropi)|
|unassigned||S48, S62, S68, S71, S72, S79, S81|
Threonine proteases use the amino acid threonine as their catalytic nucleophile. Unlike cysteine and serine, threonine is a secondary alcohol (i.e. has a methyl group). The methyl group of threonine greatly restricts the possible orientations of triad and substrate as the methyl clashes with either the enzyme backbone or histidine base. Consequently, most threonine proteases use an N-terminal threonine in order to avoid such steric clashes.
Several evolutionarily independent enzyme superfamilies with different protein folds use the N-terminal residue as a nucleophile; for example, Superfamily PB (proteosomes using the Ntn fold) and Superfamily PE (acetyltransferases using the DOM fold) This commonality of active site structure in completely different protein folds indicates that the active site evolved convergently in those superfamilies.
Families of threonine proteases
|PB clan||T1, T2, T3, T6||Archaean proteasome, beta component (Thermoplasma acidophilum)|
|PE clan||T5||Ornithine acetyltransferase (Saccharomyces cerevisiae)|
- Enzyme catalysis
- Active site
- Functional groups
- Enzyme superfamily
- PA clan
- Convergent evolution
- Divergent evolution
- Dodson, G; Wlodawer, A (September 1998). "Catalytic triads and their relatives.". Trends in Biochemical Sciences 23 (9): 347–52. doi:10.1016/S0968-0004(98)01254-7. PMID 9787641.
- Buller, AR; Townsend, CA (Feb 19, 2013). "Intrinsic evolutionary constraints on protease structure, enzyme acylation, and the identity of the catalytic triad". Proceedings of the National Academy of Sciences of the United States of America 110 (8): E653–61. Bibcode:2013PNAS..110E.653B. doi:10.1073/pnas.1221050110. PMID 23382230.
- Stryer L, Berg JM, Tymoczko JL (2002). "9 Catalytic Strategies". Biochemistry (5th ed.). San Francisco: W.H. Freeman. ISBN 0-7167-4955-6.
- Perutz, Max (1992). Protein structure. New approaches to disease and therapy. New York: W.H. Freeman and Co.
- Neurath, H (Oct 1994). "Proteolytic enzymes past and present: the second golden era. Recollections, special section in honor of Max Perutz". Protein Sci 3 (10): 1734–9. doi:10.1002/pro.5560031013. PMC 2142620. PMID 7849591.
- Ohman, KP; Hoffman, A; Keiser, HR (April 1990). "Endothelin-induced vasoconstriction and release of atrial natriuretic peptides in the rat.". Acta physiologica Scandinavica 138 (4): 549–56. doi:10.1111/j.1748-1716.1990.tb08883.x. PMID 2141214.
- Dixon, Gordon H.; Kauffman, Dorothy L.; Neurath, Hans (5 March 1958). "Amino Acid Sequence in the Region of Diisopropyl Phosphoryl Binding in Dip-Trypsin". Journal of the American Chemical Society 80 (5): 1260–1261. doi:10.1021/ja01538a059.
- WALSH, KA; NEURATH, H (October 1964). "TRYPSINOGEN AND CHYMOTRYPSINOGEN AS HOMOLOGOUS PROTEINS". Proceedings of the National Academy of Sciences of the United States of America 52 (4): 884–9. Bibcode:1964PNAS...52..884W. doi:10.1073/pnas.52.4.884. PMC 300366. PMID 14224394.
- de Haën, C; Neurath, H; Teller, DC (Feb 25, 1975). "The phylogeny of trypsin-related serine proteases and their zymogens. New methods for the investigation of distant evolutionary relationships.". Journal of Molecular Biology 92 (2): 225–59. doi:10.1016/0022-2836(75)90225-9. PMID 1142424.
- Lesk, AM; Fordham, WD (May 10, 1996). "Conservation and variability in the structures of serine proteinases of the chymotrypsin family.". Journal of Molecular Biology 258 (3): 501–37. doi:10.1006/jmbi.1996.0264. PMID 8642605.
- Blow, DM; Birktoft, JJ; Hartley, BS (Jan 25, 1969). "Role of a buried acid group in the mechanism of action of chymotrypsin". Nature 221 (5178): 337–40. Bibcode:1969Natur.221..337B. doi:10.1038/221337a0. PMID 5764436.
- Gorbalenya, AE; Blinov, VM; Donchenko, AP (Jan 6, 1986). "Poliovirus-encoded proteinase 3C: a possible evolutionary link between cellular serine and cysteine proteinase families.". FEBS Letters 194 (2): 253–7. doi:10.1016/0014-5793(86)80095-3. PMID 3000829.
- Bazan, JF; Fletterick, RJ (November 1988). "Viral cysteine proteases are homologous to the trypsin-like family of serine proteases: structural and functional implications". Proceedings of the National Academy of Sciences of the United States of America 85 (21): 7872–6. Bibcode:1988PNAS...85.7872B. doi:10.1073/pnas.85.21.7872. PMC 282299. PMID 3186696.
- Phan, J; Zdanov, A; Evdokimov, AG; Tropea, JE; Peters HK, 3rd; Kapust, RB; Li, M; Wlodawer, A; Waugh, DS (Dec 27, 2002). "Structural basis for the substrate specificity of tobacco etch virus protease". The Journal of Biological Chemistry 277 (52): 50564–72. doi:10.1074/jbc.M207224200. PMID 12377789.
- Rawlings, N.D.; Barrett, A.J. (1993). "Evolutionary families of peptidases". Biochem J 290: 205–218.
- Rawlings ND, Barrett AJ, Bateman A; Barrett; Bateman (January 2010). "MEROPS: the peptidase database". Nucleic Acids Res. 38 (Database issue): D227–33. doi:10.1093/nar/gkp971. PMC 2808883. PMID 19892822.
- Ekici, OD; Paetzel, M; Dalbey, RE (December 2008). "Unconventional serine proteases: variations on the catalytic Ser/His/Asp triad configuration.". Protein science : a publication of the Protein Society 17 (12): 2023–37. doi:10.1110/ps.035436.108. PMID 18824507.
- Damblon, C; Raquet, X; Lian, LY; Lamotte-Brasseur, J; Fonze, E; Charlier, P; Roberts, GC; Frère, JM (Mar 5, 1996). "The catalytic mechanism of beta-lactamases: NMR titration of an active-site lysine residue of the TEM-1 enzyme". Proceedings of the National Academy of Sciences of the United States of America 93 (5): 1747–52. Bibcode:1996PNAS...93.1747D. doi:10.1073/pnas.93.5.1747. PMID 8700829.
- Jelsch, C; Lenfant, F; Masson, JM; Samama, JP (Mar 9, 1992). "Beta-lactamase TEM1 of E. coli. Crystal structure determination at 2.5 A resolution.". FEBS Letters 299 (2): 135–42. doi:10.1016/0014-5793(92)80232-6. PMID 1544485.
- Brannigan, JA; Dodson, G; Duggleby, HJ; Moody, PC; Smith, JL; Tomchick, DR; Murzin, AG (Nov 23, 1995). "A protein catalytic framework with an N-terminal nucleophile is capable of self-activation". Nature 378 (6555): 416–9. Bibcode:1995Natur.378..416B. doi:10.1038/378416a0. PMID 7477383.
- Cheng, H; Grishin, NV (July 2005). "DOM-fold: a structure with crossing loops found in DmpA, ornithine acetyltransferase, and molybdenum cofactor-binding domain.". Protein science : a publication of the Protein Society 14 (7): 1902–10. doi:10.1110/ps.051364905. PMID 15937278.
- Shin, S; Yun, YS; Koo, HM; Kim, YS; Choi, KY; Oh, BH (Jul 4, 2003). "Characterization of a novel Ser-cisSer-Lys catalytic triad in comparison with the classical Ser-His-Asp triad.". The Journal of Biological Chemistry 278 (27): 24937–43. doi:10.1074/jbc.M302156200. PMID 12711609.
- Halabi, N; Rivoire, O; Leibler, S; Ranganathan, R (Aug 21, 2009). "Protein sectors: evolutionary units of three-dimensional structure.". Cell 138 (4): 774–86. doi:10.1016/j.cell.2009.07.038. PMID 19703402.
- McGrath, ME; Wilke, ME; Higaki, JN; Craik, CS; Fletterick, RJ (Nov 28, 1989). "Crystal structures of two engineered thiol trypsins.". Biochemistry 28 (24): 9264–70. doi:10.1021/bi00450a005. PMID 2611228.
- Polgár, L; Asbóth, B (Aug 7, 1986). "The basic difference in catalyses by serine and cysteine proteinases resides in charge stabilization in the transition state.". Journal of Theoretical Biology 121 (3): 323–6. doi:10.1016/s0022-5193(86)80111-4. PMID 3540454.
- Beveridge, AJ (July 1996). "A theoretical study of the active sites of papain and S195C rat trypsin: implications for the low reactivity of mutant serine proteinases". Protein science : a publication of the Protein Society 5 (7): 1355–65. doi:10.1002/pro.5560050714. PMC 2143470. PMID 8819168.
- Abrahmsén, L; Tom, J; Burnier, J; Butcher, KA; Kossiakoff, A; Wells, JA (Apr 30, 1991). "Engineering subtilisin and its substrates for efficient ligation of peptide bonds in aqueous solution.". Biochemistry 30 (17): 4151–9. doi:10.1021/bi00231a007. PMID 2021606.
- Murzin, Alexey G (June 1998). "How far divergent evolution goes in proteins". Current Opinion in Structural Biology 8 (3): 380–387. doi:10.1016/S0959-440X(98)80073-0.
- Gerlt, John A.; Babbitt, Patricia C. (2001). "DIVERGENT EVOLUTION OF ENZYMATIC FUNCTION: Mechanistically Diverse Superfamilies and Functionally Distinct Suprafamilies". Annual Review of Biochemistry 70 (1): 209–246. doi:10.1146/annurev.biochem.70.1.209. PMID 11395407.
- Stehle, Felix; Brandt, Wolfgang; Stubbs, Milton T.; Milkowski, Carsten; Strack, Dieter (October 2009). "Sinapoyltransferases in the light of molecular evolution". Phytochemistry. Evolution of Metabolic Diversity 70 (15–16): 1652–1662. doi:10.1016/j.phytochem.2009.07.023.