Directed evolution

From Wikipedia, the free encyclopedia
Jump to: navigation, search
Not to be confused with Directed_evolution_(transhumanism).
An example of directed evolution with comparison to natural evolution. The inner cycle indicates the 3 stages of the directed evolution cycle with the natural process being mimicked in brackets. The outer circle demonstrates steps a typical experiment. The red symbols indicate functional variants, the pale symbols indicate variants with reduced function.

Directed evolution (DE) is a method used in protein engineering that mimics the process of natural selection to evolve proteins or nucleic acids toward a user-defined goal.[1] It consists of subjecting a gene to iterative rounds of mutagenesis (creating a library of variants), selection (expressing the variants and isolating members with the desired function), and amplification (generating a template for the next round). It can be performed in vivo (in living cells), or in vitro (free in solution or microdroplet). Directed evolution is used both for protein engineering as an alternative to rationally designing modified proteins, as well as studies of fundamental evolutionary principles in a controlled, laboratory environment.

Principles[edit]

Directed evolution is analogous to climbing a hill on a 'fitness landscape' where elevation represents the desired property. Each round of selection samples mutants on all sides of the starting template (1) and selects the mutant with the highest elevation, thereby climbing the hill. This is repeated until a local summit is reached (2).

Directed evolution is a mimic of the natural evolution cycle in a laboratory setting. Evolution requires three things to occur: variation between replicators, that the variation causes fitness differences upon which selection acts, and that this variation is heritable. In DE, a single gene is evolved by iterative rounds of mutagenesis, selection or screening, and amplification.[2] Rounds of these steps are typically repeated, using the best variant from one round as the template for the next to achieve stepwise improvements.

The likelihood of success in a directed evolution experiment is directly related to the total library size, as evaluating more mutants increases the chances of finding one with the desired properties.[3]

Generating variation[edit]

Starting gene (left) and library of variants (right). Point mutations change single nucleotides. Insertions and deletions add or remove sections of DNA. Shuffling recombines segments of two (or more) similar genes.

The first step in performing a cycle of directed evolution is the generation of a library of variant genes. The sequence space for random sequence is vast (10130 possible sequences for a 100 amino acid protein) and extremely sparsely populated by functional proteins. Neither experimental[4], nor natural[5] evolution can ever get close to sampling so many sequences. Of course, natural evolution samples variant sequences close to functional protein sequences and this is imitated in DE by mutagenising an already functional gene.

The starting gene can be mutagenised by random point mutations (by chemical mutagens or error prone PCR)[6][7] and insertions and deletions (by transposons).[8] Gene recombination can be mimicked by DNA shuffling[9][10] of several sequences (usually of more than 70% homology) to jump into regions of sequence space between the shuffled parent genes. Finally, specific regions of a gene can be systematically randomised[11] for a more focused approach based on structure and function knowledge. Depending on the method, the library generated will vary in the proportion of functional variants it contains. Even if an organism is used to express the gene of interest, by mutagenising only that gene, the rest of the organism’s genome remains the same and can be ignored for the evolution experiment (to the extent of providing a constant genetic environment).

Detecting fitness differences[edit]

The majority of mutations are deleterious and so libraries of mutants tend to mostly have variants with reduced activity.[12] Therefore, a high-throughput assay is vital for measuring activity to find the rare variants with beneficial mutations that improve the desired properties. Two main categories of method exist for isolating functional variants. Selection systems directly couple protein function to survival of the gene, whereas screening systems individually assay each variant and allow a quantitative threshold to be set for sorting a variant or population of variants of a desired activity. Both selection and screening can be performed in living cells (in vivo evolution) or performed directly on the protein or RNA without any cells (in vitro evolution).[13][14]

During in vivo evolution, each cell (usually bacteria or yeast) is transformed with a plasmid containing a different member of the variant library. In this way, only the gene of interest differs between the cells, with all other genes being kept the same. The cells express the protein either in their cytoplasm or surface where its function can be tested. This format has the advantage of selecting for properties in a cellular environment, which is useful when the evolved protein or RNA is to be used in living organisms. When performed without cells, DE involves using in vitro transcription translation to produce proteins or RNA free in solution or compartmentalised in artificial microdroplets. This method has the benefits of being more versatile in the selection conditions (e.g temperature, solvent), and can express proteins that would be toxic to cells. Furthermore, in vitro evolution experiments can generate far larger libraries (up to 1015) because the library DNA need not be inserted into cells (often a limiting step).

Selection[edit]

Selection for binding activity is conceptually simple. The target molecule is immobilised on a solid support, a library of variant proteins is flowed over it, poor binders are washed away, and the remaining bound variants recovered to isolate their genes.[15] Binding of an enzyme to immobilised covalent inhibitor has been also used as an attempt to isolate active catalysts. This approach, however, only selects for single catalytic turnover and is not a good model of substrate binding or true substrate reactivity. If an enzyme activity can be made necessary for cell survival, either by synthesizing a vital metabolite, or destroying a toxin, then cell survival is a function of enzyme activity.[16][17] Such systems are generally only limited in throughput by the transformation efficiency of cells. They are also less expensive and labour intensive than screening, however they are typically difficult to engineer, prone to artefacts and give no information on the range of activities present in the library.

Screening[edit]

An alternative to selection is a screening system. Each variant gene is individually expressed and assayed to quantitatively measure the activity (most often by a colourgenic or fluorogenic product). The variants are then ranked and the experimenter decides which variants to use as temples for the next round of DE. Even the most high throughput assays usually have lower coverage than selection methods but give the advantage of producing detailed information on each one of the screened variants. This disaggregated data can also be used to characterise the distribution of activities in libraries which is not possible in simple selection systems. Screening systems, therefore, have advantages when it comes to experimentally characterising adaptive evolution and fitness landscapes.

Ensuring heredity[edit]

An expressed protein can either be covalently linked to its gene (as in mRNA, left) or compartmentalized with it (cells or artificial compartments, right). Either way ensures that the gene can be isolated based on the activity of the encoded protein.

When functional proteins have been isolated, it is necessary that their genes are too, therefore a genotype-phenotype link is required.[18] This can be covalent, such as mRNA display where the mRNA gene is linked to the protein at the end of translation by puromycin.[19] Alternatively the protein and its gene can be co-localised by compartmentalisation in living cells[20] or emulsion droplets.[21] The gene sequences isolated are then amplified by PCR or by transformed host bacteria. Either the single best sequence, or a pool of sequences can be used as the template for the next round of mutagenesis. The repeated cycles of Diversification-Selection-Amplification generate protein variants adapted to the applied selection pressures.

Comparison to rational protein design[edit]

Advantages of directed evolution[edit]

Rational design of a protein relies on an in-depth knowledge of the protein structure, as well as its catalytic mechanism.[22][23] Specific changes are then made by site-directed mutagenesis in an attempt to change the function of the protein. A drawback of this is that even when the structure and mechanism of action of the protein are well known, the change due to mutation is still difficult to predict. Therefore, an advantage of DE is that there is no need to understand the mechanism of the desired activity or how mutations would affect it.[24]

Limitations of directed evolution[edit]

A restriction of directed evolution is that a high-throughput assay is required in order to measure the effects of a large number of different random mutations. This can require extensive research and development before it can be used for directed evolution. Additionally, such assays are often highly specific to monitoring a particular activity and so are not transferable to new DE experiments.[25]

Additionally, selecting for improvement in the assayed function simply generates improvements in the assayed function. To understand how these improvements are achieved, the properties of the evolving enzyme have to be measured. Improvement of the assayed activity can be due to improvements in enzyme catalytic activity or enzyme concentration. There is also no guarantee that improvement on one substrate will improve activity on another. This is particularly important when the desired activity cannot be directly screened or selected for and so a ‘proxy’ substrate is used. DE can lead to evolutionary specialisation to the proxy without improving the desired activity. Consequently, choosing appropriate screening or selection conditions is vital for successful DE.

Combinatorial approaches[edit]

Combined, 'semi-rational' approaches are being investigated as address the limitations of both rational design and directed evolution.[26][27] Beneficial mutations are rare, so large numbers of random mutants have to be screened to find improved variants. 'Focussed libraries' concentrate on randomising regions though to be richer in beneficial mutations for the mutagenesis step of DE. A focussed library contains fewer variants than a traditional random mutagenesis library and so does not require such high-throughput screening.

Creating a focussed library requires some knowledge of which residues in the structure to mutate. For example, knowledge of the active site of an enzyme may allow just the residues known to interact with the substrate to be randomised.[28][29] Alternatively, knowledge of which protein regions are variable in nature can guide mutagenesis in just those regions.[30][31]

Uses[edit]

Directed evolution is frequently used for protein engineering as an alternative to rational design[32], but can also be used to investigate fundamental questions of enzyme evolution.[33]

Protein engineering[edit]

As a protein engineering tool, DE has been most successful in three areas:

  1. Improving protein stability for biotechnological use at high temperatures or in harsh solvents.[34][35]
  2. Improving binding affinity of therapeutic antibodies (Affinity maturation)[36] and the activity of de novo designed enzymes[37].
  3. Altering substrate specificity of existing enzymes,[38][39][40][41] often for use in industry).[42]

Evolution studies[edit]

The study of natural evolution is traditionally based on extant organisms and their genes. However, research is fundamentally limited by the lack of fossils (and particularly the lack of ancient DNA sequences)[43][44] and incomplete knowledge of ancient environmental conditions. Directed evolution investigates evolution in a controlled system of genes for individual enzymes[45][46][47], ribozymes[48] and replicators[49][50] (similar to experimental evolution of eukaryotes,[51][52] prokaryotes[53] and viruses[54]).

DE allows control of selection pressure, mutation rate and environment (both the abiotic environment such as temperature, and the biotic environment, such as other genes in the organism). Additionally, there is a complete record of all evolutionary intermediate genes. This allows for detailed measurements of evolutionary processes, for example epistasis, evolvability, adaptive constraint[55] fitness landscapes[56], and neutral networks[57].

See also[edit]

References[edit]

  1. ^ Stephen Lutz, Beyond directed evolution - semi-rational protein engineering and design, Curr Opin Biotechnol. 2010 December ; 21(6): 734–743.
  2. ^ Voigt, CA; Kauffman, S; Wang, ZG (2000). "Rational evolutionary design: the theory of in vitro protein evolution.". Advances in protein chemistry 55: 79–160. PMID 11050933. 
  3. ^ Dalby, PA (August 2011). "Strategy and success for the directed evolution of enzymes.". Current opinion in structural biology 21 (4): 473–80. PMID 21684150. 
  4. ^ Lipovsek, D; Plückthun, A (July 2004). "In-vitro protein evolution by ribosome display and mRNA display.". Journal of immunological methods 290 (1-2): 51–67. PMID 15261571. 
  5. ^ Dryden, DT; Thomson, AR; White, JH (6 August 2008). "How much of protein sequence space has been explored by life on Earth?". Journal of the Royal Society, Interface / the Royal Society 5 (25): 953–6. PMID 18426772. 
  6. ^ Kuchner, O; Arnold, FH (December 1997). "Directed evolution of enzyme catalysts.". Trends in biotechnology 15 (12): 523–30. PMID 9418307. 
  7. ^ Sen, S; Venkata Dasu, V; Mandal, B (December 2007). "Developments in directed evolution for improving enzyme functions.". Applied biochemistry and biotechnology 143 (3): 212–23. PMID 18057449. 
  8. ^ Jones, DD (16 May 2005). "Triplet nucleotide removal at random positions in a target gene: the tolerance of TEM-1 beta-lactamase to an amino acid deletion.". Nucleic acids research 33 (9): e80. PMID 15897323. 
  9. ^ Stemmer, WP (4 August 1994). "Rapid evolution of a protein in vitro by DNA shuffling.". Nature 370 (6488): 389–91. PMID 8047147. 
  10. ^ Crameri, A; Raillard, SA; Bermudez, E; Stemmer, WP (15 January 1998). "DNA shuffling of a family of genes from diverse species accelerates directed evolution.". Nature 391 (6664): 288–91. PMID 9440693. 
  11. ^ Reetz, MT; Carballeira, JD (2007). "Iterative saturation mutagenesis (ISM) for rapid directed evolution of functional enzymes.". Nature protocols 2 (4): 891–903. PMID 17446890. 
  12. ^ Hartl, DL (October 2014). "What can we learn from fitness landscapes?". Current opinion in microbiology 21C: 51–57. PMID 25444121. 
  13. ^ Badran, AH; Liu, DR (7 November 2014). "In vivo continuous directed evolution.". Current opinion in chemical biology 24C: 1–10. PMID 25461718. 
  14. ^ Kumar, A; Singh, S (December 2013). "Directed evolution: tailoring biocatalysts for industrial applications.". Critical reviews in biotechnology 33 (4): 365–78. PMID 22985113. 
  15. ^ Willats, WG (December 2002). "Phage display: practicalities and prospects.". Plant molecular biology 50 (6): 837–54. PMID 12516857. 
  16. ^ Leemhuis, H; Stein, V; Griffiths, AD; Hollfelder, F (August 2005). "New genotype-phenotype linkages for directed evolution of functional proteins.". Current opinion in structural biology 15 (4): 472–8. PMID 16043338. 
  17. ^ Verhoeven, KD; Altstadt, OC; Savinov, SN (March 2012). "Intracellular detection and evolution of site-specific proteases using a genetic selection system.". Applied biochemistry and biotechnology 166 (5): 1340–54. PMID 22270548. 
  18. ^ Leemhuis, H; Stein, V; Griffiths, AD; Hollfelder, F (August 2005). "New genotype-phenotype linkages for directed evolution of functional proteins.". Current opinion in structural biology 15 (4): 472–8. PMID 16043338. 
  19. ^ Lipovsek, D; Plückthun, A (July 2004). "In-vitro protein evolution by ribosome display and mRNA display.". Journal of immunological methods 290 (1-2): 51–67. PMID 15261571. 
  20. ^ Nguyen, AW; Daugherty, PS (March 2005). "Evolutionary optimization of fluorescent proteins for intracellular FRET.". Nature biotechnology 23 (3): 355–60. PMID 15696158. 
  21. ^ Schaerli, Y; Hollfelder, F (December 2009). "The potential of microfluidic water-in-oil droplets in experimental biology.". Molecular bioSystems 5 (12): 1392–404. PMID 20023716. 
  22. ^ Marshall, SA; Lazar, GA; Chirino, AJ; Desjarlais, JR (1 March 2003). "Rational design and engineering of therapeutic proteins.". Drug discovery today 8 (5): 212–21. PMID 12634013. 
  23. ^ Wilson, CJ (27 October 2014). "Rational protein design: developing next-generation biological therapeutics and nanobiotechnological tools.". Wiley interdisciplinary reviews. Nanomedicine and nanobiotechnology. PMID 25348497. 
  24. ^ Giger, L; Caner, S; Obexer, R; Kast, P; Baker, D; Ban, N; Hilvert, D (August 2013). "Evolution of a designed retro-aldolase leads to complete active site remodeling.". Nature chemical biology 9 (8): 494–8. PMID 23748672. 
  25. ^ Bornscheuer, UT; Pohl, M (April 2001). "Improved biocatalysts by directed evolution and rational protein design.". Current opinion in chemical biology 5 (2): 137–43. PMID 11282339. 
  26. ^ Lutz, S (December 2010). "Beyond directed evolution--semi-rational protein engineering and design.". Current opinion in biotechnology 21 (6): 734–43. PMID 20869867. 
  27. ^ Goldsmith, M; Tawfik, DS (August 2012). "Directed enzyme evolution: beyond the low-hanging fruit.". Current opinion in structural biology 22 (4): 406–12. PMID 22579412. 
  28. ^ Chen, MM; Snow, CD; Vizcarra, CL; Mayo, SL; Arnold, FH (April 2012). "Comparison of random mutagenesis and semi-rational designed libraries for improved cytochrome P450 BM3-catalyzed hydroxylation of small alkanes.". Protein engineering, design & selection : PEDS 25 (4): 171–8. PMID 22334757. 
  29. ^ Acevedo-Rocha, CG; Hoebenreich, S; Reetz, MT (2014). "Iterative saturation mutagenesis: a powerful approach to engineer proteins by systematically simulating Darwinian evolution.". Methods in molecular biology (Clifton, N.J.) 1179: 103–28. PMID 25055773. 
  30. ^ Jochens, H; Bornscheuer, UT (3 September 2010). "Natural diversity to guide focused directed evolution.". Chembiochem : a European journal of chemical biology 11 (13): 1861–6. PMID 20680978. 
  31. ^ Jochens, H; Aerts, D; Bornscheuer, UT (December 2010). "Thermostabilization of an esterase by alignment-guided focussed directed evolution.". Protein engineering, design & selection : PEDS 23 (12): 903–9. PMID 20947674. 
  32. ^ Turner, NJ (August 2009). "Directed evolution drives the next generation of biocatalysts.". Nature chemical biology 5 (8): 567–73. PMID 19620998. 
  33. ^ Romero, PA; Arnold, FH (December 2009). "Exploring protein fitness landscapes by directed evolution.". Nature reviews. Molecular cell biology 10 (12): 866–76. PMID 19935669. 
  34. ^ Gatti-Lafranconi, P; Natalello, A; Rehm, S; Doglia, SM; Pleiss, J; Lotti, M (8 January 2010). "Evolution of stability in a cold-active enzyme elicits specificity relaxation and highlights substrate-related effects on temperature adaptation.". Journal of molecular biology 395 (1): 155–66. PMID 19850050. 
  35. ^ Zhao, H; Arnold, FH (January 1999). "Directed evolution converts subtilisin E into a functional equivalent of thermitase.". Protein engineering 12 (1): 47–53. PMID 10065710. 
  36. ^ Hawkins, RE; Russell, SJ; Winter, G (5 August 1992). "Selection of phage antibodies by binding affinity. Mimicking affinity maturation.". Journal of molecular biology 226 (3): 889–96. PMID 1507232. 
  37. ^ Giger, L; Caner, S; Obexer, R; Kast, P; Baker, D; Ban, N; Hilvert, D (August 2013). "Evolution of a designed retro-aldolase leads to complete active site remodeling.". Nature chemical biology 9 (8): 494–8. PMID 23748672. 
  38. ^ Shaikh, FA; Withers, SG (April 2008). "Teaching old enzymes new tricks: engineering and evolution of glycosidases and glycosyl transferases for improved glycoside synthesis.". Biochemistry and cell biology = Biochimie et biologie cellulaire 86 (2): 169–77. PMID 18443630. 
  39. ^ Cheriyan, M; Walters, MJ; Kang, BD; Anzaldi, LL; Toone, EJ; Fierke, CA (1 November 2011). "Directed evolution of a pyruvate aldolase to recognize a long chain acyl substrate.". Bioorganic & medicinal chemistry 19 (21): 6447–53. PMID 21944547. 
  40. ^ MacBeath, G; Kast, P; Hilvert, D (20 March 1998). "Redesigning enzyme topology by directed evolution.". Science (New York, N.Y.) 279 (5358): 1958–61. PMID 9506949. 
  41. ^ Toscano, MD; Woycechowsky, KJ; Hilvert, D (2007). "Minimalist active-site redesign: teaching old enzymes new tricks.". Angewandte Chemie (International ed. in English) 46 (18): 3212–36. PMID 17450624. 
  42. ^ Turner, NJ (August 2009). "Directed evolution drives the next generation of biocatalysts.". Nature chemical biology 5 (8): 567–73. PMID 19620998. 
  43. ^ Pääbo, S; Poinar, H; Serre, D; Jaenicke-Despres, V; Hebler, J; Rohland, N; Kuch, M; Krause, J; Vigilant, L; Hofreiter, M (2004). "Genetic analyses from ancient DNA.". Annual review of genetics 38: 645–79. PMID 15568989. 
  44. ^ Höss, M; Jaruga, P; Zastawny, TH; Dizdaroglu, M; Pääbo, S (1 April 1996). "DNA damage and DNA sequence retrieval from ancient tissues.". Nucleic acids research 24 (7): 1304–7. PMID 8614634. 
  45. ^ Bloom, JD; Arnold, FH (16 June 2009). "In the light of directed evolution: pathways of adaptive protein evolution.". Proceedings of the National Academy of Sciences of the United States of America. 106 Suppl 1: 9995–10000. PMID 19528653. 
  46. ^ Moses, AM; Davidson, AR (17 May 2011). "In vitro evolution goes deep.". Proceedings of the National Academy of Sciences of the United States of America 108 (20): 8071–2. PMID 21551096. 
  47. ^ Goldsmith, M; Tawfik, DS (August 2012). "Directed enzyme evolution: beyond the low-hanging fruit.". Current opinion in structural biology 22 (4): 406–12. PMID 22579412. 
  48. ^ Salehi-Ashtiani, K; Szostak, JW (1 November 2001). "In vitro evolution suggests multiple origins for the hammerhead ribozyme.". Nature 414 (6859): 82–4. PMID 11689947. 
  49. ^ Sumper, M; Luce, R (January 1975). "Evidence for de novo production of self-replicating and environmentally adapted RNA structures by bacteriophage Qbeta replicase.". Proceedings of the National Academy of Sciences of the United States of America 72 (1): 162–6. PMID 1054493. 
  50. ^ Mills, DR; Peterson, RL; Spiegelman, S (July 1967). "An extracellular Darwinian experiment with a self-duplicating nucleic acid molecule.". Proceedings of the National Academy of Sciences of the United States of America 58 (1): 217–24. PMID 5231602. 
  51. ^ Marden, JH; Wolf, MR; Weber, KE (November 1997). "Aerial performance of Drosophila melanogaster from populations selected for upwind flight ability.". The Journal of experimental biology 200 (Pt 21): 2747–55. PMID 9418031. 
  52. ^ Ratcliff, WC; Denison, RF; Borrello, M; Travisano, M (31 January 2012). "Experimental evolution of multicellularity.". Proceedings of the National Academy of Sciences of the United States of America 109 (5): 1595–600. PMID 22307617. 
  53. ^ Barrick, JE; Yu, DS; Yoon, SH; Jeong, H; Oh, TK; Schneider, D; Lenski, RE; Kim, JF (29 October 2009). "Genome evolution and adaptation in a long-term experiment with Escherichia coli.". Nature 461 (7268): 1243–7. PMID 19838166. 
  54. ^ Heineman, RH; Molineux, IJ; Bull, JJ (August 2005). "Evolutionary robustness of an optimal phenotype: re-evolution of lysis in a bacteriophage deleted for its lysin gene.". Journal of molecular evolution 61 (2): 181–91. PMID 16096681. 
  55. ^ Arnold, FH; Wintrode, PL; Miyazaki, K; Gershenson, A (February 2001). "How enzymes adapt: lessons from directed evolution.". Trends in biochemical sciences 26 (2): 100–6. PMID 11166567. 
  56. ^ Aita, T; Hamamatsu, N; Nomiya, Y; Uchiyama, H; Shibanaka, Y; Husimi, Y (5 July 2002). "Surveying a local fitness landscape of a protein with epistatic sites for the study of directed evolution.". Biopolymers 64 (2): 95–105. PMID 11979520. 
  57. ^ Bloom, JD; Raval, A; Wilke, CO (January 2007). "Thermodynamics of neutral protein evolution.". Genetics 175 (1): 255–66. PMID 17110496. 

External links[edit]

Category:Evolutionary biology