What is directed protein evolution?
Directed evolution (link) is a powerful method for altering the properties of biological parts and systems. Directed protein evolution employs iterative rounds of mutation and artificial selection to generate new proteins with desirable functions.
Biological molecules have a unique ability to rapidly evolve in response to strong selective pressure. Protein engineers exploit this evolvability to generate new and useful proteins through successive rounds of mutation and selection. This approach is known as directed protein evolution, and it involves four basic steps:
1. A parent protein sequence is selected. 2. The parent sequence is mutated to generate a library of functional variants. 3. Variants are evaluated for their ability to perform the desired function. 4. The process is repeated until the desired function is achieved.
The parent sequence is chosen based on its perceived similarity to the desired function, and a library of functional variants is generated using one or several of a variety of sequence diversification techniques. High-throughput functional screens and genetic selection methods are used to identify library members with enhanced target function, and those variants are used as parent sequences in successive rounds of mutation and selection. This process is repeated until the desired function is achieved.
After a parent sequence is chosen, a library of functional mutants must be generated. Common methods used for library construction include error-prone PCR and DNA shuffling.
Error-prone PCR is a technique for introducing random point mutations into cloned sequences, in which modifications to standard PCR conditions increase the error rate of nucleotide incorporation during amplification. Common methods for decreasing polymerase fidelity include the addition of manganese ions, an increase in the concentration of magnesium, and using an imbalanced ration of dNTPs. There are a number of commercially available kits for error-prone PCR, such as the GeneMorph II random mutagenesis kit from Agilent Technologies (link).
DNA shuffling (link) is a technique for “in vitro homologous recombination of pools of selected mutant genes." In this method, parental sequences are fragmented by DNase I and then reassembled by PCR. Recombination events occur as fragments anneal at regions of sufficient sequence homology. After reassembly, chimeric sequences are amplified by PCR and coned into an appropriate vector.
Screening and selection
Once a library of mutants is generated, they must be evaluated for their ability to perform the desired function. To do this, protein engineers employ a variety of high-throughput assays. Successful assays allow research to test a large number of functional variants while maintaining a connection between phenotype (the evolved protein function) and genotype (the DNA sequence encoding the evolved function). These assays can be categorized as either “in vivo” versus “in vitro” or “selection” versus “screening." The most important distinction is between a screen and a selection. Selections allow for only cells expressing proteins that exhibit the desired function to survive. In contrast, a screen allows for cells expressing any functional variant to survive yet be distinguished by phenotype.
The most basic cell-based screening methods involve transforming a library of mutants into bacteria and identifying individual colonies or cultures that exhibit the desired function. These assays maintain a link between phenotype and genotype that is “achieved naturally by introducing plasmid DNA encoding the protein into a cell." These methods allow millions of sequence variants to be transformed into cells, and by manipulating the statistic of DNA transformation allow for each cell to contain a single vector containing a single sequence variant. These individual cells can then be isolated by growth on solid media.
List of more advanced high-throughput screening and selection methods:
1. phage display 2. ribosome display 3. mRNA-peptide fusion 4. plasmid display 5. cell-surface display 6. genetics 7. n-hybrid systems 8. in vitro compartmentalization 9. spatial address 10. mass spectrometry
Improving whole cell fluorescence of GFP by directed evolution
Wild type green fluorescent protein (GFP) is routinely used as a reporter of gene regulation. However, for some applications, a stronger whole cell fluorescence signal is required. Thus, the research team of Willem Stemmer et al. set out to construct a GFP mutant that would exhibit enhanced whole cell fluorescence. To do this, the group first constructed a synthetic GFP gene with improved codon usage. They then performed successive cycles of DNA shuffling followed by a visual screen for the brightest E. coli colonies. Using this method, they were able to generate a mutant with a whole cell fluorescence signal 45-fold greater than a standard commercially available GFP sequence.
Directed evolution of a thermostable esterase
It had been previously suggested that enzyme thermostability is incompatible with high catalytic activity at low temperature. However, by constraining both properties, the Frances Arnold group was able to create a thermostable esterase that maintains high catalytic activity at lower temperatures using directed protein evolution. To do this, the Arnold group generated a mutant library through error-prone PCR and DNA shuffling and developed a 96-well plate thermostability and activity screen. After six generations of mutagenesis and screening they were able to generate a thermostable p-nitrobenzyl esterase mutant that retains catalytic activity at low temperature.
The UC Davis 2012 iGEM team used directed evolution to engineer an E. coli strain that more efficiently degrades ethylene glycol than previous strains (link). Although not strictly a directed protein evolution project, their work demonstrates the ability of biological molecules and systems to rapidly evolve under strong selective pressure.