Protein engineering has been used to create antibody drugs, very bright fluorescent proteins and enzymes that cut down on toxic waste. Genentech's drug for macular degeneration, Lucentis, is an antibody fragment created through site-specific mutagenesis and successive screening in phage to bind its target 100-fold more tightly than its precursor1. More recently, researchers at Codexis and Merck crafted an enzyme to replace an organo-metallic catalyst in the synthesis of sitagliptin, a diabetes drug with more than a billion dollars in annual sales. The researchers began with an enzyme that could create the appropriate bonds but had no activity for the desired substrate; about a dozen rounds of mutagenesis and modeling eventually produced an enzyme with high activity2. The enzyme had yields about 10% higher and produced 20% less waste compared with the original rhodium-based catalyst. The companies won a Presidential Green Chemistry Award for the achievement, and Merck is now working with the US Food and Drug Administration to switch to a manufacturing route using the engineered enzyme.

Success in designing useful proteins is largely a matter of trying over and over again. But protein engineers often find themselves having to choose between trying to make bigger libraries and focusing libraries on sequences most likely to be effective. “Everything depends on how frequently successful solutions appear in the libraries. At the outset of any experiment, you don't really have an idea,” explains Donald Hilvert, a protein engineer and laboratory head at the Swiss Federal Institute of Technology in Zurich (ETH Zurich).

Strategies for protein engineering are manifold. Many rely on display technologies (Table 1), which link protein to gene by putting proteins on the outer surface of viral particles or cells, or by physically linking mRNA to protein in cell-free systems. In a process called directed evolution, protein libraries are screened for desired activity; the best of the lot are collected and modified in hopes of further improvements, generating another library for another screen. Deciding how to find the best versions depends on the particular application and personal preference, but options include screening large random libraries, screening focused libraries as well as using rational design and computer tools.

Table 1 Display technologies

Finding focus

It is impossible to screen every possible protein sequence: even for proteins containing a mere 100 amino acids, the options are immense. Twenty possible amino acids at each of 100 positions works out to over 10130 combinations3. To put that in perspective, the number of atoms in the observable universe is around 1080. Though many groups are working on ways to make screening more efficient (Box 1), the largest protein libraries screened have had around 1015 members.

“Larger libraries and greater diversity are better, but you can't cover everywhere,” says Gregory Weiss, who has adapted phage display to study membrane proteins at the University of California, Irvine. Generally fewer than 20 residues, if that, will be varied in an initial protein sequence of about 200 amino acids, he says. “That doesn't give you a lot of room to work in. You have to choose your 10% very carefully.”

Donald Hilvert at ETH Zurich says engineering enzymes to perform the same chemistry on new substrates is becoming straightforward. Redesigning enzymes for new reactions is more difficult.

Several long-standing techniques can help researchers decide which sections of proteins to vary: alanine scanning creates proteins in which one position at a time is converted to alanine, one of the simplest amino acids. This technique can reveal which residues are particularly important for a protein's function but does not necessarily indicate how to improve it. Saturation mutagenesis samples all 20 amino acids at a particular residue, but creates too many variants to test many residues. Intermediate approaches can be used to interrogate more residues by sampling subsets of amino acids, such as whether they are hydrophobic versus hydrophilic or contain bulky versus tiny side chains.

David Baker at the University of Washington, Seattle designs new enzymes and protein-protein interactions computationally, then tests them in the real world6.

Iterative saturation mutagenesis is a conceptual framework to systematically explore amino acids at several sites of interest (each site typically contains one to three amino acids). This approach was formalized by Manfred Reetz and colleagues at the Max Planck Institute for Coal Research, who have used it to improve an enzyme's stability and also to select enzymes that favor one enantiomer over another4. The process requires multiple rounds of screening; hits from earlier libraries are used as starting points for subsequent rounds that explore other sites, hunting for 'couples' of amino acids with synergistic effects. A freely available program called the combinatorial active-site saturation test, more commonly known as CASTER, can help researchers plan which sites to explore and how to explore them efficiently.

Gene-synthesis companies can manufacture libraries by randomizing codons at particular locations, but this still makes for more possibilities than can be screened, explains Markus Enzelberger, vice president of research and development at MorphoSys. For antibodies, screening becomes considerably harder when researchers move past binding affinity to properties such as stability, aggregation and specificity. Even if researchers limit random variation to, say, all 20 amino acids at six particular residues, that would produce 1010 DNA sequences, and because different sequences of nucleotides can code for the same amino acid, many of these are redundant. Cutting out redundant clones and stop codons reduces the six-residue library to 107, but ordering 107 distinct sequences from a gene-synthesis company is not feasible. The library size could be reduced to a few dozen by varying one amino acid at a time, but then effects of combining variation at different positions would be lost, says Enzelberger. To explore the effects of combinations, his company recently acquired the company Sloning BioTechnology, which developed a technology that relies on a clever combination of double-stranded hairpin DNA and restriction enzymes to transfer desired codons into specific regions of a gene. “That makes the difference between being able to screen all the variants and not,” says Enzelberger, who says his company will make the proprietary technology available to other researchers under certain conditions.

Protein engineering strategies may shift depending on the project, says Herren Wu, vice president of research and development at MedImmune. “If you know what you want to do, and you have a lot of structure and function information, you can use a more focused library or do rational design. If you know less, you really need a big library.” To find an antibody that binds to a particular target, researchers at his company sometimes begin with genes from individuals whose immune cells make antibodies to a pathogen. But because such starting points are often unavailable, most projects begin with phage display, a robust technique capable of displaying many sorts of antibodies. If an antibody with promising affinity is identified, discovery teams may then turn to ribosome display for optimization. Ribosome display can screen much larger libraries, he says, but is currently not as useful for de novo screening because not all antibodies can be displayed stably and efficiently.

Different goals require different approaches, agrees Sachdev Sidhu at the University of Toronto, who is using phage display to design new protein-protein interactions. To find naive interactions without natural precedents, large simple libraries work better, but once an interaction is actually identified, the strategy shifts to optimizing a specific region of the protein. “Then you need a very precise interaction in an existing interface,” says Sidhu, so focused, structure-based and computational approaches can make an existing interaction stronger.

Screening sequences in silico

A computationally designed hemagglutinin (HA) binder HB36 (blue) overlaps well with its actual structure (pink). Reprinted from reference 6 with permission from the American Association for the Advancement of Science.

Compared to wet-lab approaches, computational design can be used to screen many more variants more quickly. Designing an enzyme starts with making models of how an enzyme could facilitate catalysis of a given substrate. “There are a number of ways that a reaction can be catalyzed, and we come up with constellations of amino acid residues in ideal active sites,” explains David Baker, a computational biologist at the University of Washington, Seattle.

For each constellation, in silico screening initially identifies appropriate positions in protein scaffolds where the active site can be placed. After a few rounds of in silico optimization, the most promising sequences are made into actual proteins and tested. Most of these do not work, but the approach has created functional enzymes for a handful of specific reactions. Last year, Baker's group described two enzymes with very different scaffolds that each catalyze the Diels-Alder reaction, creating two carbon bonds and four stereoisomers from appropriate substrates5.

To design an enzyme computationally, researchers model ideal active sites. Reprinted from reference 5 with permission from the American Association for the Advancement of Science.

Still, rationally designed enzymes usually are not very good compared to their natural counterparts. Nature has created enzymes that boost rates of reactions by up to 1014 fold, says Baker, but the best computationally designed enzymes enhance reaction rates by only 106, even after a few rounds of directed evolution.

There are many possibilities why billions of years of evolution have outperformed the last few years of computational engineering. For one thing, whereas natural enzymes often have six or more amino acids that carry out an enzyme's chemistry, designed active sites usually start with three or four amino acids; additional residues are highly constrained by the protein scaffold. More importantly, the actual active site may not have the structure that was computationally predicted. The amino-acid changes necessary to give the enzyme its activity could shift the protein's structure to produce an active site with a suboptimal conformation.

“Computation allows you to explore sequence spaces that are not accessible by experiment,” says Hilvert. But before asking whether a new configuration works as predicted, researchers have to answer another question, he says. “How do I fit all of that into a scaffold that hasn't seen this kind of apparatus before?”

A broader choice of scaffolds could boost the quality of computationally designed proteins, says Baker. In fact, some options have been inspired by online volunteers playing the computer game Foldit, which Baker and coworkers developed to see whether game players could solve problems in predicting protein structures.

Another application of computational design is in creating protein-protein interactions. Baker and colleagues recently reported a small protein that binds a small, well-conserved region on influenza hemagglutinin, the protein which the virus uses to gain access to host cells6. The predicted structure of the designed protein matched the crystallographically observed structure almost perfectly, even down to the side chains on individual amino acids. Nonetheless, says Baker, researchers have much to learn before they can reliably design proteins that act as expected.

In chance we trust

Right now, protein engineers might do better to trust in chance than design, says Frances Arnold at the California Institute of Technology, whose directed evolution techniques using random mutagenesis are used in laboratories around the world. Researchers who design libraries for beneficial mutations rarely compare libraries to randomly generated ones, she says. When researchers in her lab analyze beneficial mutations in enzymes, results show a surprising lack of patterns. “You get mutations all over the enzyme. You couldn't explain them, let alone predict them,” she says. “What we've learned is that we know less than we thought we did. People learned the hard way that you couldn't just go in and make a mutation where you thought and expect it to improve an enzyme.”

“There are lots of ways to evolve a protein,” says Frances Arnold at California Institute of Technology. “If one doesn't work, you can try another.”

Error-prone PCR makes on average one variation per gene, so Arnold increases variation by recombining genes encoding pieces of homologous proteins from different species. This produces highly varied libraries, whose members may differ from each other by sometimes hundreds of residues. At the same time, the number of sequences that could code for a stable protein is high, explains Arnold. “Recombination is a great way to make many mutations simultaneously. Because these have already been vetted by nature, you know that they can make a folded protein.”

Chimeric protein libraries, made by combining parts of related proteins, create diverse libraries with a high proportion of functioning proteins. Credit: M. Smith, Arnold lab, California Institute of Technology

Given the number of sequences to explore, even a rough guide to productive sequence space can make a big difference. “If your choice of building blocks is a sensible one, then your chance of getting a hit is higher,” says Hilvert. Nonetheless, he cautions that finding more hits does not guarantee finding the best possible hit. “You do make a sacrifice because you're not looking at everything.”