Large-scale design and refinement of stable proteins using sequence-only models

doi:10.1371/journal.pone.0265020

Large-scale design and refinement of stable proteins using sequence-only models

Fig 5

Refinement of GM designs, overall and as a function of novelty.

(A) Effect of guided and random substitutions on designs created by the GM. The base stability score was much higher for this population of designs than for the expert-designed proteins tested, with a mean of 0.67; EM-guided refinement further increased it to 1.67. As with the expert-designed proteins, this demonstrates a ten-fold increase in stability. Random substitutions again had a deleterious effect, dropping mean stability to 0.29. (B) Stability of GM designs, and guided and random substitutions within those designs, as novelty increases. We consider designs to be more novel when BLAST percent identity with the most-similar design in the training corpus is lower.

doi: https://doi.org/10.1371/journal.pone.0265020.g005