Computational scoring and experimental evaluation of enzymes generated by neural networks

Sean R Johnson; Xiaozhi Fu; Sandra Viknander; Clara Goldin; Sarah Monaco; Aleksej Zelezniak; Kevin K Yang

doi:10.1038/s41587-024-02214-2

Computational scoring and experimental evaluation of enzymes generated by neural networks

Nat Biotechnol. 2024 Apr 23. doi: 10.1038/s41587-024-02214-2. Online ahead of print.

Authors

Sean R Johnson^#¹, Xiaozhi Fu^#², Sandra Viknander², Clara Goldin², Sarah Monaco³, Aleksej Zelezniak^{4

5

6}, Kevin K Yang⁷

Affiliations

¹ New England Biolabs, Ipswich, MA, USA.
² Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden.
³ Invitae, San Francisco, CA, USA.
⁴ Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden. aleksej.zelezniak@chalmers.se.
⁵ Institute of Biotechnology, Life Sciences Centre, Vilnius University, Vilnius, Lithuania. aleksej.zelezniak@chalmers.se.
⁶ Randall Centre for Cell & Molecular Biophysics, King's College London, Guy's Campus, London, UK. aleksej.zelezniak@chalmers.se.
⁷ Microsoft Research, Cambridge, MA, USA. yang.kevin@microsoft.com.

^# Contributed equally.

PMID: 38653796
DOI: 10.1038/s41587-024-02214-2

Abstract

In recent years, generative protein sequence models have been developed to sample novel sequences. However, predicting whether generated proteins will fold and function remains challenging. We evaluate a set of 20 diverse computational metrics to assess the quality of enzyme sequences produced by three contrasting generative models: ancestral sequence reconstruction, a generative adversarial network and a protein language model. Focusing on two enzyme families, we expressed and purified over 500 natural and generated sequences with 70-90% identity to the most similar natural sequences to benchmark computational metrics for predicting in vitro enzyme activity. Over three rounds of experiments, we developed a computational filter that improved the rate of experimental success by 50-150%. The proposed metrics and models will drive protein engineering research by serving as a benchmark for generative protein sequence models and helping to select active variants for experimental testing.

Abstract

Grants and funding