PROSTATA: a framework for protein stability assessment using transformers

Bioinformatics. 2023 Nov 1;39(11):btad671. doi: 10.1093/bioinformatics/btad671.

Abstract

Motivation: Accurate prediction of change in protein stability due to point mutations is an attractive goal that remains unachieved. Despite the high interest in this area, little consideration has been given to the transformer architecture, which is dominant in many fields of machine learning.

Results: In this work, we introduce PROSTATA, a predictive model built in a knowledge-transfer fashion on a new curated dataset. PROSTATA demonstrates advantage over existing solutions based on neural networks. We show that the large improvement margin is due to both the architecture of the model and the quality of the new training dataset. This work opens up opportunities to develop new lightweight and accurate models for protein stability assessment.

Availability and implementation: PROSTATA is available at https://github.com/AIRI-Institute/PROSTATA and https://prostata.airi.net.

MeSH terms

  • Machine Learning*
  • Neural Networks, Computer*
  • Point Mutation
  • Protein Stability