Limitations and challenges in protein stability prediction upon genome variations: towards future applications in precision medicine

Comput Struct Biotechnol J. 2020 Jul 24:18:1968-1979. doi: 10.1016/j.csbj.2020.07.011. eCollection 2020.

Abstract

Protein stability predictions are becoming essential in medicine to develop novel immunotherapeutic agents and for drug discovery. Despite the large number of computational approaches for predicting the protein stability upon mutation, there are still critical unsolved problems: 1) the limited number of thermodynamic measurements for proteins provided by current databases; 2) the large intrinsic variability of ΔΔG values due to different experimental conditions; 3) biases in the development of predictive methods caused by ignoring the anti-symmetry of ΔΔG values between mutant and native protein forms; 4) over-optimistic prediction performance, due to sequence similarity between proteins used in training and test datasets. Here, we review these issues, highlighting new challenges required to improve current tools and to achieve more reliable predictions. In addition, we provide a perspective of how these methods will be beneficial for designing novel precision medicine approaches for several genetic disorders caused by mutations, such as cancer and neurodegenerative diseases.

Keywords: Computational tools and databases; Machine learning; Mutations; Non-synonymous single nucleotide variants; Performance bias; Protein function; Protein stability.

Publication types

  • Review