Accurate Prediction of Hydration Sites of Proteins Using Energy Model With Atom Embedding

Front Mol Biosci. 2021 Sep 20:8:756075. doi: 10.3389/fmolb.2021.756075. eCollection 2021.

Abstract

We propose a method based on neural networks to accurately predict hydration sites in proteins. In our approach, high-quality data of protein structures are used to parametrize our neural network model, which is a differentiable score function that can evaluate an arbitrary position in 3D structures on proteins and predict the nearest water molecule that is not present. The score function is further integrated into our water placement algorithm to generate explicit hydration sites. In experiments on the OppA protein dataset used in previous studies and our selection of protein structures, our method achieves the highest model quality in terms of F1 score, compared to several previous studies.

Keywords: atom embedding; hydration sites; machine learning; prediction; protein.