Multi-order graph attention network for water solubility prediction and interpretation

Sci Rep. 2023 Mar 2;13(1):957. doi: 10.1038/s41598-022-25701-5.

Abstract

The water solubility of molecules is one of the most important properties in various chemical and medical research fields. Recently, machine learning-based methods for predicting molecular properties, including water solubility, have been extensively studied due to the advantage of effectively reducing computational costs. Although machine learning-based methods have made significant advances in predictive performance, the existing methods were still lacking in interpreting the predicted results. Therefore, we propose a novel multi-order graph attention network (MoGAT) for water solubility prediction to improve the predictive performance and interpret the predicted results. We extracted graph embeddings in every node embedding layer to consider the information of diverse neighboring orders and merged them by attention mechanism to generate a final graph embedding. MoGAT can provide the atomic-specific importance scores of a molecule that indicate which atoms significantly influence the prediction so that it can interpret the predicted results chemically. It also improves prediction performance because the graph representations of all neighboring orders, which contain diverse range of information, are employed for the final prediction. Through extensive experiments, we demonstrated that MoGAT showed better performance than the state-of-the-art methods, and the predicted results were consistent with well-known chemical knowledge.