Statistical Analysis, Investigation, and Prediction of the Water Positions in the Binding Sites of Proteins

J Chem Inf Model. 2017 Jul 24;57(7):1517-1528. doi: 10.1021/acs.jcim.6b00620. Epub 2017 Jun 27.

Abstract

Water molecules play a crucial role in biomolecular associations by mediating a hydrogen bond network or filling spaces with van der Waals interactions. Although current drug design technologies have taken water molecule interactions into account, their applications are still limited to their reliance on either excessive computer resources or a particular potential energy model. Here, we introduce a statistical method that is based on experimentally determined water molecules in the binding sites of high-resolution X-ray crystal structures to predict the potential hydration sites in the binding sites of crystal structures of interest. By clustering and analyzing the various interaction patterns of water molecules in the training data set, we derived a tetrahedron-water-cluster model based on a series of residue group triplets that form feature triangles of different shapes. In the tetrahedral-water-cluster model, a triplet of three polar atoms in the residue group triplet acts as the vertices of the bottom triangle of the tetrahedron, and the water molecule that interacts with these three polar atoms is set as the top vertex of the tetrahedron. By comparing the shapes of the bottom triangles in the training data set with the shape of the triangle in the residue group triplets in the crystal structure of interest, we can identify the bottom triangle that is most similar to the one in the residue group triplet of the crystal structure of interest. According to the tetrahedron-water-cluster model, the hydration site for the residue group triplet in the crystal structure of interest can be predicted based on the height of the tetrahedron that has the most similar bottom triangle in the training data set. A test set containing 193 crystal structures was used to evaluate model performance, and extensive comparison with the recently published program Dowser++ revealed that our model is at least as good at providing an accurate set of the potential hydration sites in crystal structures of interest.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Binding Sites
  • Hydrogen Bonding
  • Models, Molecular*
  • Protein Conformation
  • Proteins / chemistry*
  • Proteins / metabolism*
  • Statistics as Topic
  • Water / chemistry*

Substances

  • Proteins
  • Water