Cluster analysis of molecular simulation trajectories for systems where both conformation and orientation of the sampled states are important

J Comput Chem. 2016 Aug 5;37(21):1973-82. doi: 10.1002/jcc.24416. Epub 2016 Jun 12.

Abstract

Clustering methods have been widely used to group together similar conformational states from molecular simulations of biomolecules in solution. For applications such as the interaction of a protein with a surface, the orientation of the protein relative to the surface is also an important clustering parameter because of its potential effect on adsorbed-state bioactivity. This study presents cluster analysis methods that are specifically designed for systems where both molecular orientation and conformation are important, and the methods are demonstrated using test cases of adsorbed proteins for validation. Additionally, because cluster analysis can be a very subjective process, an objective procedure for identifying both the optimal number of clusters and the best clustering algorithm to be applied to analyze a given dataset is presented. The method is demonstrated for several agglomerative hierarchical clustering algorithms used in conjunction with three cluster validation techniques. © 2016 Wiley Periodicals, Inc.

Keywords: Cluster analysis; conformation; molecular dynamics; orientation; protein adsorption.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Cluster Analysis
  • Molecular Dynamics Simulation*
  • Protein Conformation
  • Proteins / chemistry*

Substances

  • Proteins