Ward Clustering Improves Cross-Validated Markov State Models of Protein Folding

J Chem Theory Comput. 2017 Mar 14;13(3):963-967. doi: 10.1021/acs.jctc.6b01238. Epub 2017 Feb 17.

Abstract

Markov state models (MSMs) are a powerful framework for analyzing protein dynamics. MSMs require the decomposition of conformation space into states via clustering, which can be cross-validated when a prediction method is available for the clustering method. We present an algorithm for predicting cluster assignments of new data points with Ward's minimum variance method. We then show that clustering with Ward's method produces better or equivalent cross-validated MSMs for protein folding than other clustering algorithms.

MeSH terms

  • Cluster Analysis
  • Markov Chains*
  • Molecular Dynamics Simulation*
  • Protein Conformation
  • Protein Folding*
  • Proteins / chemistry*

Substances

  • Proteins