Understanding protein flexibility through dimensionality reduction

J Comput Biol. 2003;10(3-4):617-34. doi: 10.1089/10665270360688228.

Abstract

This work shows how to decrease the complexity of modeling flexibility in proteins by reducing the number of dimensions necessary to model important macromolecular motions such as the induced-fit process. Induced fit occurs during the binding of a protein to other proteins, nucleic acids, or small molecules (ligands) and is a critical part of protein function. It is now widely accepted that conformational changes of proteins can affect their ability to bind other molecules and that any progress in modeling protein motion and flexibility will contribute to the understanding of key biological functions. However, modeling protein flexibility has proven a very difficult task. Experimental laboratory methods, such as x-ray crystallography, produce rather limited information, while computational methods such as molecular dynamics are too slow for routine use with large systems. In this work, we show how to use the principal component analysis method, a dimensionality reduction technique, to transform the original high-dimensional representation of protein motion into a lower dimensional representation that captures the dominant modes of motions of proteins. For a medium-sized protein, this corresponds to reducing a problem with a few thousand degrees of freedom to one with less than fifty. Although there is inevitably some loss in accuracy, we show that we can obtain conformations that have been observed in laboratory experiments, starting from different initial conformations and working in a drastically reduced search space.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Aldehyde Reductase / chemistry
  • Binding Sites
  • Computational Biology / methods*
  • Crystallography, X-Ray
  • Data Interpretation, Statistical*
  • HIV Protease / chemistry
  • Ligands
  • Magnetic Resonance Spectroscopy
  • Protein Binding
  • Protein Conformation*
  • Protein Structure, Tertiary

Substances

  • Ligands
  • Aldehyde Reductase
  • HIV Protease