Defining coarse-grained representations of large biomolecules and biomolecular complexes from elastic network models

Biophys J. 2009 Oct 21;97(8):2327-37. doi: 10.1016/j.bpj.2009.08.007.

Abstract

Coarse-grained (CG) models of large biomolecular complexes enable simulations of these systems over long timescales that are not accessible for atomistic molecular dynamics (MD) simulations. A systematic methodology, called essential dynamics coarse-graining (ED-CG), has been developed for defining coarse-grained sites in a large biomolecule. The method variationally determines the CG sites so that key dynamic domains in the protein are preserved in the CG representation. The original ED-CG method relies on a principal component analysis (PCA) of a MD trajectory. However, for many large proteins and multi-protein complexes such an analysis may not converge or even be possible. This work develops a new ED-CG scheme using an elastic network model (ENM) of the protein structure. In this procedure, the low-frequency normal modes obtained by ENM are used to define dynamic domains and to define the CG representation accordingly. The method is then applied to several proteins, such as the HIV-1 CA protein dimer, ATP-bound G-actin, and the Arp2/3 complex. Numerical results show that ED-CG with ENM (ENM-ED-CG) is much faster than ED-CG with PCA because no MD is necessary. The ENM-ED-CG models also capture functional essential dynamics of the proteins almost as well as those using full MD with PCA. Therefore, the ENM-ED-CG method may be better suited to coarse-grain a very large biomolecule or biomolecular complex that is too computationally expensive to be simulated by conventional MD, or when a high resolution atomic structure is not even available.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Actin-Related Protein 2-3 Complex / chemistry
  • Actins / chemistry
  • Adenosine Triphosphate / chemistry
  • Algorithms
  • Computer Simulation
  • Elasticity*
  • HIV-1
  • Models, Molecular
  • Neural Networks, Computer*
  • Principal Component Analysis
  • Protein Conformation
  • Protein Multimerization
  • Proteins / chemistry*
  • Time Factors
  • Viral Proteins / chemistry

Substances

  • Actin-Related Protein 2-3 Complex
  • Actins
  • Proteins
  • Viral Proteins
  • Adenosine Triphosphate