MIAX: a new paradigm for modeling biomacromolecular interactions and complex formation in condensed phases

Proteins. 2002 Sep 1;48(4):696-732. doi: 10.1002/prot.10122.

Abstract

A new paradigm is proposed for modeling biomacromolecular interactions and complex formation in solution (protein-protein interactions so far in this report) that constitutes the scaffold of the automatic system MIAX (acronym for Macromolecular Interaction Assessment X). It combines in a rational way a series of computational methodologies, the goal being the prediction of the most native-like protein complex that may be formed when two isolated (unbound) protein monomers interact in a liquid environment. The overall strategy consists of first inferring putative precomplex structures by identification of binding sites or epitopes on the proteins surfaces and a simultaneous rigid-body docking process using geometric instances alone. Precomplex configurations are defined here as all those decoys the interfaces of which comply substantially with the inferred binding sites and whose free energy values are lower. Retaining all those precomplex configurations with low energies leads to a reasonable number of decoys for which a flexible treatment is amenable. A novel algorithm is introduced here for automatically inferring binding sites in proteins given their 3-D structure. The procedure combines an unsupervised learning algorithm based on the self-organizing map or Kohonen network with a 2-D Fourier spectral analysis. To model interaction, the potential function proposed here plays a central role in the system and is constituted by empirical terms expressing well-characterized factors influencing biomacromolecular interaction processes, essentially electrostatic, van der Waals, and hydrophobic. Each of these procedures is validated by comparing results with observed instances. Finally, the more demanding process of flexible docking is performed in MIAX embedding the potential function in a simulated annealing optimization procedure. Whereas search of the entire configuration hyperspace is a major factor precluding hitherto systems from efficiently modeling macromolecular interaction modes and complex structures, the paradigm presented here may constitute a step forward in the field because it is shown that a rational treatment of the information available from the 3-D structure of the interacting monomers combined with conveniently selected computational techniques can assist to elude search of regions of low probability in configuration space and indeed lead to a highly efficient system oriented to solve this intriguing and fundamental biologic problem.

Publication types

  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Algorithms
  • Amino Acids / chemistry
  • Animals
  • Binding Sites
  • Chymotrypsinogen / chemistry
  • Chymotrypsinogen / metabolism
  • Computational Biology / methods*
  • Humans
  • Hydrophobic and Hydrophilic Interactions
  • Leukocyte Elastase / chemistry
  • Leukocyte Elastase / metabolism
  • Ligands
  • Macromolecular Substances
  • Models, Molecular*
  • Molecular Structure
  • Ovomucin / chemistry
  • Protein Binding
  • Proteins / chemistry*
  • Proteins / metabolism*
  • Sequence Homology, Amino Acid
  • Solvents / chemistry
  • Static Electricity
  • Trypsin Inhibitor, Kazal Pancreatic / chemistry
  • Trypsin Inhibitor, Kazal Pancreatic / metabolism
  • Ubiquitins / chemistry
  • Ubiquitins / metabolism

Substances

  • Amino Acids
  • Ligands
  • Macromolecular Substances
  • Proteins
  • Solvents
  • Ubiquitins
  • ovomucoid inhibitors
  • Ovomucin
  • Trypsin Inhibitor, Kazal Pancreatic
  • Chymotrypsinogen
  • Leukocyte Elastase