Optimally-connected hidden markov models for predicting MHC-binding peptides

J Bioinform Comput Biol. 2006 Oct;4(5):959-80. doi: 10.1142/s0219720006002314.

Abstract

Hidden Markov models (HMMs) are one of various methods that have been applied to prediction of major histo-compatibility complex (MHC) binding peptide. In terms of model topology, a fully-connected HMM (fcHMM) has the greatest potential to predict binders, at the cost of intensive computation. While a profile HMM (pHMM) performs dramatically fewer computations, it potentially merges overlapping patterns into one which results in some patterns being missed. In a profile HMM a state corresponds to a position on a peptide while in an fcHMM a state has no specific biological meaning. This work proposes optimally-connected HMMs (ocHMMs), which do not merge overlapping patterns and yet, by performing topological reductions, a model's connectivity is greatly reduced from an fcHMM. The parameters of ocHMMs are initialized using a novel amino acid grouping approach called "multiple property grouping." Each group represents a state in an ocHMM. The proposed ocHMMs are compared to a pHMM implementation using HMMER, based on performance tests on two MHC alleles HLA (Human Leukocyte Antigen)-A*0201 and HLA-B*3501. The results show that the heuristic approaches can be adjusted to make an ocHMM achieve higher predictive accuracy than HMMER. Hence, such obtained ocHMMs are worthy of trial for predicting MHC-binding peptides.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Artificial Intelligence*
  • Binding Sites
  • Computer Simulation
  • Histocompatibility Antigens Class I / chemistry*
  • Markov Chains
  • Models, Chemical*
  • Models, Molecular*
  • Molecular Sequence Data
  • Peptides / chemistry*
  • Protein Binding
  • Protein Interaction Mapping / methods*
  • Sequence Analysis, Protein / methods*

Substances

  • Histocompatibility Antigens Class I
  • Peptides