Evidence theoretic protein fold classification based on the concept of hyperfold

Math Biosci. 2012 Dec;240(2):148-60. doi: 10.1016/j.mbs.2012.07.001. Epub 2012 Jul 20.

Abstract

In current computational biology, assigning a protein domain to a fold class is a complicated and controversial task. It can be more challenging in the much harder task of correct identification of protein domain fold pattern solely through using extracted information from protein sequence. To deal with such a challenging problem, the concepts of hyperfold and interlaced folds are introduced for the first time. Each hyperfold is a set of interlaced folds with a centroid fold. These concepts are used to construct a framework for handling the uncertainty involved with the fold classification problem. In this approach, an unknown query protein is assigned to a hyperfold rather than a single fold. Ten different sequence based features are used to predicting the correct hyperfold. This architecture is featured by the Dempster-Shafer theory of evidence through the bodies of evidence and Dempster's rule of combination to combine the hyperfolds. The classification architecture thus developed was applied for identifying protein folds among the 27 famous SCOP fold patterns from a stringent well-known dataset. Compared with the existing predictors tested by the same benchmark dataset, our approach might achieve the better results.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Amino Acid Sequence*
  • Databases, Protein
  • Protein Folding*
  • Protein Structure, Tertiary
  • Proteins / chemistry*
  • Structure-Activity Relationship

Substances

  • Proteins