Using ensemble of classifiers for predicting HIV protease cleavage sites in proteins

Amino Acids. 2009 Mar;36(3):409-16. doi: 10.1007/s00726-008-0076-z. Epub 2008 Apr 10.

Abstract

The focus of this work is the use of ensembles of classifiers for predicting HIV protease cleavage sites in proteins. Due to the complex relationships in the biological data, several recent works show that often ensembles of learning algorithms outperform stand-alone methods. We show that the fusion of approaches based on different encoding models can be useful for improving the performance of this classification problem. In particular, in this work four different feature encodings for peptides are described and tested. An extensive evaluation on a large dataset according to a blind testing protocol is reported which demonstrates how different feature extraction methods and classifiers can be combined for obtaining a robust and reliable system. The comparison with other stand-alone approaches allows quantifying the performance improvement obtained by the ensembles proposed in this work.

MeSH terms

  • Algorithms
  • Artificial Intelligence
  • Computational Biology / methods*
  • HIV Protease / chemistry*
  • HIV Protease / genetics
  • HIV Protease / metabolism
  • HIV-1 / enzymology*
  • Humans
  • Proteins / chemistry
  • Proteins / metabolism
  • Sequence Analysis, Protein / methods*
  • Software

Substances

  • Proteins
  • HIV Protease