Predicting protein secondary structure by a support vector machine based on a new coding scheme

Genome Inform. 2004;15(2):181-90.

Abstract

Protein structure prediction is one of the most important problems in modern computational biology. Protein secondary structure prediction is a key step in prediction of protein tertiary structure. There have emerged many methods based on machine learning techniques, such as neural networks (NN) and support vector machine (SVM) etc., to focus on the prediction of the secondary structures. In this paper, a new method was proposed based on SVM. Different from the existing methods, this method takes into account of the physical-chemical properties and structure properties of amino acids. When tested on the most popular dataset CB513, it achieved a Q(3) accuracy of 0.7844, which illustrates that it is one of the top range methods for protein of secondary structure prediction.

Publication types

  • Comparative Study

MeSH terms

  • Algorithms*
  • Artificial Intelligence*
  • Models, Chemical
  • Models, Molecular
  • Protein Structure, Secondary*
  • Proteins / chemistry*
  • Proteins / classification*
  • Sequence Alignment / methods*
  • Sequence Analysis, Protein / methods*

Substances

  • Proteins