DomSVR: domain boundary prediction with support vector regression from sequence information alone

Amino Acids. 2010 Aug;39(3):713-26. doi: 10.1007/s00726-010-0506-6. Epub 2010 Feb 18.

Abstract

Protein domains are structural and fundamental functional units of proteins. The information of protein domain boundaries is helpful in understanding the evolution, structures and functions of proteins, and also plays an important role in protein classification. In this paper, we propose a support vector regression-based method to address the problem of protein domain boundary identification based on novel input profiles extracted from AAindex database. As a result, our method achieves an average sensitivity of approximately 36.5% and an average specificity of approximately 81% for multi-domain protein chains, which is overall better than the performance of published approaches to identify domain boundary. As our method used sequence information alone, our method is simpler and faster.

Publication types

  • Evaluation Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Databases, Protein
  • Humans
  • Protein Structure, Tertiary
  • Proteins / chemistry*
  • Regression Analysis
  • Sequence Alignment / methods*

Substances

  • Proteins