A new scoring function and associated statistical significance for structure alignment by CE

J Comput Biol. 2004;11(5):787-99. doi: 10.1089/cmb.2004.11.787.

Abstract

A new scoring function for assessing the statistical significance of protein structure alignment has been developed. The new scores were tested empirically using the combinatorial extension (CE) algorithm. The significance of a given score was given a p-value by curve-fitting the distribution of the scores generated by a random comparison of proteins taken from the PDB_SELECT database and the structural classification of proteins (SCOP) database. Although the scoring function was developed based on the CE algorithm, it is portable to any other protein structure alignment algorithm. The new scoring function is examined by sensitivity, specificity, and ROC curves.

Publication types

  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Algorithms
  • Computational Biology*
  • Data Interpretation, Statistical
  • Protein Structure, Tertiary*
  • Sequence Alignment*
  • Sequence Analysis, Protein*
  • Software