A low-complexity add-on score for protein remote homology search with COMER

Bioinformatics. 2018 Jun 15;34(12):2037-2045. doi: 10.1093/bioinformatics/bty048.

Abstract

Motivation: Protein sequence alignment forms the basis for comparative modeling, the most reliable approach to protein structure prediction, among many other applications. Alignment between sequence families, or profile-profile alignment, represents one of the most, if not the most, sensitive means for homology detection but still necessitates improvement. We aim at improving the quality of profile-profile alignments and the sensitivity induced by them by refining profile-profile substitution scores.

Results: We have developed a new score that represents an additional component of profile-profile substitution scores. A comprehensive evaluation shows that the new add-on score statistically significantly improves both the sensitivity and the alignment quality of the COMER method. We discuss why the score leads to the improvement and its almost optimal computational complexity that makes it easily implementable in any profile-profile alignment method.

Availability and implementation: An implementation of the add-on score in the open-source COMER software and data are available at https://sourceforge.net/projects/comer. The COMER software is also available on Github at https://github.com/minmarg/comer and as a Docker image (minmar/comer).

Supplementary information: Supplementary data are available at Bioinformatics online.

MeSH terms

  • Amino Acid Sequence
  • Computational Biology / methods
  • Data Accuracy
  • Protein Conformation*
  • Sensitivity and Specificity
  • Sequence Alignment / methods*
  • Sequence Analysis, Protein / methods*
  • Software*