Revisiting Evaluation of Multiple Sequence Alignment Methods

Methods Mol Biol. 2021:2231:299-317. doi: 10.1007/978-1-0716-1036-7_17.

Abstract

Multiple sequence alignment is a core first step in many bioinformatics analyses, and errors in these alignments can have negative consequences for scientific studies. In this article, we review some of the recent literature evaluating multiple sequence alignment methods and identify specific challenges that arise when performing these evaluations. In particular, we discuss the different trends observed in simulation studies and when using biological benchmarks. Overall, we find that multiple sequence alignment, far from being a "solved problem," would benefit from new attention.

Keywords: Model misspecification; Multiple sequence alignment; Phylogeny estimation; Statistical alignment; Structural alignment.

Publication types

  • Evaluation Study
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Review

MeSH terms

  • Algorithms
  • Benchmarking / methods*
  • Computational Biology / methods*
  • Computer Simulation
  • Databases, Genetic
  • Phylogeny
  • Sequence Alignment / methods*
  • Sequence Analysis, DNA / methods
  • Sequence Analysis, Protein / methods
  • Software*