A Potential Golden Age to Come-Current Tools, Recent Use Cases, and Future Avenues for De Novo Sequencing in Proteomics

Proteomics. 2018 Sep;18(18):e1700150. doi: 10.1002/pmic.201700150. Epub 2018 Aug 22.

Abstract

In shotgun proteomics, peptide and protein identification is most commonly conducted using database search engines, the method of choice when reference protein sequences are available. Despite its widespread use the database-driven approach is limited, mainly because of its static search space. In contrast, de novo sequencing derives peptide sequence information in an unbiased manner, using only the fragment ion information from the tandem mass spectra. In recent years, with the improvements in MS instrumentation, various new methods have been proposed for de novo sequencing. This review article provides an overview of existing de novo sequencing algorithms and software tools ranging from peptide sequencing to sequence-to-protein mapping. Various use cases are described for which de novo sequencing was successfully applied. Finally, limitations of current methods are highlighted and new directions are discussed for a wider acceptance of de novo sequencing in the community.

Keywords: bioinformatics; de novo sequencing; peptide identification; protein identification; software tools.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Algorithms
  • Animals
  • Computational Biology / methods
  • Humans
  • Proteins / analysis
  • Proteins / metabolism
  • Proteomics / methods*
  • Sequence Analysis, Protein / methods*
  • Software*

Substances

  • Proteins