Reannotation of Genomes by Means of Proteomics Data

Methods Enzymol. 2017:585:201-216. doi: 10.1016/bs.mie.2016.09.019. Epub 2016 Oct 19.

Abstract

Omics approaches have become popular in biology as powerful discovery tools, and currently gain in interest for diagnostic applications. Establishing the accurate genome sequence of any organism is easy, but the outcome of its annotation by means of automatic pipelines remains imprecise. Some protein-encoding genes may be missed as soon as they are specific and poorly conserved in a given taxon, while important to explain the specific traits of the organism. Translational starts are also poorly predicted in a relatively important number of cases, thus impacting the protein sequence database used in proteomics, comparative genomics, and systems biology. The use of high-throughput proteomics data to improve genome annotation is an attractive option to obtain a more comprehensive molecular picture of a given organism. Here, protocols for reannotating prokaryote genomes are described based on shotgun proteomics and derivatization of protein N-termini with a positively charged reagent coupled to high-resolution tandem mass spectrometry.

Keywords: Derivatization; Gene annotation; Gene detection; Genome; N-terminal modification; N-terminomics; Next-generation proteomics; Proteogenomics.

MeSH terms

  • Genomics / methods*
  • Proteomics / methods*
  • Systems Biology