A Practical Guide to Small Protein Discovery and Characterization Using Mass Spectrometry

J Bacteriol. 2022 Jan 18;204(1):e0035321. doi: 10.1128/JB.00353-21. Epub 2021 Nov 8.

Abstract

Small proteins of up to ∼50 amino acids are an abundant class of biomolecules across all domains of life. Yet due to the challenges inherent in their size, they are often missed in genome annotations, and are difficult to identify and characterize using standard experimental approaches. Consequently, we still know few small proteins even in well-studied prokaryotic model organisms. Mass spectrometry (MS) has great potential for the discovery, validation, and functional characterization of small proteins. However, standard MS approaches are poorly suited to the identification of both known and novel small proteins due to limitations at each step of a typical proteomics workflow, i.e., sample preparation, protease digestion, liquid chromatography, MS data acquisition, and data analysis. Here, we outline the major MS-based workflows and bioinformatic pipelines used for small protein discovery and validation. Special emphasis is placed on highlighting the adjustments required to improve detection and data quality for small proteins. We discuss both the unbiased detection of small proteins and the targeted analysis of small proteins of interest. Finally, we provide guidelines to prioritize novel small proteins, and an outlook on methods with particular potential to further improve comprehensive discovery and characterization of small proteins.

Keywords: LC-MS/MS; SEP; genome annotation; microprotein; proteomics; sample preparation; shotgun proteomics; small protein; sproteins; top-down proteomics.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Archaea / genetics
  • Archaea / metabolism*
  • Archaeal Proteins / chemistry
  • Archaeal Proteins / genetics
  • Archaeal Proteins / metabolism
  • Bacteria / genetics
  • Bacteria / metabolism*
  • Bacterial Proteins / chemistry*
  • Bacterial Proteins / genetics
  • Bacterial Proteins / metabolism*
  • Computational Biology
  • Gene Expression Regulation, Archaeal / physiology
  • Gene Expression Regulation, Bacterial / physiology
  • Mass Spectrometry / methods*

Substances

  • Archaeal Proteins
  • Bacterial Proteins