Small proteins in bacteria - Big challenges in prediction and identification

Proteomics. 2023 Dec;23(23-24):e2200421. doi: 10.1002/pmic.202200421. Epub 2023 Aug 23.

Abstract

Proteins with up to 100 amino acids have been largely overlooked due to the challenges associated with predicting and identifying them using traditional methods. Recent advances in bioinformatics and machine learning, DNA sequencing, RNA and Ribo-seq technologies, and mass spectrometry (MS) have greatly facilitated the detection and characterisation of these elusive proteins in recent years. This has revealed their crucial role in various cellular processes including regulation, signalling and transport, as toxins and as folding helpers for protein complexes. Consequently, the systematic identification and characterisation of these proteins in bacteria have emerged as a prominent field of interest within the microbial research community. This review provides an overview of different strategies for predicting and identifying these proteins on a large scale, leveraging the power of these advanced technologies. Furthermore, the review offers insights into the future developments that may be expected in this field.

Keywords: bioinformatics; bottom-up proteomics; databases; mass spectrometry; protein identification; proteogenomics; top-down proteomics.

Publication types

  • Review

MeSH terms

  • Computational Biology* / methods
  • Mass Spectrometry / methods
  • Proteins* / metabolism

Substances

  • Proteins