Recent Trends on the Development of Machine Learning Approaches for the Prediction of Lysine Acetylation Sites

Curr Med Chem. 2022;29(2):235-250. doi: 10.2174/0929867328999210902125308.

Abstract

Acetylation on lysine residues is considered one of the most potent protein post-translational modifications, owing to its crucial role in cellular metabolism and regulatory processes. Recent advances in experimental techniques have unraveled several lysine acetylation substrates and sites. However, owing to its cost-ineffectiveness, cumbersome process, time-consumption, and labor-intensiveness, several efforts have been geared towards the development of computational tools. In particular, machine learning (ML)-based approaches hold great promise in the rapid discovery of lysine acetylation modification sites, which could be witnessed by the growing number of prediction tools. Recently, several ML methods have been developed for the prediction of lysine acetylation sites, owing to their time- and cost-effectiveness. In this review, we present a complete survey of the state-of-the-art ML predictors for lysine acetylation. We discuss a variety of key aspects for developing a successful predictor, including operating ML algorithms, feature selection methods, validation techniques, and software utility. Initially, we review lysine acetylation site databases, current ML approaches, working principles, and their performances. Lastly, we discuss the shortcomings and future directions of ML approaches in the prediction of lysine acetylation sites. This review may act as a useful guide for the experimentalists in choosing the right ML tool for their research. Moreover, it may help bioinformaticians in the development of more accurate and advanced MLbased predictors in protein research.

Keywords: Protein; acetylation; feature encoding; lysine; machine learning; post-translational modification; prediction model..

Publication types

  • Review

MeSH terms

  • Acetylation
  • Computational Biology*
  • Humans
  • Lysine*
  • Machine Learning
  • Protein Processing, Post-Translational

Substances

  • Lysine