TIPred: a novel stacked ensemble approach for the accelerated discovery of tyrosinase inhibitory peptides

BMC Bioinformatics. 2023 Sep 21;24(1):356. doi: 10.1186/s12859-023-05463-1.

Abstract

Background: Tyrosinase is an enzyme involved in melanin production in the skin. Several hyperpigmentation disorders involve the overproduction of melanin and instability of tyrosinase activity resulting in darker, discolored patches on the skin. Therefore, discovering tyrosinase inhibitory peptides (TIPs) is of great significance for basic research and clinical treatments. However, the identification of TIPs using experimental methods is generally cost-ineffective and time-consuming.

Results: Herein, a stacked ensemble learning approach, called TIPred, is proposed for the accurate and quick identification of TIPs by using sequence information. TIPred explored a comprehensive set of various baseline models derived from well-known machine learning (ML) algorithms and heterogeneous feature encoding schemes from multiple perspectives, such as chemical structure properties, physicochemical properties, and composition information. Subsequently, 130 baseline models were trained and optimized to create new probabilistic features. Finally, the feature selection approach was utilized to determine the optimal feature vector for developing TIPred. Both tenfold cross-validation and independent test methods were employed to assess the predictive capability of TIPred by using the stacking strategy. Experimental results showed that TIPred significantly outperformed the state-of-the-art method in terms of the independent test, with an accuracy of 0.923, MCC of 0.757 and an AUC of 0.977.

Conclusions: The proposed TIPred approach could be a valuable tool for rapidly discovering novel TIPs and effectively identifying potential TIP candidates for follow-up experimental validation. Moreover, an online webserver of TIPred is publicly available at http://pmlabstack.pythonanywhere.com/TIPred .

Keywords: Bioinformatics; Feature selection; Machine learning; Sequence analysis; Stacking strategy; Tyrosinase inhibitory peptides.

MeSH terms

  • Algorithms
  • Machine Learning
  • Melanins*
  • Monophenol Monooxygenase*
  • Peptides

Substances

  • Melanins
  • Monophenol Monooxygenase
  • Peptides