Machine learning on protein-protein interaction prediction: models, challenges and trends

Brief Bioinform. 2023 Mar 19;24(2):bbad076. doi: 10.1093/bib/bbad076.

Abstract

Protein-protein interactions (PPIs) carry out the cellular processes of all living organisms. Experimental methods for PPI detection suffer from high cost and false-positive rate, hence efficient computational methods are highly desirable for facilitating PPI detection. In recent years, benefiting from the enormous amount of protein data produced by advanced high-throughput technologies, machine learning models have been well developed in the field of PPI prediction. In this paper, we present a comprehensive survey of the recently proposed machine learning-based prediction methods. The machine learning models applied in these methods and details of protein data representation are also outlined. To understand the potential improvements in PPI prediction, we discuss the trend in the development of machine learning-based methods. Finally, we highlight potential directions in PPI prediction, such as the use of computationally predicted protein structures to extend the data source for machine learning models. This review is supposed to serve as a companion for further improvements in this field.

Keywords: computational PPI prediction; deep learning; machine learning; protein–protein interaction.

Publication types

  • Review
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology / methods
  • Machine Learning*
  • Protein Interaction Mapping* / methods
  • Proteins / metabolism

Substances

  • Proteins