Prediction of Protein-Protein Interaction Sites Using Convolutional Neural Network and Improved Data Sets

Int J Mol Sci. 2020 Jan 11;21(2):467. doi: 10.3390/ijms21020467.

Abstract

Protein-protein interaction (PPI) sites play a key role in the formation of protein complexes, which is the basis of a variety of biological processes. Experimental methods to solve PPI sites are expensive and time-consuming, which has led to the development of different kinds of prediction algorithms. We propose a convolutional neural network for PPI site prediction and use residue binding propensity to improve the positive samples. Our method obtains a remarkable result of the area under the curve (AUC) = 0.912 on the improved data set. In addition, it yields much better results on samples with high binding propensity than on randomly selected samples. This suggests that there are considerable false-positive PPI sites in the positive samples defined by the distance between residue atoms.

Keywords: convolutional neural network; protein–protein interaction sites; residue binding propensity.

MeSH terms

  • Animals
  • Binding Sites
  • Datasets as Topic / standards
  • Humans
  • Neural Networks, Computer*
  • Protein Binding
  • Protein Interaction Mapping / methods*
  • Protein Interaction Mapping / standards
  • Reproducibility of Results