IILLS: predicting virus-receptor interactions based on similarity and semi-supervised learning

BMC Bioinformatics. 2019 Dec 27;20(Suppl 23):651. doi: 10.1186/s12859-019-3278-3.

Abstract

Background: Viral infectious diseases are the serious threat for human health. The receptor-binding is the first step for the viral infection of hosts. To more effectively treat human viral infectious diseases, the hidden virus-receptor interactions must be discovered. However, current computational methods for predicting virus-receptor interactions are limited.

Result: In this study, we propose a new computational method (IILLS) to predict virus-receptor interactions based on Initial Interaction scores method via the neighbors and the Laplacian regularized Least Square algorithm. IILLS integrates the known virus-receptor interactions and amino acid sequences of receptors. The similarity of viruses is calculated by the Gaussian Interaction Profile (GIP) kernel. On the other hand, we also compute the receptor GIP similarity and the receptor sequence similarity. Then the sequence similarity is used as the final similarity of receptors according to the prediction results. The 10-fold cross validation (10CV) and leave one out cross validation (LOOCV) are used to assess the prediction performance of our method. We also compare our method with other three competing methods (BRWH, LapRLS, CMF).

Conlusion: The experiment results show that IILLS achieves the AUC values of 0.8675 and 0.9061 with the 10-fold cross validation and leave-one-out cross validation (LOOCV), respectively, which illustrates that IILLS is superior to the competing methods. In addition, the case studies also further indicate that the IILLS method is effective for the virus-receptor interaction prediction.

Keywords: Gaussian interaction profile (GIP) kernel; Laplacian regularized least squares classifier; Semi-supervised learning; Similarity; Virus-receptor interaction.

MeSH terms

  • Algorithms
  • Humans
  • Least-Squares Analysis
  • ROC Curve
  • Receptors, Virus / metabolism*
  • Reproducibility of Results
  • Software*
  • Supervised Machine Learning*
  • Viruses / metabolism*

Substances

  • Receptors, Virus