Integrating temporal and spatial variabilities for identifying ion binding proteins in phage

J Bioinform Comput Biol. 2023 Jun;21(3):2350010. doi: 10.1142/S0219720023500105. Epub 2023 Jun 15.

Abstract

Recent studies reported that ion binding proteins (IBPs) in phage play a key role in developing drugs to treat diseases caused by drug-resistant bacteria. Therefore, correct recognition of IBPs is an urgent task, which is beneficial for understanding their biological functions. To explore this issue, a new computational model was developed to identify IBPs in this study. First, we used the physicochemical (PC) property and Pearson's correlation coefficient (PCC) to denote protein sequences, and the temporal and spatial variabilities were employed to extract features. Next, a similarity network fusion algorithm was employed to capture the correlation characteristics between these two different kinds of features. Then, a feature selection method called F-score was utilized to remove the influence of redundant and irrelative information. Finally, these reserved features were fed into support vector machine (SVM) to discriminate IBPs from non-IBPs. Experimental results showed that the proposed method has significant improvement in the classification performance, as compared with the state-of-the-art approach. The Matlab codes and dataset used in this study are available at https://figshare.com/articles/online_resource/iIBP-TSV/21779567 for academic use.

Keywords: F-score; Ion binding proteins; Pearson’s correlation coefficient; physicochemical property; support vector machine.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Amino Acid Sequence
  • Bacteriophages*
  • Carrier Proteins*
  • Proteins
  • Support Vector Machine

Substances

  • Carrier Proteins
  • Proteins