Accurate Identification of Antioxidant Proteins Based on a Combination of Machine Learning Techniques and Hidden Markov Model Profiles

Comput Math Methods Med. 2021 Aug 7:2021:5770981. doi: 10.1155/2021/5770981. eCollection 2021.

Abstract

Antioxidant proteins (AOPs) play important roles in the management and prevention of several human diseases due to their ability to neutralize excess free radicals. However, the identification of AOPs by using wet-lab experimental techniques is often time-consuming and expensive. In this study, we proposed an accurate computational model, called AOP-HMM, to predict AOPs by extracting discriminatory evolutionary features from hidden Markov model (HMM) profiles. First, auto cross-covariance (ACC) variables were applied to transform the HMM profiles into fixed-length feature vectors. Then, we performed the analysis of variance (ANOVA) method to reduce the dimensionality of the raw feature space. Finally, a support vector machine (SVM) classifier was adopted to conduct the prediction of AOPs. To comprehensively evaluate the performance of the proposed AOP-HMM model, the 10-fold cross-validation (CV), the jackknife CV, and the independent test were carried out on two widely used benchmark datasets. The experimental results demonstrated that AOP-HMM outperformed most of the existing methods and could be used to quickly annotate AOPs and guide the experimental process.

MeSH terms

  • Algorithms
  • Amino Acids / analysis
  • Antioxidants / chemistry*
  • Antioxidants / classification
  • Computational Biology
  • Databases, Protein / statistics & numerical data
  • Evolution, Molecular
  • Humans
  • Machine Learning*
  • Markov Chains
  • Peroxiredoxins / chemistry*
  • Peroxiredoxins / classification
  • Proteins / chemistry*
  • Proteins / classification

Substances

  • Amino Acids
  • Antioxidants
  • Proteins
  • Peroxiredoxins