pSSbond-PseAAC: Prediction of disulfide bonding sites by integration of PseAAC and statistical moments

J Theor Biol. 2019 Feb 21:463:47-55. doi: 10.1016/j.jtbi.2018.12.015. Epub 2018 Dec 12.

Abstract

The structure of protein gains additional stability against various detrimental effects by the presence of disulfide bonds. The formation of correct disulfide bonds between cysteine residues ensures proper in vivo and in vitro folding of the protein. Many cysteine residues can be present in the polypeptide chain of a protein, however, not all cysteine residues are involved in the formation of a disulfide bond, and therefore, accurate prediction of these bonds is crucial for identifying biophysical characteristics of a protein. In the present study, a novel method is proposed for the prediction of intramolecular disulfide bonds accurately using statistical moments and PseAAC. The pSSbond-PseAAC uses PseAAC along with position and composition relative features to calculate statistical moments. Statistical moments are important as they are very sensitive regarding the position of data sequences and for prediction of intramolecular disulfide bonds, moments are combined together to train neural networks. The overall accuracy of the pSSbond-PseAAC is 98.97% to sensitivity value 98.92%, specificity 98.99% and 0.98 MCC; and it outperforms various previously reported studies.

Keywords: 10-fold Cross Validation; 5-step rule; Neural networks; Position relative features.

MeSH terms

  • Computational Biology / methods
  • Cysteine / metabolism*
  • Disulfides / chemistry*
  • Neural Networks, Computer
  • Proteins / chemistry*
  • Supervised Machine Learning

Substances

  • Disulfides
  • Proteins
  • Cysteine