iHyd-PseCp: Identify hydroxyproline and hydroxylysine in proteins by incorporating sequence-coupled effects into general PseAAC

Oncotarget. 2016 Jul 12;7(28):44310-44321. doi: 10.18632/oncotarget.10027.

Abstract

Protein hydroxylation is a posttranslational modification (PTM), in which a CH group in Pro (P) or Lys (K) residue has been converted into a COH group, or a hydroxyl group (-OH) is converted into an organic compound. Closely associated with cellular signaling activities, this type of PTM is also involved in some major diseases, such as stomach cancer and lung cancer. Therefore, from the angles of both basic research and drug development, we are facing a challenging problem: for an uncharacterized protein sequence containing many residues of P or K, which ones can be hydroxylated, and which ones cannot? With the explosive growth of protein sequences in the post-genomic age, the problem has become even more urgent. To address such a problem, we have developed a predictor called iHyd-PseCp by incorporating the sequence-coupled information into the general pseudo amino acid composition (PseAAC) and introducing the "Random Forest" algorithm to operate the calculation. Rigorous jackknife tests indicated that the new predictor remarkably outperformed the existing state-of-the-art prediction method for the same purpose. For the convenience of most experimental scientists, a user-friendly web-server for iHyd-PseCp has been established at http://www.jci-bioinfo.cn/iHyd-PseCp, by which users can easily obtain their desired results without the need to go through the complicated mathematical equations involved.

Keywords: PTMs; general PseAAC; hydroxylysine; hydroxyproline; sequence-coupling model.

MeSH terms

  • Algorithms
  • Amino Acid Sequence
  • Datasets as Topic
  • Humans
  • Hydroxylation
  • Hydroxylysine / chemistry
  • Hydroxylysine / metabolism*
  • Hydroxyproline / chemistry
  • Hydroxyproline / metabolism*
  • Models, Chemical*
  • Protein Processing, Post-Translational*
  • Proteins / chemistry
  • Proteins / metabolism*

Substances

  • Proteins
  • Hydroxylysine
  • Hydroxyproline