NTyroSite: Computational Identification of Protein Nitrotyrosine Sites Using Sequence Evolutionary Features

Molecules. 2018 Jul 9;23(7):1667. doi: 10.3390/molecules23071667.

Abstract

Nitrotyrosine is a product of tyrosine nitration mediated by reactive nitrogen species. As an indicator of cell damage and inflammation, protein nitrotyrosine serves to reveal biological change associated with various diseases or oxidative stress. Accurate identification of nitrotyrosine site provides the important foundation for further elucidating the mechanism of protein nitrotyrosination. However, experimental identification of nitrotyrosine sites through traditional methods are laborious and expensive. In silico prediction of nitrotyrosine sites based on protein sequence information are thus highly desired. Here, we report a novel predictor, NTyroSite, for accurate prediction of nitrotyrosine sites using sequence evolutionary information. The generated features were optimized using a Wilcoxon-rank sum test. A random forest classifier was then trained using these features to build the predictor. The final NTyroSite predictor achieved an area under a receiver operating characteristics curve (AUC) score of 0.904 in a 10-fold cross-validation test. It also significantly outperformed other existing implementations in an independent test. Meanwhile, for a better understanding of our prediction model, the predominant rules and informative features were extracted from the NTyroSite model to explain the prediction results. We expect that the NTyroSite predictor may serve as a useful computational resource for high-throughput nitrotyrosine site prediction. The online interface of the software is publicly available at https://biocomputer.bio.cuhk.edu.hk/NTyroSite/.

Keywords: Wilcoxon-rank sum test; post-translational modification; random forest; rule extraction.

MeSH terms

  • Area Under Curve
  • Computational Biology / methods*
  • Computer Simulation
  • Protein Processing, Post-Translational
  • Proteins / chemistry*
  • ROC Curve
  • Software
  • Tyrosine / analogs & derivatives*
  • Tyrosine / chemistry

Substances

  • Proteins
  • 3-nitrotyrosine
  • Tyrosine