pLoc-mVirus: Predict subcellular localization of multi-location virus proteins via incorporating the optimal GO information into general PseAAC

Gene. 2017 Sep 10:628:315-321. doi: 10.1016/j.gene.2017.07.036. Epub 2017 Jul 18.

Abstract

Knowledge of subcellular locations of proteins is crucially important for in-depth understanding their functions in a cell. With the explosive growth of protein sequences generated in the postgenomic age, it is highly demanded to develop computational tools for timely annotating their subcellular locations based on the sequence information alone. The current study is focused on virus proteins. Although considerable efforts have been made in this regard, the problem is far from being solved yet. Most existing methods can be used to deal with single-location proteins only. Actually, proteins with multi-locations may have some special biological functions. This kind of multiplex proteins is particularly important for both basic research and drug design. Using the multi-label theory, we present a new predictor called "pLoc-mVirus" by extracting the optimal GO (Gene Ontology) information into the general PseAAC (Pseudo Amino Acid Composition). Rigorous cross-validation on a same stringent benchmark dataset indicated that the proposed pLoc-mVirus predictor is remarkably superior to iLoc-Virus, the state-of-the-art method in predicting virus protein subcellular localization. To maximize the convenience of most experimental scientists, a user-friendly web-server for the new predictor has been established at http://www.jci-bioinfo.cn/pLoc-mVirus/, by which users can easily get their desired results without the need to go through the complicated mathematics involved.

Keywords: GO; Multi-label system; PseAAC, ML-GKR, Chou's metrics.

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • Databases, Protein
  • Gene Ontology*
  • Intracellular Space
  • Protein Transport
  • Viral Proteins / genetics
  • Viral Proteins / metabolism*
  • Web Browser

Substances

  • Viral Proteins