Significantly improved prediction of subcellular localization by integrating text and protein sequence data

Pac Symp Biocomput. 2006:16-27.

Abstract

Computational prediction of protein subcellular localization is a challenging problem. Several approaches have been presented during the past few years; some attempt to cover a wide variety of localizations, while others focus on a small number of localizations and on specific organisms. We present a comprehensive system, integrating protein sequence-derived data and text-based information. Itis tested on three large data sets, previously used by leading prediction methods. The results demonstrate that our system performs significantly better than previously reported results, for a wide range of eukaryotic subcellular localizations.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Artificial Intelligence
  • Computer Simulation*
  • Databases, Protein
  • Models, Biological*
  • Proteins / genetics*
  • Proteins / metabolism*
  • Subcellular Fractions / metabolism*

Substances

  • Proteins