A probabilistic model of nuclear import of proteins

Bioinformatics. 2011 May 1;27(9):1239-46. doi: 10.1093/bioinformatics/btr121. Epub 2011 Mar 3.

Abstract

Motivation: Nucleo-cytoplasmic trafficking of proteins is a core regulatory process that sustains the integrity of the nuclear space of eukaryotic cells via an interplay between numerous factors. Despite progress on experimentally characterizing a number of nuclear localization signals, their presence alone remains an unreliable indicator of actual translocation.

Results: This article introduces a probabilistic model that explicitly recognizes a variety of nuclear localization signals, and integrates relevant amino acid sequence and interaction data for any candidate nuclear protein. In particular, we develop and incorporate scoring functions based on distinct classes of classical nuclear localization signals. Our empirical results show that the model accurately predicts whether a protein is imported into the nucleus, surpassing the classification accuracy of similar predictors when evaluated on the mouse and yeast proteomes (area under the receiver operator characteristic curve of 0.84 and 0.80, respectively). The model also predicts the sequence position of a nuclear localization signal and whether it interacts with importin-α.

Availability: http://pprowler.itee.uq.edu.au/NucImport

MeSH terms

  • Active Transport, Cell Nucleus
  • Amino Acid Sequence
  • Animals
  • Bayes Theorem
  • Cell Nucleus / metabolism*
  • Mice
  • Models, Biological
  • Nuclear Localization Signals / metabolism*
  • Nuclear Proteins / metabolism*
  • Protein Interaction Mapping
  • Saccharomyces cerevisiae / metabolism
  • Support Vector Machine
  • alpha Karyopherins / metabolism

Substances

  • Nuclear Localization Signals
  • Nuclear Proteins
  • alpha Karyopherins