SigUNet: signal peptide recognition based on semantic segmentation

BMC Bioinformatics. 2019 Dec 20;20(Suppl 24):677. doi: 10.1186/s12859-019-3245-z.

Abstract

Background: Signal peptides play an important role in protein sorting, which is the mechanism whereby proteins are transported to their destination. Recognition of signal peptides is an important first step in determining the active locations and functions of proteins. Many computational methods have been proposed to facilitate signal peptide recognition. In recent years, the development of deep learning methods has seen significant advances in many research fields. However, most existing models for signal peptide recognition use one-hidden-layer neural networks or hidden Markov models, which are relatively simple in comparison with the deep neural networks that are used in other fields.

Results: This study proposes a convolutional neural network without fully connected layers, which is an important network improvement in computer vision. The proposed network is more complex in comparison with current signal peptide predictors. The experimental results show that the proposed network outperforms current signal peptide predictors on eukaryotic data. This study also demonstrates how model reduction and data augmentation helps the proposed network to predict bacterial data.

Conclusions: The study makes three contributions to this subject: (a) an accurate signal peptide recognizer is developed, (b) the potential to leverage advanced networks from other fields is demonstrated and (c) important modifications are proposed while adopting complex networks on signal peptide recognition.

Keywords: Deep learning; Semantic segmentation; Signal peptide.

MeSH terms

  • Deep Learning
  • Neural Networks, Computer
  • Protein Sorting Signals
  • Semantics*
  • Software

Substances

  • Protein Sorting Signals