pcPromoter-CNN: A CNN-Based Prediction and Classification of Promoters

Genes (Basel). 2020 Dec 21;11(12):1529. doi: 10.3390/genes11121529.

Abstract

A promoter is a small region within the DNA structure that has an important role in initiating transcription of a specific gene in the genome. Different types of promoters are recognized by their different functions. Due to the importance of promoter functions, computational tools for the prediction and classification of a promoter are highly desired. Promoters resemble each other; therefore, their precise classification is an important challenge. In this study, we propose a convolutional neural network (CNN)-based tool, the pcPromoter-CNN, for application in the prediction of promotors and their classification into subclasses σ70, σ54, σ38, σ32, σ28 and σ24. This CNN-based tool uses a one-hot encoding scheme for promoter classification. The tools architecture was trained and tested on a benchmark dataset. To evaluate its classification performance, we used four evaluation metrics. The model exhibited notable improvement over that of existing state-of-the-art tools.

Keywords: bioinformatics; computational biology; convolution neural network (CNN); non-promoters; promoters.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Benchmarking
  • Classification / methods
  • DNA, Bacterial / genetics
  • Datasets as Topic
  • Escherichia coli K12 / genetics
  • Genes, Bacterial
  • Models, Genetic*
  • Neural Networks, Computer*
  • Promoter Regions, Genetic*

Substances

  • DNA, Bacterial