iPromoter-BnCNN: a novel branched CNN-based predictor for identifying and classifying sigma promoters

Bioinformatics. 2020 Dec 8;36(19):4869-4875. doi: 10.1093/bioinformatics/btaa609.

Abstract

Motivation: Promoter is a short region of DNA which is responsible for initiating transcription of specific genes. Development of computational tools for automatic identification of promoters is in high demand. According to the difference of functions, promoters can be of different types. Promoters may have both intra- and interclass variation and similarity in terms of consensus sequences. Accurate classification of various types of sigma promoters still remains a challenge.

Results: We present iPromoter-BnCNN for identification and accurate classification of six types of promoters-σ24,σ28,σ32,σ38,σ54,σ70. It is a CNN-based classifier which combines local features related to monomer nucleotide sequence, trimer nucleotide sequence, dimer structural properties and trimer structural properties through the use of parallel branching. We conducted experiments on a benchmark dataset and compared with six state-of-the-art tools to show our supremacy on 5-fold cross-validation. Moreover, we tested our classifier on an independent test dataset.

Availability and implementation: Our proposed tool iPromoter-BnCNN web server is freely available at http://103.109.52.8/iPromoter-BnCNN. The runnable source code can be found https://colab.research.google.com/drive/1yWWh7BXhsm8U4PODgPqlQRy23QGjF2DZ.

Supplementary information: Supplementary data are available at Bioinformatics online.

MeSH terms

  • DNA
  • Promoter Regions, Genetic
  • Sequence Analysis, DNA
  • Sigma Factor*
  • Software*

Substances

  • Sigma Factor
  • DNA