TIMER is a Siamese neural network-based framework for identifying both general and species-specific bacterial promoters

Brief Bioinform. 2023 Jul 20;24(4):bbad209. doi: 10.1093/bib/bbad209.

Abstract

Background: Promoters are DNA regions that initiate the transcription of specific genes near the transcription start sites. In bacteria, promoters are recognized by RNA polymerases and associated sigma factors. Effective promoter recognition is essential for synthesizing the gene-encoded products by bacteria to grow and adapt to different environmental conditions. A variety of machine learning-based predictors for bacterial promoters have been developed; however, most of them were designed specifically for a particular species. To date, only a few predictors are available for identifying general bacterial promoters with limited predictive performance.

Results: In this study, we developed TIMER, a Siamese neural network-based approach for identifying both general and species-specific bacterial promoters. Specifically, TIMER uses DNA sequences as the input and employs three Siamese neural networks with the attention layers to train and optimize the models for a total of 13 species-specific and general bacterial promoters. Extensive 10-fold cross-validation and independent tests demonstrated that TIMER achieves a competitive performance and outperforms several existing methods on both general and species-specific promoter prediction. As an implementation of the proposed method, the web server of TIMER is publicly accessible at http://web.unimelb-bioinfortools.cloud.edu.au/TIMER/.

Keywords: Siamese neural networks; bacterial promoter; bioinformatics; deep learning; machine learning; prediction.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacteria* / genetics
  • Bacteria* / metabolism
  • Base Sequence
  • DNA-Directed RNA Polymerases / genetics
  • DNA-Directed RNA Polymerases / metabolism
  • Neural Networks, Computer*
  • Promoter Regions, Genetic

Substances

  • DNA-Directed RNA Polymerases