Predicting enhancer-promoter interactions by deep learning and matching heuristic

Brief Bioinform. 2021 Jul 20;22(4):bbaa254. doi: 10.1093/bib/bbaa254.

Abstract

Enhancer-promoter interactions (EPIs) play an important role in transcriptional regulation. Recently, machine learning-based methods have been widely used in the genome-scale identification of EPIs due to their promising predictive performance. In this paper, we propose a novel method, termed EPI-DLMH, for predicting EPIs with the use of DNA sequences only. EPI-DLMH consists of three major steps. First, a two-layer convolutional neural network is used to learn local features, and an bidirectional gated recurrent unit network is used to capture long-range dependencies on the sequences of promoters and enhancers. Second, an attention mechanism is used for focusing on relatively important features. Finally, a matching heuristic mechanism is introduced for the exploration of the interaction between enhancers and promoters. We use benchmark datasets in evaluating and comparing the proposed method with existing methods. Comparative results show that our model is superior to currently existing models in multiple cell lines. Specifically, we found that the matching heuristic mechanism introduced into the proposed model mainly contributes to the improvement of performance in terms of overall accuracy. Additionally, compared with existing models, our model is more efficient with regard to computational speed.

Keywords: DNA sequence; deep learning; enhancer-promoter interactions; matching heuristic; pretraining.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology
  • Deep Learning*
  • Enhancer Elements, Genetic*
  • HeLa Cells
  • Heuristics
  • Human Umbilical Vein Endothelial Cells
  • Humans
  • K562 Cells
  • Models, Genetic*
  • Promoter Regions, Genetic*