CENTRE: a gradient boosting algorithm for Cell-type-specific ENhancer-Target pREdiction

Bioinformatics. 2023 Nov 1;39(11):btad687. doi: 10.1093/bioinformatics/btad687.

Abstract

Motivation: Identifying target promoters of active enhancers is a crucial step for realizing gene regulation and deciphering phenotypes and diseases. Up to now, several computational methods were developed to predict enhancer gene interactions, but they require either many epigenomic and transcriptomic experimental assays to generate cell-type (CT)-specific predictions or a single experiment applied to a large cohort of CTs to extract correlations between activities of regulatory elements. Thus, inferring CT-specific enhancer gene interactions in unstudied or poorly annotated CTs becomes a laborious and costly task.

Results: Here, we aim to infer CT-specific enhancer target interactions, using minimal experimental input. We introduce Cell-specific ENhancer Target pREdiction (CENTRE), a machine learning framework that predicts enhancer target interactions in a CT-specific manner, using only gene expression and ChIP-seq data for three histone modifications for the CT of interest. CENTRE exploits the wealth of available datasets and extracts cell-type agnostic statistics to complement the CT-specific information. CENTRE is thoroughly tested across many datasets and CTs and achieves equivalent or superior performance than existing algorithms that require massive experimental data.

Availability and implementation: CENTRE's open-source code is available at GitHub via https://github.com/slrvv/CENTRE.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Enhancer Elements, Genetic*
  • Epigenomics
  • Gene Expression Regulation
  • Humans
  • Promoter Regions, Genetic