SpecLoop predicts cell type-specific chromatin loop via transcription factor cooperation

Comput Biol Med. 2024 Mar:171:108182. doi: 10.1016/j.compbiomed.2024.108182. Epub 2024 Feb 21.

Abstract

Cell-type-Specific Chromatin Loops (CSCLs) are crucial for gene regulation and cell fate determination. However, the mechanisms governing their establishment remain elusive. Here, we present SpecLoop, a network regularization-based machine learning framework, to investigate the role of transcription factors (TFs) cooperation in CSCL formation. SpecLoop integrates multi-omics data, including gene expression, chromatin accessibility, sequence, protein-protein interaction, and TF binding motif data, to predict CSCLs and identify TF cooperations. Using high resolution Hi-C data as the gold standard, SpecLoop accurately predicts CSCL in GM12878, IMR90, HeLa-S3, K562, HUVEC, HMEC, and NHEK seven cell types, with the AUROC values ranging from 0.8645 to 0.9852 and AUPR values ranging from 0.8654 to 0.9734. Notably SpecLoop demonstrates improved accuracy in predicting long-distance CSCLs and identifies TF complexes with strong predictive ability. Our study systematically explores the TFs and TF pairs associated with CSCL through effective integration of diverse omics data. SpecLoop is freely available at https://github.com/AMSSwanglab/SpecLoop.

MeSH terms

  • Binding Sites / genetics
  • Chromatin / genetics
  • Gene Expression Regulation*
  • Protein Binding
  • Transcription Factors* / genetics
  • Transcription Factors* / metabolism

Substances

  • Transcription Factors
  • Chromatin