Genome-wide prediction of topoisomerase IIβ binding by architectural factors and chromatin accessibility

PLoS Comput Biol. 2021 Jan 19;17(1):e1007814. doi: 10.1371/journal.pcbi.1007814. eCollection 2021 Jan.

Abstract

DNA topoisomerase II-β (TOP2B) is fundamental to remove topological problems linked to DNA metabolism and 3D chromatin architecture, but its cut-and-reseal catalytic mechanism can accidentally cause DNA double-strand breaks (DSBs) that can seriously compromise genome integrity. Understanding the factors that determine the genome-wide distribution of TOP2B is therefore not only essential for a complete knowledge of genome dynamics and organization, but also for the implications of TOP2-induced DSBs in the origin of oncogenic translocations and other types of chromosomal rearrangements. Here, we conduct a machine-learning approach for the prediction of TOP2B binding using publicly available sequencing data. We achieve highly accurate predictions, with accessible chromatin and architectural factors being the most informative features. Strikingly, TOP2B is sufficiently explained by only three features: DNase I hypersensitivity, CTCF and cohesin binding, for which genome-wide data are widely available. Based on this, we develop a predictive model for TOP2B genome-wide binding that can be used across cell lines and species, and generate virtual probability tracks that accurately mirror experimental ChIP-seq data. Our results deepen our knowledge on how the accessibility and 3D organization of chromatin determine TOP2B function, and constitute a proof of principle regarding the in silico prediction of sequence-independent chromatin-binding factors.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Cells, Cultured
  • Chromatin* / chemistry
  • Chromatin* / genetics
  • Chromatin* / metabolism
  • DNA Topoisomerases, Type II* / chemistry
  • DNA Topoisomerases, Type II* / genetics
  • DNA Topoisomerases, Type II* / metabolism
  • Genome / genetics*
  • Genomics
  • Humans
  • MCF-7 Cells
  • Machine Learning
  • Mice
  • Models, Genetic*
  • Protein Binding
  • Thymocytes

Substances

  • Chromatin
  • DNA Topoisomerases, Type II

Grants and funding

This research was supported by grants from the Spanish Government and the European Regional Development Fund (SAF2017-89619-R, FCL, and TIN2015-64776-C3-2-R, FD), and the European Research Council (ERC-CoG-2014-647359, FCL). CABIMER is supported by the Andalusian Regional Government (Junta de Andalucía). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.