Bound2Learn: a machine learning approach for classification of DNA-bound proteins from single-molecule tracking experiments

Nucleic Acids Res. 2021 Aug 20;49(14):e79. doi: 10.1093/nar/gkab186.

Abstract

DNA-bound proteins are essential elements for the maintenance, regulation, and use of the genome. The time they spend bound to DNA provides useful information on their stability within protein complexes and insight into the understanding of biological processes. Single-particle tracking allows for direct visualization of protein-DNA kinetics, however, identifying whether a molecule is bound to DNA can be non-trivial. Further complications arise when tracking molecules for extended durations in processes with slow kinetics. We developed a machine learning approach, termed Bound2Learn, using output from a widely used tracking software, to robustly classify tracks in order to accurately estimate residence times. We validated our approach in silico, and in live-cell data from Escherichia coli and Saccharomyces cerevisiae. Our method has the potential for broad utility and is applicable to other organisms.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • Computer Simulation
  • DNA / genetics
  • DNA / metabolism
  • DNA-Binding Proteins / classification
  • DNA-Binding Proteins / genetics
  • DNA-Binding Proteins / metabolism*
  • Escherichia coli / genetics
  • Escherichia coli / metabolism
  • Kinetics
  • Machine Learning*
  • Protein Binding
  • Saccharomyces cerevisiae / genetics
  • Saccharomyces cerevisiae / metabolism
  • Single Molecule Imaging / methods*
  • Time-Lapse Imaging / methods*

Substances

  • DNA-Binding Proteins
  • DNA

Grants and funding