Classification of particle trajectories in living cells: Machine learning versus statistical testing hypothesis for fractional anomalous diffusion

Phys Rev E. 2020 Sep;102(3-1):032402. doi: 10.1103/PhysRevE.102.032402.

Abstract

Single-particle tracking (SPT) has become a popular tool to study the intracellular transport of molecules in living cells. Inferring the character of their dynamics is important, because it determines the organization and functions of the cells. For this reason, one of the first steps in the analysis of SPT data is the identification of the diffusion type of the observed particles. The most popular method to identify the class of a trajectory is based on the mean-square displacement (MSD). However, due to its known limitations, several other approaches have been already proposed. With the recent advances in algorithms and the developments of modern hardware, the classification attempts rooted in machine learning (ML) are of particular interest. In this work, we adopt two ML ensemble algorithms, i.e., random forest and gradient boosting, to the problem of trajectory classification. We present a new set of features used to transform the raw trajectories data into input vectors required by the classifiers. The resulting models are then applied to real data for G protein-coupled receptors and G proteins. The classification results are compared to recent statistical methods going beyond MSD.

MeSH terms

  • Biological Transport
  • Cell Survival
  • Diffusion
  • GTP-Binding Proteins / metabolism
  • Machine Learning*
  • Models, Biological*
  • Receptors, G-Protein-Coupled / metabolism
  • Single Molecule Imaging*

Substances

  • Receptors, G-Protein-Coupled
  • GTP-Binding Proteins