Machine learning approaches reveal subtle differences in breathing and sleep fragmentation in Phox2b-derived astrocytes ablated mice

J Neurophysiol. 2021 Apr 1;125(4):1164-1179. doi: 10.1152/jn.00155.2020. Epub 2021 Jan 27.

Abstract

Modern neurophysiology research requires the interrogation of high-dimensionality data sets. Machine learning and artificial intelligence (ML/AI) workflows have permeated into nearly all aspects of daily life in the developed world but have not been implemented routinely in neurophysiological analyses. The power of these workflows includes the speed at which they can be deployed, their availability of open-source programming languages, and the objectivity permitted in their data analysis. We used classification-based algorithms, including random forest, gradient boosted machines, support vector machines, and neural networks, to test the hypothesis that the animal genotypes could be separated into their genotype based on interpretation of neurophysiological recordings. We then interrogate the models to identify what were the major features utilized by the algorithms to designate genotype classification. By using raw EEG and respiratory plethysmography data, we were able to predict which recordings came from genotype class with accuracies that were significantly improved relative to the no information rate, although EEG analyses showed more overlap between groups than respiratory plethysmography. In comparison, conventional methods where single features between animal classes were analyzed, differences between the genotypes tested using baseline neurophysiology measurements showed no statistical difference. However, ML/AI workflows successfully were capable of providing successful classification, indicating that interactions between features were different in these genotypes. ML/AI workflows provide new methodologies to interrogate neurophysiology data. However, their implementation must be done with care so as to provide high rigor and reproducibility between laboratories. We provide a series of recommendations on how to report the utilization of ML/AI workflows for the neurophysiology community.NEW & NOTEWORTHY ML/AI classification workflows are capable of providing insight into differences between genotypes for neurophysiology research. Analytical techniques utilized in the neurophysiology community can be augmented by implementing ML/AI workflows. Random forest is a robust classification algorithm for respiratory plethysmography data. Utilization of ML/AI workflows in neurophysiology research requires heightened transparency and improved community research standards.

Keywords: Phox2B; machine learning; random forest; supervised learning; unsupervised learning.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Astrocytes
  • Electroencephalography* / methods
  • Gene Expression Profiling* / methods
  • Genotype
  • Homeodomain Proteins
  • Machine Learning*
  • Mice
  • Neurophysiology / methods*
  • Plethysmography* / methods
  • Respiration*
  • Sleep / physiology*
  • Transcription Factors
  • Workflow

Substances

  • Homeodomain Proteins
  • Phox2b protein, mouse
  • Transcription Factors