Consensus-Based Sorting of Neuronal Spike Waveforms

PLoS One. 2016 Aug 18;11(8):e0160494. doi: 10.1371/journal.pone.0160494. eCollection 2016.

Abstract

Optimizing spike-sorting algorithms is difficult because sorted clusters can rarely be checked against independently obtained "ground truth" data. In most spike-sorting algorithms in use today, the optimality of a clustering solution is assessed relative to some assumption on the distribution of the spike shapes associated with a particular single unit (e.g., Gaussianity) and by visual inspection of the clustering solution followed by manual validation. When the spatiotemporal waveforms of spikes from different cells overlap, the decision as to whether two spikes should be assigned to the same source can be quite subjective, if it is not based on reliable quantitative measures. We propose a new approach, whereby spike clusters are identified from the most consensual partition across an ensemble of clustering solutions. Using the variability of the clustering solutions across successive iterations of the same clustering algorithm (template matching based on K-means clusters), we estimate the probability of spikes being clustered together and identify groups of spikes that are not statistically distinguishable from one another. Thus, we identify spikes that are most likely to be clustered together and therefore correspond to consistent spike clusters. This method has the potential advantage that it does not rely on any model of the spike shapes. It also provides estimates of the proportion of misclassified spikes for each of the identified clusters. We tested our algorithm on several datasets for which there exists a ground truth (simultaneous intracellular data), and show that it performs close to the optimum reached by a support vector machine trained on the ground truth. We also show that the estimated rate of misclassification matches the proportion of misclassified spikes measured from the ground truth data.

MeSH terms

  • Action Potentials*
  • Algorithms
  • Animals
  • Cluster Analysis
  • Computer Simulation
  • Electrodes
  • Electrophysiology / methods
  • Humans
  • Models, Neurological
  • Neurons / physiology*
  • Normal Distribution
  • Probability
  • Signal Processing, Computer-Assisted
  • Support Vector Machine

Grants and funding

This work was funded by the Max Planck Society, the European Research Council, and the Human Frontiers Science Program. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. The Max Planck Society paid the salaries of GL and CMM. The HFSP and the Max Planck Society paid the salary of JF.