Unsupervised clustering algorithm for N-dimensional data

Erwin B Montgomery Jr; He Huang; Amir Assadi

doi:10.1016/j.jneumeth.2004.10.015

Unsupervised clustering algorithm for N-dimensional data

J Neurosci Methods. 2005 May 15;144(1):19-24. doi: 10.1016/j.jneumeth.2004.10.015. Epub 2004 Dec 18.

Authors

Erwin B Montgomery Jr¹, He Huang, Amir Assadi

Affiliation

¹ Department of Neurology, National Primate Research Center, University of Wisconsin-Madison, 600 Highland Ave., Madison, WI 53792, USA. montgomery@neurology.wisc.edu

PMID: 15848235
DOI: 10.1016/j.jneumeth.2004.10.015

Abstract

Cluster analysis is an important tool for classifying data. Established techniques include k-means and k-median cluster analysis. However, these methods require the user to provide a priori estimations of the number of clusters and their approximate location in the parameter space. Often these estimations can be made based on some prior understanding about the nature of the data. Alternatively, the user makes these estimations based on visualization of the data. However, the latter is problematic in data sets with large numbers of dimensions. Presented here is an algorithm that can automatically provide these estimates without human intervention based on the inherent structure of the data set. The number of dimensions does not limit it.

Publication types

Comparative Study
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, P.H.S.

MeSH terms

Algorithms*
Cluster Analysis*
Computer Simulation
Data Interpretation, Statistical
Sensitivity and Specificity

Grants and funding

5P51 RR000167/RR/NCRR NIH HHS/United States