Learning weighted metrics to minimize nearest-neighbor classification error

IEEE Trans Pattern Anal Mach Intell. 2006 Jul;28(7):1100-10. doi: 10.1109/TPAMI.2006.145.

Abstract

In order to optimize the accuracy of the Nearest-Neighbor classification rule, a weighted distance is proposed, along with algorithms to automatically learn the corresponding weights. These weights may be specific for each class and feature, for each individual prototype, or for both. The learning algorithms are derived by (approximately) minimizing the Leaving-One-Out classification error of the given training set. The proposed approach is assessed through a series of experiments with UCI/STATLOG corpora, as well as with a more specific task of text classification which entails very sparse data representation and huge dimensionality. In all these experiments, the proposed approach shows a uniformly good behavior, with results comparable to or better than state-of-the-art results published with the same data so far.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Artificial Intelligence*
  • Cluster Analysis
  • Computer Simulation
  • Data Interpretation, Statistical
  • Information Storage and Retrieval / methods*
  • Models, Statistical*
  • Numerical Analysis, Computer-Assisted
  • Pattern Recognition, Automated / methods*