Learning weighted metrics to minimize nearest-neighbor classification error

Roberto Paredes; Enrique Vidal

doi:10.1109/TPAMI.2006.145

Learning weighted metrics to minimize nearest-neighbor classification error

IEEE Trans Pattern Anal Mach Intell. 2006 Jul;28(7):1100-10. doi: 10.1109/TPAMI.2006.145.

Authors

Roberto Paredes¹, Enrique Vidal

Affiliation

¹ Departamento de Sistemas Informáticos y Computación, Instituto Tecnológico de Informática, Universidad Politiécnica de Valencia, Spain. rparedes@iti.upv.es

PMID: 16792099
DOI: 10.1109/TPAMI.2006.145

Abstract

In order to optimize the accuracy of the Nearest-Neighbor classification rule, a weighted distance is proposed, along with algorithms to automatically learn the corresponding weights. These weights may be specific for each class and feature, for each individual prototype, or for both. The learning algorithms are derived by (approximately) minimizing the Leaving-One-Out classification error of the given training set. The proposed approach is assessed through a series of experiments with UCI/STATLOG corpora, as well as with a more specific task of text classification which entails very sparse data representation and huge dimensionality. In all these experiments, the proposed approach shows a uniformly good behavior, with results comparable to or better than state-of-the-art results published with the same data so far.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms*
Artificial Intelligence*
Cluster Analysis
Computer Simulation
Data Interpretation, Statistical
Information Storage and Retrieval / methods*
Models, Statistical*
Numerical Analysis, Computer-Assisted
Pattern Recognition, Automated / methods*