A Novel Normalized-Cut Solver With Nearest Neighbor Hierarchical Initialization

IEEE Trans Pattern Anal Mach Intell. 2024 Jan;46(1):659-666. doi: 10.1109/TPAMI.2023.3279394. Epub 2023 Dec 5.

Abstract

Normalized-Cut (N-Cut) is a famous model of spectral clustering. The traditional N-Cut solvers are two-stage: 1) calculating the continuous spectral embedding of normalized Laplacian matrix; 2) discretization via K-means or spectral rotation. However, this paradigm brings two vital problems: 1) two-stage methods solve a relaxed version of the original problem, so they cannot obtain good solutions for the original N-Cut problem; 2) solving the relaxed problem requires eigenvalue decomposition, which has O(n3) time complexity ( n is the number of nodes). To address the problems, we propose a novel N-Cut solver designed based on the famous coordinate descent method. Since the vanilla coordinate descent method also has O(n3) time complexity, we design various accelerating strategies to reduce the time complexity to O(|E|) ( |E| is the number of edges). To avoid reliance on random initialization which brings uncertainties to clustering, we propose an efficient initialization method that gives deterministic outputs. Extensive experiments on several benchmark datasets demonstrate that the proposed solver can obtain larger objective values of N-Cut, meanwhile achieving better clustering performance compared to traditional solvers.