Multi-label feature selection based on HSIC and sparrow search algorithm

Math Biosci Eng. 2023 Jun 26;20(8):14201-14221. doi: 10.3934/mbe.2023635.

Abstract

Feature selection has always been an important topic in machine learning and data mining. In multi-label learning tasks, each sample in the dataset is associated with multiple labels, and labels are usually related to each other. At the same time, multi-label learning has the problem of "curse of dimensionality". Feature selection therefore becomes a difficult task. To solve this problem, this paper proposes a multi-label feature selection method based on the Hilbert-Schmidt independence criterion (HSIC) and sparrow search algorithm (SSA). It uses SSA for feature search and HSIC as feature selection criterion to describe the dependence between features and all labels, so as to select the optimal feature subset. Experimental results demonstrate the effectiveness of the proposed method.

Keywords: Hilbert-Schmidt independence criterion (HSIC); data mining; feature selection; multi-label classification; sparrow search algorithm.