Multilabel Distribution Learning Based on Multioutput Regression and Manifold Learning

IEEE Trans Cybern. 2022 Jun;52(6):5064-5078. doi: 10.1109/TCYB.2020.3026576. Epub 2022 Jun 16.

Abstract

Real-world multilabel data are high dimensional, and directly using them for label distribution learning (LDL) will incur extensive computational costs. We propose a multilabel distribution learning algorithm based on multioutput regression through manifold learning, referred to as MDLRML. By exploiting smooth, similar spaces' information provided by the samples' manifold learning and LDL, we link the two spaces' manifolds. This facilitates using the topological relationship of the manifolds in the feature space to guide the manifold construction of the label space. The smoothest regression function is used to fit the manifold data, and a locally constrained multioutput regression is designed to improve the data's local fitting. Based on the regression results, we enhance the logical labels into the label distributions, thereby mining and revealing the label's hidden information regarding importance or significance. Extensive experimental results using real-world multilabel datasets show that the proposed MDLRML algorithm significantly improves the multilabel distribution learning accuracy and efficiency over several existing state-of-the-art schemes.