ℓ1-norm based safe semi-supervised learning

Haitao Gan; Zhi Yang; Ji Wang; Bing Li

doi:10.3934/mbe.2021383

ℓ₁-norm based safe semi-supervised learning

Math Biosci Eng. 2021 Sep 7;18(6):7727-7742. doi: 10.3934/mbe.2021383.

Authors

Haitao Gan^{1

2}, Zhi Yang^{1

3}, Ji Wang¹, Bing Li⁴

Affiliations

¹ School of Computer Science, Hubei University of Technology, Wuhan 430068, China.
² Key Laboratory of Brain Machine Collaborative Intelligence of Zhejiang Province, Hangzhou 310018, China.
³ State Key Laboratory of Biocatalysis and Enzyme Engineering, Wuhan 430062, China.
⁴ School of Traffic and Transportation Engineering, Wuhan Institute of Shipbuilding Technology, Wuhan 430050, China.

PMID: 34814272
DOI: 10.3934/mbe.2021383

Abstract

In the past few years, Safe Semi-Supervised Learning (S3L) has received considerable attentions in machine learning field. Different researchers have proposed many S3L methods for safe exploitation of risky unlabeled samples which result in performance degradation of Semi-Supervised Learning (SSL). Nevertheless, there exist some shortcomings: (1) Risk degrees of the unlabeled samples are in advance defined by analyzing prediction differences between Supervised Learning (SL) and SSL; (2) Negative impacts of labeled samples on learning performance are not investigated. Therefore, it is essential to design a novel method to adaptively estimate importance and risk of both unlabeled and labeled samples. For this purpose, we present ℓ₁-norm based S3L which can simultaneously reach the safe exploitation of the labeled and unlabeled samples in this paper. In order to solve the proposed ptimization problem, we utilize an effective iterative approach. In each iteration, one can adaptively estimate the weights of both labeled and unlabeled samples. The weights can reflect the importance or risk of the labeled and unlabeled samples. Hence, the negative effects of the labeled and unlabeled samples are expected to be reduced. Experimental performance on different datasets verifies that the proposed S3L method can obtain comparable performance with the existing SL, SSL and S3L methods and achieve the expected goal.

Keywords: importance estimation; performance degradation; safe semi-supervised learning; semi-supervised learning; ℓ₁ norm.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Machine Learning*
Supervised Machine Learning*