Human-in-the-Loop Low-Shot Learning

Sen Wan; Yimin Hou; Feng Bao; Zhiquan Ren; Yunfeng Dong; Qionghai Dai; Yue Deng

doi:10.1109/TNNLS.2020.3011559

Human-in-the-Loop Low-Shot Learning

IEEE Trans Neural Netw Learn Syst. 2021 Jul;32(7):3287-3292. doi: 10.1109/TNNLS.2020.3011559. Epub 2021 Jul 6.

Authors

Sen Wan, Yimin Hou, Feng Bao, Zhiquan Ren, Yunfeng Dong, Qionghai Dai, Yue Deng

PMID: 32813663
DOI: 10.1109/TNNLS.2020.3011559

Abstract

We consider a human-in-the-loop scenario in the context of low-shot learning. Our approach was inspired by the fact that the viability of samples in novel categories cannot be sufficiently reflected by those limited observations. Some heterogeneous samples that are quite different from existing labeled novel data can inevitably emerge in the testing phase. To this end, we consider augmenting an uncertainty assessment module into low-shot learning system to account into the disturbance of those out-of-distribution (OOD) samples. Once detected, these OOD samples are passed to human beings for active labeling. Due to the discrete nature of this uncertainty assessment process, the whole Human-In-the-Loop Low-shot (HILL) learning framework is not end-to-end trainable. We hence revisited the learning system from the aspect of reinforcement learning and introduced the REINFORCE algorithm to optimize model parameters via policy gradient. The whole system gains noticeable improvements over existing low-shot learning approaches.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Feedback
Humans
Learning / physiology*
Machine Learning*
Neural Networks, Computer
Problem Solving
Reinforcement, Psychology
Uncertainty