Hypersphere-Based Weight Imprinting for Few-Shot Learning on Embedded Devices

IEEE Trans Neural Netw Learn Syst. 2021 Feb;32(2):925-930. doi: 10.1109/TNNLS.2020.2979745. Epub 2021 Feb 4.

Abstract

Weight imprinting (WI) was recently introduced as a way to perform gradient descent-free few-shot learning. Due to this, WI was almost immediately adapted for performing few-shot learning on embedded neural network accelerators that do not support back-propagation, e.g., edge tensor processing units. However, WI suffers from many limitations, e.g., it cannot handle novel categories with multimodal distributions and special care should be given to avoid overfitting the learned embeddings on the training classes since this can have a devastating effect on classification accuracy (for the novel categories). In this article, we propose a novel hypersphere-based WI approach that is capable of training neural networks in a regularized, imprinting-aware way effectively overcoming the aforementioned limitations. The effectiveness of the proposed method is demonstrated using extensive experiments on three image data sets.

Publication types

  • Research Support, Non-U.S. Gov't