Random sketch learning for deep neural networks in edge computing

Bin Li; Peijun Chen; Hongfu Liu; Weisi Guo; Xianbin Cao; Junzhao Du; Chenglin Zhao; Jun Zhang

doi:10.1038/s43588-021-00039-6

Random sketch learning for deep neural networks in edge computing

Nat Comput Sci. 2021 Mar;1(3):221-228. doi: 10.1038/s43588-021-00039-6. Epub 2021 Mar 25.

Authors

Bin Li^{1

2}, Peijun Chen³, Hongfu Liu³, Weisi Guo^{4

5}, Xianbin Cao⁶, Junzhao Du⁷, Chenglin Zhao³, Jun Zhang^{8

9}

Affiliations

¹ School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing, China. Binli@bupt.edu.cn.
² School of Information and Electronics, Beijing Institute of Technology, Beijing, China. Binli@bupt.edu.cn.
³ School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing, China.
⁴ The Alan Turing Institute, London, UK.
⁵ Centre for Autonomous and Cyberphysical Systems, Cranfield University, Cranfield, UK.
⁶ School of Electronic and Information Engineering, Beihang University, Beijing, China. xbcao@buaa.edu.cn.
⁷ The 6th Research Institute of China Electronics Corporation, Beijing, China.
⁸ School of Information and Electronics, Beijing Institute of Technology, Beijing, China.
⁹ School of Electronic and Information Engineering, Beihang University, Beijing, China.

PMID: 38183196
DOI: 10.1038/s43588-021-00039-6

Abstract

Despite the great potential of deep neural networks (DNNs), they require massive weights and huge computational resources, creating a vast gap when deploying artificial intelligence at low-cost edge devices. Current lightweight DNNs, achieved by high-dimensional space pre-training and post-compression, present challenges when covering the resources deficit, making tiny artificial intelligence hard to be implemented. Here we report an architecture named random sketch learning, or Rosler, for computationally efficient tiny artificial intelligence. We build a universal compressing-while-training framework that directly learns a compact model and, most importantly, enables computationally efficient on-device learning. As validated on different models and datasets, it attains substantial memory reduction of ~50-90× (16-bits quantization), compared with fully connected DNNs. We demonstrate it on low-cost hardware, whereby the computation is accelerated by >180× and the energy consumption is reduced by ~10×. Our method paves the way for deploying tiny artificial intelligence in many scientific and industrial applications.

Abstract

Grants and funding