Efficient Binary Weight Convolutional Network Accelerator for Speech Recognition

Lunyi Guo; Shining Mu; Yijie Deng; Chaofan Shi; Bo Yan; Zhuoling Xiao

doi:10.3390/s23031530

Efficient Binary Weight Convolutional Network Accelerator for Speech Recognition

Sensors (Basel). 2023 Jan 30;23(3):1530. doi: 10.3390/s23031530.

Authors

Lunyi Guo¹, Shining Mu¹, Yijie Deng¹, Chaofan Shi¹, Bo Yan¹, Zhuoling Xiao¹

Affiliation

¹ School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China.

Abstract

Speech recognition has progressed tremendously in the area of artificial intelligence (AI). However, the performance of the real-time offline Chinese speech recognition neural network accelerator for edge AI needs to be improved. This paper proposes a configurable convolutional neural network accelerator based on a lightweight speech recognition model, which can dramatically reduce hardware resource consumption while guaranteeing an acceptable error rate. For convolutional layers, the weights are binarized to reduce the number of model parameters and improve computational and storage efficiency. A multichannel shared computation (MCSC) architecture is proposed to maximize the reuse of weight and feature map data. The binary weight-sharing processing engine (PE) is designed to avoid limiting the number of multipliers. A custom instruction set is established according to the variable length of voice input to configure parameters for adapting to different network structures. Finally, the ping-pong storage method is used when the feature map is an input. We implemented this accelerator on Xilinx ZYNQ XC7Z035 under the working frequency of 150 MHz. The processing time for 2.24 s and 8 s of speech was 69.8 ms and 189.51 ms, respectively, and the convolution performance reached 35.66 GOPS/W. Compared with other computing platforms, accelerators perform better in terms of energy efficiency, power consumption and hardware resource consumption.

Keywords: ZYNQ; binary weights; hardware accelerator; multichannel shared computation; speech recognize.

Grants and funding

61973056/National Natural Science Foundation of China