A Cooperative Lightweight Translation Algorithm Combined with Sparse-ReLU

Xintao Xu; Yi Liu; Gang Chen; Junbin Ye; Zhigang Li; Huaxiang Lu

doi:10.1155/2022/4398839

A Cooperative Lightweight Translation Algorithm Combined with Sparse-ReLU

Comput Intell Neurosci. 2022 May 28:2022:4398839. doi: 10.1155/2022/4398839. eCollection 2022.

Authors

Xintao Xu^{1

2}, Yi Liu², Gang Chen², Junbin Ye², Zhigang Li², Huaxiang Lu^{2

3

4

5}

Affiliations

¹ School of Microelectronics, University of Science and Technology of China, Hefei, China.
² Institute of Semiconductors, Chinese Academy of Sciences, Beijing, China.
³ Materials and Optoelectronics Research Center, University of Chinese Academy of Sciences, Beijing, China.
⁴ College of Microelectronics, University of Chinese Academy of Sciences, Beijing, China.
⁵ Semiconductor Neural Network Intelligent Perception and Computing Technology Beijing Key Laboratory, Beijing, China.

Abstract

In the field of natural language processing (NLP), machine translation algorithm based on Transformer is challenging to deploy on hardware due to a large number of parameters and low parametric sparsity of the network weights. Meanwhile, the accuracy of lightweight machine translation networks also needs to be improved. To solve this problem, we first design a new activation function, Sparse-ReLU, to improve the parametric sparsity of weights and feature maps, which facilitates hardware deployment. Secondly, we design a novel cooperative processing scheme with CNN and Transformer and use Sparse-ReLU to improve the accuracy of the translation algorithm. Experimental results show that our method, which combines Transformer and CNN with the Sparse-ReLU, achieves a 2.32% BLEU improvement in prediction accuracy and reduces the number of parameters of the model by 23%, and the sparsity of the inference model increases by more than 50%.

MeSH terms

Algorithms*
Computers
Natural Language Processing
Neural Networks, Computer*
Translations