Jump-GRS: a multi-phase approach to structured pruning of neural networks for neural decoding

Xiaomin Wu; Da-Ting Lin; Rong Chen; Shuvra S Bhattacharyya

doi:10.1088/1741-2552/ace5dc

Jump-GRS: a multi-phase approach to structured pruning of neural networks for neural decoding

J Neural Eng. 2023 Jul 31;20(4):10.1088/1741-2552/ace5dc. doi: 10.1088/1741-2552/ace5dc.

Authors

Xiaomin Wu^{1

2}, Da-Ting Lin³, Rong Chen², Shuvra S Bhattacharyya^{1

4}

Affiliations

¹ Department of Electrical and Computer Engineering, University of Maryland, College Park, MD 20742, United States of America.
² Department of Diagnostic Radiology and Nuclear Medicine, University of Maryland School of Medicine, Baltimore, MD 21201, United States of America.
³ National Institute on Drug Abuse, Gaithersburg, MD 20892, United States of America.
⁴ Institute for Advanced Computer Studies (UMIACS), University of Maryland at College Park, College Park, MD 20742, United States of America.

Abstract

Objective.Neural decoding, an important area of neural engineering, helps to link neural activity to behavior. Deep neural networks (DNNs), which are becoming increasingly popular in many application fields of machine learning, show promising performance in neural decoding compared to traditional neural decoding methods. Various neural decoding applications, such as brain computer interface applications, require both high decoding accuracy and real-time decoding speed. Pruning methods are used to produce compact DNN models for faster computational speed. Greedy inter-layer order with Random Selection (GRS) is a recently-designed structured pruning method that derives compact DNN models for calcium-imaging-based neural decoding. Although GRS has advantages in terms of detailed structure analysis and consideration of both learned information and model structure during the pruning process, the method is very computationally intensive, and is not feasible when large-scale DNN models need to be pruned within typical constraints on time and computational resources. Large-scale DNN models arise in neural decoding when large numbers of neurons are involved. In this paper, we build on GRS to develop a new structured pruning algorithm called jump GRS (JGRS) that is designed to efficiently compress large-scale DNN models.Approach.On top of GRS, JGRS implements a 'jump mechanism', which bypasses retraining intermediate models when model accuracy is relatively less sensitive to pruning operations. Design of the jump mechanism is motivated by identifying different phases of the structured pruning process, where retraining can be done infrequently in earlier phases without sacrificing accuracy. The jump mechanism helps to significantly speed up execution of the pruning process and greatly enhance its scalability. We compare the pruning performance and speed of JGRS and GRS with extensive experiments in the context of neural decoding.Main results.Our results demonstrate that JGRS provides significantly faster pruning speed compared to GRS, and at the same time, JGRS provides pruned models that are similarly compact as those generated by GRS.Significance.In our experiments, we demonstrate that JGRS achieves on average 9%-20% more compressed models compared to GRS with 2-8 times faster speed (less time required for pruning) across four different initial models on a relevant dataset for neural data analysis.

Keywords: machine learning; neural decoding; neural network pruning.

Creative Commons Attribution license.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Brain-Computer Interfaces*
Calcium
Neural Networks, Computer*
Neurons

Substances

Calcium

Grants and funding

R01 NS110421/NS/NINDS NIH HHS/United States