Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference

Shahriyar Masud Rizvi; Ab Al-Hadi Ab Rahman; Usman Ullah Sheikh; Kazi Ahmed Asif Fuad; Hafiz Muhammad Faisal Shehzad

doi:10.1007/s10489-022-03756-1

Computation and memory optimized spectral domain convolutional neural network for throughput and energy-efficient inference

Appl Intell (Dordr). 2023;53(4):4499-4523. doi: 10.1007/s10489-022-03756-1. Epub 2022 Jun 11.

Authors

Shahriyar Masud Rizvi¹, Ab Al-Hadi Ab Rahman¹, Usman Ullah Sheikh¹, Kazi Ahmed Asif Fuad², Hafiz Muhammad Faisal Shehzad³

Affiliations

¹ VeCAD Research Laboratory, School of Electrical Engineering, Universiti Teknologi Malaysia, Johor Bahru, 81310 Johor Malaysia.
² School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR 97331 USA.
³ Department of Computer Science and IT, University of Sargodha, Sargodha, 40100 Punjab Pakistan.

Abstract

Conventional convolutional neural networks (CNNs) present a high computational workload and memory access cost (CMC). Spectral domain CNNs (SpCNNs) offer a computationally efficient approach to compute CNN training and inference. This paper investigates CMC of SpCNNs and its contributing components analytically and then proposes a methodology to optimize CMC, under three strategies, to enhance inference performance. In this methodology, output feature map (OFM) size, OFM depth or both are progressively reduced under an accuracy constraint to compute performance-optimized CNN inference. Before conducting training or testing, it can provide designers guidelines and preliminary insights regarding techniques for optimum performance, least degradation in accuracy and a balanced performance-accuracy trade-off. This methodology was evaluated on MNIST and Fashion MNIST datasets using LeNet-5 and AlexNet architectures. When compared to state-of-the-art SpCNN models, LeNet-5 achieves up to 4.2× (batch inference) and 4.1× (single-image inference) higher throughputs and 10.5× (batch inference) and 4.2× (single-image inference) greater energy efficiency at a maximum loss of 3% in test accuracy. When compared to the baseline model used in this study, AlexNet delivers 11.6× (batch inference) and 5× (single-image inference) increased throughput and 25× (batch inference) and 8.8× (single-image inference) more energy-efficient inference with just 4.4% reduction in accuracy.

Keywords: Computational workload; Convolutional neural network; Energy efficiency; Memory access cost; Spectral domain CNN.