On the Reduction of Computational Complexity of Deep Convolutional Neural Networks

Partha Maji; Robert Mullins

doi:10.3390/e20040305

On the Reduction of Computational Complexity of Deep Convolutional Neural Networks

Entropy (Basel). 2018 Apr 23;20(4):305. doi: 10.3390/e20040305.

Authors

Partha Maji¹, Robert Mullins¹

Affiliation

¹ Department of Computer Science and Technology, University of Cambridge, William Gates Building, 15 JJ Thomson Avenue, Cambridge CB3 0FD, UK.

Abstract

Deep convolutional neural networks (ConvNets), which are at the heart of many new emerging applications, achieve remarkable performance in audio and visual recognition tasks. Unfortunately, achieving accuracy often implies significant computational costs, limiting deployability. In modern ConvNets it is typical for the convolution layers to consume the vast majority of computational resources during inference. This has made the acceleration of these layers an important research area in academia and industry. In this paper, we examine the effects of co-optimizing the internal structures of the convolutional layers and underlying implementation of fundamental convolution operation. We demonstrate that a combination of these methods can have a big impact on the overall speedup of a ConvNet, achieving a ten-fold increase over baseline. We also introduce a new class of fast one-dimensional (1D) convolutions for ConvNets using the Toom-Cook algorithm. We show that our proposed scheme is mathematically well-grounded, robust, and does not require any time-consuming retraining, while still achieving speedups solely from convolutional layers with no loss in baseline accuracy.

Keywords: computational optimization; convolutional neural network; deep learning; hardware implementation.

Grants and funding

EPSRC