DualConv: Dual Convolutional Kernels for Lightweight Deep Neural Networks

Jiachen Zhong; Junying Chen; Ajmal Mian

doi:10.1109/TNNLS.2022.3151138

DualConv: Dual Convolutional Kernels for Lightweight Deep Neural Networks

IEEE Trans Neural Netw Learn Syst. 2023 Nov;34(11):9528-9535. doi: 10.1109/TNNLS.2022.3151138. Epub 2023 Oct 27.

Authors

Jiachen Zhong, Junying Chen, Ajmal Mian

PMID: 35230955
DOI: 10.1109/TNNLS.2022.3151138

Abstract

Convolutional neural network (CNN) architectures are generally heavy on memory and computational requirements which make them infeasible for embedded systems with limited hardware resources. We propose dual convolutional kernels (DualConv) for constructing lightweight deep neural networks. DualConv combines 3×3 and 1×1 convolutional kernels to process the same input feature map channels simultaneously and exploits the group convolution technique to efficiently arrange convolutional filters. DualConv can be employed in any CNN model such as VGG-16 and ResNet-50 for image classification, you only look once (YOLO) and R-CNN for object detection, or fully convolutional network (FCN) for semantic segmentation. In this work, we extensively test DualConv for classification since these network architectures form the backbone for many other tasks. We also test DualConv for image detection on YOLO-V3. Experimental results show that, combined with our structural innovations, DualConv significantly reduces the computational cost and number of parameters of deep neural networks while surprisingly achieving slightly higher accuracy than the original models in some cases. We use DualConv to further reduce the number of parameters of the lightweight MobileNetV2 by 54% with only 0.68% drop in accuracy on CIFAR-100 dataset. When the number of parameters is not an issue, DualConv increases the accuracy of MobileNetV1 by 4.11% on the same dataset. Furthermore, DualConv significantly improves the YOLO-V3 object detection speed and improves its accuracy by 4.4% on PASCAL visual object classes (VOC) dataset.