ReSTiNet: An Efficient Deep Learning Approach to Improve Human Detection Accuracy

Shahriar Shakir Sumit; Dayang Rohaya Awang Rambli; Seyedali Mirjalili; M Saef Ullah Miah; Muhammad Mudassir Ejaz

doi:10.1016/j.mex.2022.101936

ReSTiNet: An Efficient Deep Learning Approach to Improve Human Detection Accuracy

MethodsX. 2022 Dec 2:10:101936. doi: 10.1016/j.mex.2022.101936. eCollection 2023.

Authors

Shahriar Shakir Sumit¹, Dayang Rohaya Awang Rambli¹, Seyedali Mirjalili^{2

3

4}, M Saef Ullah Miah⁵, Muhammad Mudassir Ejaz⁶

Affiliations

¹ Department of Computer & Information Sciences, Universiti Teknologi PETRONAS (UTP), Seri Iskandar, Perak 32610, Malaysia.
² Centre for Artificial Intelligence Research and Optimization, Torrens University Australia, Fortitude Valley, QLD 4006, Australia.
³ Yonsei Frontier Lab, Yonsei University, Seoul, South Korea.
⁴ University Research and Innovation Center, Obuda University, 1034 Budapest, Hungary.
⁵ Faculty of Computing, College of Computing and Applied Sciences, Universiti Malaysia Pahang, Pekan 26600, Malaysia.
⁶ Electrical & Electronics Engineering, Universiti Teknologi PETRONAS (UTP), Seri Iskandar, Perak 32610, Malaysia.

Abstract

Human detection is an important task in computer vision. It is one of the most important tasks in global security and safety monitoring. In recent days, Deep Learning has improved human detection technology. Despite modern techniques, there are very few optimal techniques to construct networks with a small size, deep architecture, and fast training time while maintaining accuracy. ReSTiNet is a novel small convolutional neural network that overcomes the problems of network size, detection speed, and accuracy. The developed ReSTiNet contains fire modules by evaluating their number and position in the network to minimize the model parameters and network size. To improve the detection speed and accuracy of ReSTiNet, the residual block within the fire modules is carefully designed to increase the feature propagation and maximize the information flow in the network. The developed approach compresses the well-known Tiny-YOLO architecture while improving the following features: (i) small model size, (ii) faster detection speed, (iii) resolution of overfitting, and (iv) better performance than other compact networks such as SqueezeNet and MobileNet in terms of mAP on the Pascal VOC and MS COCO datasets. ReSTiNet is 10.7 MB, five times smaller than Tiny-YOLO. On Tesla k80, mAP is 27.3% for MS COCO and 63.74% for PASCAL VOC. The validation of the proposed ReSTiNet model has been done on INRIA person dataset using the Tesla K80.•All the necessary steps, algorithms, and mathematical formulas for building the net- work are provided.•The network is small in size but has a faster detection speed with high accuracy.

Keywords: Computer vision; Human detection; Low memory devices; Object detection; ReSTiNet; Tiny convolutional neural network.