A Novel Image Classification Method Based on Residual Network, Inception, and Proposed Activation Function

Ali Abdullah Yahya; Kui Liu; Ammar Hawbani; Yibin Wang; Ali Naser Hadi

doi:10.3390/s23062976

A Novel Image Classification Method Based on Residual Network, Inception, and Proposed Activation Function

Sensors (Basel). 2023 Mar 9;23(6):2976. doi: 10.3390/s23062976.

Authors

Ali Abdullah Yahya¹, Kui Liu¹, Ammar Hawbani², Yibin Wang¹, Ali Naser Hadi³

Affiliations

¹ School of Computer and Information, Anqing Normal University, Anqing 246011, China.
² School of Computer and Technology, University of Science and Technology of China, Hefei 230027, China.
³ School of Computer and Information, Hefei University of Technology, Hefei 230009, China.

Abstract

In deeper layers, ResNet heavily depends on skip connections and Relu. Although skip connections have demonstrated their usefulness in networks, a major issue arises when the dimensions between layers are not consistent. In such cases, it is necessary to use techniques such as zero-padding or projection to match the dimensions between layers. These adjustments increase the complexity of the network architecture, resulting in an increase in parameter number and a rise in computational costs. Another problem is the vanishing gradient caused by utilizing Relu. In our model, after making appropriate adjustments to the inception blocks, we replace the deeper layers of ResNet with modified inception blocks and Relu with our non-monotonic activation function (NMAF). To reduce parameter number, we use symmetric factorization and 1×1 convolutions. Utilizing these two techniques contributed to reducing the parameter number by around 6 M parameters, which has helped reduce the run time by 30 s/epoch. Unlike Relu, NMAF addresses the deactivation problem of the non-positive number by activating the negative values and outputting small negative numbers instead of zero in Relu, which helped in enhancing the convergence speed and increasing the accuracy by 5%, 15%, and 5% for the non-noisy datasets, and 5%, 6%, 21% for non-noisy datasets.

Keywords: 1 × 1 convolutions; inception; non-monotonic activation function (NMAF); residual networks; symmetric factorization.

Grants and funding

61603003/National Natural Science Foundation of China