Deep ConvNet: Non-Random Weight Initialization for Repeatable Determinism, Examined with FSGM

Richard N M Rudd-Orthner; Lyudmila Mihaylova

doi:10.3390/s21144772

Deep ConvNet: Non-Random Weight Initialization for Repeatable Determinism, Examined with FSGM

Sensors (Basel). 2021 Jul 13;21(14):4772. doi: 10.3390/s21144772.

Authors

Richard N M Rudd-Orthner¹, Lyudmila Mihaylova¹

Affiliation

¹ Department of Automatic Control and Systems Engineering, University of Sheffield, Sheffield S1 3JD, UK.

Abstract

A repeatable and deterministic non-random weight initialization method in convolutional layers of neural networks examined with the Fast Gradient Sign Method (FSGM). Using the FSGM approach as a technique to measure the initialization effect with controlled distortions in transferred learning, varying the dataset numerical similarity. The focus is on convolutional layers with induced earlier learning through the use of striped forms for image classification. Which provided a higher performing accuracy in the first epoch, with improvements of between 3-5% in a well known benchmark model, and also ~10% in a color image dataset (MTARSI2), using a dissimilar model architecture. The proposed method is robust to limit optimization approaches like Glorot/Xavier and He initialization. Arguably the approach is within a new category of weight initialization methods, as a number sequence substitution of random numbers, without a tether to the dataset. When examined under the FGSM approach with transferred learning, the proposed method when used with higher distortions (numerically dissimilar datasets), is less compromised against the original cross-validation dataset, at ~31% accuracy instead of ~9%. This is an indication of higher retention of the original fitting in transferred learning.

Keywords: FSGM; adversarial perturbation attack; convolutional layers; machine learning; repeatable determinism; smart sensors; transferred learning; weight initialization.

MeSH terms

Humans
Machine Learning*
Male
Neural Networks, Computer*