Feedforward neural networks initialization based on discriminant learning

Kateryna Chumachenko; Alexandros Iosifidis; Moncef Gabbouj

doi:10.1016/j.neunet.2021.11.020

Feedforward neural networks initialization based on discriminant learning

Neural Netw. 2022 Feb:146:220-229. doi: 10.1016/j.neunet.2021.11.020. Epub 2021 Nov 25.

Authors

Kateryna Chumachenko¹, Alexandros Iosifidis², Moncef Gabbouj³

Affiliations

¹ Faculty of Information Technology and Communication Sciences, Tampere University, FI 33720, Tampere, Finland. Electronic address: kateryna.chumachenko@tuni.fi.
² Department of Electrical and Computer Engineering, Aarhus University, DK 8200, Aarhus, Denmark. Electronic address: ai@ece.au.dk.
³ Faculty of Information Technology and Communication Sciences, Tampere University, FI 33720, Tampere, Finland. Electronic address: moncef.gabbouj@tuni.fi.

PMID: 34902796
DOI: 10.1016/j.neunet.2021.11.020

Abstract

In this paper, a novel data-driven method for weight initialization of Multilayer Perceptrons and Convolutional Neural Networks based on discriminant learning is proposed. The approach relaxes some of the limitations of competing data-driven methods, including unimodality assumptions, limitations on the architectures related to limited maximal dimensionalities of the corresponding projection spaces, as well as limitations related to high computational requirements due to the need of eigendecomposition on high-dimensional data. We also consider assumptions of the method on the data and propose a way to account for them in a form of a new normalization layer. The experiments on three large-scale image datasets show improved accuracy of the trained models compared to competing random-based and data-driven weight initialization methods, as well as better convergence properties in certain cases.

Keywords: Discriminant learning; Neural networks initialization.

MeSH terms

Learning
Machine Learning*
Neural Networks, Computer*