Learning in Convolutional Neural Networks Accelerated by Transfer Entropy

Adrian Moldovan; Angel Caţaron; Răzvan Andonie

doi:10.3390/e23091218

Learning in Convolutional Neural Networks Accelerated by Transfer Entropy

Entropy (Basel). 2021 Sep 16;23(9):1218. doi: 10.3390/e23091218.

Authors

Adrian Moldovan^{1

2}, Angel Caţaron^{1

2}, Răzvan Andonie^{1

3}

Affiliations

¹ Department of Electronics and Computers, Transilvania University, 500024 Braşov, Romania.
² Technology, Siemens SRL, 500007 Braşov, Romania.
³ Department of Computer Science, Central Washington University, Ellensburg, WA 98926, USA.

Abstract

Recently, there is a growing interest in applying Transfer Entropy (TE) in quantifying the effective connectivity between artificial neurons. In a feedforward network, the TE can be used to quantify the relationships between neuron output pairs located in different layers. Our focus is on how to include the TE in the learning mechanisms of a Convolutional Neural Network (CNN) architecture. We introduce a novel training mechanism for CNN architectures which integrates the TE feedback connections. Adding the TE feedback parameter accelerates the training process, as fewer epochs are needed. On the flip side, it adds computational overhead to each epoch. According to our experiments on CNN classifiers, to achieve a reasonable computational overhead-accuracy trade-off, it is efficient to consider only the inter-neural information transfer of the neuron pairs between the last two fully connected layers. The TE acts as a smoothing factor, generating stability and becoming active only periodically, not after processing each input sample. Therefore, we can consider the TE is in our model a slowly changing meta-parameter.

Keywords: Convolutional Neural Network; causality; deep learning; transfer entropy.