Why Dilated Convolutional Neural Networks: A Proof of Their Optimality

Entropy (Basel). 2021 Jun 18;23(6):767. doi: 10.3390/e23060767.

Abstract

One of the most effective image processing techniques is the use of convolutional neural networks that use convolutional layers. In each such layer, the value of the layer's output signal at each point is a combination of the layer's input signals corresponding to several neighboring points. To improve the accuracy, researchers have developed a version of this technique, in which only data from some of the neighboring points is processed. It turns out that the most efficient case-called dilated convolution-is when we select the neighboring points whose differences in both coordinates are divisible by some constant . In this paper, we explain this empirical efficiency by proving that for all reasonable optimality criteria, dilated convolution is indeed better than possible alternatives.

Keywords: convolutional neural networks; dilated neural networks; optimality.