Rosenblatt's First Theorem and Frugality of Deep Learning

Alexander Kirdin; Sergey Sidorov; Nikolai Zolotykh

doi:10.3390/e24111635

Rosenblatt's First Theorem and Frugality of Deep Learning

Entropy (Basel). 2022 Nov 10;24(11):1635. doi: 10.3390/e24111635.

Authors

Alexander Kirdin^{1

2}, Sergey Sidorov¹, Nikolai Zolotykh¹

Affiliations

¹ Institute of Information Technologies, Mathematics and Mechanics, Lobachevsky State University, 603022 Nizhni Novgorod, Russia.
² Institute for Computational Modelling, Russian Academy of Sciences, Siberian Branch, 660036 Krasnoyarsk, Russia.

Abstract

The Rosenblatt's first theorem about the omnipotence of shallow networks states that elementary perceptrons can solve any classification problem if there are no discrepancies in the training set. Minsky and Papert considered elementary perceptrons with restrictions on the neural inputs: a bounded number of connections or a relatively small diameter of the receptive field for each neuron at the hidden layer. They proved that under these constraints, an elementary perceptron cannot solve some problems, such as the connectivity of input images or the parity of pixels in them. In this note, we demonstrated Rosenblatt's first theorem at work, showed how an elementary perceptron can solve a version of the travel maze problem, and analysed the complexity of that solution. We also constructed a deep network algorithm for the same problem. It is much more efficient. The shallow network uses an exponentially large number of neurons on the hidden layer (Rosenblatt's A-elements), whereas for the deep network, the second-order polynomial complexity is sufficient. We demonstrated that for the same complex problem, the deep network can be much smaller and reveal a heuristic behind this effect.

Keywords: classification; complexity; deep network; elementary perceptron; shallow network; travel maze problem.

Grants and funding

075-15-2020-808/Ministry of Science and Higher Education of the Russian Federation