New optimization algorithms for neural network training using operator splitting techniques

Neural Netw. 2020 Jun:126:178-190. doi: 10.1016/j.neunet.2020.03.018. Epub 2020 Mar 26.

Abstract

In the following paper we present a new type of optimization algorithms adapted for neural network training. These algorithms are based upon sequential operator splitting technique for some associated dynamical systems. Furthermore, we investigate through numerical simulations the empirical rate of convergence of these iterative schemes toward a local minimum of the loss function, with some suitable choices of the underlying hyper-parameters. We validate the convergence of these optimizers using the results of the accuracy and of the loss function on the MNIST, MNIST-Fashion and CIFAR 10 classification datasets.

Keywords: CIFAR10; Dynamical system; MNIST; Nesterov; Neural network; Splitting.

MeSH terms

  • Algorithms*
  • Neural Networks, Computer*