Approximation of smooth functionals using deep ReLU networks

Neural Netw. 2023 Sep:166:424-436. doi: 10.1016/j.neunet.2023.07.012. Epub 2023 Jul 18.

Abstract

In recent years, deep neural networks have been employed to approximate nonlinear continuous functionals F defined on Lp([-1,1]s) for 1≤p≤∞. However, the existing theoretical analysis in the literature either is unsatisfactory due to the poor approximation results, or does not apply to the rectified linear unit (ReLU) activation function. This paper aims to investigate the approximation power of functional deep ReLU networks in two settings: F is continuous with restrictions on the modulus of continuity, and F has higher order Fréchet derivatives. A novel functional network structure is proposed to extract features of higher order smoothness harbored by the target functional F. Quantitative rates of approximation in terms of the depth, width and total number of weights of neural networks are derived for both settings. We give logarithmic rates when measuring the approximation error on the unit ball of a Hölder space. In addition, we establish nearly polynomial rates (i.e., rates of the form exp-a(logM)b with a>0,0<b<1) when measuring the approximation error on a space of analytic functions.

Keywords: Approximation theory; Deep learning theory; Fréchet derivative; Polynomial rates; ReLU; Smooth functionals.

MeSH terms

  • Algorithms*
  • Neural Networks, Computer*