摘要

In recent years, deep neural networks have been employed to approximate nonlinear continuous functionals F defined on L-p([-1, 1](s)) for 1 < p < 8. However, the existing theoretical analysis in the literature either is unsatisfactory due to the poor approximation results, or does not apply to the rectified linear unit (ReLU) activation function. This paper aims to investigate the approximation power of functional deep ReLU networks in two settings: F is continuous with restrictions on the modulus of continuity, and F has higher order Frechet derivatives. A novel functional network structure is proposed to extract features of higher order smoothness harbored by the target functional F. Quantitative rates of approximation in terms of the depth, width and total number of weights of neural networks are derived for both settings. We give logarithmic rates when measuring the approximation error on the unit ball of a Holder space. In addition, we establish nearly polynomial rates (i.e., rates of the form exp (-a(logM)(b)) with a > 0, 0 < b < 1) when measuring the approximation error on a space of analytic functions.

  • 单位
    北京航空航天大学