We develop a theoretical foundation for the application of Nesterov's accelerated gradient descent method (AGD) to the approximation of solutions of a wide class of partial differential equations (PDEs). This is achieved by proving the existence of an invariant set and exponential convergence rates when its preconditioned version (PAGD) is applied to minimize locally Lipschitz smooth, strongly convex objective functionals. We introduce a second-order ordinary differential equation (ODE) with a preconditioner built-in and show that PAGD is an explicit time-discretization of this ODE, which requires a natural time step restriction for energy stability. At the continuous time level, we show an exponential convergence of the ODE solution to its steady state using a simple energy argument. At the discrete level, assuming the aforementioned step size restriction, the existence of an invariant set is proved and a matching exponential rate of convergence of the PAGD scheme is derived by mimicking the energy argument and the convergence at the continuous level. Applications of the PAGD method to numerical PDEs are demonstrated with certain nonlinear elliptic PDEs using pseudo-spectral methods for spatial discretization, and several numerical experiments are conducted. The results confirm the global geometric and mesh size-independent convergence of the PAGD method, with an accelerated rate that is improved over the preconditioned gradient descent (PGD) method.
翻译:我们为采用Nesterov的加速梯度下降法(AGD)来接近一系列局部差异方程式(PDEs)的解决方案的近似解决方案奠定了一个理论基础。在持续的时间层面,我们用简单的能源参数来证明ODE解决方案与稳定状态的指数趋同率。在离散的层面上,假设上述步数限制,就证明存在一个惯性方程式,通过模拟能源参数和连续水平的趋同率来推导PAGD计划的指数趋同率。 将PAGD方法应用到数字PDE方法(使用伪光谱级的PDE方法)与某些非直线性PDE方法对能源稳定性进行明确的时间分解。 在离散的层面上,我们用一个简单的能源参数来证明ODE方法与其稳定状态的指数趋同率呈指数趋同率。 在离散的分级模型中,将快速递增的PAGDGD方法与某些非直线性PDE方法进行对比。