We consider stability issues in minimizing a continuous (probably parameterized, nonconvex and nonsmooth) real-valued function $f$. We call a point stationary if all its possible directional derivatives are nonnegative. In this work, we focus on two notions of stability on stationary points of $f$: parametric stability and convergence stability. Parametric considerations are widely studied in various fields, including smoothed analysis, numerical stability, condition numbers and sensitivity analysis for linear programming. Parametric stability asks whether minor perturbations on parameters lead to dramatic changes in the position and $f$ value of a stationary point. Meanwhile, convergence stability indicates a non-escapable solution: Any point sequence iteratively produced by an optimization algorithm cannot escape from a neighborhood of a stationary point but gets close to it in the sense that such stationary points are stable to the precision parameter and algorithmic numerical errors. It turns out that these notions have deep connections to geometry theory. We show that parametric stability is linked to deformations of graphs of functions. On the other hand, convergence stability is concerned with area partitioning of the function domain. Utilizing these connections, we prove quite tight conditions of these two stability notions for a wide range of functions and optimization algorithms with small enough step sizes and precision parameters. These conditions are subtle in the sense that a slightly weaker function requirement goes to the opposite of primitive intuitions and leads to wrong conclusions. We present three applications of this theory. These applications reveal some understanding on Nash equilibrium computation, nonconvex and nonsmooth optimization, as well as the new optimization methodology of deep neural networks.
翻译:我们考虑的是稳定问题,最大限度地减少连续(可能是参数化的、非混凝土的和不毛的)实际价值的美元(ff美元)的功能。如果所有可能的定向衍生物都是非负面的,我们称之为一个点固定。在这项工作中,我们侧重于固定点上的两个稳定概念:参数稳定性和趋同稳定性。在各个领域,包括平滑分析、数字稳定性、条件数数和对线性编程的敏感性分析,对参数的轻微扰动是否会导致一个固定点的位置和美元值的急剧变化。同时,趋同稳定性稳定性稳定性表明一种不可避免的解决办法:由优化算法产生的任何点迭代序列都无法从一个固定点的附近逃脱,但从这种固定点稳定与精确参数和算法误差的意义上看,这些概念与测线理论的精度有密切联系。我们发现,对参数的轻微扭曲性的稳定性是否导致一个不固定性的变化,另一方面,趋同精确度的精确度的精确度的精确度的精确度的精确度的精确度的精确度的精确度是这些关系。我们发现,这些精确度的精确度和精确度的精确度的精确度的精确度的精度的精确度的精确度的精确度的精确度的精确度的精确度的精确度是这些联系。