Accelerated gradient methods are the cornerstones of large-scale, data-driven optimization problems that arise naturally in machine learning and other fields concerning data analysis. We introduce a gradient-based optimization framework for achieving acceleration, based on the recently introduced notion of fixed-time stability of dynamical systems. The method presents itself as a generalization of simple gradient-based methods suitably scaled to achieve convergence to the optimizer in a fixed-time, independent of the initialization. We achieve this by first leveraging a continuous-time framework for designing fixed-time stable dynamical systems, and later providing a consistent discretization strategy, such that the equivalent discrete-time algorithm tracks the optimizer in a practically fixed number of iterations. We also provide a theoretical analysis of the convergence behavior of the proposed gradient flows, and their robustness to additive disturbances for a range of functions obeying strong convexity, strict convexity, and possibly nonconvexity but satisfying the Polyak-{\L}ojasiewicz inequality. We also show that the regret bound on the convergence rate is constant by virtue of the fixed-time convergence. The hyperparameters have intuitive interpretations and can be tuned to fit the requirements on the desired convergence rates. We validate the accelerated convergence properties of the proposed schemes on a range of numerical examples against the state-of-the-art optimization algorithms. Our work provides insights on developing novel optimization algorithms via discretization of continuous-time flows.
翻译:加速梯度方法是机械学习和其他数据分析领域自然产生的大规模、数据驱动优化问题的基石。我们根据最近推出的动态系统固定时间稳定性概念,采用基于梯度的优化框架来加速实现加速。该方法本身是简单的基于梯度的方法的概括化,其规模可以适当调整,以便在固定时间与优化一致,而不必与初始化无关。我们首先利用一个持续的时间框架来设计固定时间稳定的动态系统,然后提供一致的离散战略,例如等同的离散时间算法在几乎固定的迭代数中跟踪优化者。我们还从理论上分析了拟议梯度流的趋同行为,以及它们对一系列功能的累加性干扰的稳健性,这些功能服从强的趋同性、严格的趋同性、可能不相趋同性,但满足了Polyak-L}ojasiewicz的不平等性。我们还表明,由于固定时间趋同性趋同性,对趋同率的遗憾程度与实际固定时间趋同性相适应。超常数的梯度算法性调整了预期的趋同性趋同性趋同性比率。我们提出的压性平比性平级平级平级平级办法,使我们的轨平级平时平级平级平级平级平比的轨率性地解释。我们提出的平时平级平级平级平级平的轨数。