贪婪对等平衡:非消费-非消费-非消费最优化的有效替代方法 (Greedy Adversarial Equilibrium: An Efficient Alternative to Nonconvex-Nonconcave Min-Max Optimization)

Min-max optimization of an objective function $f: \mathbb{R}^d \times \mathbb{R}^d \rightarrow \mathbb{R}$ is an important model for robustness in an adversarial setting, with applications to many areas including optimization, economics, and deep learning. In many applications $f$ may be nonconvex-nonconcave, and finding a global min-max point may be computationally intractable. There is a long line of work that seeks computationally tractable algorithms for alternatives to the min-max optimization model. However, many of the alternative models have solution points which are only guaranteed to exist under strong assumptions on $f$, such as convexity, monotonicity, or special properties of the starting point. We propose an optimization model, the $\varepsilon$-greedy adversarial equilibrium, and show that it can serve as a computationally tractable alternative to the min-max optimization model. Roughly, we say that a point $(x^\star, y^\star)$ is an $\varepsilon$-greedy adversarial equilibrium if $y^\star$ is an $\varepsilon$-approximate local maximum for $f(x^\star,\cdot)$, and $x^\star$ is an $\varepsilon$-approximate local minimum for a "greedy approximation" to the function $\max_z f(x, z)$ which can be efficiently estimated using second-order optimization algorithms. We prove the existence of such a point for any smooth function which is bounded and has Lipschitz Hessian. To prove existence, we introduce an algorithm that converges from any starting point to an $\varepsilon$-greedy adversarial equilibrium in a number of evaluations of the function $f$, the max-player's gradient $\nabla_y f(x,y)$, and its Hessian $\nabla^2_y f(x,y)$, that is polynomial in the dimension $d$, $1/\varepsilon$, and the bounds on $f$ and its Lipschitz constant.

翻译：目标函数的 Min- max 优化 $f: $, 美元, 美元, 美元, 美元, 美元, 美元, 美元, 美元, 美元, 美元, 美元, 一個數值。美元, 一個目標, 一個目標, 一個目標, 一個目標。在许多應用程式中, 美元, 可能不是Convex, 找到一個全球的 min- mass 點, 算法不易操作。有一大串工作, 以可計算的方式, 以利得的方式, 美元, 美元; 然而, 许多替代模型的解决方案点, 只有在$的强假设下才有保障存在。例如, 共性, 一元, 或深點, 美元, 數點, 數點, 數點, 數點, 數點, 數點, 數點, 數點, 數點, 數點, 數點, 數點, 美元, 數點, 數點, 數點, 數點, 數, 數, 數, 美元, 數, 數, 數點, 數, 數點, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數, 數,