解决最优化问题等级的分散适应动力方法 (A Decentralized Adaptive Momentum Method for Solving a Class of Min-Max Optimization Problems)

Min-max saddle point games have recently been intensely studied, due to their wide range of applications, including training Generative Adversarial Networks (GANs). However, most of the recent efforts for solving them are limited to special regimes such as convex-concave games. Further, it is customarily assumed that the underlying optimization problem is solved either by a single machine or in the case of multiple machines connected in centralized fashion, wherein each one communicates with a central node. The latter approach becomes challenging, when the underlying communications network has low bandwidth. In addition, privacy considerations may dictate that certain nodes can communicate with a subset of other nodes. Hence, it is of interest to develop methods that solve min-max games in a decentralized manner. To that end, we develop a decentralized adaptive momentum (ADAM)-type algorithm for solving min-max optimization problem under the condition that the objective function satisfies a Minty Variational Inequality condition, which is a generalization to convex-concave case. The proposed method overcomes shortcomings of recent non-adaptive gradient-based decentralized algorithms for min-max optimization problems that do not perform well in practice and require careful tuning. In this paper, we obtain non-asymptotic rates of convergence of the proposed algorithm (coined DADAM$^3$) for finding a (stochastic) first-order Nash equilibrium point and subsequently evaluate its performance on training GANs. The extensive empirical evaluation shows that DADAM$^3$ outperforms recently developed methods, including decentralized optimistic stochastic gradient for solving such min-max problems.

翻译：最近,由于应用范围很广,包括培训Genemental Adversarial Networks(GANs)等应用范围很广,对最底层马鞍游戏进行了深入的研究。然而,最近解决这些游戏的多数努力仅限于诸如 convex concave games等特殊机制。此外,人们通常认为,最底层的优化问题要么由单一机器解决,要么由集中式的多台机器解决,每个机器都与中央节点沟通。当基础通信网络的宽度较低时,后一种方法就变得具有挑战性。此外,隐私因素可能决定某些节点可以与其他节点的一组节点进行交流。因此,开发以分散方式解决微量游戏游戏等特殊机制。为此,我们开发了一种分散式的适应性动力(ADAM)型算法,在目标功能满足微量挥发性不平等状态的条件下,这是对调调价调情况的一种一般化。拟议方法可以克服最近一些不适应性基调的基调美元与其他节点的节点通信网点进行交流。因此,很有兴趣地设计了以分散式的节流的方法, MAMAMADL3 级的平级算法,我们最近需要对硬化的伸缩的不进行精确的平压方法,从而进行细的平压的平压。