Min-max saddle point games have recently been intensely studied, due to their wide range of applications, including training Generative Adversarial Networks~(GANs). However, most of the recent efforts for solving them are limited to special regimes such as convex-concave games. Further, it is customarily assumed that the underlying optimization problem is solved either by a single machine or in the case of multiple machines connected in centralized fashion, wherein each one communicates with a central node. The latter approach becomes challenging, when the underlying communications network has low bandwidth. In addition, privacy considerations may dictate that certain nodes can communicate with a subset of other nodes. Hence, it is of interest to develop methods that solve min-max games in a decentralized manner. To that end, we develop a decentralized adaptive momentum (ADAM)-type algorithm for solving min-max optimization problem under the condition that the objective function satisfies a Minty Variational Inequality condition, which is a generalization to convex-concave case. The proposed method overcomes shortcomings of recent non-adaptive gradient-based decentralized algorithms for min-max optimization problems that do not perform well in practice and require careful tuning. In this paper, we obtain non-asymptotic rates of convergence of the proposed algorithm (coined DADAM$^3$) for finding a (stochastic) first-order Nash equilibrium point and subsequently evaluate its performance on training GANs. The extensive empirical evaluation shows that DADAM$^3$ outperforms recently developed methods, including decentralized optimistic stochastic gradient for solving such min-max problems.
翻译:最近,由于应用范围很广,包括培训General Adversarial Networks~(GANs),对最底层马鞍游戏进行了深入的研究。然而,最近解决这些游戏的多数努力仅限于诸如 convex concave games等特殊机制。此外,人们通常认为,最底层的优化问题要么由单一机器解决,要么由集中式的多台机器解决,每个机器都与中央节点沟通。当基础通信网络的宽度较低时,后一种方法就变得具有挑战性。此外,隐私因素可能决定某些节点可以与其他节点的一组节点进行交流。因此,制定以分散化方式解决微点游戏的大部分努力。为此,我们开发了一种分散化的适应性动力(ADAM)型算法,在目标功能满足最微弱的挥发性不平等状态的条件下,这是对调的节流化节点的比较。 拟议的方法可以克服最近一些非适应性基调的基调的基调价格的节率评估的缺点,3 显示广泛的方法以分散式的方式解决微调调的平级的平级的平级的平压的平压。