Gradient-based methods for two-player games produce rich dynamics that can solve challenging problems, yet can be difficult to stabilize and understand. Part of this complexity originates from the discrete update steps given by simultaneous or alternating gradient descent, which causes each player to drift away from the continuous gradient flow -- a phenomenon we call discretization drift. Using backward error analysis, we derive modified continuous dynamical systems that closely follow the discrete dynamics. These modified dynamics provide an insight into the notorious challenges associated with zero-sum games, including Generative Adversarial Networks. In particular, we identify distinct components of the discretization drift that can alter performance and in some cases destabilize the game. Finally, quantifying discretization drift allows us to identify regularizers that explicitly cancel harmful forms of drift or strengthen beneficial forms of drift, and thus improve performance of GAN training.
翻译:两个玩家游戏的渐进式渐进式方法产生丰富的动态,可以解决具有挑战性的问题,但可能难以稳定和理解。这种复杂因素的一部分源于同时或交替梯度下降带来的离散更新步骤,这些步骤导致每个玩家远离连续的梯度流 -- -- 我们称之为离散漂移现象。我们利用后向错误分析,得出了与离散动态密切关联的经修改的连续动态系统。这些经修改的动态提供了对零和游戏相关臭名昭著的挑战的洞察力,包括基因反向网络。特别是,我们确定了可改变性能的离散漂移的不同组成部分,并在某些情况下破坏了游戏的稳定性。最后,对离散漂移的量化使我们能够识别明确取消有害漂移形式或加强有益漂移形式的常规化者,从而改进GAN培训的绩效。