The minimax optimization over Riemannian manifolds (possibly nonconvex constraints) has been actively applied to solve many problems, such as robust dimensionality reduction and deep neural networks with orthogonal weights (Stiefel manifold). Although many optimization algorithms for minimax problems have been developed in the Euclidean setting, it is difficult to convert them into Riemannian cases, and algorithms for nonconvex minimax problems with nonconvex constraints are even rare. On the other hand, to address the big data challenges, decentralized (serverless) training techniques have recently been emerging since they can reduce communications overhead and avoid the bottleneck problem on the server node. Nonetheless, the algorithm for decentralized Riemannian minimax problems has not been studied. In this paper, we study the distributed nonconvex-strongly-concave minimax optimization problem over the Stiefel manifold and propose both deterministic and stochastic minimax methods. The Steifel manifold is a non-convex set. The global function is represented as the finite sum of local functions. For the deterministic setting, we propose DRGDA and prove that our deterministic method achieves a gradient complexity of $O( \epsilon^{-2})$ under mild conditions. For the stochastic setting, we propose DRSGDA and prove that our stochastic method achieves a gradient complexity of $O(\epsilon^{-4})$. The DRGDA and DRSGDA are the first algorithms for distributed minimax optimization with nonconvex constraints with exact convergence. Extensive experimental results on the Deep Neural Networks (DNNs) training over the Stiefel manifold demonstrate the efficiency of our algorithms.
翻译:里格曼方程式的迷你最大优化( 可能无法调试 ) 已被积极用于解决许多问题, 比如强力的维度降低和高神经网络( Stiefel 元件 ) 。 虽然在 Euclidean 设置中已经开发了许多针对迷你最大问题的优化算法, 但是很难将其转换成里格曼尼特案例, 而在非康纳特限制下, 非康纳克斯小型问题的算法甚至甚至很少。 另一方面, 为了应对大数据挑战, 最近出现了分散( 无服务器) 培训技术, 因为这些技术可以减少通信管理, 避免服务器节点的瓶颈问题 。 尽管如此, 尚未研究分散的里格曼尼特小算法问题 。 在本文中, 我们研究在斯特里菲勒 中分布的非convex- concol- conconmax 优化问题, 并提议稳定性RS- 和 Stochacial listal IMex 方法。 Stteifel 是非康ladeal- dal- dal- dal- dal- ladeal ladeal laft lade ex lax lax 。