In decentralized optimization environments, each agent $i$ in a network of $n$ nodes has its own private function $f_i$, and nodes communicate with their neighbors to cooperatively minimize the aggregate objective $\sum_{i=1}^n f_i$. In this setting, synchronizing the nodes' updates incurs significant communication overhead and computational costs, so much of the recent literature has focused on the analysis and design of asynchronous optimization algorithms, where agents activate and communicate at arbitrary times without needing a global synchronization enforcer. However, most works assume that when a node activates, it selects the neighbor to contact based on a fixed probability (e.g., uniformly at random), a choice that ignores the optimization landscape at the moment of activation. Instead, in this work we introduce an optimization-aware selection rule that chooses the neighbor providing the highest dual cost improvement (a quantity related to a dualization of the problem based on consensus). This scheme is related to the coordinate descent (CD) method with the Gauss-Southwell (GS) rule for coordinate updates; in our setting however, only a subset of coordinates is accessible at each iteration (because each node can communicate only with its neighbors), so the existing literature on GS methods does not apply. To overcome this difficulty, we develop a new analytical framework for smooth and strongly convex $f_i$ that covers the class of set-wise CD algorithms -- a class that directly applies to decentralized scenarios, but is not limited to them -- and we show that the proposed set-wise GS rule achieves a speedup factor of up to the maximum degree in the network (which is in the order of $\Theta(n)$ for highly connected graphs). The speedup predicted by our analysis is validated in numerical experiments with synthetic data.
翻译:在分散化的优化环境中,一个由美元组成的节点网络中的每个代理商$1美元都有自己的私人功能 $f_i$,并且与邻居通信,合作最小化总目标$sum ⁇ i=1 ⁇ n f_i$。在此背景下,同步节点更新会产生巨大的通信间接费用和计算成本,因此,最近的许多文献都侧重于分析和设计非同步优化算法,即代理商在任意时间在不需要全球同步执行器的情况下激活和通信。然而,大多数文献都假设当节点启动时,它根据固定概率选择邻居进行联系(例如,统一随机),这种选择忽略了启动时的优化场景。相反,我们在工作中引入了优化认知选择提供最高双重成本改进的邻居规则(数量与基于共识的问题的双重化有关) 。这个办法与调离子(CD) 方法与我们Gaus- Southwell(GS) 规则相协调,但基于固定的频率更新规则,而不是随机随机调,在我们设置的级别中, 只能使用一个可访问的基数的基点的基点 来显示我们现有的基数的基数 。