Motivated by decentralized sensing and policy evaluation problems, we consider a particular type of distributed optimization problem that involves averaging several stochastic, online observations on a network. We design a dual-based method for this consensus problem with Polyak--Ruppert averaging and analyze its behavior. We show that this algorithm attains an accelerated deterministic error depending optimally on the condition number of the network, and also that it has order-optimal stochastic error. This improves on the guarantees of state-of-the-art distributed optimization algorithms when specialized to this setting, and yields -- among other things -- corollaries for decentralized policy evaluation. Our proofs rely on explicitly studying the evolution of several relevant linear systems, and may be of independent interest. Numerical experiments are provided, which validate our theoretical results and demonstrate that our approach outperforms existing methods in finite-sample scenarios on several natural network topologies.
翻译:在分散遥感和政策评估问题的推动下,我们考虑了一种特定的分布式优化问题,它涉及在网络上平均进行数个随机在线观测。我们设计了一种双基方法,用Polyak-Ruppert的平均值来应对这一共识问题,并分析其行为。我们表明,这种算法取得了加速的确定性错误,最理想地取决于网络的条件数量,而且它有定点的最佳随机误差。这改进了在专门为此设置时对最先进的分布式优化算法的保障,并产生了 -- -- 除其他外 -- -- 分散化政策评估的卷轴。我们的证据依赖于明确研究若干相关线性系统的演变,并可能具有独立的兴趣。我们提供了数字实验,这些实验证实了我们的理论结果,并表明我们的方法在一些自然网络图理学上超越了在有限抽样假设中的现有方法。