This work examines adaptive distributed learning strategies designed to operate under communication constraints. We consider a network of agents that must solve an online optimization problem from continual observation of streaming data. The agents implement a distributed cooperative strategy where each agent is allowed to perform local exchange of information with its neighbors. In order to cope with communication constraints, the exchanged information must be unavoidably compressed. We propose a diffusion strategy nicknamed as ACTC (Adapt-Compress-Then-Combine), which relies on the following steps: i) an adaptation step where each agent performs an individual stochastic-gradient update with constant step-size; ii) a compression step that leverages a recently introduced class of stochastic compression operators; and iii) a combination step where each agent combines the compressed updates received from its neighbors. The distinguishing elements of this work are as follows. First, we focus on adaptive strategies, where constant (as opposed to diminishing) step-sizes are critical to respond in real time to nonstationary variations. Second, we consider the general class of directed graphs and left-stochastic combination policies, which allow us to enhance the interplay between topology and learning. Third, in contrast with related works that assume strong convexity for all individual agents' cost functions, we require strong convexity only at a network level, a condition satisfied even if a single agent has a strongly-convex cost and the remaining agents have non-convex costs. Fourth, we focus on a diffusion (as opposed to consensus) strategy. Under the demanding setting of compressed information, we establish that the ACTC iterates fluctuate around the desired optimizer, achieving remarkable savings in terms of bits exchanged between neighboring agents.
翻译:这项工作考察了适应性分布式学习战略,目的是在通信限制下运作。 我们考虑的是, 一个代理商网络, 它必须从持续观测流数据中解决在线优化问题。 代理商实施一个分布式合作战略, 允许每个代理商与邻居进行本地信息交流。 为了应对通信限制, 交换的信息必须不可避免压缩。 我们建议了一个推广战略, 名称为ACT( Adapt- Compress- then-Combine), 它依赖于以下步骤 : (一) 一个适应步骤, 每个代理商必须用不断的步数来持续观测流数据, 解决在线优化问题。 (二) 一个压缩步骤, 利用最近推出的随机压缩操作压缩操作操作操作操作员类别; 和 (三) 一个组合步骤, 每个代理商可以合并从邻居那里收到的压缩更新的更新信息。 这项工作的区别性内容如下。 首先, 我们侧重于适应性战略, 持续( 而不是不断减少) 级规模对于实时应对不固定的变化至关重要。 第二, 我们考虑普通的直线图形和左端混合组合政策, 将常规的类别在不断调整中, 。 使得我们开始在高端的服务器上, 在高端网络上, 运行中, 运行中, 需要一种稳定的网络上, 高端的连接的连接的连接的连接的网络功能的连接 。