Conventional frequentist learning, as assumed by existing federated learning protocols, is limited in its ability to quantify uncertainty, incorporate prior knowledge, guide active learning, and enable continual learning. Bayesian learning provides a principled approach to address all these limitations, at the cost of an increase in computational complexity. This paper studies distributed Bayesian learning in a wireless data center setting encompassing a central server and multiple distributed workers. Prior work on wireless distributed learning has focused exclusively on frequentist learning, and has introduced the idea of leveraging uncoded transmission to enable "over-the-air" computing. Unlike frequentist learning, Bayesian learning aims at evaluating approximations or samples from a global posterior distribution in the model parameter space. This work investigates for the first time the design of distributed one-shot, or "embarrassingly parallel", Bayesian learning protocols in wireless data centers via consensus Monte Carlo (CMC). Uncoded transmission is introduced not only as a way to implement "over-the-air" computing, but also as a mechanism to deploy channel-driven MC sampling: Rather than treating channel noise as a nuisance to be mitigated, channel-driven sampling utilizes channel noise as an integral part of the MC sampling process. A simple wireless CMC scheme is first proposed that is asymptotically optimal under Gaussian local posteriors. Then, for arbitrary local posteriors, a variational optimization strategy is introduced. Simulation results demonstrate that, if properly accounted for, channel noise can indeed contribute to MC sampling and does not necessarily decrease the accuracy level.
翻译:现有联邦学习协议所假设的常规常年学习,其量化不确定性、吸收先前知识、指导积极学习和不断学习的能力有限。 巴伊西亚学习以计算复杂性的增加为代价,提供了解决所有这些限制的原则性方法。本文研究将巴伊西亚学习分散在一个无线数据中心,由中央服务器和多分布工人组成。无线分布式学习先前的工作完全侧重于常年学习,并引入了利用未编码传输实现“超空”计算的想法。与常年学习不同,巴伊西亚学习旨在从模型参数空间的全球精确分布中正确计算近似或样本。这项工作首次调查了分发的一发或“平行”的图像设计,通过共识蒙特卡洛(CMC)在无线数据中心中传播拜伊斯学习协议。 引入了无编码传输不仅作为实施“超空”计算的一种方法,而且作为部署频道驱动的监控取样的一种机制:而不是将频道噪音作为正确核算的准确的直径分布式分布式分布式分布式分布式分布式分布式分布式分布式分布式分布式分布式分布式分布式分布式分布式分布式分布式分布式分布式分布式分布式分布式分布式学习,而不是在模型中,通过一个最佳的循环浏览式浏览式浏览式浏览式浏览式浏览式浏览式浏览式浏览式浏览式浏览式浏览式浏览式浏览式浏览式取样器系统,而将开始采用一个完整的取样式取样制成一个拟议系统,而将采用一个整体式取样式取样式取样制制成一个整体式取样制制成一个比例。