具有纵向和横向数据分割的多电视网络交叉联邦学习 (Cross-Silo Federated Learning for Multi-Tier Networks with Vertical and Horizontal Data Partitioning)

We consider federated learning in tiered communication networks. Our network model consists of a set of silos, each holding a vertical partition of the data. Each silo contains a hub and a set of clients, with the silo's vertical data shard partitioned horizontally across its clients. We propose Tiered Decentralized Coordinate Descent (TDCD), a communication-efficient decentralized training algorithm for such two-tiered networks. To reduce communication overhead, the clients in each silo perform multiple local gradient steps before sharing updates with their hub. Each hub adjusts its coordinates by averaging its workers' updates, and then hubs exchange intermediate updates with one another. We present a theoretical analysis of our algorithm and show the dependence of the convergence rate on the number of vertical partitions, the number of local updates, and the number of clients in each hub. We further validate our approach empirically via simulation-based experiments using a variety of datasets and objectives.

翻译：我们考虑在分层通信网络中进行联合学习。我们的网络模型由一组各持有数据垂直分割的筒仓组成。每个筒仓包含一个枢纽和一组客户, 筒仓的垂直数据碎片分布在其客户之间。我们提议为这种双层网络使用一个通信效率高的分散式培训算法( TDCD ) 。为了减少通信间接费用, 每个筒仓的客户在与其中心共享最新消息之前执行多个本地梯度步骤。每个枢纽通过平均其工人更新信息调整其坐标, 然后中心中心相互交换中间更新信息。我们对我们的算法进行理论分析, 并显示对垂直分割数量、本地更新数量和每个枢纽的客户数的趋同率的依赖性。我们用各种数据集和目标通过模拟实验进一步验证我们的做法。

相关内容

联邦学习

关注 199

联邦学习（Federated Learning）是一种新兴的人工智能基础技术，在 2016 年由谷歌最先提出，原本用于解决安卓手机终端用户在本地更新模型的问题，其设计目标是在保障大数据交换时的信息安全、保护终端数据和个人数据隐私、保证合法合规的前提下，在多参与方或多计算结点之间开展高效率的机器学习。其中，联邦学习可使用的机器学习算法不局限于神经网络，还包括随机森林等重要算法。联邦学习有望成为下一代人工智能协同算法和协作网络的基础。

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日