Federated learning (FL) is a new machine learning paradigm, the goal of which is to build a machine learning model based on data sets distributed on multiple devices--so called Isolated Data Island--while keeping their data secure and private. Most existing work manually splits commonly-used public datasets into partitions to simulate real-world Isolated Data Island while failing to capture the intrinsic characteristics of real-world domain data, like medicine, finance or AIoT. To bridge this huge gap, this paper presents and characterizes an Isolated Data Island benchmark suite, named FLBench, for benchmarking federated learning algorithms. FLBench contains three domains: medical, financial and AIoT. By configuring various domains, FLBench is qualified for evaluating the important research aspects of federated learning, and hence become a promising platform for developing novel federated learning algorithms. Finally, FLBench is fully open-sourced and in fast-evolution. We package it as an automated deployment tool. The benchmark suite will be publicly available from http://www.benchcouncil.org/FLBench.
翻译:联邦学习(FL)是一种新的机器学习模式,其目的是根据在多种设备(即所谓的孤立数据岛)上分布的数据集建立一个机器学习模式,这些数据集被称作“孤立数据岛”,同时保持数据安全和隐私。大多数现有工作手工将常用的公共数据集分割成分区,模拟现实世界孤立数据岛,同时不能捕捉真实世界域数据的内在特征,如医学、金融或AIOT。为了缩小这一巨大差距,本文展示并描述一个孤立的数据岛基准套件,名为FLBench,用于为联合学习算法进行基准化。FLBench包含三个领域:医疗、财务和AIoT。通过配置不同领域,FLBench有资格评估联邦学习的重要研究方面,从而成为开发新的联合学习算法的有前途的平台。最后,FLBench是完全开放来源,在快速进化中。我们将其包装成一个自动部署工具。基准套件将从http://www.benchcounil.org/FLFLBechnch。