Many researchers have proposed replacing the aggregation server in federated learning with a blockchain system to improve privacy, robustness, and scalability. In this approach, clients would upload their updated models to the blockchain ledger and use a smart contract to perform model averaging. However, the significant delay and limited computational capabilities of blockchain systems make it inefficient to support machine learning applications on the blockchain. In this paper, we propose a new public blockchain architecture called DFL, which is specially optimized for distributed federated machine learning. Our architecture inherits the merits of traditional blockchain systems while achieving low latency and low resource consumption by waiving global consensus. To evaluate the performance and robustness of our architecture, we implemented a prototype and tested it on a physical four-node network, and also developed a simulator to simulate larger networks and more complex situations. Our experiments show that the DFL architecture can reach over 90\% accuracy for non-I.I.D. datasets, even in the presence of model poisoning attacks, while ensuring that the blockchain part consumes less than 5\% of hardware resources.
翻译:许多研究人员已经提出用区块链系统取代联邦学习中的汇聚服务器,以提高隐私性、鲁棒性和可扩展性。在这种方法中,客户端将更新后的模型上传至区块链账本,并使用智能合约进行模型平均。然而,区块链系统的显著延迟和有限的计算能力使得在区块链上支持机器学习应用变得效率低下。在本文中,我们提出了一种新的公共区块链体系结构(命名为 DFL),专门针对分布式联邦机器学习进行了优化。我们的架构继承了传统区块链系统的优点,同时通过放弃全局一致性实现低延迟和低资源消耗。为了评估我们的架构的性能和鲁棒性,我们实现了一个原型,并在一个物理的四节点网络上进行了测试,同时还开发了一个模拟器来模拟更大的网络和更复杂的情况。我们的实验表明,DFL架构可以在非I.I.D.数据集上实现超过90%的准确度,即使在存在模型中毒攻击的情况下也能保证区块链部分消耗不到硬件资源的5%。