Many researchers are trying to replace the aggregation server in federated learning with a blockchain system to achieve better privacy, robustness and scalability. In this case, clients will upload their updated models to the blockchain ledger, and use a smart contract on the blockchain system to perform model averaging. However, running machine learning applications on the blockchain is almost impossible because a blockchain system, which usually takes over half minute to generate a block, is extremely slow and unable to support machine learning applications. This paper proposes a completely new public blockchain architecture called DFL, which is specially optimized for distributed federated machine learning. This architecture inherits most traditional blockchain merits and achieves extremely high performance with low resource consumption by waiving global consensus. To characterize the performance and robustness of our architecture, we implement the architecture as a prototype and test it on a physical four-node network. To test more nodes and more complex situations, we build a simulator to simulate the network. The LeNet results indicate our system can reach over 90% accuracy for non-I.I.D. datasets even while facing model poisoning attacks, with the blockchain consuming less than 5% of hardware resources.
翻译:许多研究人员试图用联结式学习系统来取代总合服务器,以联结式学习系统取代总合服务器,以达到更好的隐私、稳健性和可缩缩性。在这种情况下,客户将把最新模型上传到链链分类账上,并使用块链系统上的智能合同来平均地运行模型。然而,在块链上运行机器学习应用程序几乎是不可能的,因为块链系统通常需要半分钟以上的时间来生成一个区块,这种系统非常缓慢,无法支持机器学习应用程序。本文提议建立一个全新的公共块链结构,称为DFL, 专门优化用于分布式联结式机器学习。这个结构继承了大多数传统的链条优点,并且通过放弃全球共识,以低资源消耗实现了极高的性能。为了描述我们架构的性能和稳健性,我们把结构作为原型,并在物理四点网络上测试它。要测试更多的节点和更复杂的情况,我们就建立一个模拟网络的模拟器。 LeNet结果显示,我们的系统可以达到90%以上的非I.D.数据集的精度,即使面临模型中毒攻击,使用不到5 %的硬件。