Federated learning (FL), as a distributed machine learning approach, has drawn a great amount of attention in recent years. FL shows an inherent advantage in privacy preservation, since users' raw data are processed locally. However, it relies on a centralized server to perform model aggregation. Therefore, FL is vulnerable to server malfunctions and external attacks. In this paper, we propose a novel framework by integrating blockchain into FL, namely, blockchain assisted decentralized federated learning (BLADE-FL), to enhance the security of FL. The proposed BLADE-FL has a good performance in terms of privacy preservation, tamper resistance, and effective cooperation of learning. However, it gives rise to a new problem of training deficiency, caused by lazy clients who plagiarize others' trained models and add artificial noises to conceal their cheating behaviors. To be specific, we first develop a convergence bound of the loss function with the presence of lazy clients and prove that it is convex with respect to the total number of generated blocks $K$. Then, we solve the convex problem by optimizing $K$ to minimize the loss function. Furthermore, we discover the relationship between the optimal $K$, the number of lazy clients, and the power of artificial noises used by lazy clients. We conduct extensive experiments to evaluate the performance of the proposed framework using the MNIST and Fashion-MNIST datasets. Our analytical results are shown to be consistent with the experimental results. In addition, the derived optimal $K$ achieves the minimum value of loss function, and in turn the optimal accuracy performance.
翻译:联邦学习(FL)作为一种分散式的机器学习方法,近年来引起了人们的极大关注。联邦学习(FL)由于用户的原始数据是在当地处理的,因此在保护隐私方面具有内在的优势,因为用户的原始数据是在当地处理的。但是,它依靠一个中央服务器来进行模型聚合。因此,FL很容易受到服务器故障和外部攻击的影响。在本文中,我们提出一个新的框架,将块链结合到FL(BLADE-FL)中,以加强FL(BLADE-FL)的安全。拟议的BLADE-FL在保护隐私、篡改阻力和有效合作学习方面表现良好。然而,这引起了培训不足的新问题,这是由懒惰的客户造成的,他们把别人的训练模型弄脏了,增加了人为的噪音以掩盖他们的欺骗行为。我们首先将损失功能与懒惰客户的存在结合起来,证明它与产生的FISTFK(K)块的总数有关。然后,我们通过优化的美元来解决 Convex问题,把成本降到最低值,以尽量减少结果功能。此外,我们使用最优的实验客户之间的实验性实验行为是最佳的。我们使用最优K(美元) 和最优的实验性能的成绩,我们使用最优的试验的试验的进度的试验的进度的进度。