In a vertical federated learning (VFL) scenario where features and model are split into different parties, communications of sample-specific updates are required for correct gradient calculations but can be used to deduce important sample-level label information. An immediate defense strategy is to protect sample-level messages communicated with Homomorphic Encryption (HE), and in this way only the batch-averaged local gradients are exposed to each party (termed black-boxed VFL). In this paper, we first explore the possibility of recovering labels in the vertical federated learning setting with HE-protected communication, and show that private labels can be reconstructed with high accuracy by training a gradient inversion model. Furthermore, we show that label replacement backdoor attacks can be conducted in black-boxed VFL by directly replacing encrypted communicated messages (termed gradient-replacement attack). As it is a common presumption that batch-averaged information is safe to share, batch label inference and replacement attacks are a severe challenge to VFL. To defend against batch label inference attack, we further evaluate several defense strategies, including confusional autoencoder (CoAE), a technique we proposed based on autoencoder and entropy regularization. We demonstrate that label inference and replacement attacks can be successfully blocked by this technique without hurting as much main task accuracy as compared to existing methods.
翻译:在纵向联合学习(VFL)情景中,特征和模型分为不同的方,需要提供具体抽样更新的通信,以便准确计算梯度,但可以用来推断重要的样本标签信息。即时防御战略是保护与智障加密(HHE)传递的样本级信息,以这种方式,只有批量平均本地梯度暴露于各方(黑箱VFL)中。在本文中,我们首先探讨是否有可能在与HE保护通信的垂直联合学习环境中恢复标签,并表明通过培训梯度转换模型,可以以高度精确的方式重建私人标签。此外,我们表明标签替换后门攻击可以通过黑箱VFLL进行,直接替换加密发送的加密信息(定期梯度替换攻击 ) 。 通常的假设是,批量平均信息可以安全地共享,批量标签推断和替换攻击对VFFLL是一个严峻的挑战。 为防范批量标签攻击,我们进一步评估了几种防御战略,包括混带自动编码(CoAEE),可以直接替换黑箱 VFLFL(W),这是我们提议的一种技术,可以成功地进行升级。