A novel form of inference attack in vertical federated learning (VFL) is proposed, where two parties collaborate in training a machine learning (ML) model. Logistic regression is considered for the VFL model. One party, referred to as the active party, possesses the ground truth labels of the samples in the training phase, while the other, referred to as the passive party, only shares a separate set of features corresponding to these samples. It is shown that the active party can carry out inference attacks on both training and prediction phase samples by acquiring an ML model independently trained on the training samples available to them. This type of inference attack does not require the active party to be aware of the score of a specific sample, hence it is referred to as an agnostic inference attack. It is shown that utilizing the observed confidence scores during the prediction phase, before the time of the attack, can improve the performance of the active party's autonomous model, and thus improve the quality of the agnostic inference attack. As a countermeasure, privacy-preserving schemes (PPSs) are proposed. While the proposed schemes preserve the utility of the VFL model, they systematically distort the VFL parameters corresponding to the passive party's features. The level of the distortion imposed on the passive party's parameters is adjustable, giving rise to a trade-off between privacy of the passive party and interpretabiliy of the VFL outcomes by the active party. The distortion level of the passive party's parameters could be chosen carefully according to the privacy and interpretabiliy concerns of the passive and active parties, respectively, with the hope of keeping both parties (partially) satisfied. Finally, experimental results demonstrate the effectiveness of the proposed attack and the PPSs.
翻译:在垂直联合学习(VFL)中,提出了一种新型的隐性推断攻击形式,即双方合作培训机器学习模型(ML),为VFL模型考虑后勤回归。一个被称为活跃方的方面拥有培训阶段样本的地面真实标签,而另一个被称为被动方的一方则拥有与这些样本相对应的一组单独特征。这表明活跃方可以通过获得一个ML模型,独立培训其可获得的培训样本,对培训和预测阶段样本进行被动攻击。这种推断攻击并不要求活跃方仔细了解特定样本的分数,因此被称为“主动方”在培训阶段拥有样本的地面真实标签,而另一个被称为被动方的一方则在攻击阶段之前,只使用观察到的信任分数,可以改善活跃方自主模型的性能,从而提高预测性攻击的质量。作为反制衡、隐私保护计划(PPSS)是针对对方提出的,而拟议选择方的精确度的参数则在系统扭曲VFLFL的数值上,使主动方的主动性参数能够对主动性值做出精确性解释。