Machine learning models' goal is to make correct predictions for specific tasks by learning important properties and patterns from data. By doing so, there is a chance that the model learns properties that are unrelated to its primary task. Property Inference Attacks exploit this and aim to infer from a given model (\ie the target model) properties about the training dataset seemingly unrelated to the model's primary goal. If the training data is sensitive, such an attack could lead to privacy leakage. This paper investigates the influence of the target model's complexity on the accuracy of this type of attack, focusing on convolutional neural network classifiers. We perform attacks on models that are trained on facial images to predict whether someone's mouth is open. Our attacks' goal is to infer whether the training dataset is balanced gender-wise. Our findings reveal that the risk of a privacy breach is present independently of the target model's complexity: for all studied architectures, the attack's accuracy is clearly over the baseline. We discuss the implication of the property inference on personal data in the light of Data Protection Regulations and Guidelines.
翻译:机器学习模型的目标是通过从数据中学习重要属性和模式,对具体任务作出正确预测。 通过这样做,该模型有可能了解与其主要任务无关的属性。 财产推断攻击利用了这一点,目的是从某一模型( 目标模型)中推断出与模型主要目标无关的培训数据集的属性。 如果培训数据敏感,这种攻击可能导致隐私泄漏。 本文调查目标模型的复杂性对这种攻击的准确性的影响, 重点是神经神经神经网络分类器。 我们攻击那些受过面部图像培训的模型, 以预测某人的嘴是否开放。 我们的攻击目标是判断培训数据集是否平衡了性别。 我们的调查发现, 隐私侵犯的风险与目标模型的复杂性无关: 对于所有研究过的架构, 攻击的准确性明显高于基线。 我们讨论根据数据保护条例和导则, 个人数据属性推断的影响。