神经网络中独有私人特征的意外记忆 (Measuring Unintended Memorisation of Unique Private Features in Neural Networks)

Neural networks pose a privacy risk to training data due to their propensity to memorise and leak information. Focusing on image classification, we show that neural networks also unintentionally memorise unique features even when they occur only once in training data. An example of a unique feature is a person's name that is accidentally present on a training image. Assuming access to the inputs and outputs of a trained model, the domain of the training data, and knowledge of unique features, we develop a score estimating the model's sensitivity to a unique feature by comparing the KL divergences of the model's output distributions given modified out-of-distribution images. Our results suggest that unique features are memorised by multi-layer perceptrons and convolutional neural networks trained on benchmark datasets, such as MNIST, Fashion-MNIST and CIFAR-10. We find that strategies to prevent overfitting (e.g.\ early stopping, regularisation, batch normalisation) do not prevent memorisation of unique features. These results imply that neural networks pose a privacy risk to rarely occurring private information. These risks can be more pronounced in healthcare applications if patient information is present in the training data.

翻译：神经网络对培训数据构成隐私风险, 原因是它们倾向于回忆和泄漏信息。我们以图像分类为重点, 显示神经网络也无意中回忆了独特的特征, 即使它们只在培训数据中出现过一次。一个独特的特征的例子就是在培训图像中不小心出现一个人的名字。假设能够获取经过培训的模式的投入和产出、培训数据领域和独特特征的知识, 我们开发了一个分数, 通过比较模型在经过修改的传播外图像输出分布的 KL 差异来估计模型对一个独特特征的敏感度。这些结果表明神经网络对隐私风险很少发生私人信息。这些风险在医疗保健应用中更为明显, 如果对病人数据进行培训, 这些风险在医疗保健应用中会更加明显。

相关内容

Neural Networks

关注 1648

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

【CVPR 2022】基于代表性片段知识传播的弱监督时间动作定位，Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation

专知会员服务

6+阅读 · 2022年3月12日

【Max Welling】图神经网络知识表示与推荐，Graph Neural Networks for Knowledge Representation and Recommendation

专知会员服务

44+阅读 · 2022年3月4日

【如何做研究】How to research ，22页ppt

专知会员服务

112+阅读 · 2021年4月17日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日