To ensure the privacy of sensitive data used in the training of deep learning models, a number of privacy-preserving methods have been designed by the research community. However, existing schemes are generally designed to work with textual data, or are not efficient when a large number of images is used for training. Hence, in this paper we propose a lightweight and efficient approach to preserve image privacy while maintaining the availability of the training set. Specifically, we design the pixel block mixing algorithm for image classification privacy preservation in deep learning. To evaluate its utility, we use the mixed training set to train the ResNet50, VGG16, InceptionV3 and DenseNet121 models on the WIKI dataset and the CNBC face dataset. Experimental findings on the testing set show that our scheme preserves image privacy while maintaining the availability of the training set in the deep learning models. Additionally, the experimental results demonstrate that we achieve good performance for the VGG16 model on the WIKI dataset and both ResNet50 and DenseNet121 on the CNBC dataset. The pixel block algorithm achieves fairly high efficiency in the mixing of the images, and it is computationally challenging for the attackers to restore the mixed training set to the original training set. Moreover, data augmentation can be applied to the mixed training set to improve the training's effectiveness.
翻译:为确保深层学习模式培训中使用的敏感数据的保密性,研究界设计了一些保护隐私的方法,但是,现有计划的设计一般是为了与文本数据合作,或者在大量图像用于培训时效率不高。因此,在本文件中,我们提议了一种轻量和高效的方法,以维护图像隐私,同时保持培训成套材料的可用性。具体地说,我们设计了像素块混合算法,以便在深层学习中保护图像分类隐私。为了评估其效用,我们使用混合培训套件来培训ResNet50、VGG16、InceptionV3和DenseNet121模型,在WIKI数据集和CNBC脸数据集方面实现了相当高的效率。测试组的实验结果显示,我们的计划保存了图像隐私,同时保持了深层学习模式中培训成套材料的可用性。此外,实验结果表明,我们在WIKI数据集的VGG16模型以及CNBC数据集的ResNet50和DenseNet121上取得了良好的性能。像素块算算法在图像的混合培训中实现了相当高的效率,因此,为改进了混合培训的升级数据集。