With the advent of big data era and the development of artificial intelligence and other technologies, data security and privacy protection have become more important. Recommendation systems have many applications in our society, but the model construction of recommendation systems is often inseparable from users' data. Especially for deep learning-based recommendation systems, due to the complexity of the model and the characteristics of deep learning itself, its training process not only requires long training time and abundant computational resources but also needs to use a large amount of user data, which poses a considerable challenge in terms of data security and privacy protection. How to train a distributed recommendation system while ensuring data security has become an urgent problem to be solved. In this paper, we implement two schemes, Horizontal Federated Learning and Secure Distributed Training, based on Intel SGX(Software Guard Extensions), an implementation of a trusted execution environment, and TensorFlow framework, to achieve secure, distributed recommendation system-based learning schemes in different scenarios. We experiment on the classical Deep Learning Recommendation Model (DLRM), which is a neural network-based machine learning model designed for personalization and recommendation, and the results show that our implementation introduces approximately no loss in model performance. The training speed is within acceptable limits.
翻译:随着大数据时代的到来以及人工智能和其他技术的开发,数据安全和隐私保护变得更加重要。建议系统在我们的社会中有许多应用,但建议系统的模型构建往往与用户的数据密不可分。特别是对于深层次的基于学习的建议系统,由于模型的复杂性和深层次学习本身的特点,其培训过程不仅需要很长的培训时间和丰富的计算资源,而且还需要使用大量用户数据,这在数据安全和隐私保护方面构成相当大的挑战。如何在确保数据安全的同时培训分布式建议系统已成为需要解决的紧迫问题。在本文件中,我们实施了两种计划:基于Intel SGX(软件保护扩展)的横向联邦学习和安全分配培训、实施信任的执行环境以及TensorFlow框架,以便在不同的情景下实现安全、分布式基于建议系统的学习计划。我们试验了经典深层学习建议模型(DLRM),这是一个基于神经网络的机器学习模型,目的是个性化和建议,结果显示我们的实施速度在模型范围内是可以接受的。