通过模拟研究数据披露机制对建议者系统的影响 (Studying the Impact of Data Disclosure Mechanism in Recommender Systems via Simulation)

Recently, privacy issues in web services that rely on users' personal data have raised great attention. Unlike existing privacy-preserving technologies such as federated learning and differential privacy, we explore another way to mitigate users' privacy concerns, giving them control over their own data. For this goal, we propose a privacy aware recommendation framework that gives users delicate control over their personal data, including implicit behaviors, e.g., clicks and watches. In this new framework, users can proactively control which data to disclose based on the trade-off between anticipated privacy risks and potential utilities. Then we study users' privacy decision making under different data disclosure mechanisms and recommendation models, and how their data disclosure decisions affect the recommender system's performance. To avoid the high cost of real-world experiments, we apply simulations to study the effects of our proposed framework. Specifically, we propose a reinforcement learning algorithm to simulate users' decisions (with various sensitivities) under three proposed platform mechanisms on two datasets with three representative recommendation models. The simulation results show that the platform mechanisms with finer split granularity and more unrestrained disclosure strategy can bring better results for both end users and platforms than the "all or nothing" binary mechanism adopted by most real-world applications. It also shows that our proposed framework can effectively protect users' privacy since they can obtain comparable or even better results with much less disclosed data.

翻译：最近,依赖用户个人数据的网络服务隐私问题引起了人们的极大关注。与现有的隐私保护技术,如联合学习和差异隐私等不同,我们探索了另一种方法来减轻用户隐私关切,让用户控制自己的数据。为此,我们提议了一个隐私意识建议框架,让用户对其个人数据进行微妙的控制,包括隐含行为,例如点击和手表等。在这个新框架内,用户可以积极主动地控制根据预期隐私风险和潜在公用事业之间的权衡而披露哪些数据。然后,我们研究用户在不同的数据披露机制和建议模式下作出的隐私保护决策,以及他们的数据披露决定如何影响推荐人系统的绩效。为了避免真实世界实验的高成本,我们采用模拟来研究我们拟议框架的影响。具体地说,我们提议了一个强化学习算法,在三个拟议平台机制下,用三个具有代表性的建议模型来模拟用户的决定。模拟结果表明,平台机制以细化的颗粒度和更具节制的披露战略可以给终端用户和平台带来更好的结果,即使他们的数据披露决定会影响推荐人系统的业绩。为避免真实世界实验成本,我们提议的框架也比实际应用了更低得多。我们提出的“所有”的软件机制可以更好地保护用户。我们所提出的数据机制可以更好地利用“所有或完全的软件机制。