Internet of things (IoT) devices, such as smart meters, smart speakers and activity monitors, have become highly popular thanks to the services they offer. However, in addition to their many benefits, they raise privacy concerns since they share fine-grained time-series user data with untrusted third parties. In this work, we consider a user releasing her data containing personal information in return of a service from an honest-but-curious service provider (SP). We model user's personal information as two correlated random variables (r.v.'s), one of them, called the secret variable, is to be kept private, while the other, called the useful variable, is to be disclosed for utility. We consider active sequential data release, where at each time step the user chooses from among a finite set of release mechanisms, each revealing some information about the user's personal information, i.e., the true values of the r.v.'s, albeit with different statistics. The user manages data release in an online fashion such that the maximum amount of information is revealed about the latent useful variable as quickly as possible, while the confidence for the sensitive variable is kept below a predefined level. For privacy measure, we consider both the probability of correctly detecting the true value of the secret and the mutual information (MI) between the secret and the released data. We formulate both problems as partially observable Markov decision processes (POMDPs), and numerically solve them by advantage actor-critic (A2C) deep reinforcement learning (DRL). We evaluate the privacy-utility trade-off (PUT) of the proposed policies on both the synthetic data and smoking activity dataset, and show their validity by testing the activity detection accuracy of the SP modeled by a long short-term memory (LSTM) neural network.
翻译:诸如智能仪表、智能扬声器和活动监测器等事物的互联网设备,由于所提供的服务而变得非常受欢迎。然而,除了许多好处外,它们还引起隐私关切,因为它们与不信任的第三方分享精细的时序用户数据。在这项工作中,我们考虑用户发布载有个人信息的数据,以回报一个诚实但可靠的服务提供商(SP)提供的服务。我们用两个相关随机变量(r.v.s)来模拟用户的个人信息,其中一种称为秘密变量,是保密的,而另一个称为有用的变量,是公开的,以方便使用。我们考虑积极的连续发布数据,每次用户从一组有限的发布机制中选择其中的个人数据,每个都披露一些关于用户个人信息的信息,即,以诚实但可靠的服务供应商(SP.v.s)的真实价值,尽管有不同的统计。用户以在线方式管理数据发布数据,以便尽可能快速披露潜在的有用变量,另一个称为有用的变量,称为有用的变量,另一个称为有用的变量,是有用的变量,为了便于披露。我们考虑主动的连续发布数据,每个步骤,每个步骤都是从有限的一组开放的节流流数据,我们测量的精确度的精确度, 数据记录的精确度,然后我们考虑它们的精确度的精确度活动。我们思考的精确度, 的精确度的计算,我们思考的深度测量度, 和精确度,我们思考的深度的计算了它们之间的数据活动。