Recently, differential privacy (DP) is getting attention as a privacy definition when publishing statistics of a dataset. However, when answering a decision problem with a DP mechanism, it causes a two-sided error. This characteristic of DP is not desirable when publishing risk information such as concerning COVID-19. This paper proposes relaxing DP to mitigate the limitation and improve the utility of published information. First, we define a policy that separates information into sensitive and non-sensitive. Then, we define asymmetric differential privacy (ADP) that provides the same privacy guarantee as DP to sensitive information. This partial protection induces asymmetricity in privacy protection to improve utility and allow a one-sided error mechanism. Following ADP, we propose two mechanisms for two tasks based on counting query with utilizing these characteristics: top-$k$ query and publishing risk information of viruses with an accuracy guarantee. Finally, we conducted experiments to evaluate proposed algorithms using real-world datasets and show their practicality and improvement of the utility, comparing state-of-the-art algorithms.
翻译:最近,在公布数据集统计数据时,差异隐私(DP)作为隐私定义受到注意。然而,在对DP机制的决定问题作出答复时,它造成双向错误。在公布诸如COVID-19等风险信息时,这种DP的特点并不可取。本文件提议放松DP,以减少限制和改进已公布信息的效用。首先,我们界定了将信息分为敏感和非敏感信息的政策。然后,我们定义了不对称差异隐私(ADP),对敏感信息提供与DP相同的隐私保障。这种部分保护导致隐私保护不对称,以改善其效用并允许单向错误机制。在ADP之后,我们提议了两个机制,两个基于利用这些特性进行计数的任务:最高至千美元的查询和以准确性保证的方式公布病毒风险信息。最后,我们进行了实验,用真实世界数据集来评价拟议的算法,并显示其实用性和改进性,比较了最先进的算法。