Conventional private data publication mechanisms aim to retain as much data utility as possible while ensuring sufficient privacy protection on sensitive data. Such data publication schemes implicitly assume that all data analysts and users have the same data access privilege levels. However, it is not applicable for the scenario that data users often have different levels of access to the same data, or different requirements of data utility. The multi-level privacy requirements for different authorization levels pose new challenges for private data publication. Traditional PPDP mechanisms only publish one perturbed and private data copy satisfying some privacy guarantee to provide relatively accurate analysis results. To find a good tradeoff between privacy preservation level and data utility itself is a hard problem, let alone achieving multi-level data utility on this basis. In this paper, we address this challenge in proposing a novel framework of data publication with compressive sensing supporting multi-level utility-privacy tradeoffs, which provides differential privacy. Specifically, we resort to compressive sensing (CS) method to project a $n$-dimensional vector representation of users' data to a lower $m$-dimensional space, and then add deliberately designed noise to satisfy differential privacy. Then, we selectively obfuscate the measurement vector under compressive sensing by adding linearly encoded noise, and provide different data reconstruction algorithms for users with different authorization levels. Extensive experimental results demonstrate that ML-DPCS yields multi-level of data utility for specific users at different authorization levels.
翻译:常规私营数据公布机制旨在尽可能保留数据效用,同时确保敏感数据的充分隐私保护;这类数据公布机制暗含地假定所有数据分析员和用户拥有相同的数据访问特权水平;然而,数据用户往往有不同程度的获取相同数据的机会,或对数据效用的不同要求,这种假设并不适用;不同授权水平的多层次隐私要求给私人数据出版带来了新的挑战;传统的PPDP机制只公布一个受干扰的和私人的数据副本,满足一些隐私保证,以提供相对准确的分析结果;要找到隐私保护水平与数据效用本身之间的良好平衡是一个棘手问题,更不用说在此基础上实现多层次的数据数据效用;在本文件中,我们应对这一挑战,提出一个新的数据出版框架,采用压缩的遥感,支持多层次的通用数据交换;具体地说,我们采用压缩的遥感(CS)方法,将用户的数据以一元为单位,向较低维度的空间提供比较准确的矢量代表,然后刻意地设计噪音以满足差异隐私权。