For protecting users' private data, local differential privacy (LDP) has been leveraged to provide the privacy-preserving range query, thus supporting further statistical analysis. However, existing LDP-based range query approaches are limited by their properties, i.e., collecting user data according to a pre-defined structure. These static frameworks would incur excessive noise added to the aggregated data especially in the low privacy budget setting. In this work, we propose an Adaptive Hierarchical Decomposition (AHEAD) protocol, which adaptively and dynamically controls the built tree structure, so that the injected noise is well controlled for maintaining high utility. Furthermore, we derive a guideline for properly choosing parameters for AHEAD so that the overall utility can be consistently competitive while rigorously satisfying LDP. Leveraging multiple real and synthetic datasets, we extensively show the effectiveness of AHEAD in both low and high dimensional range query scenarios, as well as its advantages over the state-of-the-art methods. In addition, we provide a series of useful observations for deploying AHEAD in practice.
翻译:为了保护用户的私人数据,已利用当地差异隐私(LDP)来提供隐私保存范围查询,从而支持进一步的统计分析。然而,现有的基于LDP的范围的查询方法由于其特性而受到限制,即根据预先界定的结构收集用户数据。这些静态框架将特别在低隐私预算环境中给综合数据增加过多噪音。在这项工作中,我们提议采用适应性和动态控制已建树结构的适应性高等级分解程序,从而对注入的噪音进行妥善控制,以保持高用途。此外,我们为AHEAD的正确选择参数制定指导方针,以使总体效用在严格满足LDP的同时能够保持一贯的竞争性。我们利用多种真实和合成数据集,广泛展示AHEAD在低和高维范围查询情景中的有效性,以及它对州一级方法的优势。此外,我们为在实践中部署AHEAD提供了一系列有用的观察。