Although query-based systems (QBS) have become one of the main solutions to share data anonymously, building QBSes that robustly protect the privacy of individuals contributing to the dataset is a hard problem. Theoretical solutions relying on differential privacy guarantees are difficult to implement correctly with reasonable accuracy, while ad-hoc solutions might contain unknown vulnerabilities. Evaluating the privacy provided by QBSes must thus be done by evaluating the accuracy of a wide range of privacy attacks. However, existing attacks require time and expertise to develop, need to be manually tailored to the specific systems attacked, and are limited in scope. In this paper, we develop QuerySnout (QS), the first method to automatically discover vulnerabilities in QBSes. QS takes as input a target record and the QBS as a black box, analyzes its behavior on one or more datasets, and outputs a multiset of queries together with a rule to combine answers to them in order to reveal the sensitive attribute of the target record. QS uses evolutionary search techniques based on a novel mutation operator to find a multiset of queries susceptible to lead to an attack, and a machine learning classifier to infer the sensitive attribute from answers to the queries selected. We showcase the versatility of QS by applying it to two attack scenarios, three real-world datasets, and a variety of protection mechanisms. We show the attacks found by QS to consistently equate or outperform, sometimes by a large margin, the best attacks from the literature. We finally show how QS can be extended to QBSes that require a budget, and apply QS to a simple QBS based on the Laplace mechanism. Taken together, our results show how powerful and accurate attacks against QBSes can already be found by an automated system, allowing for highly complex QBSes to be automatically tested "at the pressing of a button".
翻译:尽管基于查询的系统(QBS)已成为匿名分享数据的主要解决方案之一,但建立能有力保护为数据集作出贡献的个人隐私的QBS是一个棘手的问题。依靠不同隐私保障的理论解决方案很难以合理的准确性正确执行,而临时解决方案可能包含未知的脆弱性。因此,评估QBS提供的隐私必须通过评价范围广泛的隐私攻击的准确性来完成。然而,现有的攻击需要时间和专门知识来开发,有时需要手工定制特定系统被攻击,而且范围有限。在本文件中,我们开发了QurySnout(QS),这是在QBSes中自动发现弱点的第一个方法。QS将目标记录和QBS作为黑盒输入,在一个或多个数据集上分析其行为,并产生多套查询,同时结合解答,以揭示目标记录的敏感属性。QS使用进化搜索技术,在新变异操作者中找到一个能够导致攻击的多端点,在QS上应用一个自动查询,在S上显示一个直观攻击的系统,通过我们所选的直观的直观数据机制显示一个我们所发现的。