通过机器学习估计 g-疏漏 (Estimating g-Leakage via Machine Learning)

from arxiv, This is the extended version of the paper which will appear in the Proceedings of the 2020 ACM SIGSAC Conference on Computer and Communications Security (CCS '20), November 9-13, 2020, Virtual Event, USA

This paper considers the problem of estimating the information leakage of a system in the black-box scenario. It is assumed that the system's internals are unknown to the learner, or anyway too complicated to analyze, and the only available information are pairs of input-output data samples, possibly obtained by submitting queries to the system or provided by a third party. Previous research has mainly focused on counting the frequencies to estimate the input-output conditional probabilities (referred to as frequentist approach), however this method is not accurate when the domain of possible outputs is large. To overcome this difficulty, the estimation of the Bayes error of the ideal classifier was recently investigated using Machine Learning (ML) models and it has been shown to be more accurate thanks to the ability of those models to learn the input-output correspondence. However, the Bayes vulnerability is only suitable to describe one-try attacks. A more general and flexible measure of leakage is the g-vulnerability, which encompasses several different types of adversaries, with different goals and capabilities. In this paper, we propose a novel approach to perform black-box estimation of the g-vulnerability using ML. A feature of our approach is that it does not require to estimate the conditional probabilities, and that it is suitable for a large class of ML algorithms. First, we formally show the learnability for all data distributions. Then, we evaluate the performance via various experiments using k-Nearest Neighbors and Neural Networks. Our results outperform the frequentist approach when the observables domain is large.

翻译：本文探讨了在黑箱情景中估算系统信息泄漏的问题。假设学习者对系统内部误差不了解, 或分析起来过于复杂, 唯一的可用信息是一对输入输出数据样本, 可能是通过向系统提交查询或第三方提供的。先前的研究主要侧重于计算频率, 以估计输入- 输出有条件概率( 被称为常客方法), 但是当可能的输出范围很大时, 这种方法并不准确。为了克服这一困难, 最近利用机器学习( ML) 模型对理想分类器的贝斯误差进行了调查, 并且由于这些模型有能力学习输入- 输出通信, 可能只有向系统查询或由第三方提供。然而, 巴耶斯的脆弱性仅适合描述一次攻击的频率。更一般和灵活的渗漏度是 g- vulner, 它包含几种不同类型的对手, 以及不同的目标和能力。为了克服这一困难, 我们提出了一种新颖的方法, 对理想分类器的贝斯误进行了黑箱估计, 使用机器学习模型的误差值( ML) 常规方法, 要求我们使用一个大的正确性方法, 使用最精确性的方法, 正确性。正确性的方法需要我们使用 malL 正确性。正确性。。