引导攻击:Agnistist Aversarial攻击模型 (Feature Importance Guided Attack: A Model Agnostic Adversarial Attack)

Research in adversarial learning has primarily focused on homogeneous unstructured datasets, which often map into the problem space naturally. Inverting a feature space attack on heterogeneous datasets into the problem space is much more challenging, particularly the task of finding the perturbation to perform. This work presents a formal search strategy: the `Feature Importance Guided Attack' (FIGA), which finds perturbations in the feature space of heterogeneous tabular datasets to produce evasion attacks. We first demonstrate FIGA in the feature space and then in the problem space. FIGA assumes no prior knowledge of the defending model's learning algorithm and does not require any gradient information. FIGA assumes knowledge of the feature representation and the mean feature values of defending model's dataset. FIGA leverages feature importance rankings by perturbing the most important features of the input in the direction of the target class. While FIGA is conceptually similar to other work which uses feature selection processes (e.g., mimicry attacks), we formalize an attack algorithm with three tunable parameters and investigate the strength of FIGA on tabular datasets. We demonstrate the effectiveness of FIGA by evading phishing detection models trained on four different tabular phishing datasets and one financial dataset with an average success rate of 94%. We extend FIGA to the phishing problem space by limiting the possible perturbations to be valid and feasible in the phishing domain. We generate valid adversarial phishing sites that are visually identical to their unperturbed counterpart and use them to attack six tabular ML models achieving a 13.05% average success rate.

翻译：对抗性学习的研究主要侧重于单一的、非结构化的数据集,这些数据集往往自然地映射到问题空间。在问题空间中,将对混杂数据集的地貌空间攻击转换到一个问题空间更具挑战性,特别是寻找要执行的扰动性任务。这项工作提出了一个正式的搜索战略:“FIGA ”在混杂的表格数据集的特征空间中发现了扰动性,以产生逃避攻击。我们首先在特征空间和问题空间空间中展示FIGA。FIGA 假设对防御模型的学习算法没有事先了解,而不需要任何渐变信息。FIGA 假设了解模型的特征表现和捍卫模型数据集的平均值。FIGA 利用对目标类别输入方向的最重要特征进行扰动性评分。FIGA 概念在概念选择过程(例如,mimimicryal 攻击)中与其他工作在概念上相似,我们将攻击算成三个非金枪鱼参数,并调查FIGA的不透明性模型在表A 4 数据探测率上产生的强度。我们用一个平均数据来显示其成功等级数据比率。我们通过不同数据来达到可能达到FIGA 。