Windows malware detectors based on machine learning are vulnerable to adversarial examples, even if the attacker is only given black-box query access to the model. The main drawback of these attacks is that: (i) they are query-inefficient, as they rely on iteratively applying random transformations to the input malware; and (ii) they may also require executing the adversarial malware in a sandbox at each iteration of the optimization process, to ensure that its intrusive functionality is preserved. In this paper, we overcome these issues by presenting a novel family of black-box attacks that are both query-efficient and functionality-preserving, as they rely on the injection of benign content - which will never be executed - either at the end of the malicious file, or within some newly-created sections. Our attacks are formalized as a constrained minimization problem which also enables optimizing the trade-off between the probability of evading detection and the size of the injected payload. We empirically investigate this trade-off on two popular static Windows malware detectors, and show that our black-box attacks can bypass them with only few queries and small payloads, even when they only return the predicted labels. We also evaluate whether our attacks transfer to other commercial antivirus solutions, and surprisingly find that they can evade, on average, more than 12 commercial antivirus engines. We conclude by discussing the limitations of our approach, and its possible future extensions to target malware classifiers based on dynamic analysis.
翻译:以机器学习为基础的视窗恶意软件检测器很容易受到对抗性实例的影响,即使攻击者仅被给予黑盒询问模型的黑盒访问权限,这些袭击的主要缺点是:(一) 它们缺乏查询效率,因为它们依赖于对输入的恶意软件进行迭接随机转换;以及(二) 它们可能还要求在优化过程每次迭代的沙箱中执行对抗性恶意软件,以确保它的侵入功能得到维护。 在本文件中,我们通过展示黑盒袭击的新式系列,这些黑盒袭击既具有查询效率和功能保留,也依赖注入良性内容,而这种内容在恶意文件的末尾,或者在一些新建的章节中,是不会被执行的。我们的攻击被正式化为限制的最小化问题,这也使得在蒸发检测的概率和注入的有效载荷的大小之间实现最佳的交换。我们用两种流行的静态视窗恶意检测器检测器进行这种交易,并表明我们的黑箱袭击能够通过很少的查询和小型有效载荷来绕过它们,因为它们将永远无法执行。 我们只能通过将其他的逆向未来目标转换, 我们只能通过平均的变压性病毒的标签来评估我们未来的目标。