Adversarial attacks expose important vulnerabilities of deep learning models, yet little attention has been paid to settings where data arrives as a stream. In this paper, we formalize the online adversarial attack problem, emphasizing two key elements found in real-world use-cases: attackers must operate under partial knowledge of the target model, and the decisions made by the attacker are irrevocable since they operate on a transient data stream. We first rigorously analyze a deterministic variant of the online threat model by drawing parallels to the well-studied $k$-\textit{secretary problem} and propose \algoname, a simple yet practical algorithm yielding a provably better competitive ratio for $k=2$ over the current best single threshold algorithm. We also introduce the \textit{stochastic $k$-secretary} -- effectively reducing online blackbox attacks to a $k$-secretary problem under noise -- and prove theoretical bounds on the competitive ratios of \textit{any} online algorithms adapted to this setting. Finally, we complement our theoretical results by conducting a systematic suite of experiments on MNIST and CIFAR-10 with both vanilla and robust classifiers, revealing that, by leveraging online secretary algorithms, like \algoname, we can get an online attack success rate close to the one achieved by the optimal offline solution.
翻译:阿德萨里攻击暴露了深层次学习模式的重要脆弱性,然而却很少注意数据作为流流到达的设置。 在本文中,我们正式确定在线对抗性攻击问题,强调现实世界使用情况中发现的两个关键要素:攻击者必须在目标模型的部分知识下行动,攻击者的决定是不可撤销的,因为他们在瞬时数据流上行动。我们首先严格分析在线威胁模型的确定变式,方法是与经过仔细研究的$-$-\textit{秘书问题}平行,并提议一个简单而实用的算法,在目前最好的单一门槛算法上,以美元=2美元产生一个可以想象的更好的竞争比率。我们还引入了“textit{tochaticatic $-$-secretarial}——将在线黑盒攻击有效地降低到在噪音下的一个$-保密问题 -- -- 并证明在符合这一环境的网上算法的竞争比率上有理论约束。最后,我们补充了我们的理论结果,在像MINIST和CIAR10这样的网络攻击率上进行系统性的实验套,通过一个可靠的网络智能智能智能智能升级,我们能够通过一个实现最佳的网络攻击率的升级,通过一个成功的网络智能升级,通过一个成功的智能升级来展示。