Randomized smoothing is a recent and celebrated solution to certify the robustness of any classifier. While it indeed provides a theoretical robustness against adversarial attacks, the dimensionality of current classifiers necessarily imposes Monte Carlo approaches for its application in practice. This paper questions the effectiveness of randomized smoothing as a defense, against state of the art black-box attacks. This is a novel perspective, as previous research works considered the certification as an unquestionable guarantee. We first formally highlight the mismatch between a theoretical certification and the practice of attacks on classifiers. We then perform attacks on randomized smoothing as a defense. Our main observation is that there is a major mismatch in the settings of the RS for obtaining high certified robustness or when defeating black box attacks while preserving the classifier accuracy.
翻译:随机滑动是证明任何分类器坚固性的最新和值得庆幸的解决办法。 虽然它确实提供了理论上的稳健性, 但当前分类器的维度必然迫使蒙特卡洛在实际中应用它。 本文质疑随机滑动作为防御手段的有效性, 质疑最先进的黑箱袭击。 这是一个新颖的观点, 因为以前的研究将认证视为无可置疑的保证。 我们首先正式强调了理论认证与攻击分类器的做法之间的不匹配性。 我们随后对随机滑动进行攻击作为防御。 我们的主要观察是,塞族共和国在获得高度认证的稳健性或者在保护分类器准确性的同时击败黑箱袭击时存在重大不匹配性。