Randomized Smoothing (RS) is a promising method for obtaining robustness certificates by evaluating a base model under noise. In this work we: (i) theoretically motivate why ensembles are a particularly suitable choice as base models for RS, and (ii) empirically confirm this choice, obtaining state of the art results in multiple settings. The key insight of our work is that the reduced variance of ensembles over the perturbations introduced in RS leads to significantly more consistent classifications for a given input, in turn leading to substantially increased certifiable radii for difficult samples. We also introduce key optimizations which enable an up to 50-fold decrease in sample complexity of RS, thus drastically reducing its computational overhead. Experimentally, we show that ensembles of only 3 to 10 classifiers consistently improve on the strongest single model with respect to their average certified radius (ACR) by 5% to 21% on both CIFAR-10 and ImageNet. On the latter, we achieve a state-of-the-art ACR of 1.11. We release all code and models required to reproduce our results upon publication.
翻译:通过在噪音下评估一个基准模型来获得稳健性证明(RS)是一个很有希望的方法。在这项工作中,我们:(一) 理论上激励人们为什么集合是一个特别合适的选择,作为RS的基础模型,以及(二) 经验性地确认这种选择,在多种环境中取得最新的结果。我们工作的关键见解是,在RS引进的扰动中,聚合差异的减少导致对某项输入进行明显一致的分类,反过来又导致对困难样品的可核证辐射量大幅度增加。我们还引入了关键优化,使RS的样本复杂性减少多达50倍,从而大幅度降低其计算间接成本。我们实验性地表明,只有3至10个分类者组成的组合在最强的单一模型上不断改进,其平均经认证的半径(ACR)在CIFAR-10和图像网络上都增加了5%至21%。在后者,我们实现了1.11年的状态的ACRR。我们发布了在出版时复制结果所需的所有代码和模型。