Certified defenses based on convex relaxations are an established technique for training provably robust models. The key component is the choice of relaxation, varying from simple intervals to tight polyhedra. Paradoxically, however, it was empirically observed that training with tighter relaxations can worsen certified robustness. While several methods were designed to partially mitigate this issue, the underlying causes are poorly understood. In this work we investigate the above phenomenon and show that tightness may not be the determining factor for reduced certified robustness. Concretely, we identify two key features of relaxations that impact training dynamics: continuity and sensitivity. We then experimentally demonstrate that these two factors explain the drop in certified robustness when using popular relaxations. Further, we show, for the first time, that it is possible to successfully train with tighter relaxations (i.e., triangle), a result supported by our two properties. Overall, we believe the insights of this work can help drive the systematic discovery of new effective certified defenses.
翻译:基于康韦克斯放松的经认证的防御是训练可观察到的稳健模型的既定技术。 关键组成部分是选择放松, 从简单的间隔到紧凑的聚赫德拉。 但是,从经验上看, 更严格放松的培训可能会加剧经认证的稳健性。 虽然设计了几种方法来部分缓解这一问题,但根本原因并不清楚。 在这项工作中,我们调查了上述现象,并表明紧张性可能不是降低经认证的稳健性的决定因素。 具体地说,我们确定了影响培训动态的放松的两个关键特征: 连续性和敏感性。 然后,我们实验性地证明,这两个因素解释了在使用大众放松时,经认证的稳健性下降的原因。 此外,我们第一次表明,通过更严格放松( 三角) 的训练是成功的,这是我们两个属性所支持的结果。 总的来说, 我们相信, 这项工作的洞察力可以帮助系统发现新的有效的经认证的防御。