Adversarial patches pose a realistic threat model for physical world attacks on autonomous systems via their perception component. Autonomous systems in safety-critical domains such as automated driving should thus contain a fail-safe fallback component that combines certifiable robustness against patches with efficient inference while maintaining high performance on clean inputs. We propose BagCert, a novel combination of model architecture and certification procedure that allows efficient certification. We derive a loss that enables end-to-end optimization of certified robustness against patches of different sizes and locations. On CIFAR10, BagCert certifies 10.000 examples in 43 seconds on a single GPU and obtains 86% clean and 60% certified accuracy against 5x5 patches.
翻译:自动驾驶等安全关键领域的自主系统应包含一个故障安全后退部分,将可验证的稳健性与高效推断的补丁结合起来,同时保持高效的清洁投入的高效性能。我们建议采用BagCert,这是模型架构和认证程序的新型组合,可以有效认证。我们得出一个损失,可以对不同大小和地点的补丁进行端到端的经认证的稳健性优化。在CIFAR10上,BagCert在43秒内验证了单一GPU的10,000个实例,在5x5补丁中获得了86%的清洁和60%的认证准确性。