Backdoor learning is an emerging and important topic of studying the vulnerability of deep neural networks (DNNs). Many pioneering backdoor attack and defense methods are being proposed successively or concurrently, in the status of a rapid arms race. However, we find that the evaluations of new methods are often unthorough to verify their claims and real performance, mainly due to the rapid development, diverse settings, as well as the difficulties of implementation and reproducibility. Without thorough evaluations and comparisons, it is difficult to track the current progress and design the future development roadmap of the literature. To alleviate this dilemma, we build a comprehensive benchmark of backdoor learning, called BackdoorBench. It consists of an extensible modular based codebase (currently including implementations of 8 state-of-the-art (SOTA) attack and 9 SOTA defense algorithms), as well as a standardized protocol of a complete backdoor learning. We also provide comprehensive evaluations of every pair of 8 attacks against 9 defenses, with 5 poisoning ratios, based on 5 models and 4 datasets, thus 8,000 pairs of evaluations in total. We further present analysis from different perspectives about these 8,000 evaluations, studying the effects of attack against defense algorithms, poisoning ratio, model and dataset in backdoor learning. All codes and evaluations of BackdoorBench are publicly available at \url{https://backdoorbench.github.io}.
翻译:后门学习是研究深层神经网络(DNNs)脆弱性的一个新兴重要专题。许多先入为主的后门攻击和防御方法都是连续或同时提出的,处于快速军备竞赛的状态。然而,我们发现,对新方法的评价往往无法核实其主张和实际表现,主要原因是快速发展、环境不同以及执行和再生的困难。没有彻底的评价和比较,很难跟踪当前的进展并设计文献的未来发展路线图。为了缓解这一困境,我们建立了后门学习的全面基准,称为后门的后门攻击和防御方法。它包括一个可扩展模块化的代码库(目前包括实施8次最先进的攻击和9次SOTA防御算法),以及一个完整的后门学习的标准化协议。我们还根据5个模型和4个数据集对每对8次攻击进行全面评估。我们从不同的角度对8 000次的后门学习进行了分析,在对8 000次评价中学习了所有攻击的后门的后门评估。我们从各种分析中学习了所有攻击的后门准则。