The robustness of a deep classifier can be characterized by its margins: the decision boundary's distances to natural data points. However, it is unclear whether existing robust training methods effectively increase the margin for each vulnerable point during training. To understand this, we propose a continuous-time framework for quantifying the relative speed of the decision boundary with respect to each individual point. Through visualizing the moving speed of the decision boundary under Adversarial Training, one of the most effective robust training algorithms, a surprising moving-behavior is revealed: the decision boundary moves away from some vulnerable points but simultaneously moves closer to others, decreasing their margins. To alleviate these conflicting dynamics of the decision boundary, we propose Dynamics-aware Robust Training (DyART), which encourages the decision boundary to engage in movement that prioritizes increasing smaller margins. In contrast to prior works, DyART directly operates on the margins rather than their indirect approximations, allowing for more targeted and effective robustness improvement. Experiments on the CIFAR-10 and Tiny-ImageNet datasets verify that DyART alleviates the conflicting dynamics of the decision boundary and obtains improved robustness under various perturbation sizes compared to the state-of-the-art defenses. Our code is available at https://github.com/Yuancheng-Xu/Dynamics-Aware-Robust-Training.
翻译:深度分类器的稳健性可以用其边际特征来描述:决定边界与自然数据点之间的距离;然而,尚不清楚现有的稳健培训方法是否有效提高了每个脆弱点在培训期间的差幅。为了理解这一点,我们提议了一个连续时间框架,以量化每个点决定边界相对速度的相对速度;在反向培训这一最有效的稳健培训算法下,通过直观决定边界的移动速度,发现一种令人惊讶的移动行为:决定边界从一些脆弱点移开,同时移近其他点,缩小它们的差幅。为缓解决定边界的这些相互冲突的动态,我们提议采用动态-aware robust 培训(DyART),鼓励决策边界参与优先增加较小差幅的移动。与先前的工程相比,DyART直接在边缘运行,而不是间接的近似,从而能够更有针对性和更有效的稳健性改进。对CIFAR-10和Tiny-ImageNet数据集的实验,可以核实DyArt-Artal-Destrang-Destrual-Destrual-Destrual-Destrual-Destrual-Atustrual-comstrations)在不同的状态下,可以改进。