Despite progress in adversarial training (AT), there is a substantial gap between the top-performing and worst-performing classes in many datasets. For example, on CIFAR10, the accuracies for the best and worst classes are 74% and 23%, respectively. We argue that this gap can be reduced by explicitly optimizing for the worst-performing class, resulting in a min-max-max optimization formulation. Our method, called class focused online learning (CFOL), includes high probability convergence guarantees for the worst class loss and can be easily integrated into existing training setups with minimal computational overhead. We demonstrate an improvement to 32% in the worst class accuracy on CIFAR10, and we observe consistent behavior across CIFAR100 and STL10. Our study highlights the importance of moving beyond average accuracy, which is particularly important in safety-critical applications.
翻译:尽管在对抗性培训方面取得了进展,但在许多数据集中,表现最佳和表现最差的班级之间存在巨大差距,例如,在CIFAR10, 最佳和最坏班级的接受率分别为74%和23%。我们争辩说,通过明确优化表现最差班级,可以缩小这一差距,从而形成一个最低和最高最佳的优化配方。我们称之为以班为重点的在线学习(CFOL)的方法,包括了对最坏班级损失的高概率趋同保证,并且可以很容易地纳入现有的培训设施,同时提供最低的计算间接费用。我们显示,在CIFAR10中,最坏班级的准确率提高到32%,我们在CIFAR100和STL10中看到一致的行为。我们的研究强调了超越平均准确性的重要性,这对于安全至关重要。