Contrastive learning (CL) has recently emerged as an effective approach to learning representation in a range of downstream tasks. Central to this approach is the selection of positive (similar) and negative (dissimilar) sets to provide the model the opportunity to `contrast' between data and class representation in the latent space. In this paper, we investigate CL for improving model robustness using adversarial samples. We first designed and performed a comprehensive study to understand how adversarial vulnerability behaves in the latent space. Based on these empirical evidences, we propose an effective and efficient supervised contrastive learning to achieve model robustness against adversarial attacks. Moreover, we propose a new sample selection strategy that optimizes the positive/negative sets by removing redundancy and improving correlation with the anchor. Experiments conducted on benchmark datasets show that our Adversarial Supervised Contrastive Learning (ASCL) approach outperforms the state-of-the-art defenses by $2.6\%$ in terms of the robust accuracy, whilst our ASCL with the proposed selection strategy can further gain $1.4\%$ improvement with only $42.8\%$ positives and $6.3\%$ negatives compared with ASCL without a selection strategy.
翻译:最近,对立学习(CL)已成为在一系列下游任务中学习代表性的一种有效方法。这一方法的核心是选择正(类似)和负(不同)组合,为潜在空间的数据和阶级代表性提供“连接”的示范机会;在本文件中,我们调查CL,利用对抗样品改进模型的稳健性;我们首先设计并开展了一项全面研究,以了解敌对脆弱性在潜在空间中如何表现;根据这些经验证据,我们建议进行有效和高效的监督对比学习,以实现对对抗性攻击的稳健模式。此外,我们提出了一个新的抽样选择战略,通过消除冗余和改进与锚点的关联,优化正/负组合。在基准数据集上进行的实验表明,我们的反向超强反向反向学习(ASCL)方法在稳健的准确性方面比近于最先进的防御能力2.6美元。而我们ACTL与拟议的选择战略可以进一步获得1.4美元改进,只有42.8美元正和6.3美元负值,而与ASCL相比,没有一个反向选择战略。