强力积极学习:强力深深学习模式的抽样有效培训 (Robust Active Learning: Sample-Efficient Training of Robust Deep Learning Models)

Active learning is an established technique to reduce the labeling cost to build high-quality machine learning models. A core component of active learning is the acquisition function that determines which data should be selected to annotate. State-of-the-art acquisition functions -- and more largely, active learning techniques -- have been designed to maximize the clean performance (e.g. accuracy) and have disregarded robustness, an important quality property that has received increasing attention. Active learning, therefore, produces models that are accurate but not robust. In this paper, we propose \emph{robust active learning}, an active learning process that integrates adversarial training -- the most established method to produce robust models. Via an empirical study on 11 acquisition functions, 4 datasets, 6 DNN architectures, and 15105 trained DNNs, we show that robust active learning can produce models with the robustness (accuracy on adversarial examples) ranging from 2.35\% to 63.85\%, whereas standard active learning systematically achieves negligible robustness (less than 0.20\%). Our study also reveals, however, that the acquisition functions that perform well on accuracy are worse than random sampling when it comes to robustness. We, therefore, examine the reasons behind this and devise a new acquisition function that targets both clean performance and robustness. Our acquisition function -- named density-based robust sampling with entropy (DRE) -- outperforms the other acquisition functions (including random) in terms of robustness by up to 24.40\% (3.84\% than random particularly), while remaining competitive on accuracy. Additionally, we prove that DRE is applicable as a test selection metric for model retraining and stands out from all compared functions by up to 8.21\% robustness.

翻译：积极学习是降低标签成本以建立高质量机器学习模式的既定技术。积极学习的核心部分是获取功能,它决定了哪些数据应当被选作注释。最先进的获取功能 -- -- 更重要的是,积极学习技术 -- -- 设计的目的是最大限度地提高清洁性能(例如准确性),忽视了强性,这是人们日益关注的一个重要质量属性。因此,积极学习生成了准确性但不强的模型。在本文中,我们建议了将对抗性培训 -- -- 最成熟的生成强性模型的方法 -- -- 纳入对抗性培训的积极学习过程。对11个获取功能、4个数据集、6个DNNN架构和15105个经过培训的DNNS进行了经验研究,我们表明,强性学习能够产生强性模型(对对抗性实例的准确性能),从2.35 ⁇ 到63.85 ⁇ 至6.8,而标准的主动性学习模式系统化(比强性强性强性(低于0.20 ⁇ ) 。然而,我们的研究还表明,对于准确性的获取功能尤其差于准确性,当我们进行随机性测试时,而有更强性测试性功能。