As AI agents are increasingly used in the real open world with unknowns or novelties, they need the ability to (1) recognize objects that (i) they have learned and (ii) detect items that they have not seen or learned before, and (2) learn the new items incrementally to become more and more knowledgeable and powerful. (1) is called novelty detection or out-of-distribution (OOD) detection and (2) is called class incremental learning (CIL), which is a setting of continual learning (CL). In existing research, OOD detection and CIL are regarded as two completely different problems. This paper theoretically proves that OOD detection actually is necessary for CIL. We first show that CIL can be decomposed into two sub-problems: within-task prediction (WP) and task-id prediction (TP). We then prove that TP is correlated with OOD detection. The key theoretical result is that regardless of whether WP and OOD detection (or TP) are defined explicitly or implicitly by a CIL algorithm, good WP and good OOD detection are necessary and sufficient conditions for good CIL, which unifies novelty or OOD detection and continual learning (CIL, in particular). A good CIL algorithm based on our theory can naturally be used in open world learning, which is able to perform both novelty/OOD detection and continual learning. Based on the theoretical result, new CIL methods are also designed, which outperform strong baselines in terms of CIL accuracy and its continual OOD detection by a large margin.
翻译:随着AI代理在未知或新颖性世界中越来越广泛地使用,它们需要具备以下能力:(1)识别它们已学习到的对象和检测它们以前没有接触过或学习过的对象;(2)逐步学习新的对象,以便变得更加知识渊博和强大。 (1)被称为新颖性检测或分布外(OOD)检测,(2)被称为类别增量学习(CIL),这是连续学习(CL)的一种设置。在现有研究中,OOD检测和CIL被视为两个完全不同的问题。本文从理论上证明了OOD检测实际上对于CIL是必要的。我们首先展示了CIL可以分解为两个子问题:任务内预测(WP)和任务标识预测(TP)。然后我们证明了TP与OOD检测相关。关键的理论结果是,无论WP和OOD检测(或TP)是否由CIL算法明确定义或隐含定义,良好的WP和OOD检测都是良好的CIL的必要和充分条件,从而统一了新颖性或OOD检测和连续学习(特别是CIL)。基于我们的理论的良好CIL算法自然可以用于开放世界学习,可以执行新颖/OOD检测和连续学习。基于这一理论结果,还设计了新的CIL方法,它们在CIL准确性及其持续OOD检测方面都比强大的基线算法表现优异。