In open-world learning, an agent starts with a set of known classes, detects, and manages things that it does not know, and learns them over time from a non-stationary stream of data. Open-world learning is related to but also distinct from a multitude of other learning problems and this paper briefly analyzes the key differences between a wide range of problems including incremental learning, generalized novelty discovery, and generalized zero-shot learning. This paper formalizes various open-world learning problems including open-world learning without labels. These open-world problems can be addressed with modifications to known elements, we present a new framework that enables agents to combine various modules for novelty-detection, novelty-characterization, incremental learning, and instance management to learn new classes from a stream of unlabeled data in an unsupervised manner, survey how to adapt a few state-of-the-art techniques to fit the framework and use them to define seven baselines for performance on the open-world learning without labels problem. We then discuss open-world learning quality and analyze how that can improve instance management. We also discuss some of the general ambiguity issues that occur in open-world learning without labels.
翻译:在开放世界的学习中,一个代理机构先从一系列已知的班级开始,检测和管理它所不知道的事物,然后从非静止的数据流中学习。开放世界的学习与许多其他学习问题相关,但也与许多其他学习问题不同。本文简要分析了一系列广泛的问题之间的关键差异,包括渐进学习、普遍的新发现和普遍零弹式学习。本文件将开放世界的各种学习问题正式化,包括开放世界的无标签学习。这些开放世界的问题可以随着对已知要素的修改而得到解决,我们提出了一个新的框架,使代理机构能够将各种模块结合起来,用于新发现、创新特征描述、渐进学习和实例管理,以便以一种不受监督的方式从无标签数据流中学习新课程,调查如何调整少数最先进的技术,以适应框架,并利用这些技术为开放世界的学习确定七个基准,而没有标签问题。我们然后讨论开放世界学习质量,并分析如何改进实例管理。我们还讨论了在开放世界学习过程中出现的一些普遍模糊问题,而没有标签。