Considering the nature of unlabelled data, it is common for partially labelled training datasets to contain samples that belong to novel categories. Although these so-called observed novel categories exist in the training data, they do not belong to any of the training labels. In contrast, open-sets define novel categories as those unobserved during during training, but present during testing. This research is the first to generalize between observed and unobserved novel categories within a new learning policy called open-set learning with augmented category by exploiting unlabeled data or open-LACU. This study conducts a high-level review on novelty detection so to differentiate between research fields that concern observed novel categories, and the research fields that concern unobserved novel categories. Open-LACU is then introduced as a synthesis of the relevant fields to maintain the advantages of each within a single learning policy. Currently, we are finalising the first open-LACU network which will be combined with this pre-print to be sent for publication.
翻译:考虑到未贴标签数据的性质,部分标签的培训数据集通常包含属于新分类的样本。虽然在培训数据中存在这些所谓的观察新类别,但这些所谓的观察新类别不属于任何培训标签。相反,开放数据集将新类别定义为培训期间未观察到的类别,但在测试期间存在。这一研究是第一个在所谓的开放分类学习的新学习政策(通过开发未贴标签的数据或开放的拉加单位,扩大类别)中将观察和未观察的新类别加以概括的新类别。本研究对新发现领域进行高级别审查,以便区分与所观察到的新类别有关的研究领域和未观察到的新类别有关的研究领域。然后,将开放拉加单位作为相关领域的综合体加以介绍,以便在单一的学习政策中维护每个领域的优势。目前,我们正在最后确定第一个开放拉加U网络,它将与这一预印一起发送出版。