In many applications, data is easy to acquire but expensive and time-consuming to label prominent examples include medical imaging and NLP. This disparity has only grown in recent years as our ability to collect data improves. Under these constraints, it makes sense to select only the most informative instances from the unlabeled pool and request an oracle (e.g., a human expert) to provide labels for those samples. The goal of active learning is to infer the informativeness of unlabeled samples so as to minimize the number of requests to the oracle. Here, we formulate active learning as an open-set recognition problem. In this paradigm, only some of the inputs belong to known classes; the classifier must identify the rest as unknown. More specifically, we leverage variational neural networks (VNNs), which produce high-confidence (i.e., low-entropy) predictions only for inputs that closely resemble the training data. We use the inverse of this confidence measure to select the samples that the oracle should label. Intuitively, unlabeled samples that the VNN is uncertain about are more informative for future training. We carried out an extensive evaluation of our novel, probabilistic formulation of active learning, achieving state-of-the-art results on MNIST, CIFAR-10, and CIFAR-100. Additionally, unlike current active learning methods, our algorithm can learn tasks without the need for task labels. As our experiments show, when the unlabeled pool consists of a mixture of samples from multiple datasets, our approach can automatically distinguish between samples from seen vs. unseen tasks.
翻译:在许多应用中,数据容易获取,但费用昂贵且耗时费时,标签突出的例子包括医疗成像和NLP。随着我们收集数据的能力的提高,这种差异在最近几年才随着数据采集能力的提高而扩大。在这些限制下,我们从未贴标签的库中只选择信息最丰富的实例,并要求一个神器(例如,一位人类专家)为这些样品提供标签是有道理的。积极学习的目的是推断未贴标签的样品的丰富性,以便尽可能减少对甲骨文的要求数量。在这里,我们将积极学习作为开放的识别问题。在这个模式中,只有部分输入属于已知的类别;在这种模式中,分类者必须确定其余部分为未知的。更具体地说,我们利用变异性神经网络(VNNNFs)来选择最丰富的实例,要求一个神器(例如,一位人类专家)来为这些样品提供标签。我们使用这种信任度的反向量度来选择对甲骨骼进行标记的样品。我们非标签的方法可以直截然地区分非标签的样品。 VNNFN的样品是属于已知的,对已知的样品属于已知的类别的,对于未来的样品的样本进行不确证培训的,我们需要学习的,我们进行中的新的,我们进行着一个动态的,我们进行着样的,我们进行着一个动态的,我们进行着样式的飞行的研判的研判。