Supervised representation learning with deep networks tends to overfit the training classes and the generalization to novel classes is a challenging question. It is common to evaluate a learned embedding on held-out images of the same training classes. In real applications however, data comes from new sources and novel classes are likely to arise. We hypothesize that incorporating unlabelled images of novel classes in the training set in a semi-supervised fashion would be beneficial for the efficient retrieval of novel-class images compared to a vanilla supervised representation. To verify this hypothesis in a comprehensive way, we propose an original evaluation methodology that varies the degree of novelty of novel classes by partitioning the dataset category-wise either randomly, or semantically, i.e. by minimizing the shared semantics between base and novel classes. This evaluation procedure allows to train a representation blindly to any novel-class labels and evaluate the frozen representation on the retrieval of base or novel classes. We find that a vanilla supervised representation falls short on the retrieval of novel classes even more so when the semantics gap is higher. Semi-supervised algorithms allow to partially bridge this performance gap but there is still much room for improvement.
翻译:由深层网络监督的代言学习往往过度适应培训课程,而向小类推广则是一个具有挑战性的问题。通常的做法是评价在同一培训班的悬置图像中学习到的嵌入。但在实际应用中,数据来自新来源,新类可能出现。我们假设,将无标签的小类图像纳入半监督的培训中,有利于与香草监督的代言相比,有效检索小类图像。为了全面核实这一假设,我们提出了一个最初的评价方法,通过随机或从语义角度对数据集分类进行分解,从而改变新颖类的新颖程度。即最大限度地减少基础和新类之间的共享语义学。这种评价程序可以盲目地培训任何新类标签的代表,并评价在检索基础或新类时的冷藏代表。我们发现,香草监督的代言在新类的检索中显得更差,因此,在语义学差距更大时,新颖类的代言代言的代言的代言语的代言语也更短。 半受监督的算算法可以部分缩小这一差距,但在这方面仍有很大的改进余地。