This work is a systematical analysis on the so-called hard class problem in zero-shot learning (ZSL), that is, some unseen classes disproportionally affect the ZSL performances than others, as well as how to remedy the problem by detecting and exploiting hard classes. At first, we report our empirical finding that the hard class problem is a ubiquitous phenomenon and persists regardless of used specific methods in ZSL. Then, we find that high semantic affinity among unseen classes is a plausible underlying cause of hardness and design two metrics to detect hard classes. Finally, two frameworks are proposed to remedy the problem by detecting and exploiting hard classes, one under inductive setting, the other under transductive setting. The proposed frameworks could accommodate most existing ZSL methods to further significantly boost their performances with little efforts. Extensive experiments on three popular benchmarks demonstrate the benefits by identifying and exploiting the hard classes in ZSL.
翻译:这项工作是对零点学习(ZSL)中所谓的硬阶级问题进行系统分析,即一些隐匿阶级对ZSL的表现影响比其他阶级大得多,以及如何通过探测和利用硬阶级来纠正问题。首先,我们报告我们的经验发现,硬阶级问题是一种普遍存在的现象,无论ZSL采用何种具体方法,都持续存在。然后,我们发现,隐匿阶级之间的高语义亲近性是困难的根本原因,我们设计了两种衡量标准来探测硬阶级。最后,提出了两个框架来通过探测和开发硬阶级来纠正问题,一个是潜入式的,另一个是感化式的。提议的框架可以适应大多数现有的ZSLL方法,以很少的努力来大大提升他们的绩效。关于三种流行基准的广泛实验通过查明和利用ZSL的硬阶级来证明其好处。