Generative based strategy has shown great potential in the Generalized Zero-Shot Learning task. However, it suffers severe generalization problem due to lacking of feature diversity for unseen classes to train a good classifier. In this paper, we propose to enhance the generalizability of GZSL models via improving feature diversity of unseen classes. For this purpose, we present a novel Diverse Feature Synthesis (DFS) model. Different from prior works that solely utilize semantic knowledge in the generation process, DFS leverages visual knowledge with semantic one in a unified way, thus deriving class-specific diverse feature samples and leading to robust classifier for recognizing both seen and unseen classes in the testing phase. To simplify the learning, DFS represents visual and semantic knowledge in the aligned space, making it able to produce good feature samples with a low-complexity implementation. Accordingly, DFS is composed of two consecutive generators: an aligned feature generator, transferring semantic and visual representations into aligned features; a synthesized feature generator, producing diverse feature samples of unseen classes in the aligned space. We conduct comprehensive experiments to verify the efficacy of DFS. Results demonstrate its effectiveness to generate diverse features for unseen classes, leading to superior performance on multiple benchmarks. Code will be released upon acceptance.
翻译:在通用零热学习任务中,基于生成的战略显示了巨大的潜力,然而,由于对无法见的班级缺乏特色多样性,无法训练优秀的分类师,因此,它面临着严重的普遍性问题。在本文件中,我们提议通过改进不可见班级的特征多样性,提高GZSL模型的通用性。为此,我们提出了一个新型的多样化地貌合成模型(DFS),不同于以前仅利用生成过程中的语义知识的以往工作,外勤部以统一的方式将视觉知识与语义学知识结合起来,从而产生不同类别的不同特征样本,并导致形成强有力的分类,以在测试阶段识别可见和看不见的班级。为了简化学习,外勤部在统一的空间中代表视觉和语义知识,使其能产生执行不那么兼容性强的良好特征样本。因此,外勤部由两个连续的发电机组成:一个统一的地貌生成器,将语义和视觉表达方式转换成一致的特征;一个综合地貌生成器,生成了统一的空间中看不见班级的不同特征样本。我们进行全面实验,以核实外勤部的功效。结果将显示其有效性,将显示其高端标准。