通用零光学习生成双反向双向网络 (Generative Dual Adversarial Network for Generalized Zero-shot Learning)

This paper studies the problem of generalized zero-shot learning which requires the model to train on image-label pairs from some seen classes and test on the task of classifying new images from both seen and unseen classes. Most previous models try to learn a fixed one-directional mapping between visual and semantic space, while some recently proposed generative methods try to generate image features for unseen classes so that the zero-shot learning problem becomes a traditional fully-supervised classification problem. In this paper, we propose a novel model that provides a unified framework for three different approaches: visual-> semantic mapping, semantic->visual mapping, and metric learning. Specifically, our proposed model consists of a feature generator that can generate various visual features given class embeddings as input, a regressor that maps each visual feature back to its corresponding class embedding, and a discriminator that learns to evaluate the closeness of an image feature and a class embedding. All three components are trained under the combination of cyclic consistency loss and dual adversarial loss. Experimental results show that our model not only preserves higher accuracy in classifying images from seen classes, but also performs better than existing state-of-the-art models in in classifying images from unseen classes.

翻译：本文研究通用零光学习问题, 要求模型从某些可见的班级对图像标签配对进行培训, 并测试从可见和不可见的班级对新图像进行分类的任务。多数以前的模型试图在视觉空间和语义空间之间学习固定的单向绘图, 而最近提出的一些基因化方法试图为看不见班级生成图像特征, 以便零光学习问题成为传统的完全监督的分类问题。在本文中, 我们提出一个新的模型, 为三种不同方法提供一个统一的框架: 视觉 - > 语义绘图、语义 - > 视觉绘图和指标学习。具体地说, 我们提议的模型包括一个功能生成器, 能够产生给类嵌入的各种视觉特征, 将每个视觉特征映射回到相应的班级嵌入, 以及一个导师, 学会评估图像特征和班级嵌入的近距离。所有三个组成部分都是在循环一致性损失和双重对抗性损失的组合下接受培训的。实验结果显示, 我们的模型不仅在所见的班级中保持了对图像进行分类的更精确的精确性, 并且还比现有图像的状态演得更好。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【KDD2020】图神经网络生成式预训练，GPT-GNN: Generative Pre-Training of Graph Neural Networks

专知会员服务

99+阅读 · 2020年7月3日

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

生成式对抗网络GAN在计算机视觉中的应用概述，GANs in computer vision: Introduction to generative learning（part1）

专知会员服务

63+阅读 · 2020年4月19日

【CVPR2020-清华大学】渐进对抗网络的细粒度域适应，Progressive Adversarial Networks

专知会员服务

27+阅读 · 2020年4月4日