《多模式贺卡数据集监督说明》 (Weakly Supervised Annotations for Multi-modal Greeting Cards Dataset)

In recent years, there is a growing number of pre-trained models trained on a large corpus of data and yielding good performance on various tasks such as classifying multimodal datasets. These models have shown good performance on natural images but are not fully explored for scarce abstract concepts in images. In this work, we introduce an image/text-based dataset called Greeting Cards. Dataset (GCD) that has abstract visual concepts. In our work, we propose to aggregate features from pretrained images and text embeddings to learn abstract visual concepts from GCD. This allows us to learn the text-modified image features, which combine complementary and redundant information from the multi-modal data streams into a single, meaningful feature. Secondly, the captions for the GCD dataset are computed with the pretrained CLIP-based image captioning model. Finally, we also demonstrate that the proposed the dataset is also useful for generating greeting card images using pre-trained text-to-image generation model.

翻译：近年来,越来越多的经过培训的模型经过培训,掌握了大量数据,在诸如多式联运数据集分类等各种任务上取得了良好的业绩。这些模型在自然图像上表现良好,但对于图像中稀少的抽象概念没有进行充分的探索。在这项工作中,我们引入了一个图像/基于文本的数据集,称为Greeting Cards。具有抽象视觉概念的数据集(GCD)。在我们的工作中,我们提议汇总来自预先培训的图像和文本嵌入的特征,以学习GCD的抽象视觉概念。这使我们能够学习文本修改后的图像特征,将多模式数据流中的补充和冗余信息合并成一个单一的、有意义的特征。第二,GCD数据集的描述是用预先培训的CLIP图像说明模型计算出来的。最后,我们还表明提议的数据集对于使用经过培训的文本到图像生成模型生成贺卡图像也是有益的。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【WWW2020】学习上下文化文档表示用于医疗答案检索，Learning Contextualized Document Representations for Healthcare Answer Retrieval

专知会员服务

26+阅读 · 2020年2月10日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日