争取以快速微调方式制作开放词汇场景图 (Towards Open-vocabulary Scene Graph Generation with Prompt-based Finetuning) - 专知论文

会员服务 ·

0

情景 · 图 · Extensibility · 推断 · 类别 ·

2022 年 8 月 17 日

Towards Open-vocabulary Scene Graph Generation with Prompt-based Finetuning

翻译：争取以快速微调方式制作开放词汇场景图

Tao He,Lianli Gao,Jingkuan Song,Yuan-Fang Li

Scene graph generation (SGG) is a fundamental task aimed at detecting visual relations between objects in an image. The prevailing SGG methods require all object classes to be given in the training set. Such a closed setting limits the practical application of SGG. In this paper, we introduce open-vocabulary scene graph generation, a novel, realistic and challenging setting in which a model is trained on a set of base object classes but is required to infer relations for unseen target object classes. To this end, we propose a two-step method that firstly pre-trains on large amounts of coarse-grained region-caption data and then leverages two prompt-based techniques to finetune the pre-trained model without updating its parameters. Moreover, our method can support inference over completely unseen object classes, which existing methods are incapable of handling. On extensive experiments on three benchmark datasets, Visual Genome, GQA, and Open-Image, our method significantly outperforms recent, strong SGG methods on the setting of Ov-SGG, as well as on the conventional closed SGG.

翻译：场景图形生成( SGG) 是一项基本任务,旨在检测图像中对象之间的视觉关系。流行的 SGG 方法要求所有对象类别在训练组中提供。这种封闭式设置限制了SGG的实际应用。在本文中,我们引入了开放式词汇场景图形生成,这是一个新颖、现实和具有挑战性的设置,在一组基本对象类别上对模型进行了培训,但需要据此推断看不见目标对象类别的关系。为此,我们提出了一个两步方法,首先在大量粗略区域覆盖数据上进行预演,然后利用两种即时技术在不更新参数的情况下对预培训模型进行微调。此外,我们的方法可以支持对现有方法无法处理的完全看不见的物体类别进行推断。在对三个基准数据集(视觉基因组、GQA和Open-Image)进行的广泛实验中,我们的方法大大超越了在设计Ov- SGG 上以及常规封闭的 SGGG 上最近的强大SGG 方法。

0

相关内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

161+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

Cr离子变价对镍基合金氟盐腐蚀行为的影响及机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

2+阅读 · 2014年12月31日

遥感图像融合方法的几何性能研究

国家自然科学基金

1+阅读 · 2013年12月31日

可替代HEMA 的Janus纳米复合材料的合成与功能

国家自然科学基金

0+阅读 · 2013年12月31日

GSK-3β/β-catenin信号通路参与ARDS后认知功能障碍发生的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

钙钛矿结构Cr基氧化物单晶的制备和磁电效应研究

国家自然科学基金

0+阅读 · 2013年12月31日

GSK-3β调控血管平滑肌细胞特异性转录因子Myocardin对动脉粥样硬化斑块形成作用及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

含2-氨基嘧啶π-共轭聚合物的合成及其光、电性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

斑马鱼β-catenin核转运的调控机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

水溶性天然高分子重金属螯合剂的可控合成、结构调控与性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

Exploring Effective Knowledge Transfer for Few-shot Object Detection

Arxiv

0+阅读 · 2022年10月5日

Unbiased Scene Graph Generation using Predicate Similarities

Arxiv

0+阅读 · 2022年10月3日

Prompt Tuning for Graph Neural Networks

Arxiv

0+阅读 · 2022年9月30日

Towards Large-Scale Small Object Detection: Survey and Benchmarks

Arxiv

40+阅读 · 2022年7月28日

Scene Graph Generation: A Comprehensive Survey

Arxiv

26+阅读 · 2022年1月3日

Towards Open World Object Detection

Arxiv

13+阅读 · 2021年3月3日

Few-Shot Graph Classification with Model Agnostic Meta-Learning

Arxiv

23+阅读 · 2020年3月18日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

Text Generation from Knowledge Graphs with Graph Transformers

Arxiv

35+阅读 · 2019年4月4日

Hierarchical Graph Representation Learning with Differentiable Pooling

Hierarchical Graph Representation Learning with Differentiable Pooling

Arxiv

14+阅读 · 2018年6月26日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

161+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《机器人弹性物体感知技术研究》227页

《关于乌克兰战争现状的七项当代洞察》译文

OpenAI“开放权重模型”即将进入美军作战体系

《大语言模型在航空发动机系统诊断与维护中的应用研究》

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

相关论文

Exploring Effective Knowledge Transfer for Few-shot Object Detection

Arxiv

0+阅读 · 2022年10月5日

Unbiased Scene Graph Generation using Predicate Similarities

Arxiv

0+阅读 · 2022年10月3日

Prompt Tuning for Graph Neural Networks

Arxiv

0+阅读 · 2022年9月30日

Towards Large-Scale Small Object Detection: Survey and Benchmarks

Arxiv

40+阅读 · 2022年7月28日

Scene Graph Generation: A Comprehensive Survey

Arxiv

26+阅读 · 2022年1月3日

Towards Open World Object Detection

Arxiv

13+阅读 · 2021年3月3日

Few-Shot Graph Classification with Model Agnostic Meta-Learning

Arxiv

23+阅读 · 2020年3月18日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

Text Generation from Knowledge Graphs with Graph Transformers

Arxiv

35+阅读 · 2019年4月4日

Hierarchical Graph Representation Learning with Differentiable Pooling

Hierarchical Graph Representation Learning with Differentiable Pooling

Arxiv

14+阅读 · 2018年6月26日

相关基金

Cr离子变价对镍基合金氟盐腐蚀行为的影响及机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

2+阅读 · 2014年12月31日

遥感图像融合方法的几何性能研究

国家自然科学基金

1+阅读 · 2013年12月31日

可替代HEMA 的Janus纳米复合材料的合成与功能

国家自然科学基金

0+阅读 · 2013年12月31日

GSK-3β/β-catenin信号通路参与ARDS后认知功能障碍发生的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

钙钛矿结构Cr基氧化物单晶的制备和磁电效应研究

国家自然科学基金

0+阅读 · 2013年12月31日

GSK-3β调控血管平滑肌细胞特异性转录因子Myocardin对动脉粥样硬化斑块形成作用及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

含2-氨基嘧啶π-共轭聚合物的合成及其光、电性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

斑马鱼β-catenin核转运的调控机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

水溶性天然高分子重金属螯合剂的可控合成、结构调控与性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员