具有知识图的培训前语言图像 (Contrastive Language-Image Pre-Training with Knowledge Graphs) - 专知论文

会员服务 ·

0

知识 (knowledge) · 图 · contrastive · 知识图谱 · Extensibility ·

2022 年 10 月 17 日

Contrastive Language-Image Pre-Training with Knowledge Graphs

翻译：具有知识图的培训前语言图像

Xuran Pan,Tianzhu Ye,Dongchen Han,Shiji Song,Gao Huang

from arxiv, Accepted by NeurIPS2022

Recent years have witnessed the fast development of large-scale pre-training frameworks that can extract multi-modal representations in a unified form and achieve promising performances when transferred to downstream tasks. Nevertheless, existing approaches mainly focus on pre-training with simple image-text pairs, while neglecting the semantic connections between concepts from different modalities. In this paper, we propose a knowledge-based pre-training framework, dubbed Knowledge-CLIP, which injects semantic information into the widely used CLIP model. Through introducing knowledge-based objectives in the pre-training process and utilizing different types of knowledge graphs as training data, our model can semantically align the representations in vision and language with higher quality, and enhance the reasoning ability across scenarios and modalities. Extensive experiments on various vision-language downstream tasks demonstrate the effectiveness of Knowledge-CLIP compared with the original CLIP and competitive baselines.

翻译：近年来,大规模培训前框架迅速发展,能够以统一的形式获得多种模式的表述,并在转入下游任务时取得有希望的业绩;然而,现有方法主要侧重于培训前的简单图像文本配对,同时忽视不同模式概念之间的语义联系;在本文件中,我们提议了一个知识性培训前框架,称为知识性知识-CLIP,将语义信息注入广泛使用的CLIP模式;通过在培训前进程中引入知识性目标,并利用不同类型的知识图表作为培训数据,我们的模式可以将愿景和语言的表述进行语义上的调整,提高质量,提高不同情景和模式的推理能力;对各种愿景性下游任务的广泛实验表明,与原CLIP和竞争性基线相比,知识-CLIP的有效性。

0

相关内容

知识 (knowledge)

知识 (knowledge)

通过学习、实践或探索所获得的认识、判断或技能。

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

专知会员服务

43+阅读 · 2020年1月28日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

KDD2021 | 最新GNN官方教程

KDD2021 | 最新GNN官方教程

机器学习与推荐算法

2+阅读 · 2021年8月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Th细胞表达的趋化因子受体及配体基因多态性与HCV感染转归关系的研究

国家自然科学基金

0+阅读 · 2014年12月31日

锂硫电池中硫/碳复合材料退化机理的质谱学研究

国家自然科学基金

0+阅读 · 2014年12月31日

高容量锂离子电池负极集流体泡沫铜的环境疲劳行为、损伤机理及寿命模型

国家自然科学基金

0+阅读 · 2014年12月31日

主题模型建模框架下的高分辨率遥感影像半监督分类研究

国家自然科学基金

0+阅读 · 2013年12月31日

纠缠及纠缠之外的量子关联刻画

国家自然科学基金

0+阅读 · 2013年12月31日

基于博弈激励的分布式自适应异构无线网络拓扑控制

国家自然科学基金

0+阅读 · 2012年12月31日

基本群表示，调和度量的构造及其到上同调的应用

国家自然科学基金

1+阅读 · 2011年12月31日

抗磷脂抗体作为缺血性脑卒中预后生物标志的研究

国家自然科学基金

0+阅读 · 2011年12月31日

中国城市布局及发展差异的空间统计模型研究

国家自然科学基金

0+阅读 · 2009年12月31日

视觉密码方案的构造

国家自然科学基金

0+阅读 · 2009年12月31日

Relational Symmetry based Knowledge Graph Contrastive Learning

Arxiv

0+阅读 · 2022年11月19日

Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone

Arxiv

0+阅读 · 2022年11月18日

Contrastive Knowledge Graph Error Detection

Arxiv

0+阅读 · 2022年11月18日

Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks

Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks

Arxiv

0+阅读 · 2022年11月17日

Pre-training Methods in Information Retrieval

Arxiv

16+阅读 · 2021年11月27日

Graph Self-Supervised Learning: A Survey

Arxiv

15+阅读 · 2021年8月5日

Graph Contrastive Learning with Adaptive Augmentation

Arxiv

10+阅读 · 2021年2月26日

Contrastive Clustering

Arxiv

31+阅读 · 2020年9月21日

Knowledge Graphs

Arxiv

102+阅读 · 2020年3月4日

K-BERT: Enabling Language Representation with Knowledge Graph

K-BERT: Enabling Language Representation with Knowledge Graph

Arxiv

19+阅读 · 2019年9月17日

VIP会员

文章信息

相关主题

知识 (knowledge)

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

专知会员服务

43+阅读 · 2020年1月28日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

新书册《几何深度学习的数学基础》

中程单向攻击无人机的战略意义：俄乌战争启示

在无标注条件下适配视觉—语言模型：全面综述

面向视觉语言模型的持续学习：遗忘之外的综述与分类体系

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

KDD2021 | 最新GNN官方教程

KDD2021 | 最新GNN官方教程

机器学习与推荐算法

2+阅读 · 2021年8月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Relational Symmetry based Knowledge Graph Contrastive Learning

Arxiv

0+阅读 · 2022年11月19日

Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone

Arxiv

0+阅读 · 2022年11月18日

Contrastive Knowledge Graph Error Detection

Arxiv

0+阅读 · 2022年11月18日

Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks

Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks

Arxiv

0+阅读 · 2022年11月17日

Pre-training Methods in Information Retrieval

Arxiv

16+阅读 · 2021年11月27日

Graph Self-Supervised Learning: A Survey

Arxiv

15+阅读 · 2021年8月5日

Graph Contrastive Learning with Adaptive Augmentation

Arxiv

10+阅读 · 2021年2月26日

Contrastive Clustering

Arxiv

31+阅读 · 2020年9月21日

Knowledge Graphs

Arxiv

102+阅读 · 2020年3月4日

K-BERT: Enabling Language Representation with Knowledge Graph

K-BERT: Enabling Language Representation with Knowledge Graph

Arxiv

19+阅读 · 2019年9月17日

相关基金

Th细胞表达的趋化因子受体及配体基因多态性与HCV感染转归关系的研究

国家自然科学基金

0+阅读 · 2014年12月31日

锂硫电池中硫/碳复合材料退化机理的质谱学研究

国家自然科学基金

0+阅读 · 2014年12月31日

高容量锂离子电池负极集流体泡沫铜的环境疲劳行为、损伤机理及寿命模型

国家自然科学基金

0+阅读 · 2014年12月31日

主题模型建模框架下的高分辨率遥感影像半监督分类研究

国家自然科学基金

0+阅读 · 2013年12月31日

纠缠及纠缠之外的量子关联刻画

国家自然科学基金

0+阅读 · 2013年12月31日

基于博弈激励的分布式自适应异构无线网络拓扑控制

国家自然科学基金

0+阅读 · 2012年12月31日

基本群表示，调和度量的构造及其到上同调的应用

国家自然科学基金

1+阅读 · 2011年12月31日

抗磷脂抗体作为缺血性脑卒中预后生物标志的研究

国家自然科学基金

0+阅读 · 2011年12月31日

中国城市布局及发展差异的空间统计模型研究

国家自然科学基金

0+阅读 · 2009年12月31日

视觉密码方案的构造

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员