保留标题信息: 防止快捷键学习 (Keep the Caption Information: Preventing Shortcut Learning in Contrastive Image-Caption Retrieval) - 专知论文

会员服务 ·

0

contrastive · 可约的 · 学成 · 优化器 · 解码 ·

2022 年 4 月 28 日

Keep the Caption Information: Preventing Shortcut Learning in Contrastive Image-Caption Retrieval

翻译：保留标题信息: 防止快捷键学习

Maurits Bleeker,Andrew Yates,Maarten de Rijke

from arxiv, Preprint

To train image-caption retrieval (ICR) methods, contrastive loss functions are a common choice for optimization functions. Unfortunately, contrastive ICR methods are vulnerable to learning shortcuts: decision rules that perform well on the training data but fail to transfer to other testing conditions. We introduce an approach to reduce shortcut feature representations for the ICR task: latent target decoding (LTD). We add an additional decoder to the learning framework to reconstruct the input caption, which prevents the image and caption encoder from learning shortcut features. Instead of reconstructing input captions in the input space, we decode the semantics of the caption in a latent space. We implement the LTD objective as an optimization constraint, to ensure that the reconstruction loss is below a threshold value while primarily optimizing for the contrastive loss. Importantly, LTD does not depend on additional training data or expensive (hard) negative mining strategies. Our experiments show that, unlike reconstructing the input caption, LTD reduces shortcut learning and improves generalizability by obtaining higher recall@k and r-precision scores. Additionally, we show that the evaluation scores benefit from implementing LTD as an optimization constraint instead of a dual loss.

翻译：为了培训图像解码(ICR)方法,对比式损失功能是优化功能的一种常见选择。不幸的是,对比式的ICR方法容易学习捷径:决定规则对培训数据效果良好,但未能转移到其他测试条件。我们引入了一种办法来减少ICR任务的捷径特征显示:潜在目标解码(LTD) 。我们在学习框架中增加了一个解码器,以重建输入标题,使图像和字幕编码编码器无法学习快捷功能。我们不是在输入空间重建输入标题,而是在隐蔽空间解码标题的语义。我们把LTD目标作为一种优化限制,以确保重建损失低于临界值,而主要是优化对比性损失。重要的是,LTD并不依赖额外的培训数据或昂贵(硬)负式采矿战略。我们的实验显示,与重建输入标题不同的是,LTD减少快捷式学习,并通过获取更高的回溯@k和r-precision分数来改进一般性。此外,我们显示,实施LTD作为双重损失最高限值的评价得分。

0

相关内容

contrastive

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

专知会员服务

97+阅读 · 2020年4月10日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

TRAF3IP3调控T细胞活性与肿瘤免疫的分子机制

国家自然科学基金

0+阅读 · 2016年12月31日

CyPA/CD147信号通路在蛛网膜下腔出血后早期脑损伤中的作用机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

微通道内非牛顿流体-弹性颗粒液固两相流动特性研究

国家自然科学基金

0+阅读 · 2014年12月31日

PSMA通过TRAF6和TTC3调控前列腺癌细胞自噬在CRPC产生过程中的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

表面层片状梯度纳米结构金属塑性行为与微观机制

国家自然科学基金

0+阅读 · 2013年12月31日

硅溶胶负载芳基磷酸盐杂化成核剂的制备及成核机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

缺血性脑损伤介导的ErbB4胞内结构域分解的分子机制及作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

TRAIL协同IER3调节NF-κB信号通路介导肝癌细胞凋亡的相关机制研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

电磁场驱动纳米管线旋转的实验及数值模拟研究

国家自然科学基金

0+阅读 · 2008年12月31日

DIRECTOR: Generator-Classifiers For Supervised Language Modeling

DIRECTOR: Generator-Classifiers For Supervised Language Modeling

Arxiv

0+阅读 · 2022年6月15日

Enhancing Egocentric 3D Pose Estimation with Third Person Views

Arxiv

0+阅读 · 2022年6月15日

TLDR: Twin Learning for Dimensionality Reduction

Arxiv

0+阅读 · 2022年6月15日

Learning to Reduce Information Bottleneck for Object Detection in Aerial Images

Arxiv

0+阅读 · 2022年6月15日

Low-Rank Hankel Tensor Completion for Traffic Speed Estimation

Arxiv

0+阅读 · 2022年6月14日

On the Role of Channel Capacity in Learning Gaussian Mixture Models

Arxiv

0+阅读 · 2022年6月14日

Using Defect Prediction to Improve the Bug Detection Capability of Search-Based Software Testing

Arxiv

0+阅读 · 2022年6月14日

EnergyMatch: Energy-based Pseudo-Labeling for Semi-Supervised Learning

Arxiv

0+阅读 · 2022年6月13日

STD-NET: Search of Image Steganalytic Deep-learning Architecture via Hierarchical Tensor Decomposition

Arxiv

0+阅读 · 2022年6月12日

Few-shot Learning for Multi-label Intent Detection

Arxiv

21+阅读 · 2020年10月11日

VIP会员

文章信息

相关主题

相关VIP内容

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

专知会员服务

97+阅读 · 2020年4月10日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《面向未来：军事应用中基于人工智能融合的场景分析及其对全球安全的影响》

《美陆军特种作战条令》最新102页

《美军条令：斯特赖克步兵步枪排与班作战条令》最新450页

美空军作战实验室通过人工智能和指挥控制技术创新推进杀伤链

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

DIRECTOR: Generator-Classifiers For Supervised Language Modeling

DIRECTOR: Generator-Classifiers For Supervised Language Modeling

Arxiv

0+阅读 · 2022年6月15日

Enhancing Egocentric 3D Pose Estimation with Third Person Views

Arxiv

0+阅读 · 2022年6月15日

TLDR: Twin Learning for Dimensionality Reduction

Arxiv

0+阅读 · 2022年6月15日

Learning to Reduce Information Bottleneck for Object Detection in Aerial Images

Arxiv

0+阅读 · 2022年6月15日

Low-Rank Hankel Tensor Completion for Traffic Speed Estimation

Arxiv

0+阅读 · 2022年6月14日

On the Role of Channel Capacity in Learning Gaussian Mixture Models

Arxiv

0+阅读 · 2022年6月14日

Using Defect Prediction to Improve the Bug Detection Capability of Search-Based Software Testing

Arxiv

0+阅读 · 2022年6月14日

EnergyMatch: Energy-based Pseudo-Labeling for Semi-Supervised Learning

Arxiv

0+阅读 · 2022年6月13日

STD-NET: Search of Image Steganalytic Deep-learning Architecture via Hierarchical Tensor Decomposition

Arxiv

0+阅读 · 2022年6月12日

Few-shot Learning for Multi-label Intent Detection

Arxiv

21+阅读 · 2020年10月11日

相关基金

TRAF3IP3调控T细胞活性与肿瘤免疫的分子机制

国家自然科学基金

0+阅读 · 2016年12月31日

CyPA/CD147信号通路在蛛网膜下腔出血后早期脑损伤中的作用机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

微通道内非牛顿流体-弹性颗粒液固两相流动特性研究

国家自然科学基金

0+阅读 · 2014年12月31日

PSMA通过TRAF6和TTC3调控前列腺癌细胞自噬在CRPC产生过程中的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

表面层片状梯度纳米结构金属塑性行为与微观机制

国家自然科学基金

0+阅读 · 2013年12月31日

硅溶胶负载芳基磷酸盐杂化成核剂的制备及成核机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

缺血性脑损伤介导的ErbB4胞内结构域分解的分子机制及作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

TRAIL协同IER3调节NF-κB信号通路介导肝癌细胞凋亡的相关机制研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

电磁场驱动纳米管线旋转的实验及数值模拟研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员