通过积极竞争的采矿活动对强烈的视听事件歧视 (Robust Audio-Visual Instance Discrimination via Active Contrastive Set Mining) - 专知论文

会员服务 ·

0

MINE · contrastive · 判别器 · 稳健性 · 示例 ·

2022 年 4 月 26 日

Robust Audio-Visual Instance Discrimination via Active Contrastive Set Mining

翻译：通过积极竞争的采矿活动对强烈的视听事件歧视

Hanyu Xuan,Yihong Xu,Shuo Chen,Zhiliang Wu,Jian Yang,Yan Yan,Xavier Alameda-Pineda

from arxiv, 7 pages, 4 figures, accepted at IJCAI 2022

The recent success of audio-visual representation learning can be largely attributed to their pervasive property of audio-visual synchronization, which can be used as self-annotated supervision. As a state-of-the-art solution, Audio-Visual Instance Discrimination (AVID) extends instance discrimination to the audio-visual realm. Existing AVID methods construct the contrastive set by random sampling based on the assumption that the audio and visual clips from all other videos are not semantically related. We argue that this assumption is rough, since the resulting contrastive sets have a large number of faulty negatives. In this paper, we overcome this limitation by proposing a novel Active Contrastive Set Mining (ACSM) that aims to mine the contrastive sets with informative and diverse negatives for robust AVID. Moreover, we also integrate a semantically-aware hard-sample mining strategy into our ACSM. The proposed ACSM is implemented into two most recent state-of-the-art AVID methods and significantly improves their performance. Extensive experiments conducted on both action and sound recognition on multiple datasets show the remarkably improved performance of our method.

翻译：最近视听代表性学习的成功在很大程度上可归因于视听同步这一普遍特性,可用作自我说明的监督。作为一种最先进的解决方案,视听实例歧视(AVID)将实例歧视扩大到视听领域。现有的AVID方法通过随机抽样构建了对比性组合,其依据的假设是,所有其他视频的视听视频片段与语义无关。我们争辩说,这一假设是粗糙的,因为由此产生的对比组合有许多缺点。在本文中,我们通过提出一部新颖的主动反向采掘(ACSM)来克服这一限制,它旨在用丰富的信息和多样的负面反向采掘出反向型组。此外,我们还将一个具有语义觉觉觉的硬抽样采掘战略纳入我们的ACSM战略。拟议的ACSM被实施为两种最新的最新的AVID方法,并大大改进了它们的业绩。在多个数据集上进行的广泛行动和正确识别实验,展示了我们方法的显著改进。

0

相关内容

MINE

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

321+阅读 · 2020年11月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【ICCV 2019 Toturial】Global Optimization for Geometric Understanding with Provable Guarantees（具有可证明保证的几何理解的全局优化）

【ICCV 2019 Toturial】Global Optimization for Geometric Understanding with Provable Guarantees（具有可证明保证的几何理解的全局优化）

专知会员服务

18+阅读 · 2019年11月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Fe基块体非晶合金中异质非晶结构及纳米晶形成演变机理

国家自然科学基金

0+阅读 · 2015年12月31日

重椭圆方程的弱有限元方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

新型Plectin-1荧光、MRI靶向分子探针对胰腺癌早期诊断的实验研究

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

基于同步辐射CT的非常规油气储集层微纳孔隙三维多尺度结构研究

国家自然科学基金

0+阅读 · 2013年12月31日

内生非晶复合材料在高速率动态冲击下的力学响应机理

国家自然科学基金

0+阅读 · 2012年12月31日

时空分辨波动光谱光学系统研究及样机研制

国家自然科学基金

0+阅读 · 2012年12月31日

齿轮传动多尺度参数与轮齿裂纹扩展演变关联规律研究

国家自然科学基金

0+阅读 · 2012年12月31日

Vitamin E脂质体纳米颗粒携带siRNA靶向抑制HCV的实验研究

国家自然科学基金

0+阅读 · 2011年12月31日

高分辨大视野的多针孔SPECT成像研究

国家自然科学基金

0+阅读 · 2008年12月31日

ProGCL: Rethinking Hard Negative Mining in Graph Contrastive Learning

Arxiv

0+阅读 · 2022年6月14日

Contrastive Learning for Unsupervised Domain Adaptation of Time Series

Contrastive Learning for Unsupervised Domain Adaptation of Time Series

Arxiv

0+阅读 · 2022年6月13日

2nd Place Solution for ICCV 2021 VIPriors Image Classification Challenge: An Attract-and-Repulse Learning Approach

Arxiv

0+阅读 · 2022年6月13日

Masked Autoencoders are Robust Data Augmentors

Arxiv

0+阅读 · 2022年6月10日

Efficient Visual Recognition with Deep Neural Networks: A Survey on Recent Advances and New Directions

Arxiv

20+阅读 · 2021年8月30日

Spatially Consistent Representation Learning

Arxiv

14+阅读 · 2021年3月10日

Data Augmentation for Graph Neural Networks

Arxiv

38+阅读 · 2020年12月2日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

Linkage Based Face Clustering via Graph Convolution Network

Arxiv

16+阅读 · 2019年3月27日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

321+阅读 · 2020年11月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【ICCV 2019 Toturial】Global Optimization for Geometric Understanding with Provable Guarantees（具有可证明保证的几何理解的全局优化）

【ICCV 2019 Toturial】Global Optimization for Geometric Understanding with Provable Guarantees（具有可证明保证的几何理解的全局优化）

专知会员服务

18+阅读 · 2019年11月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

ProGCL: Rethinking Hard Negative Mining in Graph Contrastive Learning

Arxiv

0+阅读 · 2022年6月14日

Contrastive Learning for Unsupervised Domain Adaptation of Time Series

Contrastive Learning for Unsupervised Domain Adaptation of Time Series

Arxiv

0+阅读 · 2022年6月13日

2nd Place Solution for ICCV 2021 VIPriors Image Classification Challenge: An Attract-and-Repulse Learning Approach

Arxiv

0+阅读 · 2022年6月13日

Masked Autoencoders are Robust Data Augmentors

Arxiv

0+阅读 · 2022年6月10日

Efficient Visual Recognition with Deep Neural Networks: A Survey on Recent Advances and New Directions

Arxiv

20+阅读 · 2021年8月30日

Spatially Consistent Representation Learning

Arxiv

14+阅读 · 2021年3月10日

Data Augmentation for Graph Neural Networks

Arxiv

38+阅读 · 2020年12月2日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

Linkage Based Face Clustering via Graph Convolution Network

Arxiv

16+阅读 · 2019年3月27日

相关基金

Fe基块体非晶合金中异质非晶结构及纳米晶形成演变机理

国家自然科学基金

0+阅读 · 2015年12月31日

重椭圆方程的弱有限元方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

新型Plectin-1荧光、MRI靶向分子探针对胰腺癌早期诊断的实验研究

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

基于同步辐射CT的非常规油气储集层微纳孔隙三维多尺度结构研究

国家自然科学基金

0+阅读 · 2013年12月31日

内生非晶复合材料在高速率动态冲击下的力学响应机理

国家自然科学基金

0+阅读 · 2012年12月31日

时空分辨波动光谱光学系统研究及样机研制

国家自然科学基金

0+阅读 · 2012年12月31日

齿轮传动多尺度参数与轮齿裂纹扩展演变关联规律研究

国家自然科学基金

0+阅读 · 2012年12月31日

Vitamin E脂质体纳米颗粒携带siRNA靶向抑制HCV的实验研究

国家自然科学基金

0+阅读 · 2011年12月31日

高分辨大视野的多针孔SPECT成像研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员